It used to be easy enough to distinguish between human-made and AI-generated imagery — just two years ago, you couldn’t use image models to create a menu for a Mexican restaurant without inventing new culinary delights like “enchuita,” “churiros,” “burrto,” and “margartas.”
Now, when I ask the brand new ChatGPT Images 2.0 model for a menu of Mexican food, it creates something that could immediately be used in a restaurant without customers noticing that something’s off. (However, ceviche priced at $13.50 might make me question the quality of the fish.)
Image Credits:ChatGPT Images 2.0 For comparison, here’s the result I got from DALL-E 3 two years ago (at the time, ChatGPT did not generate images):
Image Credits:Microsoft Designer (DALL-E 3) AI image generators have historically struggled to spell because they generally used diffusion models, which work by reconstructing images from noise.