Results of the "Can you tell which images are AI generated?" survey

popcar2@programming.dev · edit-2 2 years ago

Results of the "Can you tell which images are AI generated?" survey

MooseBoys@lemmy.world · 2 years ago

I still don’t believe the avocado comic is one-shot AI-generated. Composited from multiple outputs, sure. But I have not once seen generative AI produce an image that includes properly rendered text like this.

deranger@sh.itjust.works · edit-2 2 years ago

Bing image creator uses the new DALL-E model which does hands and text pretty good.

generated this first try with the prompt a cartoon avocado holding a sign that says ‘help me’

dotMonkey@lemmy.world · 2 years ago

People forget just how fast this tech is evolving

S_H_K@lemmy.fmhy.net · 2 years ago

Absolutely SDXL with loras already can do a lot of what it was thought impossible.

seralth@lemmy.world · 2 years ago

deleted by creator

isildun@sh.itjust.works · 2 years ago

Image generation tech has gone crazy over the past year and a half or so. At the speed it’s improving I wouldn’t rule out the possibility.

Here’s a paper from this year discussing text generation within images (it’s very possible these methods aren’t SOTA anymore – that’s how fast this field is moving): https://openaccess.thecvf.com/content/WACV2023/html/Rodriguez_OCR-VQGAN_Taming_Text-Within-Image_Generation_WACV_2023_paper.html

b000urns@lemmy.world · 2 years ago

Yeah I’m sceptical too, what tool and prompt was used to produce this?

Mint@lemmy.one · edit-2 2 years ago

Its Dalle 3 its not that difficult to generate something like that using dalle 3 here’s some shreks I generated as a showcase Shrek 1 inage

Shrek 2 Image

Shrek 3 Image

All of these are just generated nothing else

b000urns@lemmy.world · 2 years ago

Huh interesting it handles text relatively well

kattenluik@feddit.nl · 2 years ago

I found the avocado comic the easiest to tell, since the missing eyebrow was so insanely out of place.

Mint@lemmy.one · edit-2 2 years ago

Its not that difficult to generate something like that using dalle 3 here’s some shreks I generated as a showcase Shrek 1 inage

Shrek 2 Image

Shrek 3 Image

All of these are just generated nothing else

MooseBoys@lemmy.world · 2 years ago

Prompt and tool links? I know there are tools that try to pick out label text in the prompt and composite it after the fact, but I don’t consider this one-shot AI generated, even if it’s a single tool from the user’s perspective.

Mint@lemmy.one · edit-2 2 years ago

Its Dalle 3 like I said. As far as in aware Dalle 3 doesn’t do that since the text isn’t always perfect still. Can’t really provide prompts since its been a bit, and the history on it isn’t great, but I was just mostly shrek in x style and saying “x” do mind you Dalle is very heavily censored now, so you’re now unlikely to be able to recreate that.

It’s on - https://bing.com/create