I just got a much better version using this command instead, which uses the maximum image size according to https://github.com/openai/openai-cookbook/blob/main/examples...

  OPENAI_API_KEY="$(llm keys get openai)" \
    uv run 'https://raw.githubusercontent.com/simonw/tools/refs/heads/main/python/openai_image.py' \
    -m gpt-image-2 \
    "Do a where's Waldo style image but it's where is the raccoon holding a ham radio" \
    --quality high --size 3840x2160

https://gist.github.com/simonw/88eecc65698a725d8a9c1c918478a... - I found the raccoon!

I think that image cost 40 cents.

▲ makira 7 hours ago | parent | next [-]

Fed into a clear Claude Code max effort session with : "Inspect waldo2.png, and give me the pixel location of a raccoon holding a ham radio.". It sliced the image into small sections and gave:

"Found the raccoon holding a ham radio in waldo2.png (3840×2160).

  - Raccoon center: roughly (460, 1680)                                                                                            
  - Ham radio (walkie-talkie) center: roughly (505, 1650) — antenna tip around (510, 1585)                                         
  - Bounding box (raccoon + radio): approx x: 370–540, y: 1550–1780                                                                
                                                                                                                                   
  It's in the lower-left area of the image, just right of the red-and-white striped souvenir umbrella, wearing a green vest. "

Which is correct!

▲

cwillu 6 hours ago | parent | next [-]

I had one problem: finding the raccoon. Now I have two: finding the red-and-white striped souvenir umbrella, and finding the raccoon.

▲

makira 6 hours ago | parent [-]

simonw posted 2 different images: make sure to look at the second one.

▲

cwillu 6 hours ago | parent [-]

Yeah, I noticed that just now, but too late to delete the comment :p

	▲	jaggederest 4 hours ago \| parent [-]
		You had a meta problem, and three, in total: find the raccoon, find the umbrella, find the right link in the comments.

▲

M3L0NM4N 4 hours ago | parent | prev [-]

We would need a larger sample size than just myself, but the raccoon was in the very first spot I looked. Found it literally immediately, as if that's where my eyes naturally gravitated to first. Hopefully that's just luck and not an indictment of the image-creating ability, as if there is some element missing from this "Where's Waldo" image, that would normally make Waldo hard to find.

	▲	nerdsniper 2 hours ago \| parent [-]
		There seemed to be more space around the raccoon than most other subjects. Zoomed out it appears as almost a “halo” highlighting the raccoon.

▲ wewtyflakes 4 hours ago | parent | prev | next [-]

A startling number of people either have no arms, one arm, a half of an arm, or a shrunken arm; how odd!

▲

rattlesnakedave an hour ago | parent | next [-]

To be fair, the average person has fewer than two arms.

	▲	floodfx 36 minutes ago \| parent [-]
		Haha. Underrated comment!

▲

cozzyd 40 minutes ago | parent | prev [-]

This is why they're congregating around the first aid and the lost and found

▲ davebren 7 hours ago | parent | prev | next [-]

The faces...that's nice that it turned a kid's book into an abomination

▲

keithnz 4 minutes ago | parent | next [-]

it's interesting, zoomed out it kind of looks ok, zoomed in.... oh my.

▲

Filligree 2 hours ago | parent | prev [-]

By image generation standards this is a ridiculously good result. No surprise that people instantly find the new limits, but they are new limits.

	▲	davebren 2 hours ago \| parent [-]
		It could already copy the art styles from its training data, what is the advancement here?

▲ mirekrusin 2 hours ago | parent | prev | next [-]

Can it generate non halloween version though?

This lower-is-better danse macabre, nightmares inducing ratio feels like interesting proxy for models capability.

▲ 2 hours ago | parent | prev | next [-]

[deleted]

▲ louiereederson 7 hours ago | parent | prev | next [-]

The people in this image remind me of early this person does not exist, in the best way

	▲	dfee 4 hours ago \| parent [-]
		fair point, also "this raccoon does not exist"

▲ gpt5 4 hours ago | parent | prev | next [-]

I tried it on the ChatGPT web UI and it also worked, although the ham radio looks like a handbag to me.

https://postimg.cc/wyxgCgNY

▲ ireadmevs 7 hours ago | parent | prev [-]

I found it on the 2nd image! On the 1st one not yet...