| ▲ | simonw a day ago | |||||||
I wonder if Gemini 3 Pro would do better at this particular test? They're very proud of its spatial awareness and vision abilities. | ||||||||
| ▲ | music4airports 11 hours ago | parent [-] | |||||||
>They're very proud of its spatial awareness and vision abilities. Suuuuuuuuure they are. I haven't found a single multimodal model, vision LLM, or any model at all that can segment and extract music charts/infographics. Can Gemini 3 Pro, in one shot, turn charts like these into lists of "artist - album" without choking on the visuals? https://reddit.com/r/citypop/comments/10fu1t5/city_pop_album... https://reddit.com/r/indieheads/comments/173o33z/the_new_ind... | ||||||||
| ||||||||