| ▲ | nigardev 9 hours ago | |
visual analysis is the right bottleneck to call out. most coding agents can read and write code fine because its just text. but identify a corroded valve from a photo and suggest the right fix? thats a different problem entirely. curious how your benchmark scores the gap between text-reasoning and visual-reasoning tasks | ||
| ▲ | Aeroi 9 hours ago | parent [-] | |
[dead] | ||