| ▲ | zozbot234 9 hours ago | |||||||||||||||||||||||||||||||
Obligatory reminder that today's so called "AGI" has trouble figuring out whether I should walk or drive to the car wash in order to get my dirty car washed. It has to think through the scenario step by step, whereas any human can instantly grok the right answer. | ||||||||||||||||||||||||||||||||
| ▲ | wongarsu 9 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||
The idea/hope is that a video model would answer the car wash problem correctly. There are exactly the kinds of issues you have to solve to avoid teleporting objects around in a video, so whenever we manage more than a couple seconds of coherent video we will have something that understands the real world much better than text-based models. Then we "just" have to somehow make a combined model that has this kind of understanding and can write text and make tool calls Yes, this is kind of like Tesla promising full self driving in 2016 | ||||||||||||||||||||||||||||||||
| ▲ | SpicyLemonZest 9 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
I just don't know how to engage with these criticisms anymore. Do you not see how increasingly convoluted the "simple question LLMs can't answer" bar has gotten since 2022? Do the human beings you know not have occasional brain farts where they recommend dumb things that don't make much sense? | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | reducesuffering 7 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||
What are you talking about? OpenAI's ChatGPT free tier (that everyone uses) answers this in the first sentence within a couple seconds. "If your goal is to get your dirty car washed… you should probably drive it to the car wash " | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||