Remix.run Logo
energy123 4 hours ago

But they don't only operate on language? They operate on token sequences, which can be images, coordinates, time, language, etc.

kergonath 4 hours ago | parent | next [-]

It’s an interesting observation, but I think you have it backwards. The examples you give are all using discrete symbols to represent something real and communicating this description to other entities. I would argue that all your examples are languages.

samrus 3 hours ago | parent | prev [-]

Whats the first L stand for? Thats not just vestogial, their model of the world is formed almost exclusively from language rather than a range of things contributing significantly like for humans.

The biggest thing thats missing is actual feedback to their decisions. They have no "idea of that because transformers and embeddings dont model that yet. And langiage descriptions and image representations of feedback arent enough. They are too disjointed. It needs more