| ▲ | vatsachak 4 hours ago | |||||||
All of these LLMs are getting better at being at an LLM But GPT-5.5 is as useful an LLM can be; it has solved lemmas I've thought about for a year, it can implement typed STLCs in Rust when I give it a formal grammar, it can help me analyze Postgres planner dumps. It's great at tasks that have short solutions but - they cannot learn based on a project - their long term planning capabilities are worse than worms - they are unconfident in decision making - their internal representations are disgusting compared to JEPA - they don't have any "system clock" like humans and computers do - LLM architecture is not modular like computer architecture or human brain architecture There's so many issues with LLMs. I wish that companies can start working on the next generation of architectures before the bubble pops | ||||||||
| ▲ | derwiki 4 hours ago | parent | next [-] | |||||||
Totally agree! They also conflate things all the time (a major type of hallucination) and IIUC that can’t be solved with the current architecture, just patched over | ||||||||
| ▲ | 4 hours ago | parent | prev | next [-] | |||||||
| [deleted] | ||||||||
| ▲ | esafak 2 hours ago | parent | prev [-] | |||||||
> - their internal representations are disgusting compared to JEPA You say this based on a theoretical understanding or did you inspect them? | ||||||||
| ||||||||