Remix.run Logo
vatsachak 4 hours ago

All of these LLMs are getting better at being at an LLM

But GPT-5.5 is as useful an LLM can be; it has solved lemmas I've thought about for a year, it can implement typed STLCs in Rust when I give it a formal grammar, it can help me analyze Postgres planner dumps.

It's great at tasks that have short solutions but

- they cannot learn based on a project

- their long term planning capabilities are worse than worms

- they are unconfident in decision making

- their internal representations are disgusting compared to JEPA

- they don't have any "system clock" like humans and computers do

- LLM architecture is not modular like computer architecture or human brain architecture

There's so many issues with LLMs. I wish that companies can start working on the next generation of architectures before the bubble pops

derwiki 4 hours ago | parent | next [-]

Totally agree! They also conflate things all the time (a major type of hallucination) and IIUC that can’t be solved with the current architecture, just patched over

4 hours ago | parent | prev | next [-]
[deleted]
esafak 2 hours ago | parent | prev [-]

> - their internal representations are disgusting compared to JEPA

You say this based on a theoretical understanding or did you inspect them?

vatsachak 2 hours ago | parent [-]

Look at VLM mechanistic interpretability papers vs just pca on JEPA trained weights.

JEPA gives you interpretability for free.

I have not personally inspected them and my view is maybe a more exaggerated/dramatic claim of those working in the JEPA sphere