Remix.run Logo
skybrian 5 hours ago

He's saying that pre-training an LLM alone can't do it, but if you run an LLM in a loop with tools (like any coding agent) then it can. Also, the technique his group came up with should be used more:

> This is the weakness of deep learning that is alleviated with a new algorithm that my group presented in Nature a couple of years ago. Our “continual backpropagation” made one small change: every so often a less-used neuron would be re-initialized to small random weights. This allows the variation to continue and plasticity to be retained.

Here's the paper: https://www.nature.com/articles/s41586-024-07711-7

It has a fair number of citations, but I haven't looked into how much it's used.

simianwords 4 hours ago | parent [-]

Sorry this makes no sense — humans also use tools to evaluate their discoveries.

Kant said something like this: knowledge can’t be obtained by pure thinking, it needs interaction with the world.

This is obvious to me so why is the author making a claim that LLMs can make knowledge without access to environment but purely through thinking in aether