Remix.run Logo
andix 3 days ago

Exactly. Just look at what they are really useful right now. Running LLMs in feedback-loops (agents) so they can try out random-ish approaches until some verification function passes (tests).

It's like the infinite monkeys on typewrighters that will type whatever you are looking for, given infinite time. LLMs are just tuned to much better odds than the monkeys are. But it's still a lot of randomness, with random results.

roadside_picnic 3 days ago | parent | next [-]

> It's like the infinite monkeys on typewrighters that will type whatever you are looking for, given infinite time.

In the monkey example the infinite time is doing a lot of work there. The fact that LLMs can search through semantic space and find reasonably correct paths in a reasonable time is directly tied to the reason why they are valuable.

Saying "these two things are similar except one can be useful and one can't" is not a great comparison.

For me the real lesson learned isn't how "smart" LLMs are, but rather how much human work is basically reducible to repeating past work with minor variation. Human's believe they are "reasoning" but so much code writen is just the human brain doing the same autocomplete style work that LLMs can do now.

tikhonj 3 days ago | parent | next [-]

The point is that it's the same process with—much—better priors.

This seems like a reasonable view to me. It's surprising just how much better priors matter and how we can develop those priors by training on a bunch of text. But it also explains, or at least hints at an explanation, for why LLM capabilities are so jagged, and in such inhuman ways.

dpark 3 days ago | parent [-]

> The point is that it's the same process

Except it’s not at all the same process. The fact that LLM are non deterministic is not the same as churning out random garbage.

tsunamifury 3 days ago | parent [-]

The literally churn out random garbage and are trained over time for that garbage to look more and more like an acceptable outcome to humans.

It’s training monkeys at typewriters through reinforcement.

dpark 3 days ago | parent [-]

> trained over time

So not random.

> acceptable outcome to humans

And not garbage.

It’s real weird to see people argue that LLM output is no different than random gibberish and then handwave over the fact that it’s clearly not with terms like “training”, as if a steam of random garbage is trainable.

tsunamifury 2 days ago | parent [-]

Hey dpark.

I quite literally created and productized predictive linguistics and behavioral vectors at Google.

If you had stopped to consider what I explained; you’d understand that it’s the process of turning random garbage into increasingly acceptable outputs.

Ie training the monkeys.

The insight you are missing is the rule of networked scale. It turns out that any reactive node scaled enough can form sophisticated predictive system given reward over a training topography, even if it starts out at garbage or is literally made of monkeys.

So it is garbage. And you can turn garbage into semi-intelligence.

dpark 2 days ago | parent [-]

A human child is born with no ability to speak intelligibly. All they can do is babble. Through years of training they gain the ability to speak intelligibly and communicate in advanced ways.

The act of successful training means it’s not garbage anymore.

> So it is garbage.

This statement is ultimately meaningless and I continue to find it weird that someone who works in this space would support this view. If you fundamentally change the nature of a thing, it’s no longer that original thing. Is tan HDD still random garbage after you fill it with family photos just because that’s how it starts?

tsunamifury 2 days ago | parent [-]

Because you can do it with anything. Literally any reactive node physical or virtual can do it given scale or complexity.

You can start with garbage.

This is not human. Humans are billions of years in passed DNA learning. Babies are born to a sophistication level millions of times higher than this.

I’m pretty surprised you don’t know this.

dpark 2 days ago | parent [-]

If you start with a fire hose of literal sewage and install a series of filters culminating in a reverse osmosis step that pours clean drinking water out, the product is not shit even if the original input was.

I don’t believe that you can’t understand the distinction between “at one point this was garbage” and “at the present time this is still garbage”. You’re clearly smarter than that.

tsunamifury 20 hours ago | parent [-]

I’m saying the distinction doesn’t matter. This is what NNs proved. All matter can become semi intelligent.

andix 3 days ago | parent | prev | next [-]

> but so much code writen is just the human brain doing the same autocomplete style work that LLMs can do now.

That's the part they are really good at. But they are really bad at taking complex decisions. Most of them are just guesses from a finite amount of solutions they were trained on, or from options they have in context.

godwinson__4-8 3 days ago | parent [-]

Indeed. Humans are well known for being good at "taking complex decisions" for which they have no "training", "options" or "context".

andix 3 days ago | parent | next [-]

Humans have a much bigger "context window". They remember many things they did an hour ago, a week ago, or even years ago.

godwinson__4-8 3 days ago | parent [-]

Yes, and your ability to remember a relatively few things that happened years ago is predicated on your ability to also forget most things that happen to you - like what you had for dinner last week. Good thing we have technology to fill in the gaps.

And nothing about this makes your initial comment any less goofy. Anyone who has ever had to make a difficult decision knows more than half the battle is preparation. Where do you think complex decisions come from? Have current events left you with the impression that people just waltz into idk say the Situation Room and just big brain their way through world events? That's how the current administration seems to think the world works, with quite predictable results.

Society is already algorithmic. To optimize for humans being dumb. AI is nothing more than another advance along this continuum. No one is impressed by your ability to remember something years ago, many if not most mammals have the same capability. Human recall is also notoriously bad in many cases - see numerous studies on the reliability of eye witnesses testimony.

AI is smart because most people are dumb. Come to terms with the fact that your anthropocentrism need not be based on a notion of intellectual supremacy and you'll be a far less tedious person to deal with.

andix 3 days ago | parent [-]

You didn't convince me, that I'm the tedious person to deal with here.

godwinson__4-8 3 days ago | parent [-]

Clearly the LLMs lack of pride is also a deficiency in your view.

nkrisc 3 days ago | parent | prev [-]

Humans also generally have the will to live.

godwinson__4-8 3 days ago | parent [-]

Indeed. It's almost like the LLM was the one that invented the "tactical" nuke in the first place.

queenkjuul 3 days ago | parent [-]

No, that would be Douglas MacArthur

harry8 3 days ago | parent | prev | next [-]

>Saying "these two things are similar except one can be useful and one can't" is not a great comparison.

Launching a nuclear war is an interesting definition of "useful", not one I'd agree with and that exact scenario is what is being discussed.

So yes this is a perfectly valid and useful comparison in examining this particular, civilisation ending limitation.

Folcon 3 days ago | parent | prev [-]

I mean to a point?

You do have to successfully write something the first time

We already acknowledge this to a degree, what is experience other than having done something similar before?

That first time though, you've got to figure something out that time

mettamage 3 days ago | parent | prev [-]

Hmm saying it’s random-ish is doing it a disservice. I understand it’s a stochastic process but there’s definitely some level of understanding. Not at the level of lived experience but usually an LLM with vision capabilities can call a spade a spade and do something useful with it. And when a verification function shows how they are wrong then they usually come with a better and more informed approach.

So I can’t fully see how that’s related to the infinite monkeys. A typewriting monkey doesn’t have access to a verification function. And even if it did, it would not be the original concept anymore with infinite typewriting monkeys producing the works of Shakespeare.

Nevertheless, I upvoted your comment because it’s definitely insightful.

dwattttt 3 days ago | parent [-]

"understanding" is overstating it. Correlation between tokens embedded in the weights via training, yes.

anon84873628 3 days ago | parent | next [-]

Feedback loops certainly seem to give them some level of understanding.

Agent reads a skill file about how to use a CLI tool. It tries to use the tool but gets an error about the input format. It tries again with a different format based on the error message, and sees that command succeeded. It compares what worked to what was in the skill file and notes the difference. On future invocations it continues to use the new format.

Is that not "understanding" how to use the tool?

mountainriver 3 days ago | parent | prev | next [-]

What’s the difference? It’s clearly processing information and coming up with the right answer

hgoel 3 days ago | parent | prev | next [-]

What exactly would you call understanding? It's a correlation matrix of concepts.

varjag 3 days ago | parent | prev [-]

Training is a loan word used to describe human learning process. For a reason.

andix 3 days ago | parent [-]

Humans learn on the job. LLMs don't. Very important difference.

varjag 2 days ago | parent [-]

Where do LLMs learn?

andix 2 days ago | parent [-]

They get trained before release. On general information. But they don't improve while working on very specific tasks. Every new session is like an experienced human on their first day at a new job.

varjag 2 days ago | parent [-]

When they get to you it's only inference. You have basic misunderstanding of what ANNs are.

They train on a billion "jobs". Which is not terribly efficient but oh man they do train.

andix 2 days ago | parent [-]

Unique skills and jobs do exist. And LLMs can't gain additional knowledge "on the job" like humans can. They are generalists, that can only be steered by prompts, skills and context. Thats all I'm saying.

This fact is currently the most limiting factor for LLMs.