| ▲ | Rebuff5007 2 days ago |
| Heres a definition. How impressive is the output relative to the input. And by input, I don't just mean the prompt, but all the training data itself. Do you think someone who has only ever studied pre-calc would be able to work through a calculus book if they had sufficient time? how about a multi-variable calc book? How about grad level mathematics? IMO intelligence and thinking is strictly about this ratio; what can you extrapolate from the smallest amount of information possible, and why? From this perspective, I dont think any of our LLMs are remotely intelligent despite what our tech leaders say. |
|
| ▲ | kryogen1c 2 days ago | parent | next [-] |
| Hear, hear! I have long thought this, but not had as good way to put it as you did. If you think about geniuses like Einstein and ramanujen, they understood things before they had the mathematical language to express them. LLMs are the opposite; they fail to understand things after untold effort, training data, and training. So the question is, how intelligent are LLMs when you reduce their training data and training? Since they rapidly devolve into nonsense, the answer must be that they have no internal intelligence Ever had the experience of helping someone who's chronically doing the wrong thing, to eventually find they had an incorrect assumption, an incorrect reasoning generating deterministic wrong answers? LLMs dont do that; they just lack understanding. They'll hallucinate unrelated things because they dont know what they're talking about - you may have also had this experience with someone :) |
| |
| ▲ | hodgehog11 2 days ago | parent | next [-] | | > So the question is, how intelligent are LLMs when you reduce their training data and training? Since they rapidly devolve into nonsense, the answer must be that they have no internal intelligence This would be the equivalent of removing all senses of a human from birth and expecting them to somehow learn things. They will not. Therefore humans are not intelligent? > LLMs dont do that; they just lack understanding. You have no idea what they are doing. Since they are smaller than the dataset, they must have learned an internal algorithm. This algorithm is drawing patterns from somewhere - those are its internal, incorrect assumptions. It does not operate in the same way that a human does, but it seems ridiculous to say that it lacks intelligence because of that. It sounds like you've reached a conclusion, that LLMs cannot be intelligent because they have said really weird things before, and are trying to justify it in reverse. Sure, it may not have grasped that particular thing. But are you suggesting that you've never met a human that is feigning understanding in a particular topic say some really weird things akin to an LLM? I'm an educator, and I have heard the strangest things that I just cannot comprehend no matter how much I dig. It really feels like shifting goalposts. We need to do better than that. | | |
| ▲ | pka 2 days ago | parent [-] | | > and are trying to justify it in reverse In split-brain experiments this is exactly how one half of the brain retroactively justifies the action of the other half. Maybe it is the case in LLMs that an overpowered latent feature sets the overall direction of the "thought" and then inference just has to make the best of it. |
| |
| ▲ | nsagent 2 days ago | parent | prev [-] | | You might be interested in reading about the minimum description length (MDL) principle [1]. Despite all the dissenters to your argument, what your positing is quite similar to MDL. It's how you can fairly compare models (I did some research in this area for LLMs during my PhD). Simply put, to compare models, you describe both the model and training data using a code (usual reported as number of bits). The trained model that represents the data within the fewest number of bits is the more powerful model. This paper [2] from ICML 2021 shows a practical approach for attempting to estimate MDL for NLP models applied to text datasets. [1]: http://www.modelselection.org/mdl/ [2]: https://proceedings.mlr.press/v139/perez21a.html |
|
|
| ▲ | mycall 2 days ago | parent | prev | next [-] |
| Animals think but come with instincts which breaks the output relative to the input test you propose. Behaviors are essentially pre-programmed input from millions of years of evolution, stored in the DNA/neurology. The learning thus typically associative and domain-specific, not abstract extrapolation. A crow bending a piece of wire into a hook to retrieve food demonstrates a novel solution extrapolated from minimal, non-instinctive, environmental input. This kind of zero-shot problem-solving aligns better with your definition of intelligence. |
|
| ▲ | tremon 2 days ago | parent | prev | next [-] |
| I'm not sure I understand what you're getting at. You seem to be on purpose comparing apples and oranges here: for an AI, we're supposed to include the entire training set in the definition of its input, but for a human we don't include the entirety of that human's experience and only look at the prompt? |
| |
| ▲ | Rebuff5007 2 days ago | parent [-] | | > but for a human we don't include the entirety of that human's experience and only look at the prompt? When did I say that? Of course you look at a human's experience when you judge the quality of their output. And you also judge their output based on the context they did their work in. Newton wouldn't be Newton if he was the 14th guy to claim that the universe is governed by three laws of motion. Extending the example I used above, I would be more impressed if an art student aced a tough calc test than a math student, given that a math student probably has spent much more time with the material. "Intelligence and "thinking" are abstract concepts, and I'm simply putting forward a way that I think about them. It works very much outside the context of AI too. The "smartest" colleagues I've worked with are somehow able to solve a problem with less information or time than I need. Its usually not because they have more "training data" than me. |
|
|
| ▲ | lukebuehler 2 days ago | parent | prev | next [-] |
| That an okay-ish definition, but to me this is more about whether this kind of "intelligence" is worth it, not whether it is intelligence itself. The current AI boom clearly thinks it is worth to put that much input to get the current frontier-model-level of output. Also, don't forget the input scales across roughly 1B weekly users at inference time. I would say a good definition has to, minimally, take on the Turing test (even if you disagree, you should say why). Or in current vibe parlance, it does "feel" intelligent to many people--they see intelligence in it. In my book this allows us to call it intelligent, at least loosely. |
|
| ▲ | fragmede 2 days ago | parent | prev | next [-] |
| There are plenty of humans that will never "get"
calculus, despite numerous attempts at the class and countless hours of 1:1 tutoring. Are those people not intelligent? Do they not think? We could say yes they aren't, but by the metric of making money, plenty of people are smart enough to be rich, while college math professors aren't. And while that's a facile way of measuring someone's worth or their contribution to society (some might even say "bad"), it remains that even if someone cant understand calculus, some of them are intelligent enough to understand humans enough to be rich through some fashion that wasn't simply handed to them. |
| |
| ▲ | chipsrafferty 2 days ago | parent [-] | | I don't think it's actually true that someone with: 1. A desire to learn calculus
2. A good teacher
3. No mental impairments such as dementia or other major brain drainers could not learn calculus. Most people don't really care to try or don't get good resources. What you see as an intelligent mathematician is almost always someone born with better resources that was also encouraged to pursue math. | | |
| ▲ | fragmede 2 days ago | parent [-] | | 1
and 3 are loopholes large enough to drive a semi truck through. You could calculate how far the truck traveled if you have its acceleration with a double integral, however. |
|
|
|
| ▲ | hodgehog11 2 days ago | parent | prev | next [-] |
| Yeah, that's compression. Although your later comments neglect the many years of physical experience that humans have as well as the billions of years of evolution. And yes, by this definition, LLMs pass with flying colours. |
| |
| ▲ | saberience 2 days ago | parent [-] | | I hate when people bring up this “billions of years of evolution” idea. It’s completely wrong and deluded in my opinion. Firstly humans have not been evolving for “billions” of years. Homo sapiens have been around for maybe 300’000 years, and the “homo” genus has been 2/3 million years. Before that we were chimps etc and that’s 6/7 million years ago. If you want to look at the entire brain development, ie from mouse like creatures through to apes and then humans that’s 200M years. If you want to think about generations it’s only 50/75M generations, ie “training loops”. That’s really not very many. Also the bigger point is this, for 99.9999% of that time we had no writing, or any kind of complex thinking required. So our ability to reason about maths, writing, science etc is only in the last 2000-2500 years! Ie only roughly 200 or so generations. Our brain was not “evolved” to do science, maths etc. Most of evolution was us running around just killing stuff and eating and having sex. It’s only a tiny tiny amount of time that we’ve been working on maths, science, literature, philosophy. So actually, these models have a massive, massive amount of training more than humans had to do roughly the same thing but using insane amounts of computing power and energy. Our brains were evolved for a completely different world and environment and daily life that the life we lead now. So yes, LLMs are good, but they have been exposed to more data and training time than any human could have unless we lived for 100000 years and still perform worse than we do in most problems! | | |
| ▲ | hodgehog11 2 days ago | parent | next [-] | | Okay, fine, let's remove the evolution part. We still have an incredible amount of our lifetime spent visualising the world and coming to conclusions about the patterns within. Our analogies are often physical and we draw insights from that. To say that humans only draw their information from textbooks is foolhardy; at the very least, you have to agree there is much more. I realise upon reading the OP's comment again that they may have been referring to "extrapolation", which is hugely problematic from the statistical viewpoint when you actually try to break things down. My argument for compression asserts that LLMs see a lot of knowledge, but are actually quite small themselves. To output a vast amount of information in such a small space requires a large amount of pattern matching and underlying learned algorithms. I was arguing that humans are actually incredible compressors because we have many years of history in our composition. It's a moot point though, because it is the ratio of output to capacity that matters. | | |
| ▲ | vrighter 10 hours ago | parent [-] | | They can't learn iterative algorithms if they cannot execute loops. And blurting out an output which we then feed back in does not count as a loop. That's a separate invocation with fresh inputs, as far as the system is concerned. They can attempt to mimic the results for small instances of the problem, where there are a lot of worked examples in the dataset, but they will never ever be able to generalize and actually give the correct output for arbitrary sized instances of the problem. Not with current architectures. Some algorithms simply can't be expressed as a fixed-size matrix multiplication. |
| |
| ▲ | GoblinSlayer 2 days ago | parent | prev | next [-] | | >Most of evolution was us running around just killing stuff and eating and having sex. Tell Boston Dynamics how to do that. Mice inherited brain from their ancestors. You might think you don't need a working brain to reason about math, but that's because you don't know how thinking works, it's argument from ignorance. | | |
| ▲ | saberience 2 days ago | parent [-] | | You've missed the point entirely. People argue that humans have had the equivalent of training a frontier LLM for billions of years. But training a frontier LLM involves taking multiple petabytes of data, effectively all of recorded human knowledge and experience, every book ever written, every scientific publication ever written, all of known maths, science, encylopedias, podcasts, etc. And then training that for millions of years worth of GPU-core time. You cannot possibly equate human evolution with LLM training, it's ridiculous. Our "training" time didn't involve any books, maths, science, reading, 99.9999% of our time was just in the physical world. So you can quite rationally argue that our brains ability to learn without training is radically better and more efficient that the training we do for LLMs. Us running around in the jungle wasn't training our brain to write poetry or compose music. | | |
| ▲ | dwaltrip 2 days ago | parent [-] | | > Us running around in the jungle wasn't training our brain to write poetry or compose music. This is a crux of your argument, you need to justify it. It sounds way off base to me. Kinda reads like an argument from incredulity. | | |
| ▲ | KalMann 2 days ago | parent | next [-] | | No, I think what he said was true. Human brains have something about them that allow for the invention of poetry or music. It wasn't something learned through prior experience and observation because there aren't any poems in the wild. You might argue there's something akin to music, but human music goes far beyond anything in nature. | | |
| ▲ | hodgehog11 2 days ago | parent [-] | | We have an intrinsic (and strange) reward system for creating new things, and it's totally awesome. LLMs only started to become somewhat useful once researchers tried to tap in to that innate reward system and create proxies for it. We definitely have not succeeded in creating a perfect mimicry of that system though, as any alignment researcher would no doubt tell you. |
| |
| ▲ | saberience 2 days ago | parent | prev [-] | | So you're arguing that "running around in the jungle" is equivalent to feeding the entirety of human knowledge in LLM training? Are you suggesting that somehow there were books in the jungle, or perhaps boardgames? Perhaps there was a computer lab in the jungle? Were apes learning to conjugate verbs while munching on bananas? I don't think I'm suggesting anything crazy here... I think people who say LLM training is equivalent to "billions of years of evolution" need to justify that argument far more than I need to justify that running around in the jungle is equivalent to mass processing petabytes of highly rich and complex dense and VARIED information. One year of running around in the same patch of jungle, eating the same fruit, killing the same insects, and having sex with the same old group of monkeys isn't going to be equal to training with the super varied, complete, entirety of human knowledge, is it? If you somehow think it is though, I'd love to hear your reasoning. | | |
| ▲ | hodgehog11 2 days ago | parent | next [-] | | There is no equivalency, only contributing factors. One cannot deny that our evolutionary history has contributed to our current capacity, probably in ways that are difficult to perceive unless you're an anthropologist. Language is one mode of expression, and humans have many. This is another factor that makes humans so effective. To be honest, I would say that physical observation is far more powerful than all the bodies of text, because it is comprehensive and can respond to interaction. But that is merely my opinion. No-one should be arguing that an LLM training corpus is the same as evolution. But information comes in many forms. | |
| ▲ | chipsrafferty 2 days ago | parent | prev [-] | | You're comparing the hyper specific evolution of 1 individual (an AI system) to the more general evolution of the entire human species (billions of individuals). It's as if you're forgetting how evolution actually works - natural selection - and forgetting that when you have hundreds of billions of individuals over thousands of years that even small insights gained from "running around in the jungle" can compound in ways that are hard to conceptualize. I'm saying that LLM training is not equivalent to billions of years of evolution because LLMs aren't trained using evolutionary algorithms; there will always be fundamental differences. However, it seems reasonable to think that the effect of that "training" might be more or less around the same level. |
|
|
|
| |
| ▲ | Ajakks 2 days ago | parent | prev | next [-] | | Im so confused as to how you think you can cut an endless chain at the mouse. Were mammals the first thing? No. Earth was a ball of ice for a billion years - all life at that point existed solely around thermal vents at the bottom of the oceans... that's inside of you, too. Evolution doesn't forget - everything that all life has ever been "taught" (violently had programmed into us over incredible timelines) all that has ever been learned in the chain of DNA from the single cell to human beings - its ALL still there. | |
| ▲ | smohare 2 days ago | parent | prev [-] | | [dead] |
|
|
|
| ▲ | skeeter2020 2 days ago | parent | prev | next [-] |
| This feels too linear. Machines are great at ingesting huge volumes of data, following relatively simple rules and producing optimized output, but are LLMs sufficiently better than humans at finding windy, multi-step connections across seemingly unrelated topics & fields? Have they shown any penchant for novel conclusions from observational science? What I think your ratio misses is the value in making the targeted extrapolation or hypothesis that holds up out of a giant body of knowledge. |
| |
| ▲ | blks 2 days ago | parent [-] | | Are you aware of anything novel, produced by an LLM? |
|
|
| ▲ | jononor 2 days ago | parent | prev | next [-] |
| For more on this perspective, see the paper On the measure of intelligence (F. Chollet, 2019). And more recently, the ARC challenge/benchmarks, which are early attempts at using this kind of definition in practice to improve current systems. |
|
| ▲ | rolisz 2 days ago | parent | prev [-] |
| Is the millions of years of evolution part of the training data for humans? |
| |
| ▲ | Rebuff5007 2 days ago | parent [-] | | Millions of years of evolution have clearly equipped our brain with some kind of structure (or "inductive bias") that makes it possible for us to actively build a deep understanding for our world... In the context of AI I think this translates more to representations and architecture than it does with training data. | | |
| ▲ | goatlover 2 days ago | parent [-] | | Because genes don't encode the millions of years of experience from ancestors, despite how interesting that is in say the Dune Universe (with help of the spice melange). My understanding is genes don't even specifically encode for the exact structure of the brain. It's more of a recipe that gets generated than a blue print, with young brains doing a lot of pruning as they start experiencing the world. It's a malleable architecture that self-adjusts as needed. |
|
|