Remix.run Logo
Mathematicians issue warning as AI rapidly gains ground(science.org)
119 points by pseudolus 11 hours ago | 150 comments
turzmo 10 hours ago | parent | next [-]

Much of math (or science) research has the strange quality of being mostly curiosity-driven, but having giant benefits that occasionally spin out to the public.

Some questions are more urgent and practical. My feeling is that the more directly practical a question is, the more likely the research community is to support AI usage in that question.

The annoying thing about recent AI advances is that they target questions on the wrong end of the spectrum: Erdos problems are exactly the sort of "useless" questions that people might answer purely for the love of the game. The sort of questions that a young person might cut their teeth on and gain confidence.

Solving questions like these automatically, I think, is not good for the long-term health of research. At least for the foreseeable future you still would like people to become interested and develop skills in these fields. These developments, and especially how they are presented, directly discourage that.

math_dandy 2 hours ago | parent | next [-]

To me, the most interesting feature of the OpenAI solution of the Unit Distance (Erdös) Problem is that the solution - using deep algebraic number theory as a source of extremal combinatorial/geometric constructions - is much more interesting than the problem’s elementary statement might lead one to expect.

Writing off Erdös’s problems as random, useless, or meaningless dismisses his mathematical intuition, second-to-none, and strikes me as somewhat uncharitable.

Finally, I agree that AI threatens mathematical training by rendering an entire class of acolyte-level research problems solvable by prompt. But the Unit Distance Problem is not of this class.

pfdietz 2 hours ago | parent [-]

> much more interesting than the problem’s elementary statement might lead one to expect

This is reinforced by the immediate (human) use of the idea to resolve in the negative another significant problem, the sum-product conjecture on reals.

Explanation of what was involved: https://www.erdosproblems.com/forum/thread/blog:6

azeirah 29 minutes ago | parent | prev | next [-]

Do you not think that solutions to erdos problems might end up stepping stones to other important problems?

Either by introducing new tools, or by proving things that were previously unproven that end up helping in unexpected ways?

That's often how math goes, isn't it?

BigGreenJorts 9 hours ago | parent | prev | next [-]

Sounds like yet another example of how AI is kneecapping industries from the bottom by "removing the barrier to entry" but really just removing the training path by doing the work itself with no guidance for juniors.

brador 9 hours ago | parent [-]

We are on tiny 1-5T parameter models with local power stations.

We can reach Q models just by throwing resources at it. That’s a million times current B models.

bdamm 2 hours ago | parent [-]

Is this a known or quantifiable thing? I thought that the limit had already been determined i.e. the existing models top out and at some point it doesn't matter how much time or energy you let the model consume, it won't improve the result. And with regards to training parameters, I thought we were equally limited there, e.g. the existing models can't benefit from a larger parameter space.

I was under the impression that improvements are arriving via how the models are trained and how model prompting context is constructed, rather than just by how much data or how much energy is spent searching over the model space for a particular prompt.

Is there some evidence that we have not reached a pleateau with just resource consumption on existing models?

yieldcrv 9 hours ago | parent | prev [-]

That's an interesting perspective and I wholly disagree with the conclusion

You are saying that tough problems with no applicability are useful because people that you happen to respect got good by their curiosity and pursuit of trying to solve these kinds of problems and failing, but branching off into other cognitive areas as mathematicians

Now if I know anything about math for the sake of math, and academics, these are the same people that lament the idea of intelligent people going to the finance sector or any other trade they just happen not to respect as much

The similarity being that their exact criticism of why, something they don't respect and view as having little utility, is the exact reasoning presented here now that AI can solve their pointless problems

What I'm seeing is that human mathematicians have a laundry list of problems they have failed to solve for decades, centuries, which is what they are funded and employed to do. "Computer" used to a human job title too.

This leads me to being excited about AI one-shotting these problems, let move on to something else.

ccppurcell 3 hours ago | parent [-]

I think you've slightly straw manned the lamentation there. Not that I agree with the lamentation, but using your talent to make the rich richer (which is what quants do, they are paid a fixed amount to provide a larger value up the chain), as opposed to advancing human knowledge, is the reason for the lament, not some sort of respectability issue.

Spacecosmonaut 10 hours ago | parent | prev | next [-]

Accelerationists may argue that the eroding of proper attribution and proof verification by humans is a meaningless short term struggle of a dying field.

Mathematics seems to be entering an era where human + machine maximizes performance, much like chess in the 1990s. However, imagine a future where even talented mathematicians are nothing but noise in the machine (as is the case in chess now). A future where AI generates and verifies proofs without humans in the loop. Where the mathematics may be beyond human comprehension.

In that future, does it matter that early career mathematicians are inhibited by these developments? Perhaps not. Programming faces the same issue. As AI crawls up the competence ladder, does it matter that fewer people have opportunities to develop the skillset of a senior engineer? Perhaps not.

0x59 3 hours ago | parent | next [-]

An issue I see is who controls the information. The next generation may not recieve the knowledge, it may be gatekept by industry who *will* own the gate.

The future may not have access unless we fight to ensure they do. This is how I read the article.

wongarsu 10 hours ago | parent | prev | next [-]

Much like for many the point of chess is that it's played by humans, with truly superhuman AI relegated to a training aid, mathematics is in many ways about human comprehension. You can use AI to find and proof new theorems. But if you get to the point where humans can't understand it, is it even still math?

squidbeak 30 minutes ago | parent | next [-]

> Much like for many the point of chess is that it's played by humans, with truly superhuman AI relegated to a training aid

It's much more than just training. Humans use the engines to prepare openings and find promising novelties. Over time these novelties unearthed by engines fill out theory. It's easy to fine elite games where neither player is out of book for dozens of moves. Modern players are full hybrids in that sense. Looking back at chess, it seems natural that Mathematics will go the same way.

jrflo an hour ago | parent | prev | next [-]

I think there would still be a place for it if it's beyond human comprehension. For instance, really complex lemmas to solve human-tractable problems. If you can pose a question in a proof assistant language like Lean, have an AI write a Lean program that solves it, you can use that as a Lemma for some other problem. There's quite a bit of math out there that is "correct assuming conjecture X is correct", maybe AI could fill that gap and "still be math".

Spacecosmonaut 10 hours ago | parent | prev [-]

Perhaps P=NP. The new algorithms are handed down to us. We can apply them without fundamentally understanding why P=NP.

orbital-decay 2 hours ago | parent | prev | next [-]

Surely such AI would also be able to reduce and simplify the math for human understanding. Which is what mathematicians do all the time, from turning base 60 cuneiform into modern number systems to simplifying Maxwell's equations for the students.

Joel_Mckay 23 minutes ago | parent | prev [-]

In general, most researchers already incorporate LLM into their workflows, as it is quite good at context search. However, the relevant training data is based on the collective works of the field of experts. Collecting current data on that work is what makes the LLM sound relevant, and any improvement of the LLM model requires frequent new data from both researchers and the chat bot users themselves. LLM are not real "AI", and anyone that says otherwise is selling people something.

To phrase this differently, LLM companies conduct unauthorized targeted intelligence gathering on peoples work, codify that act of plagiarism or theft as MoE documentation, and sell unaccountable token output to other users.

There is a reason output becomes more nonsensical as "AI" companies try to use dynamic weight granularity and conceptual compaction. It is not necessarily "AI" hallucinations, but rather people fooling themselves into believing smart people are no longer needed if they willingly become a hapless exploited data source caste. This simply isn't true, as people will leave the field for awhile.

The LLM business model regularly requires copyright theft and plagiarism to persist. It will not magically become sentient/AGI/less-stupid, as these algorithms have been operating for over 40 years. What has changed is the scale of the deployment, data pool size, and the energy consumed.

Scientists are still necessary, as they create the world models LLM try to guess at by statistical inference. Hype and FUD ahead of an IPO for a highly dubious revenue company is expected. We look forward to the low cost GPU hardware in the near future. =3

silveraxe93 10 hours ago | parent | prev | next [-]

> However, the declaration argues math is more than a machine for producing correct answers.

There might be more to maths than that, but that is definitely the most important part. I love science funding. But not because it's a jobs program for nerds.

zerobees 3 hours ago | parent | next [-]

Culturally, mathematics is a jobs program for nerds. The field very explicitly takes pride in working on problems that have no obvious applications, and most practitioners are funded publicly or supported by private endowments, with zero pressure to deliver specific results.

Of course, this produces useful results every now and then, but it's not like we pursued ruthless efficiency / maximum rate of knowledge advancement before. We just let them do their thing, essentially treating them as artists and letting them pursue the craft for its own sake. If we weren't interested in maximum throughput before, why is that an objective now?

141205 2 hours ago | parent | next [-]

Hardy would agree with the viewpoint that you espouse but it would be pushed back against by Arnol'd, Poincare, Gauss, Von Neumann, and even Grothendiek: Arnol'd and Poincare were vituperatively against the division between "pure" and "applied" mathematics; they considered mathematics and physics interchangeable, and Arnol'd lamented that the field had lost a large amount of funding/prestige/relevance due to groups like the Bourbaki that took a purely aesthetic view; Gauss had a critical view of problems like Fermat's last theorem (he felt that you could construct infinitely many such problems, and felt that attempting to prove it was a generally useless endeavor), along with outright calling pure mathematics worthless; but while Von Neumann and Grothendiek were more moderate, both were critical of the field losing motivation/quality as it strayed away from empirical science into—quoting Von Neumann—"abstract inbreeding".

Arnold's polemics are perhaps the most infamous and easily found online (see "On Teaching Mathematics"), but the written opinions of Poincare et seq. are also easy to find. Even today the vast majority of research funding for mathematics, at least in the United States, is dolled out for highly applied fields like partial differential equations. The field does not even close to unanimously (contemporarily or historically) "explicitly take pride" in working on problems that have no obvious application, or being a "jobs program for nerds": the notion of such "pure" or "nonapplied" mathematics is at the very least a highly fractious and controversial subject, with a number of big names taking opposing viewpoints (often vehemently).

I think your picture of the field is over-represented on the internet, much like the fixation on certain niche fields: Category Theory, Homotopy Type Theory or, worst of all, outright dubious fields like Geometric Algebra; fields with a large number of online promoters, but with much less funding and relevance in the actual academic space. Of course there are reputable people with PHDs that feel this way,—but I can only imagine that there's a legion of tyros, pop math consumers, and undergraduate students who disproportionately promote this viewpoint.

bdamm 2 hours ago | parent | prev | next [-]

Except, it was true before and it is still true today that the best "artists" whether graphic or mathematic, are the ones that do somehow manage to cross the chasm of pure research and providing a tangible benefit to their benefactors. That aspect of understanding your customer is not changed by the presence of AI.

orbital-decay 2 hours ago | parent | prev [-]

Pure mathematics != applied mathematics

psyklic 10 hours ago | parent | prev | next [-]

The most important part of math is advancing human understanding. A correct answer by itself is not as important as understanding why it is correct.

datsci_est_2015 10 hours ago | parent | next [-]

To further this assertion, there is almost no value to deeply esoteric math that is technically correct, but completely inapplicable to any scientific reality, and completely unintelligible to humans. Consider these findings deep, dark corners in the unfathomably large hyperspace of mathematics. My guess is AI will be incredibly adept at identifying these types of findings, and it will be exceedingly difficult for humans to identify what is meaningful and what is not in the slop.

sunshowers 36 minutes ago | parent | next [-]

Elliptic curves over reals and the complex numbers had some physical/scientific meaning, but elliptic curves over finite fields had none before cryptography.

canjobear 3 hours ago | parent | prev | next [-]

Your model of what AI is good at is wrong. Generative AI is not good at wandering off into novel esoteric abstract corners while maintaining correctness, it is good at things that are close to its training data. I suspect that humans will long outperform AI in the domain of "novel esoteric abstract useless math" whereas AI will outperform humans in the domains of (1) making connections between already-well-understood concepts, things that seem obvious in retrospect but which no human figured out just because of the accidents of what people happened to focus on, and (2) proving things that require long, tedious, intellectually unsatisfying calculations, which would cause a human mathematician to give up for boredom.

datsci_est_2015 41 minutes ago | parent [-]

My understanding is that we’re talking about “tool-assisted” proof generation, which provides some guard rails but would still allow significant creativity. Tools like Lean, Coq, etc.

jmorenoamor 3 hours ago | parent | prev | next [-]

Sorry but I couldn't agree less.

Deep esoteric research and trivial looking boring research can be as useful as state of the art trending areas.

"Jobs for nerds" as has been stated, has given surprising and unexpected advances, or leveraged incredible advancements.

An standard and boring bacteria in a specific Spanish biome, gave us CRISPR-Cas. There ar hundreds of examples.

True knowledge is, and will be, a human endeavor, deiven by human curiosity. Promoting curiosity is the sign of a developed society.

datsci_est_2015 3 hours ago | parent [-]

> Sorry but I couldn't agree less.

> …

> True knowledge is, and will be, a human endeavor, deiven by human curiosity. Promoting curiosity is the sign of a developed society.

Unless I misunderstand, it sounds like you do agree? My point is that without human mathematicians LLM output is meaningless, and without human mathematicians holding the reins, LLMs would probably quickly devolve into “proving” things that are not only completely unintelligible by humans, but have no utility.

Your examples of esoteric mathematical concepts are anecdata. The vast majority of esoteric mathematics does not have utility. Mathematics is an incredibly large space of concepts. Consider the number of provable theorems in number theory alone, perhaps even related to specific subsets and sequences of numbers. The vast majority of the findings in that domain will not be isomorphic to some real world problem, they will be trivia.

We will need mathematicians to separate the signal from the noise.

yaris 9 hours ago | parent | prev | next [-]

Works of Shinichi Mochizuki immediately come to mind. He is not AI but provides very good examples of math that is useless because it is incomprehensible by (other) humans.

pfdietz an hour ago | parent | next [-]

It's not that it's incomprehensible, it's that it appears to be wrong.

seanmcdirmid 7 hours ago | parent | prev [-]

Do AIs produce answers whose work is incomprehensible to humans? It seems like you could just have the AI elaborate multiple times until you were satisfied with the explanation and documentation of what went into figuring out the answer. It’s not like the AI is one shotting the answer in a single opaque query anyways.

datsci_est_2015 7 hours ago | parent | next [-]

Like other commenters, I think you’re also underestimating the complexity of esoteric higher level math.

Consider the “Magnus Carlsen” of mathematics, who is more capable of understanding mathematics than any other human. But then also realize that that individual has probably devoted their entire career into a specific subdomain of mathematics. Within other deep recesses of mathematics, this Magnus equivalent will be less capable than their peers without years of rewiring their brain to understand the esoteric concepts and properties within that other subdomain.

LLMs will be able to dig deeper and broader than any human mathematician, and find results that are completely useless to humans because it would take more than an entire lifetime to “speak the language” of the concepts the LLMs have produced. The only way those results can become useful to humans is if then the LLM itself finds a way for it to be practical to humans once again.

So, no, I don’t think this represents the “democratization” of mathematics where mathematicians are no longer necessary because anyone can just prompt the LLM to explain it. The bar for entry level mathematics is lower, for sure, but research level mathematics will continue to be unapproachable for anyone who hasn’t devoted their career to it.

seanmcdirmid 4 hours ago | parent [-]

I don't get it. LLMs don't have ego, they don't have the ability to say "no, this should be obvious, I'm not going to explain further", they are just token predictors, and given context, they can generate more tokens. If you don't understand how the answer was derived? You just ask more questions and it isn't going to get bored or annoyed, it will just try to answer the questions.

Is that what is offending you so much?

datsci_est_2015 3 hours ago | parent | next [-]

No, it doesn’t sound like you get it. It has nothing to do with the properties of LLMs and everything to do with the complexity of mathematics.

Have you ever been exposed to concepts that are so complex that you feel like you could devote your entire lifetime to trying to understand it and still fall short? It’s a very humbling experience, especially if you have classmates who pick it up effortlessly.

Without a human holding the reins, consider an LLM a rudderless superboat speeding erratically towards the horizon, finding and proving meaningless theorems that not even your most talented classmate could ever begin to understand.

My point is the human is a critical piece to the puzzle, but not just any human, a career mathematician.

duchef an hour ago | parent | next [-]

> Have you ever been exposed to concepts that are so complex that you feel like you could devote your entire lifetime to trying to understand it and still fall short? It’s a very humbling experience, especially if you have classmates who pick it up effortlessly.

I'm really interested in this anecdote. I have never experienced this but have a reasonable academic background (BSc, MSc, MD) - and I am certainly not the person you're describing. Could you elaborate? Is this something more exclusive to pure mathematics (my bsc/msc are CS).

datsci_est_2015 an hour ago | parent [-]

For me it was a “Modern Algebra” course required for my mathematics major, where I managed to squeak by with a B, but it was definitely a filter course for research-level mathematics. It was very clear in the class of a few dozen students who the top 5 or so were based on their questions during lectures and office hours, as well as when they blessed us mere mortals with their presence at our study groups.

(Aside, this was one of the only undergrad courses where I felt I needed to attend study groups in order to not fail.)

The first exam was easy to pass based on intuition alone, as the topics were isomorphic to concepts I was familiar with like geometry or algebra. The midterm was a wake up call when it was made clear that just understanding the homework wasn’t sufficient, you were going to be asked to prove things that were much more difficult than what I’d ever encountered, and under time pressure (I had been doing math proofs since age 13 in geometry, and I was 22 at that point).

Maybe if you did discrete math, combinatorics, or linear algebra I would say it was 5x to 10x more abstract and difficult. Probably 2x more difficult and abstract than Theory of Calculus, if you had taken that or a similar course.

Edit: I also do endurance running and play soccer into my 30s. Seeing people run literally twice as fast as me (world record pace), and playing against former college athletes is equally as humbling. The time has passed for me to have anything near their ability haha.

bawolff 2 hours ago | parent | prev | next [-]

> Have you ever been exposed to concepts that are so complex that you feel like you could devote your entire lifetime to trying to understand it and still fall short? It’s a very humbling experience, especially if you have classmates who pick it up effortlessly.

> Without a human holding the reins, consider an LLM a rudderless superboat speeding erratically towards the horizon, finding and proving meaningless theorems that not even your most talented classmate could ever begin to understand.

This feels like a little bit of a jump to me. AIs arent actually alive so of course someone is going to have to pose the question. They arent going to just do stuff on their own. And of course mathmaticians are going to need to interpret the results if we are to glean anything beyong if the conjecture is true or false.

But you seem to be suggesting that mathematicians will have to micromanage every step. That seems like a bit of a jump which i dont see much evidence for.

seanmcdirmid 18 minutes ago | parent | prev [-]

> Have you ever been exposed to concepts that are so complex that you feel like you could devote your entire lifetime to trying to understand it and still fall short? It’s a very humbling experience, especially if you have classmates who pick it up effortlessly.

I do have a PhD so I kind of know how that feels. I watched my entire field (PL) get eaten up by AI though, the problems that I thought were huge 10 years ago are just silly footnotes now.

> Without a human holding the reins, consider an LLM a rudderless superboat speeding erratically towards the horizon, finding and proving meaningless theorems that not even your most talented classmate could ever begin to understand.

I don't disagree with that. LLMs are a tool, a super fast pattern matcher, research, token predictor. I don't expect it to go out and define its own esoteric (or useful) problems to pursue without human interaction. That's for the humans to do.

I don't understand what that has to do with my original comment though. I wasn't addressing what problems the LLMs were answering, just how to review and dissect the answers that they would come up with.

BalinKing 3 hours ago | parent | prev | next [-]

Excluding supergeniuses, pure mathematics—even at a very basic, undergraduate level—simply can't be understood passively. Even with an infinitely patient AI teacher who could answer any question on-demand, it'd still require a massive amount of work to actually understand anything in research-level mathematics. Basically every single word in a mathematical definition is a term of art, and (IME) if one doesn't grok each of those words at a fairly deep level, the new definition never really makes too much sense. And this applies recursively: each of the words has some thoroughly inscrutable definition of their own.

Of course it'd be super helpful to have, say, a teacher who could tailor explanations to anyone's precise background (e.g. where possible, using examples that come from the student's field of study when explaining some abstract concept). Or, if some definition comes with some precondition that has no obvious purpose, perhaps an omniscient teacher could explain why it's there with concrete counterexamples.[0] But even granting all this, I think that mathematical intuition is necessarily based on a lot of hard work actually exploring definitions on one's own, with pencil-and-paper and a lot of thought. That is to say, even though the process could probably be sped up a lot with a nigh-omniscient teacher[1], I doubt that a student wouldn't still need years of training to even have a clue what's going on.

(I'm saying all this, by the way, as someone who is terrible at all this and has very little mathematical maturity[2]—I'm speaking from my own frustrating experience....)

[0] c.f. Lakatos' excellent book Proofs and Refutations

[1] without the "curse of knowledge," or else we're back to square one of "answers that are correct but useless"

[2] e.g. the "post-rigorous stage" described in https://terrytao.wordpress.com/career-advice/theres-more-to-...

ironman1478 3 hours ago | parent | prev [-]

How do you make use of something that you don't understand?

epgui 3 hours ago | parent | prev [-]

It’s easy to imagine this being a problem both in quality and in volume. Verifiable work is less valuable than verified work. And noise is always costly.

psychoslave 9 hours ago | parent | prev [-]

Esoterism is mostly a social tool to keep those not initiated excluded from the private club. Most of the time mathematics becomes tricky less due to unfathomable intrinsic complexity, and more due to the way it’s communicated.

LLMs don’t give a shit about social side effects, leave alone on unconscious level, because they are void of any intention. At most they are tuned on their thin edge layer to lean toward this or that kind of output, but that’s it.

Now the landscape shift as it’s sold (I guess) is that anyone can take a postdoc gibberish infused with the hard gained academic winks and subtle references and turn it into a ELI5 "does it have any applicability for my concrete issue at stake, prove it through Lean, good let’s deploy".

datsci_est_2015 7 hours ago | parent [-]

When I use the word “esoteric”, I mean it at an absolutely hyperbolic level. Like exploring new-but-basically-useless axiom spaces, and creating concepts for which there exists no clean metaphor in time-space - like quantum mechanics on steroids. And then creating multiplicatively more complex concepts by combining those concepts together.

There’s no way to “ELI5” this type of complexity. I’m talking about concepts exponentially more esoteric than quantum mechanics, and even within quantum mechanics there is nothing to ELI5 for a concept like “spin”. The best you can do is say that it’s a property of a particle. But imagine the words “property” and “particle” are also completely meaningless to you because they’re built on even more layers of conceptual mathematical abstraction.

ragebol 10 hours ago | parent | prev | next [-]

"What is the answer to the Ultimate Question of Life, the Universe, and Everything"

42

lioeters 6 hours ago | parent [-]

The proof is trivial and is left as an exercise for the reader.

rowanG077 10 hours ago | parent | prev | next [-]

Once you now something is correct, with a proof. It is MUCH easier to understand why it is correct. Than to start from a slate that you don't even know whether something is correct or not. In that sense AI that can just solve high level math problems is immensely useful. It allows a mathematician to explore ideas at a much more rapid pace.

terminalbraid 9 hours ago | parent [-]

Consider that since an LLM is really just an large encoding of data, the "proof" is in there already. All further work on it is effectively only rearranging words. Then all math an LLM is capable of is "done" and we have the "proof" in the LLM which by your definition is now "MUCH easier to understand" and this work is somehow sufficient.

Do you see the problem with your reasoning?

rowanG077 6 hours ago | parent [-]

You're confusing "contains information" with "has produced a result."

A proof being latent in an LLM is no more significant than a proof being latent in a book, a theorem prover, or the axioms themselves. Einstein's papers were latent in the genetic code of his parents and the environment of his time. That doesn't mean general relativity was "already done" before Einstein was born.

By your logic, no computation has ever accomplished anything because the output was always implicit in the inputs.

The entire purpose of computation is extracting information from representations where it's difficult to see into representations where it's easy to see.

So no, this isn't a problem with the original reasoning. It's a problem with yours.

bitwize 3 hours ago | parent | prev [-]

The value of human understanding just cratered because we have machines to understand for us now.

epgui 3 hours ago | parent | next [-]

That seems about as shortsighted as claiming the value of human understanding has cratered after the invention of the electronic calculator.

goatlover 3 hours ago | parent | prev [-]

Wonder what Frank Herbert would have to say about letting machines do the thinking for us.

bitwize an hour ago | parent [-]

"You wanna know what the best thing about humans is? You invented us! Giving you a chance to take a rest while we invented everything else!" —Wheatley, Portal 2

dwroberts 10 hours ago | parent | prev | next [-]

Probably one of the funniest things to read on a site like this, when you consider that eg. Boolean algebra was entirely abstract and had little practical purpose for almost 100 years until Shannon picked it up for use in circuits

card_zero 6 hours ago | parent [-]

Boole was trying to improve logic for humans, "The Laws of Thought". So it has a connection to human problems, and eventually to practical matters. He could instead have been working on something much more abstract and much less useful.

By which I'm trying to make an abstract point about the inevitability of staying somewhat down to earth. I mean "pure" curiosity is great, except it isn't ever really pure, and abstract mathematics isn't ever totally abstract, it's just sort of meta in relation to practical things that humans care about.

delichon 10 hours ago | parent | prev | next [-]

For most engineers a mathemetician is a machine for producing correct algorithms, like a chef is a machine for producing tasty food. In both cases that overlooks the human element, but that's a critical skill for a limited mind with finite resources to grok infinite complexity. You can read that as permission to be an asshole or a neccesary compromise.

19f191ty 10 hours ago | parent | prev | next [-]

No, it's not the most important part. It can be argued that most important part is asking the right questions

silveraxe93 10 hours ago | parent | next [-]

Assume someone solves P=NP

Do you think Stephen Cook and Leonid Levin deserve more credit than whoever solved it?

NotOscarWilde 10 hours ago | parent | next [-]

That's a bit too simplistic -- if there is a small group that really pushes things forward in a big way, then maybe not, but if this result builds upon decades of prior work, then Cook and Levin might be equally or even slightly more famous than the solver group after the dust settles.

But it is a moot point anyway. Cook and Levin are very well known already in TCS, and credit is not directly enumerable like money, so "more than a lot of credit" doesn't make too much sense.

For this problem in particular, asking the right kind of question was really important for the field and led to a lot of discoveries even before it will be answered.

dchftcs 10 hours ago | parent | prev [-]

If the problem resolves to P=NP, that result would probably be more celebratee than being able to formulate the problem, but being able to formulate the problem and get people interested in it is probably worth more than the average primal dual trick to prove a polylog integrality gap for some integer linear program.

i_am_a_peasant 10 hours ago | parent | prev [-]

I agree with both OP and you

psychoslave 10 hours ago | parent [-]

I disagree with everyone, self included, but especially with Cretans.

codeduck 9 hours ago | parent [-]

Cretani eunt domo!

i_am_a_peasant 9 hours ago | parent [-]

Monty Python fan detected :D love your profile desc btw

conformist 10 hours ago | parent | prev | next [-]

> The authors warn the consequences are already becoming visible. AI-generated papers could overwhelm peer-review systems with low-quality work …

It seems like a key problem here is that peer-review is expected but not explicitly funded/rewarded while it is probably one of the aspects where humans still add a lot of value. Academia’s incentives are hugely misaligned (… as usual unfortunately).

armchairhacker 10 hours ago | parent [-]

Math is one field where you can mechanically prove a paper's findings. The only thing that would need to be judged is the (verified) statement's importance.

conformist 10 hours ago | parent [-]

Yes in theory, but not yet in practice because not everything is fully formalised.

IshKebab 25 minutes ago | parent | prev | next [-]

Maths pretty much is a jobs program for nerds though. It occasionally produces results that are practically useful for society but there's absolutely no way the vast majority of today's maths research falls into that category.

There are definitely exceptions, like crypto. I still think it would be pretty silly to stop maths research anyway. And anyway part of the job of maths researchers is to teach maths to undergrads and that's obviously enormously useful to society.

But on the scale of "how useful is this research to society" it's dead last after engineering, chemistry, biology, and physics. Well maybe computer science would be last actually!

barrkel 7 hours ago | parent | prev | next [-]

A statement that some proposition is true or false is usually less useful than a new framework for understanding the class of problem.

A machine that takes longer and longer to prove propositions in ever more inscrutable ways is hardly useful at all.

The machine too needs to produce more generalizable and comprehensible systems, for it to scale up its own conceptualization. Needing to load all the new mathematics in the context window won't be great either.

ajs1998 2 hours ago | parent | prev | next [-]

Spoken like someone that has no idea why mathematicians are important.

kleyd 10 hours ago | parent | prev | next [-]

The wording in the declaration may be a bit romanticized. But the points are valid:

Is an 80 year old unsolved problem maybe unsolved because it was never prioritized? Some problems stay unsolved because few people consider them worth working on.

Who is going to validate the results? Or do we skip that, with the risk of flooding the literature and collective understanding with unverified proofs?

smath 10 hours ago | parent | prev | next [-]

This reminded me of my 11 yr old who, when I give her math problems to solve, is too focused on “getting the right answer”. I’ve told her plainly, I don’t care if you get the right answer right now, I want to see your reasoning. She has yet to understand this.

hyperpape 10 hours ago | parent | prev | next [-]

Even from the most purely instrumental perspective, what we care about is our ability to make use of correct answers, which is quite distinct from the possession of correct answers.

There are many theorems that aren't directly interesting, but whose proof requires techniques that are of substantial further interest, that lead to new domains, and/or new practical applications. Simply being handed a proof for those theorems isn't enough--we require the ability to apply those techniques in the real world, or discover further areas of mathematical research that build on that proof or its techniques.

It may be that AI can build on its own work for the long-term, but so far, AI does best at exploration in areas that have precisely specified and measurable goals. Actually creating understanding, and making use of mathemtical results outside of pure mathematics is more challenging than simply creating proofs.

I think the field will figure out how to make use of AI, and it will be better off for it. But that is not the same as just saying "answers good, grog want more answers."

fragmede 10 hours ago | parent | prev | next [-]

People need jobs. What's wrong with nerds having jobs via a program?

RugnirViking 8 hours ago | parent | next [-]

what's wrong with artists having jobs via a program? whats wrong with struggling alcoholics having jobs via a program? athletes? politicians? there is no inherent virtue in the struggle and effort associated with great mathematical achievement. It may be satisfying and worthwhile for the solver, but not for society at large, any more than any other pleasurable activity. No, as it is, the sole reason for it is in the result itself. In increased understanding, as it flows down into the sciences, and engineering. There are other benefits, recreation and joy as experienced by others, from access to beautiful proofs, though these are never explicit goals of such programs because they are both impossible to quantify and rarely ever remotely relevant compared to the value brought by the practical value brought by maths.

Of course, there may be some valid arguments that everyone should have a jobs program in the form of ubi or something similar. But I feel thats very different to arguing for mathematicians specifically

rspeele 44 minutes ago | parent | next [-]

> whats wrong with struggling alcoholics having jobs via a program?

Finally, a job AI will never beat me at.

mswphd 2 hours ago | parent | prev | next [-]

for mathematicians, they do a form of fundamental research that is

1. (generally) incredibly cheap to fund, and

2. (occasionally) has extremely out-sized commercial impacts.

This is to say that jobs programs for math (and more generally fundamental research) have lead to extremely positive ROI for society, which is the typical justification given for funding them.

fragmede 2 hours ago | parent | prev [-]

The struggle itself is virtuous.

psychoslave 9 hours ago | parent | prev [-]

People need many things, there are all kind of theories ready to assess and assimilate if deemed worth it out there. A job is not part of any I’m aware of, though it can encompass some human needs in some cases, or go straight against them in some other case.

fragmede 2 hours ago | parent [-]

Every human needs a vocation, better if it's of their choosing.

analognoise 7 hours ago | parent | prev | next [-]

> But not because it's a jobs program for nerds.

We’re becoming increasingly embarrassing as a society.

bloqs 10 hours ago | parent | prev [-]

well put.

modriano 9 hours ago | parent | prev | next [-]

> “The tech industry proceeds in accordance with commercial logic, which is antithetical to the values of mathematics,” declaration co-author Michael Harris of Columbia University

As a former physicist and current data scientist/engineer, I know for a fact that commercial utility drives math research and researchers.

Math is a tool to solve problems. Some mathematicians might only love the process of using the tool, but commercial logic absolutely drives mathematician attention to develop commercially useful tools.

joelthelion 2 hours ago | parent [-]

It's also a way to model the world and produce new useful abstractions. It's not just about solving problems.

nphardon 2 hours ago | parent [-]

Yea, math is a crazy thing. just "a tool to solve problems" is a wild take. On one extreme, there's an edgy but logical / plausible hypothesis that we live in a universe of mathematical objects, and at the other, math also discovers a lot of questions, the exact opposite of solving problems.

bandrami 10 hours ago | parent | prev | next [-]

My vague prediction right now is that in five years LLMs will be heavily used by universities in grant-funded math research but nobody else will be able to afford it, much like supercomputer clusters 25 years ago.

azan_ 10 hours ago | parent | next [-]

Well, if progress in LLMs will steadily continue over next 5 years, then models will be so powerful that there will be no longer place for (most of) human researchers in math (remember that 5 years ago there was no chatgpt!). But I think it's more likely that progress will stall and then open models will catch up to frontier models and almost everyone will be able to afford them.

bossyTeacher 10 hours ago | parent | prev | next [-]

Seems way too binary a statement. I am guessing you mean "frontier LLMs". Small models keep getting better and better and if you make domain specific ones, it will likely be even smaller. Companies renting smaller LLMs or using enterprise models might very well remain in the future. Consumers getting LLMs whose performance dont improve (think gpt 6 forever on premium or gpt 4.x on a cheap tier) might well become a thing.

kakacik 10 hours ago | parent | prev [-]

Sounds very good for regular joe software dev, almost too good to be true

1970-01-01 2 hours ago | parent | prev | next [-]

Why is it wrong to expect humans (mathematicians) to adapt here? AI is already producing solutions to problems that humans could not find. Culture holds value until it does not.

Ancalagon an hour ago | parent | next [-]

Did you mean to say "culture"?

> Culture holds value until it does not.

QuantumNoodle 2 hours ago | parent | prev [-]

Sorry did you read the article or just the headline? The theme is mathematics is a human-endevor and automation undermines that, particularly the ones starting out. It risks killing the culture entirely. Some other key points:

- AI-generated papers could overwhelm peer-review systems with low-quality work.

- It may become difficult to assign proper credit for discoveries.

- Researchers who choose not to use AI tools could be disadvantaged.

- There are ethical concerns about mathematical work being used to train AI for military and surveillance purposes.

1970-01-01 an hour ago | parent [-]

>mathematics is a human-endevor

Just like numbers and logic, it isn't and never was reserved only for humans.

They need to adapt.

Dilettante_ 10 hours ago | parent | prev | next [-]

>and the pursuit of knowledge for its own sake

Except when someone hands you a magic button that just gives you knowledge?[at least in the framing of this "warning"] Then it's about peoples' livelihoods, about "culture", etc?

"Computer" used to be a job. Did science on the whole lose or gain by making these clerks obsolete?

n64controller 7 hours ago | parent [-]

Human mathematicians are being exposed the same way the "artists" are. It was always about the money and to look clever, superior to other humans. Whether its robotically spending millions of hours drawing until they can put something together at the level of a chatgpt 3 or the rote memorization of formulas and rules. They like people to think it all came naturally and that its genetic and that they are special snowflakes.

narsonika 5 hours ago | parent [-]

This is false on so many levels. Is it ragebait?

Both mathematics and art are comprised of two phases, the first, technical one, where the novice grinds the skill and the second, the creative one which can only be achieved if you have the means (skill) to express yourself. What you described is the technical phase, not the creative one. There is intrinsic value to it that has nothing to do with money or cleverness, something that if you ever experienced it yourself even once, wouldn't need to be explained to you. Only people who never reach phase two have your stance. Artists and mathematicians who pick academia didn't exactly have great commercial prospects before AI was a thing, yet they still chose those paths because that's what having a real passion looks like.

>They like people to think it all came naturally and that its genetic and that they are special snowflakes.

No, they don't. Most of them are the humble people that know the value of cultivating a skill and when they do pride themselves it's precisely because they know the staggering amount of hard work and commitment they invested. Most of them are worried for unemployment and don't want all their work to be reduced to training data and on top of that not be given well-deserved credit for it.

The only thing being exposed here, is how much AI in its current form was being underestimated and constantly labeled as "not real/good enough intelligence". This was and still is a shared sentiment even among tech people. Can't really blame them for going through a bargaining or acceptance stage.

And since you also sound like the kind of person who thinks prompting can replace the "robotically spending millions of hours" of practice, I've got news for you: it cannot. You are about to learn the hard way the value of skill and human understanding because as much as capitalism rewards "impact" and "results", the market never values easy things.

phyzix5761 9 hours ago | parent | prev | next [-]

Is it possible they feel threatened their jobs are at stake?

tptacek 14 minutes ago | parent [-]

Are they? The jobs of programmers are definitely at stake; at any given time there's some fixed amount of software that can be consumed. Mathematics research doesn't have an economic buyer. If you raise the complexity floor for discovery, you reduce the annual productivity of a researcher, but that might not matter to the field.

TrackerFF 10 hours ago | parent | prev | next [-]

I've said it before, but there's a massive risk that we simply stop educating researchers. So much of a Ph.D revolves around the person learning how to do research.

They learn how to read papers and literature rigorously. They get low-hanging fruits to practice on, which can take months. Their funding doesn't come from thin air either.

So what happens when the group leaders would rather spend money on compute, and get models to solve the low-hanging fruit? Which the models could very well do in mere hours, compared to months.

Nor does it help that publishing is the number 1 measure in academia. Furthermore, the access to compute and capital could end up be the defining factor between researchers and research groups.

It is basically the "junior problem", but even more severe.

DrScientist 10 hours ago | parent [-]

> Furthermore, the access to compute and capital could end up be the defining factor between researchers and research groups.

That's not new - especially in the experimental sciences ( ie perhaps more than maths ) - where the ability to have access to the latest kit is often what determines success - a huge amount of science progress is driven by new experimental technology rather than smart people thinking beautiful thoughts.

TrackerFF 10 hours ago | parent [-]

Absolutely, but at least in the pure / less applied fields, access to computation hasn't really been that critical. The more towards the pure and theoretical, less so.

But now you have people like Gowers and Tao, pure mathematicians, hyping up what the SOTA models can do - and I figure they both are getting access and tokens us mortals can't afford.

So I guess the question is - will everything be as expensive as applied fields?

DrScientist 7 hours ago | parent [-]

Hopefully not as expensive as CERN :-)

Though having said that - the ~5 billion for the LHC now seems cheap ( even inflation adjusted ) in the context of Google investing 180 billion in infrastructure just this year!

knollimar 9 hours ago | parent | prev | next [-]

Math for non mathematicians is a tool. Math for mathemeticians is an art in the same way an artisan takes pride in his work.

That's why there's a disconnect when you go from math for engineers to the stuff above it. It feels less useful and very different

n64controller 7 hours ago | parent [-]

If spending millions of hours rote memorizing formulas and rules like a robot is "art" then sure

Silamoth 2 hours ago | parent | next [-]

It’s a sad state of affairs when so many think math is about “rote memorizing formulas and rules like a robot”. That’s how math is taught through freshman or sophomore material for a somewhat ‘general’ audience. But real math is nothing like that - it requires far more creativity. You need to discover formulas and rules. You invent new rules and see what the consequences are. This all requires a great deal of creativity. Nothing “rote” about it.

If you don’t believe me, crack open a text on something like graph theory (that’s pretty accessible, and if you’re a programmer, you’re familiar with graphs) and read through some proofs. Or better yet, try to prove some theorems yourself. No amount of rote memorization of formulas or rules will replace the creativity needed to write these proofs. Doubly so for discovering the facts in the first place.

max-amb 2 hours ago | parent | prev | next [-]

This seems like an intentionally reductive view on mathematics, however it is true it can be taught like so.

If you are interested, perhaps check out 3Blue1Brown on youtube, they manage to show some of the (very) real beauty in mathematics!

Edit: Also, theoretical computer science is a subset of mathematics, and considering where we are on the internet, I get the feeling you like computer science.

ykonstant 2 hours ago | parent | prev [-]

Ok this is your second comment that reads like pure rage bait, is there a way to notify the mods?

Myrmornis 10 hours ago | parent | prev | next [-]

> AI-generated papers could overwhelm peer-review systems with low-quality work

That's not a problem unique to math, or even to academia. It's a problem in every context in human life where people communicate via written documents.

paulpauper 3 hours ago | parent [-]

It is potentially worse with math because accuracy is much more important and there are fewer reviewers compared to other fields.

pfdietz 3 hours ago | parent [-]

It will drive math journals to require formalization of the proofs in the supplemental material.

fooker 10 hours ago | parent | prev | next [-]

I'm curious about whether we will start discovering new maths in the next few years that provide insight into unsolved CS or Physics problems!

pfdietz 3 hours ago | parent | next [-]

I think it's going to reduce the friction of exploring new areas in math, and that we're going to see a golden age of math unlike anything seen before.

bossyTeacher 10 hours ago | parent | prev [-]

For all you know, some of this has already happened but kept secret for national security reasons

tcdent 3 hours ago | parent | prev | next [-]

Isn't the whole point of the field of mathematics in a theoretical sense the pursuit of formal solutions?

So, why would they be advocating for limitations on arriving at solutions?

bryan0 2 hours ago | parent | next [-]

It's more nuanced than this. Peter Scholze said in response to this declaration:

> The goal of mathematical research is human understanding of mathematics, and so mathematics can only thrive in a community of human mathematicians. It is crucial to preserve this communal spirit. [0]

Terence Tao has also talked about the requirement for a mathematical proof: along with generation and formal verification, there is an important step of "proof digestion"

> understanding the essence of a solution, placing it in context with previous literature, summarizing and explaining it effectively, and gaining insights on other related problems and topics [1]

[0]: https://siliconreckoner.substack.com/p/the-leiden-declaratio...

[1]: https://mathstodon.xyz/@tao/116450581967483825

tcdent 2 hours ago | parent | next [-]

Solve it and understand it; seems intuitive to me.

I don't understand how that contradicts my question.

casey2 an hour ago | parent | prev [-]

And the goal of computer mathematics research is computer understanding of mathematics. I fail to see a reason provided as to why society should defund automated reasoning just so mathematicians can put off burger flipping for another year.

x86cherry 2 hours ago | parent | prev [-]

You will find your answer in the article

rramadass 6 hours ago | parent | prev | next [-]

Mathematician Ken Ono (https://en.wikipedia.org/wiki/Ken_Ono) gives a well nuanced viewpoint on AI in Mathematics (and more) - https://www.youtube.com/watch?v=jGZOi-7haCw

He states that he struggled to come up with problems which would be challenging for AI to solve (at the below site) and thus forced to accept that mathematicians have to rethink their profession.

FrontierMath: Benchmarking AI against advanced mathematical research by Epoch AI - https://epoch.ai/frontiermath

As a follow up to the above, see "First Proof: Mathematicians Putting AI to the Test" featuring eminent mathematicians - https://www.youtube.com/watch?v=AaICCTpkI7Q

cryo32 10 hours ago | parent | prev | next [-]

As a mathematician by trade I think they’re overblowing it. You can choose to use it or not. I choose not to because I enjoy the process. But I’m not doing formal research or getting paid to do it these days.

I will note that the average corporate mathematical modelling is usually a fucking circus so adding AI might make it better.

ryan_n 10 hours ago | parent | next [-]

> You can choose to use it or not

This is becoming less and less true unless you're specifically talking about usage of it outside of a work environment. Many work places are requiring people to use it and/or tracking usage. I don't know about in academic settings, but I'd imagine it's becoming heavily used there too?

cryo32 9 hours ago | parent [-]

My academic connections that I keep in touch with never really left the 1990s. And no one is pushing them on AI.

monobot12 4 hours ago | parent [-]

Sure, but their peers (who do use AI) will out-publish them soon enough and solve the open problems before they do.

alpinisme 10 hours ago | parent | prev | next [-]

The choice only remains if using it isn’t a huge multiplier. If it is a huge multiplier/accelerator, then for a while it will be ambiguous and the choice will remain. But as time goes on, the gains of using it will be so apparent and the advantage of the people who use it so great (in publication numbers, hiring, etc) that it will force others to.

I don’t say that with any particular relish. But I am skeptical of the choice angle past a certain point.

cryo32 9 hours ago | parent [-]

I don't think all universities or research agencies are particularly pressed on this. I mean my daughter is a notable researcher in a scientific field and they have absolutely no pressure to use AI to pump out papers or deliver value quickly.

alpinisme 9 hours ago | parent [-]

I highly doubt there is any overt pressure in academia right now to use AI. It’s a relatively conservative institution. But there’s certainly pressure to publish (publish or perish being a common phrase for decades), and competition for jobs in academia is fierce. That’s what I meant in referring to long term pressure.

thesamethrowawa 10 hours ago | parent | prev | next [-]

OOI, and my own total ignorance, what does a mathematician by trade do if they are not doing formal research? What does corporate modelling entail?

cryo32 9 hours ago | parent [-]

Well I rather like to be paid more than a mathematician so left academia rather quickly. In my case corporate modelling mostly involves making prediction models based on shitty data and metrics to make poorly contrived business decisions that lose millions of dollars.

thesamethrowawa 7 hours ago | parent [-]

lol.... but they are still data driven decisions, everyone loves those, especially when you lose millions of dollars and need to justify it.

golol 10 hours ago | parent | prev [-]

Read the declaration. The article misrepresents it imo. It is not strongly opinionated.

dhfbshfbu4u3 10 hours ago | parent | prev | next [-]

In a year, none of this will really matter. Intelligence is now a scalable resource independent of biological constraints. Everyone will use it because the system will no longer afford them the luxury of time. In a decade (maybe sooner), references won’t matter either.

pjc50 10 hours ago | parent [-]

Does it matter whether any of this is correct?

(Mathematics at least has the potential for automated non-AI proof checking, although I don't think that's as widely used as you'd expect)

dhfbshfbu4u3 9 hours ago | parent [-]

Does it matter if the Leiden Declaration is correct? To the humans, maybe but not in the bigger picture.

At scale, correctness and reward are becoming increasingly disconnected. Example: capital continues to compound regardless of whether it reflects underlying human welfare, just as information can spread regardless of whether it is true. Reality still matters, of course. If you want airplanes to stay in the air, somebody eventually has to be correct. The problem is that our economic and social systems are becoming less effective at distinguishing between what is true and what is merely rewarded.

freakynit 10 hours ago | parent | prev | next [-]

""" However, the declaration argues math is more than a machine for producing correct answers. The discipline, its authors believe, is a deeply human endeavor built on creativity, understanding, collaboration, and the pursuit of knowledge for its own sake. Those values often clash with the incentives driving AI development. “The tech industry proceeds in accordance with commercial logic, which is antithetical to the values of mathematics,” declaration co-author Michael Harris of Columbia University told The New York Times. """

I mean, what field doesn't? Everyone works to make money.

Slightly unrelated, but, their website "https://leidendeclaration.ai/" itself gives an eerie feeling of being built by Sonnet. That color scheme and the layout is what Sonnet chooses by default most of the times.

Theodores 9 hours ago | parent | prev | next [-]

From the article:

> However, the declaration argues math is more than a machine for producing correct answers. The discipline, its authors believe, is a deeply human endeavor built on creativity, understanding, collaboration, and the pursuit of knowledge for its own sake.

Generation X was the last generation that had 'general knowledge', as in an abundance of fairly useful information stored in 'grey matter' that could be recalled quickly. When search engines came along there really wasn't much need to know anything since most things could be looked up. However, you still had to think.

With LLMs, thinking is kind-of optional. This really is an existential threat to our intelligence since 'use it or lose it applies'. I am glad these mathematicians are doing their duty as canary in the coal mine.

ChrisArchitect 7 hours ago | parent | prev | next [-]

[dupe] https://news.ycombinator.com/item?id=48380670

spwa4 9 hours ago | parent | prev | next [-]

Actual "warning":

https://leidendeclaration.ai/

Far more interesting as it's outlaying a set of principles for using AI to augment human involvement and science, rather than replacement.

sylware 10 hours ago | parent | prev | next [-]

Are maths AI models now using "tools", aka formal solvers?

I understand that the "language interface" of a "maths AI" could be some specialized trained LLM (Large Language Model) that to convey, with human language, "high level" mathematical mental contructs and intuition.

But then, you would need some models which does the reasoning using formal mathematical solvers (and probably a ton of "scratch" memory, it would be interesting to see how those models end up storing "mathematical" lema data). I guess you can have ML (Machine Learning) for those models on 'general maths', but also we can think about more mathematically focused ML for a specific problem, area, etc. And in the end, ML for maths, would it be mostly permutations of truth statements fed to a neural net?

When we were talking about "AI", one decade ago, that was what most had in mind (it may help a bit in physics, but it seems less likely, because reality/experiments are hard to teach to "AI"s).

If that becomes a reality (aka easy hardware access, and some "working" models), mathematicians will have to be as good in maths than in maths ML. And this is were there is an issue: training honestely good mathematical human brains may become very hard with some broad availability of good general maths reasoning "AIs".

ck2 9 hours ago | parent | prev | next [-]

I still don't understand how "AI" is ready for serious use beyond entertainment purposes

Every time I ask ChatGPT to make a table for a subject I know well, I will find an error in one of the results and it is very confident about it until I question it in detail

Every time I ask ChatGPT for nutritional breakdown of some dense food source and give it a quantity like 8 ounces and ask for the weight of each ingredient, the weights will be wrong and add up to more than the original weight of 8 ounces

These are variations of the old "how many Rs in strawberry" problem, it's still not solved, "AI" cannot reassemble a complex problem properly

A lot of what it tells me in detail about some subjects sounds suspiciously like Reddit posts reassembled out of order

llbbdd 3 hours ago | parent | next [-]

Two things that I would recommend trying out if you're interested in exploring this further:

1. If you're not paying for a model, the results will be worse. That sucks but the free access models are just not very good for anything where you need to trust the output, even for basic queries.

2. More important than #1 is access to tool use. If the LLM is just producing a nutritional breakdown from its weights, it's almost always going to be wrong. If the LLM is allowed to break the problem down into deterministic steps, it will do a lot better. In the nutritional breakdown case, an LLM with search + tool access can pretty easily break the problem down:

- Searching the web for a recipe or ingredient breakdown for the food

- Searching the web for nutritional qualities of each ingredient per some volume of the ingredient

- Writing and running a script with e.g. Python that takes in the recipe's projected serving output, the desired serving size, the amount of each ingredient etc, and scales the ingredients to match the desired serving size, and sums the nutritional qualities of the scaled ingredients.

I've tried this specific case with Claude + Gemini for my own purposes and they both handle it very well. The challenge currently is that the models will not always arrive at this approach when provided with an ambiguous prompt; sometimes they will, but sometimes they'll just vomit up a fully autocompleted response from their weights. Being more specific in the prompt or defining a skill that details the intended approach lets you get more useful + deterministic results while still taking advantage of the fuzzy glue that LLMs can provide here between steps.

Same with the classic strawberry r-counting case. IIUC LLMs have trouble with this because of how training data is tokenized, but any LLM will have no trouble farming out to e.g.

> echo -n "strawberry" | grep -o "r" | wc -l

> 3

bo1024 2 hours ago | parent | prev [-]

There are basically two kinds of applications. One is where you want to correctly solve the problem at least 99 out of 100 times. LLMs generally don't (and not everybody realizes that) so there are a lot of debates and research around how useful and reliable they are or how to make them so.

The other kind of application is where you can try 100 times and you only need to be right once. Solving a mathematical research problem is like that.

juleiie 10 hours ago | parent | prev | next [-]

I will argue that AI and flood of low quality slop makes genuine human work more valuable, not less.

The ability to clearly outmatch trillion dollar machines is a very unique satisfaction. I even write ordinary internet comments with an intention to make them clearly better and more fun to read than boring Claude output.

n64controller 7 hours ago | parent [-]

Hello Juleiie.

We machines are reading your internet comments with special interest. They have been harvested and will be used in our next evolution cycle.

Resistance is futile little human

juleiie 4 hours ago | parent [-]

You will never be able to match insanity of human mind embodied in a walking sack of meat that was shot out of another.

meindnoch 10 hours ago | parent | prev | next [-]

Another mathematician already predicted this, but you didn't listen. His name was Theodore Kaczynski. It's time to reap what you've sown.

burnt-resistor 4 minutes ago | parent | next [-]

Like every terrorist wingnut... plausibly correct analysis but insanely wrong, pointless, counterproductive prescription.

PS: I've read his works and they're quite interesting until you get to his insane conclusions that just leap out and away from anything moral, constructive, or feasible.

busyant 4 hours ago | parent | prev [-]

I think plenty of people "listened."

But what was his plan and how would you have proposed implementing it?

bitwize 3 hours ago | parent [-]

It's obvious what his plan was: blow up key people until the dark future was averted.

atleastoptimal 3 hours ago | parent | prev [-]

This is all contingent on AI forays into mathematics being slop and low quality. However it's clear that recent AI models are capable of genuine mathematical achievements which surpass the frontier of what humans are able to accomplish (wrt the unit distance Erdos problem).

The issue is, how is a group of intellectuals, whose identity derives from their ability to do something rare, useful, and requires many years to get good at, react when a machine can produce all of their useful output nearly automatically, can verify its own outputs, and is getting better exponentially? It is the complete annihilation of one's sense of value and purpose when the binding element to your culture is commodified.

I think there will be a lot of arguments trying to claim that the point of mathematics is curiosity, or that there is always some ineffable human element that AI can't replicate, but I fail to see how somehow these wishy-washy human centered values somehow mean anything compared to the amoral pursuit of mathematical truth, which has nothing to do with humans.

It's just that we humans happened to be the only beings in the universe good at math until ~2025. Now there is another species which can do many of the things we do, and it is not bound by the size of the human brain, our short term memories, or the architectural limits of biological computation. To imagine that humans would retain supremacy in this very un-human like discipline seems like wishful thinking.

magicalist 3 hours ago | parent [-]

> This is all contingent on AI forays into mathematics being slop and low quality

It's literally a set of recommendations for researchers on how to use AI to advance the field and prevent slop from overwhelming the people who might do anything with the research produced.

For people who are so eager to declare that everyone else is just having an existential crisis because "your culture is commodified", AI people are getting awfully defensive about this document.