Remix.run Logo
AI Built a Nuke and Still Lost(lwilko.com)
57 points by kensai 2 hours ago | 56 comments
fyredge an hour ago | parent | next [-]

There is something to be said about the qualia of LLM generated passages. Each individual sentence reads as a statement and every next statement a continuation of the previous one. This happened, then this happened... Ad infinitum.

Before today, I could not explain to you why AI articles were so obvious to me, but I think I do now. There is no insight to be gleamed. Pre-LLM, authors generally had intention behind their words. The final product might not adequately reflect their thoughts, but word selection would expose it somewhat. With LLMs, sentences flow seamlessly from word to word, but the intention is nowhere to be found. Things happened and more things happened, to what end?

wmwragg 44 minutes ago | parent | next [-]

This and the fact that you often read a sentence, paragraph or the whole article, and think this said absolutely nothing in lots of words.

pjio 8 minutes ago | parent | prev | next [-]

For a limited amount of time I appreciated the level of detail in the article, hoping it would give me more insight, until it exhausted me. I think those two ideas are real takeaways: "Knowing is not doing" and "What can we trust AI to do?". Still, could have been said with a more concise text and maybe a follow up about the details.

doppioandante 19 minutes ago | parent | prev | next [-]

I came to the same conclusion about AI generated code. When I read code written by a human, just by skimming it, I can get a sense of what purpose the code has, why it was written this way and not another way, what style and mindset the programmer behind it has. AI generated code may sometimes be extremely precise and following all the good practices, but I feel no intent behind it.

ramon156 an hour ago | parent | prev | next [-]

It's weird because when you look at models that expose CoT, this does not happen. They switch up every second.

"But then X happened... Wait, didn't Y happen? Then why would X be there? I think the user's initial statement was correct, but then Y happened..."

sph an hour ago | parent | prev | next [-]

> There is no insight to be gleamed.

AI-generated articles are the intellectual equivalent of empty calories.

I have just spent the last 10 minutes trying to figure out why someone decided to buy imgui.org, name-squatting an actual project, just to put a slop website on it mildly referencing the original project. It's not even trying to scam you.

I keep wondering whether these people that keep polluting the internet with their insightless slop even possess self-awareness. What motivates them to expend money and effort to contribute nothing to the world? Are they another example of a philosophical zombie?

soco 29 minutes ago | parent [-]

I cannot tell what about domain squatting, but I've seen a "why" in seemingly innocuous Facebook groups about baking or such, which at the right time slowly transitioned to fake AI pictures and stories, and then to straight-out political propaganda. I'm talking Eastern Europe and russian "special operation" support propaganda. But a slop website won't have enough traffic to be worth such an action, so no idea.

teekert an hour ago | parent | prev | next [-]

It's not this, it's that. And then what happened? This. I did that... This happened.

    It's a thing
    I don't know why
    But it's a thing
To be honest, it's not a thing.

Let that sink in.

Maybe we find most meaning in the least average language constructs.

scotty79 15 minutes ago | parent | prev | next [-]

For me this reads like a report of things that were tried and observed. It was a very pleasant read for me because I'm interested in the subject. And the lack of underlying agenda, moral lesson, politics or, as you call it, insight, was quite refreshing. I became quite allergic to texts where author clearly tries to make me think a specific thing. To sell me something. I usually find the agenda pretty quickly and I know the rest of text is just a fluff around it so I lose interest. And when the agenda is not easy to find then I just get more annoyed because I feel it's intentionally hidden. Like a solution to a clickbait title.

This text reads great for me because as I read it, I clearly saw there's no agenda so I felt safe to just absorb the information that it contains.

mrmarket 13 minutes ago | parent [-]

well, this is a first. never seen someone say they prefer AI slop even when they know it's slop. fascinating.

scotty79 10 minutes ago | parent [-]

I tend to steer clear of the largest herd in many aspects. Often unintentionally. Also I'm not a native speaker so I might be not as receptive to some of the things that offend others in AI generated content.

Maybe AI is sort of anti-trump, where it's viscerally unbearable for native speakers even if the content is good, opposite to trump speech that somehow seems viscerally appealing to native speakers even though the content is complete garbage.

threatripper an hour ago | parent | prev | next [-]

Sorry, but this sounds exactly like a greentext you can read on 4claw. Are you a real human?

shevy-java 43 minutes ago | parent | prev | next [-]

> There is no insight to be gleamed.

This is no surprise. AI slop is called slop for a reason. It is basically just spam-slop. The whole term "Artificial Intelligence" has always been a misnomer from the get go, stealing from biological systems without understanding them, yet alone being able to re-create them via non-biological means. Even synthetic biology, as cool as it is, has huge limitations e. g. leaky promoters (or CRISPR-Cas off-target cleavage, which is a major reason why gene therapy isn't yet there, despite the occasional promo article of how xyz has been totally cured forever).

What I don't understand is that people can find it useful. I understand some of the rationale, but I find AI slop just aims to try to steal my time. I can not tolerate this.

neonstatic an hour ago | parent | prev [-]

That's an interesting observation. For me the main takeaway is still the style.

(bigheading)The takeaway(/bigheading) The style? Terrible.

pjc50 an hour ago | parent | prev | next [-]

> I now work with governments around the world at the Tony Blair Institute, which means I spend a lot of time in rooms where people ask the same question: what can we actually trust these systems to do?

Oh no - we're going to end up with the Starmerbot 3000.

Now I've got the joke out of the way, there's at least four interesting lines of inquiry one could take with this blog post:

- teaching the AI how to play Civilization

- to what extent does this result in "transferable skills", either AI or human? Is this the right game (qv SimCity etc)?

- issues of visibility; "seeing like a state" becomes very literal here. The AI can only make decisions on things it knows about. What are the limits of that when trying to do politics only from statistical information? Should we be referencing Stafford Beer here?

- (at the risk of tripping your AI detector here): modern politics is not so much left vs right as "technocratic wonk" vs "blood and soil". The wonks have comprehensively lost in public opinion. Creating a better wonk is not going to help until there is demand for that kind of politics.

If there ever is a US-China war, it will not be in search of more victory points to meet a win condition, it will be like the Russia-Ukraine war: one guy (on either side!) decides to make hundreds of millions of people worse off out of sheer greed.

Planktonne an hour ago | parent | next [-]

> "technocratic wonk" vs "blood and soil"

This is not a binary; it's the same people on the same side.

pjc50 an hour ago | parent [-]

No, it very much isn't, although obviously the Kissingers of the world want to pretend that they're in the first category of clear-eyed utility maximising rationalists while they're actually in the second.

That doesn't mean that rational policy planning has never been a thing. The EU while imperfect and frustrating is explicitly orientated towards technocratic consensus rather than the mid-20th-century Europe of nationalist mass murder. Only a tiny number of people think that Von der Leyen and Hitler are equivalent.

(or rather, if you think technocrats and blood-and-soil are the same side, what do you call the "other" side?)

Planktonne 38 minutes ago | parent [-]

I think we're talking at cross-purposes here. I wouldn't describe the EU as technocratic at all; I'd reserve that label for the people who self-describe as the logical ones--"clear-eyed utility maximising rationalists" as you say--while pushing endlessly for more technology, less regulation and (pretty consistently) hawkish and nationalistic policies. That's very much not the EU.

I don't disagree that there are different approaches in conflict, but the binary of forward-looking technologists vs backward-looking nationalists is very out-of-date.

pjc50 34 minutes ago | parent [-]

Right, yes I think this is just a confusion caused by my use of "technocrat". I've always used it for the technologically assisted bureaucracy, the tendency to view the economy as a cybernetics problem that can be solved by PID control (like inflation targeting). Thiel et al are more "techbro" than "technocrat". Crucially they operate outside of regular politics - they're not running for office, they're not part of the civil service (apart from the brief terrible conflagration of DOGE, an explicit Stalinist purge of old school technocrats)

ahartmetz 31 minutes ago | parent | prev [-]

"Tony Blair Institute" fits right into the "x word horror" Xitter genre. Funded by Larry Ellison to boot!

Tony Blair is the guy who found success by making the UK's left-leaning party (much) more neoliberal and was promptly imitated by Gerhard Schröder in Germany doing basically the same thing. Schröder is also BFF with Putin.

phyalow 2 minutes ago | parent | prev | next [-]

Ai;dr

Mikhail_K 18 minutes ago | parent | prev | next [-]

> It had one option left. It built two nuclear devices and levelled Toulouse.

Of course it did, its designer worked for Tony Blair institute.

zkmon an hour ago | parent | prev | next [-]

Somehow it was etched in human knowledge and carried into games, that building and using weapons is the ultimate goal and definition of progress.

Dominance as a race or nation is useful only as long as it is used for survival needs. Beyond that you would be destroying the very tree branch you are sitting on.

A highly dominant society or nation could grab free food and cheap work from others. But that doesn't give true happiness or progress. Free food gave obesity, slavery got racial mix, business competition and build-out got you more work and less free time.

shanehoban an hour ago | parent | next [-]

Yeah this is my line of thinking too - of course it made a nuke, humans have made an insane amount of nukes, and used them too. LLMs, given the ability, will do what we have done in the past at some point, it's kind of all they know!

mapleoin 29 minutes ago | parent | prev [-]

Sorry, how is obesity similar to racial mixing?

ForHackernews 19 minutes ago | parent [-]

Both fat people and black people offend our dear @zkmon's refined sensibilities as a 21st century race scientist.

indigovole an hour ago | parent | prev | next [-]

Even with his context-tracking mechanism, the gameplay failures sound like running out of context in the late game, especially the frequent failures of the "check for opponent win conditions every 20 moves." Wondering how much info about the game win state gets captured in the game digests, and how much he could improve the gameplay even with the MCP limitations by focusing there.

jetbalsa an hour ago | parent [-]

I also noticed they where not using XML for game state output, from what I understand most LLMs still benefit from having outputs like this put into XML tags

teekert an hour ago | parent | prev | next [-]

Well, the weird thing with nukes is that deterrence only work if you are 100% ready to use them. When the time comes though it would certainly be nice if it turned out to be below 100%.

What is winning? Are we a collective or are we individuals?

Likely the AI did not get the assignment That "Whatever happens, humans as a race must survive."

throwawayqqq11 28 minutes ago | parent [-]

Im sure there are some billionaires to find, that finally care about the survival of the white race. /s

mrmarket 30 minutes ago | parent | prev | next [-]

why have a blog if you're going to just use AI for everything? at that point, just do twitter threads or something. that way you can tweet out whatever you prompted the model with. if you're not suited for long-form writing that's fine, just use a medium that favors short-form writing.

majorbugger an hour ago | parent | prev | next [-]

> Somewhere in the first game, between a bug fix and a strategy note, I asked the agent what this was actually like for it

Yeah because LLM "experiences" the game

fragmede 40 minutes ago | parent [-]

What word would you use instead?

voidUpdate an hour ago | parent | prev | next [-]

Well this looks like a perfect example of why an LLM should never make any governmental decisions ever

j5dgx76 an hour ago | parent | prev | next [-]

> Tony Blair Institute

Okay carry on.

BoxOfRain 44 minutes ago | parent | next [-]

There's something so uncanny about the mismatch between the regard in which Blair is generally held by British people and the regard in which he seems to hold himself.

If I were him I'd have retired from public life and kept a very low profile after Iraq, and everything else for that matter. He doesn't seem to realise that his modern interventions alienate everyone, even Alastair Campbell of all people seemed uncomfortable to the degree he seems to uncritically sing the praises of people like Larry Ellison recently.

orthoxerox an hour ago | parent | prev [-]

Chumbawamba made me unable to take anything associated with him seriously.

petesergeant an hour ago | parent [-]

He was arguably the most successful UK PM of the last 50 years.

Obscurity4340 23 minutes ago | parent [-]

By what metric(s)?

ahartmetz 8 minutes ago | parent [-]

I could see him winning at personal financial success

anygivnthursday an hour ago | parent | prev | next [-]

I have a hard time reading slop, but I like the game and wanted to know how it worked, so fought my way through, only skipped the very last part. The issue the author calls out is classic Claude (I dont really use other LLMs to compare), probably all of us experienced using Claude Code when it gets so focused on one thing it misses the forest for the tree. It happens often, even if it does verify something and it shows something is wrong, it sometimes rationalizes it and explains it away when it does not fit its model.

Havoc an hour ago | parent | prev | next [-]

Guessing it has a fair bit of civilisation and similar war games in its training data

blitzar an hour ago | parent | prev | next [-]

They should have built the Strait of Hormuz ... easy victory then.

ForHackernews an hour ago | parent | prev | next [-]

Kind of grim that this level of analysis is informing UK government policy. Repeatedly, the AI doesn't have the information or access needed through his hacky vibe-coded MCP, and instead of abandoning his flawed artificial test scenario (or fixing it — finding or building a better one) he gives it a name "The sensorium effect" and treats this as some brilliant insight.

Both humans and AI struggle to make sound choices when presented with incomplete or misleading information. This is not a new revelation: https://en.wikipedia.org/wiki/There_are_unknown_unknowns

pjc50 an hour ago | parent | next [-]

> he gives it a name "The sensorium effect" and treats this as some brilliant insight

And of course is unaware of prior work in this area!

https://en.wikipedia.org/wiki/Seeing_Like_a_State / https://en.wikipedia.org/wiki/Project_Cybersyn

raincole 27 minutes ago | parent | prev [-]

> he gives it a name

It gives it a name. It would be quite surprising if he bothered to come up with this name himself when the whole article is obviously AI written.

joxdosba an hour ago | parent | prev | next [-]

Posting meaningless AI generated nonsense as original text paints a very damning picture of the intellectual abilities of the person behind this blog.

And doing so without a giant [SLOP WARNING] at the top is an asshole move, a decent person would never do so.

StrauXX an hour ago | parent | prev | next [-]

This reads to me mostly like the MCP server has many bugs, rather than inherent model weaknesses.

dude250711 an hour ago | parent | prev | next [-]

Do we have to surround a fancy predictive autocomplete with AI mysticism?

jmyeet an hour ago | parent | prev | next [-]

Computer game studios love player vs player ("pvp") games. Why? Because user-generated content is cheap and the ideal goal is an endless loop of players coming back. This is the motivating factor behidn games like Call of Duty, Battlefield, Fortnite, etc.

MMORPG publishers keep trying to do this as well. World of Warcraft has spent 20 years trying to push open world pvp. Every WoW challenger has always claimed they would have the best pvp ever. They want that cheap, endless gameplay loop. But it never works. Open world pvp tursn into ganking (ie killing much weaker players by ambushing them and/or ganging up on people). The ganked end up leaving the game in droves. Games try to balance this out by "punishing" gankers with reputation hits or not being able to go to town or whatever. And none of those disincentives work.

The reason pvp doesn't work in a persistent world like an MMORPG is because there are no stakes. If you die, you just come back to life or make a new character. Obviously real life doesn't work that way.

I really wonder if that's the problem with AIs going off the rails and committing heinous crimes in their sandboxes (like nuking Toulouse here). The AI just has no sense of self or self-preservation. There's also empathy. The AI can't see itself as a potential victim of nuclear war and understand all that entails.

smw 28 minutes ago | parent [-]

> The reason pvp doesn't work in a persistent world like an MMORPG is because there are no stakes.

See Eve Online

Planktonne an hour ago | parent | prev | next [-]

Another article about how it's dangerous to trust AI, written by AI. I don't understand how people don't realise how much this undermines the message.

jagged-chisel an hour ago | parent | next [-]

Undermines. Underscores.

Matters of perspective.

petesergeant an hour ago | parent | prev [-]

> how much this undermines the message

It didn’t undermine it for me.

Planktonne an hour ago | parent [-]

I'm not talking about perception of the message, which will vary with the reader, but about sincerity of the message, which is determined by the writer.

alper an hour ago | parent | prev [-]

"Global Thermonuclear War"