Remix.run Logo
jdw64 3 hours ago

They claim the AI itself built an OS and demoed Doom running on it. Personally, rather than Doom, I think I'll only truly appreciate the real value of this advancement when flawless real-time YouTube subtitle translation actually becomes a reality.

lacewing 2 hours ago | parent | next [-]

> They claim the AI itself built an OS and demoed Doom running on it.

Doom ran on MS-DOS, which - by modern standards - provided a shockingly minimal set of abstractions for programs. I think about the only thing you need to run Doom is the int 21h "API" to access the FAT filesystem and perform keyboard I/O. Note that MS-DOS did not provide facilities such as memory virtualization / management, process management, video drivers, sound drivers, etc - that was all provided by the hardware itself, which had its own hardware interrupts handled by the code in the device's ROM. It's why Doom required you to choose the type of a sound card you have, the interrupt / DMA channel to use, etc.

So I think this is a lot less of a flex than it seems; in fact, Anthropic using agents to build a semi-unusable compiler is far more of an achievement. Providing enough of int 21h to run Doom is probably something that a human could do in a weekend, doubly so if they can peek at the source code of FreeDOS.

jdw64 2 hours ago | parent | next [-]

You have a real knack for explaining things. That makes total sense. Regarding the compiler part, I'm actually trying to build my own language using LLMs, and to be honest, it barely works. I wasn't very familiar with classic OS architectures like MS-DOS, but your breakdown really helped me understand the context

philistine 2 hours ago | parent | prev [-]

Wake me when the AI rewrites Doom completely from a single prompt.

port11 3 hours ago | parent | prev | next [-]

One can only dream of Gemini Spam-Detector 3.5, which will finally make YouTube useable again. I can’t believe the kind of crap that gets recommended in the Kids app.

Analemma_ 3 hours ago | parent | prev | next [-]

I don't think you can get much more real-time than what we already have? iOS and Android both do live translations through their respective earbuds at decent latency, you can't get much faster without running into fundamental linguistics limitations.

For instance, you can't translate a Japanese sentence into English until you reach the verb at the end; no amount of latency improvement can overcome the fact that languages have different word orders.

jdw64 3 hours ago | parent | next [-]

You're right, but the YouTube subtitle translation actually isn't working right now. The English captions show up fine, so reading them isn't an issue, but stil

TacticalCoder 3 hours ago | parent | prev [-]

> I don't think you can get much more real-time than what we already have?

The YouTube real-time translation are utter garbage. And that may the less bad of the real-time translations out there. Still pure garbage.

If you watch anything specific to one sport/hobby, the number of words that are incorrectly translated is just wild.

gowld 3 hours ago | parent | prev [-]

The first thing mentioned is copying something that exists. The second thing is not.