Remix.run Logo
ivanovm 6 hours ago

You could just look it up on their website leaderboard? The newest Claude model makes over $10k profit over a simulated year of operation, after starting with $500

jeffreyrogers 6 hours ago | parent | next [-]

They've never translated it to the real world though. So saying the problem is "too easy" when they have no public (as far as I know) demonstration that they've solved that problem is a stretch.

ivanovm 6 hours ago | parent [-]

Yes, they did. You could also find this information easily. A company like Andon creates value by exposing interesting AI failure modes, so it makes perfect sense for them to move on to harder problems when the previous ones get saturated. I think you're just being overly cynical.

jeffreyrogers 6 hours ago | parent [-]

Can you point me to an example then? It's not linked in the article as far as I can tell and it's not easy to find on their website if it's there. I don't count simulations because I used to work with simulations regularly and they often fail to translate to the real world.

Tallain 3 hours ago | parent | prev | next [-]

Since when is a simulation equal to real world performance?

pocksuppet 6 hours ago | parent | prev [-]

So in other words, no, an LLM has never made profit.