Remix.run Logo
saberience 3 hours ago

Have you actually used LLMs for non trivial tasks? They are still incredibly bad when it comes to actually hard engineering work and they still lie all the time, it's just gotten harder to notice, especially if you're just letting it run all night and generate reams of crap.

Most people are optimizing for terrible benchmarks and then don't really understand what the model did anyone and just assume it did something good. It's the blind leading the blind basically, and a lot of people with an AI-psychosis or delusion.

nfg 3 hours ago | parent [-]

Do you realise who you’re replying to?

emp17344 an hour ago | parent | next [-]

Why should we care that he’s famous?

nfg an hour ago | parent [-]

Fame doesn’t enter it - the point is Karpathy has about as strong a claim as anyone to having “actually used LLMs for non trivial tasks”.

CamperBob2 3 minutes ago | parent [-]

Reminds me of another famous HN footgun, where some people were arguing about math. One of them backed up his opinion by pointing out that he made it to the Putnam competition, or something like that. The other guy said, "Cool. I won it that year."

_menelaus 2 hours ago | parent | prev [-]

lolololol