Remix.run Logo
imiric a day ago

Benchmark scores are marketing fluff. Just like the rest of this article with alleged praises from early adopters, and highly scripted and edited videos.

AI companies are grasping at straws by selling us minor improvements to stale technology so they can pump up whatever valuation they have left.

j_timberlake a day ago | parent [-]

The fact that people like you are still posting like this after Veo 3 is wild. Nothing could possibly be forcing you to hold onto that opinion, yet you come out in drones in every AI thread to repost it.

imiric 16 hours ago | parent [-]

I concede that my last sentence was partly hyperbolic, particularly around "stale technology". But the rest of what I wrote is an accurate description of the state of the AI industry, from the perspective of an unbiased outsider, anyway.

What we've seen from Veo 3 is impressive, and the technology is indisputably advancing. But at the same time we're flooded with inflated announcements from companies that create their own benchmarks or optimize their models specifically to look good on benchmarks. Yet when faced with real world tasks the same models still produce garbage, they need continuous hand-holding to be useful, and they often simply waste my time. At least, this has been my experience with Sonnet 3.5, 3.7, Gemini, o1, o3, and all of the SOTA models I've tried so far. So there's this dissonance between marketing and reality that's making it really difficult to trust what any of these companies say anymore.

Meanwhile, little thought is put into the harmful effects of these tools, and any alleged focus on "safety" is as fake as the hallucinations that plague them.

So, yes, I'm jaded by the state of the tech industry and where it's taking us, and I wish this bubble would burst already.