Remix.run Logo
bufferoverflow 2 days ago

This article makes no sense. It criticizes current LLMs and then without stopping for a second pretends future LLMs will have these problems. Even though hallucination levels have been going down with every generation. Even though every test and benchmark we can come up with, LLMs do better with every generation.

ToucanLoucan 2 days ago | parent | next [-]

> Even though hallucination levels have been going down with every generation.

Gonna need a BIG citation on that one, chief.

> Even though every test and benchmark we can come up with, LLMs do better with every generation.

Has it occurred to you the people making the tests and benchmarks are, more often than not, the same people making the LLM? Like yeah if I'm given carte blanche to make my own test cases and I'm accountable to no one and nothing else, my output quality would be steadily going up too.

The other day I tried asking Copilot for a good framework for accomplishing a task, and it made one up. I tried the query again, more specifically, and it referred me to a framework in another language. And yes, I specified.

financetechbro 2 days ago | parent [-]

> Gonna need a BIG citation on that one, chief.

OP has consumed so much LLM they’ve started to hallucinate themselves

ToucanLoucan 2 days ago | parent [-]

Perhaps the real hallucinations were the friends we made along the way

otabdeveloper4 2 days ago | parent | prev [-]

> Future bicycles will fly. Just two more rounds of venture capital investment, trust the plan.

sebastiennight 2 days ago | parent [-]

To be fair, we went from the invention of the bicycle to a (wheeled) flying apparatus within a shockingly short amount of time.

otabdeveloper4 2 days ago | parent [-]

Yeah, but not by dumping a billion bucks into bicycle venture capital funds.