▲ | ToucanLoucan 2 days ago | |||||||
> Even though hallucination levels have been going down with every generation. Gonna need a BIG citation on that one, chief. > Even though every test and benchmark we can come up with, LLMs do better with every generation. Has it occurred to you the people making the tests and benchmarks are, more often than not, the same people making the LLM? Like yeah if I'm given carte blanche to make my own test cases and I'm accountable to no one and nothing else, my output quality would be steadily going up too. The other day I tried asking Copilot for a good framework for accomplishing a task, and it made one up. I tried the query again, more specifically, and it referred me to a framework in another language. And yes, I specified. | ||||||||
▲ | financetechbro 2 days ago | parent [-] | |||||||
> Gonna need a BIG citation on that one, chief. OP has consumed so much LLM they’ve started to hallucinate themselves | ||||||||
|