▲ | thorum 4 days ago | ||||||||||||||||
> Six of the eleven picked the same movie This is surely the greatest weakness of current LLMs for any task needing a spark of creativity. | |||||||||||||||||
▲ | torginus 4 days ago | parent | next [-] | ||||||||||||||||
I have noticed this too - often when one model volunteered the wrong answer - such as making up a nonexistent API, I asked another, and it gave me the exact same thing! It's highly unlikely that two totally independent models would make up the same fictional thing. There must be something strange going on (most likely training on each others' wrong outputs, but I dunno) | |||||||||||||||||
| |||||||||||||||||
▲ | Timwi 4 days ago | parent | prev [-] | ||||||||||||||||
This is definitely something very early LLMs could do that has kind of gotten beat out of them. I used to ask ChatGPT to simulate a text adventure game, but now if you try that you always get exactly the same one. | |||||||||||||||||
|