| ▲ | Retr0id 3 hours ago | |||||||||||||||||||||||||||||||
I'm just saying it's epistemically unrigorous to the point of being equivalent to anecdata. | ||||||||||||||||||||||||||||||||
| ▲ | gchamonlive 3 hours ago | parent [-] | |||||||||||||||||||||||||||||||
How should one conduct such a rigourously reproducible experiment when LLMs by nature aren't deterministic and when you don't have access to the model you are comparing to from months ago? | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||