| ▲ | swyx 4 hours ago | ||||||||||||||||
can u test it on say who won the 2024 US election | |||||||||||||||||
| ▲ | ghurtado 4 hours ago | parent | next [-] | ||||||||||||||||
I can't really think of a less reliable test for anything at all than making a random guess as to something that had about 50/50 odds to begin with Easiest Turing test ever... | |||||||||||||||||
| |||||||||||||||||
| ▲ | WarmWash 4 hours ago | parent | prev | next [-] | ||||||||||||||||
Usually the labs do some kind of post training on major events so the model isn't totally lost. A better test is something like "what is the latest version of NumPy?" | |||||||||||||||||
| |||||||||||||||||
| ▲ | czk 4 hours ago | parent | prev | next [-] | ||||||||||||||||
with thinking off and tools disabled: | |||||||||||||||||
| ▲ | redsocksfan45 3 hours ago | parent | prev [-] | ||||||||||||||||
[dead] | |||||||||||||||||