| ▲ | lemonish97 6 hours ago | ||||||||||||||||||||||||||||
What is your evidence for this claim? | |||||||||||||||||||||||||||||
| ▲ | fooker 6 hours ago | parent [-] | ||||||||||||||||||||||||||||
They say hill climbing https://microsoft.ai/news/building-a-hillclimbing-machine-la... Unless they specifically clarify that the testing and training benchmarks are completely separate, we have to assume they test on the same 'hill' the model climbs. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||