| ▲ | rco8786 5 hours ago | |||||||
If they’re reaching the same results across a variety of the most popular public models, it doesn’t seem like that big a deal to know if it was Opus 4 or Opus 4.5 | ||||||||
| ▲ | hn_throwaway_99 4 hours ago | parent [-] | |||||||
Reproducibility is (supposed to be) a cornerstone of science. Model versions are absolutely critical to understand what was actually tested and how to reproduce it. | ||||||||
| ||||||||