| ▲ | olliepro 2 days ago | |||||||
Do you have a better way to measure LLMs? Measurement implies quantitative evaluation... which is the same as benchmarks. | ||||||||
| ▲ | Wowfunhappy 2 days ago | parent [-] | |||||||
I don’t have a good way to measure them, but I think they should be evaluated more like how we evaluate movies, or restaurants. Namely, experienced critics try them and write reviews. | ||||||||
| ||||||||