| ▲ | mbesto a day ago | |||||||
How do you objectively tell whether a model "performs" better than another? | ||||||||
| ▲ | belval a day ago | parent [-] | |||||||
Not the original commenter but I work in the space and we have large annotated datasets with "gold" evidence that we want to retrieve, the evaluation of new models is actually very quantitative. | ||||||||
| ||||||||