| ▲ | MostlyStable 4 hours ago | |||||||||||||
How long have you been testing this? Have you noted a large improvement? I tested Opus for this quite a while ago (maybe 4.5? Whatever was out about a year ago), and it performed quite poorly on my use case. | ||||||||||||||
| ▲ | nik736 4 hours ago | parent [-] | |||||||||||||
I have put together an internal benchmark on 1000s of business documents with weird tables, structure, etc. that I run on every relevant model release. Opus 4.8 performs very very well. But it is obviously overkill for the task (and expensive at doing so). I just wanted to respond to the OP. | ||||||||||||||
| ||||||||||||||