▲ | ks2048 10 days ago | |
I've been doing some experiments with the OCR API on macOS lately and wonder how it compares to these LLMs. Overall, it's very impressive, but makes some mistakes (on easy images - i.e. obviously wrong) that require human intervention. I would like to compare it to these models, but this benchmark is beyond OCR - extracted structured JSON. |