Remix.run Logo
ks2048 10 days ago

I've been doing some experiments with the OCR API on macOS lately and wonder how it compares to these LLMs.

Overall, it's very impressive, but makes some mistakes (on easy images - i.e. obviously wrong) that require human intervention.

I would like to compare it to these models, but this benchmark is beyond OCR - extracted structured JSON.