The opus models seems pretty adept and extracting structured data from ocr https://www.ocrarena.ai/battle