Remix.run Logo
netdur 11 hours ago

I’ve tried that too, trying to detect the scan layout to get better OCR, but it didn’t really beat a fine-tuned Qwen 2.5 VLM 7B. I’d say fine-tuning is the way to go

rexreed 18 minutes ago | parent [-]

what fine tuning approach did you use?