▲ | netdur 11 hours ago | |
I’ve tried that too, trying to detect the scan layout to get better OCR, but it didn’t really beat a fine-tuned Qwen 2.5 VLM 7B. I’d say fine-tuning is the way to go | ||
▲ | rexreed 18 minutes ago | parent [-] | |
what fine tuning approach did you use? |