▲ | AndrewDucker 9 days ago | |
Tesseract can manage 99% accuracy on anything other than handwriting. Without being an LLM. Is there an advantage of using an LLM here? | ||
▲ | jauntywundrkind 9 days ago | parent [-] | |
I'm really curious about this too! I don't know! There's some comments I've run across saying Qwen2.5-VL's really good at handwriting recognition. It'd also be interesting to see how Tesseract compares when trying to OCR more mixed text+graphic media. Some possible examples: high-design magazines with color backgrounds, TikTok posts, maps, cardboard hold-up signs at political gatherings. |