Remix.run Logo
chpatrick 5 hours ago

It absolutely hasn't been solved, it's just got pretty decent in recent years.

malfist 4 hours ago | parent [-]

Pretty decent might be quiet the stretch. I'd term it almost acceptable, but only if you're using commercial solutions like amazon's textract, doing it with open source tools is at best, extremely painful and vaguely accurate.

chpatrick 2 hours ago | parent [-]

PaddleOCR (also from Baidu) is pretty damn good actually.

__rito__ 37 minutes ago | parent [-]

I have shipped with PaddleOCR to prod. Works pretty well. (Usage limited to printed documents in Anglosphere). Runs fully offline, in CPU.