Remix.run Logo
embedding-shape 5 hours ago

Haven't seen anything particular about that, but lots of the documents with names that were half-redacted contain OCRd text that is completely garbled, but olmocr-2-7b seems to handle it just fine. Unsure if they just had sucky processes or if there is something else going on.

helterskelter 5 hours ago | parent [-]

Might be a good fit for uploading a git repo and crowdsourcing

embedding-shape 4 hours ago | parent [-]

Was my first impulse too but not sure I trust that unless I could gather a bunch of people I trust, which would mean I'd no longer be anonymous. Kind of a catch22.