| ▲ | embedding-shape 5 hours ago | |||||||
Haven't seen anything particular about that, but lots of the documents with names that were half-redacted contain OCRd text that is completely garbled, but olmocr-2-7b seems to handle it just fine. Unsure if they just had sucky processes or if there is something else going on. | ||||||||
| ▲ | helterskelter 5 hours ago | parent [-] | |||||||
Might be a good fit for uploading a git repo and crowdsourcing | ||||||||
| ||||||||