Remix.run Logo
constantinum 7 days ago

There is also Unstract(open-source) that helps process structured data extraction. Key differences:

1. Unstract has a Pre-processing layer(OCR). Which converts documents into LLM readable formats.(helps improve accuracy, and control costs)

2. Unstract also connects to your existing data sources, making it an out-of-the-box ETL tool.

https://github.com/Zipstack/unstract

ttul 7 days ago | parent | next [-]

I’d throw a vote in the column for Unstract. Making the code AGPL is a first class move for a company that is trying to make money from the hosted version of the same software.

fudged71 7 days ago | parent | prev | next [-]

Any idea how it compares with docetl?

oriettaxx 7 days ago | parent | prev [-]

impressive, really