▲ | favorited 2 hours ago | |
Docling primarily uses AI models to extract PDF content, this project looks like it uses a custom parser written in Java, built atop veraPDF. | ||
▲ | brumar 2 hours ago | parent [-] | |
Correct me if I am wrong, but Docling can do both. It has also, among other strategies, a non-AI pipeline to determine the layout (based on qpdf I believe). So these projects are not that different. |