| ▲ | layer8 3 hours ago |
| For low-level work, qpdf can be quite useful: https://github.com/qpdf/qpdf |
|
| ▲ | ripe 3 hours ago | parent [-] |
| Came here to say this. Qpdf is my go-to for manipulating pdf files on the command line. Encrypting, decrypting, extracting and merging pages. It's Apache-licensed and written in C++. |
| |
| ▲ | voidmain0001 an hour ago | parent [-] | | How do you use qpdf for extraction when its README states “qpdf does not render PDFs or perform text extraction, and it does not contain higher-level interfaces for working with page contents.” | | |
| ▲ | ratrocket 41 minutes ago | parent [-] | | Not the person you're replying to, but when they said "extraction" I believe they're talking about extracting pages from a PDF (like "splitting" the PDF apart, page-wise), not text. At least that's a thing I've used qpdf for in the past. |
|
|