| ▲ | sosojustdo 5 hours ago | |
I've been working on a tool specifically to handle these messy PDF-to-Markdown conversions because I ran into the same issues with tables and multi-column layouts. I’ve optimized https://markdownconverter.pro/pdf-to-markdown to handle complex PDFs, including those tricky tables that span multiple pages and 2-column formats that usually trip up tools like Docling. It also extracts embedded diagrams/images and links them properly in the output. Full disclosure: I'm the developer behind it. I’d love to see if it handles your specific datasheets better than the models you've tried. Feel free to give it a spin! | ||
| ▲ | bradfa 4 hours ago | parent [-] | |
Cool! But given that often electronics documentation is covered by NDAs, my preferred solution is local-first if at all possible. | ||