Remix clone Hacker News

new | show | ask | jobs Github

	▲	nojito a day ago
		The best way to parse pdfs is to convert them to images and feed them into the llm. This workflow is highly optimized.
	▲	wpasc a day ago \| parent \| next [-]
		For sure there are very optimized ways to do it. My point is that a non technical user will drag and drop a pdf into a chatbot. and from a UX/product perspective, they should have to think about it more than that IMO. but seemingly, that's very much an expensive, inefficient way of doing it (burning through a whole context window try to read it, reloading it multiple times per conversation, etc.).
	▲	seemaze a day ago \| parent \| prev [-]
		Absolutely this. Never try to parse a native PDF document with any expectation of coherence or consistency.