Remix clone Hacker News

new | show | ask | jobs Github

	▲	jexp 2 hours ago
		Shouldn’t it be possible since forever to put machine readable source information into PDF metadata. It’s more a problem of the tools and programs generating the PDFs. We spend millions turning structured information into PDFs and billions to extract the same data from a printer rendering language
	▲	neonmagenta 2 hours ago \| parent \| next [-]
		Exactly. But we have no real coordination or uniform application in how we're creating PDFs across all these programs so we always end up with a fun mix of what will and wont be static, scalable, searchable
	▲	an hour ago \| parent \| prev \| next [-]
		[deleted]
	▲	vjvjvjvjghv an hour ago \| parent \| prev [-]
		Exactly. It’s pretty insane that we have converged on storing documents as PDF. And it looks like no work is done on making PDF files machine readable.