| ▲ | jexp 2 hours ago | |
Shouldn’t it be possible since forever to put machine readable source information into PDF metadata. It’s more a problem of the tools and programs generating the PDFs. We spend millions turning structured information into PDFs and billions to extract the same data from a printer rendering language | ||
| ▲ | neonmagenta 2 hours ago | parent | next [-] | |
Exactly. But we have no real coordination or uniform application in how we're creating PDFs across all these programs so we always end up with a fun mix of what will and wont be static, scalable, searchable | ||
| ▲ | an hour ago | parent | prev | next [-] | |
| [deleted] | ||
| ▲ | vjvjvjvjghv an hour ago | parent | prev [-] | |
Exactly. It’s pretty insane that we have converged on storing documents as PDF. And it looks like no work is done on making PDF files machine readable. | ||