| ▲ | chaps 4 hours ago | |||||||
For my workflows, layout extraction has been so inconsistent that I've stopped attempting to use it. It's simpler to just throw everything into postgis and run intersection checks on size-normalized pages. | ||||||||
| ▲ | kergonath 3 hours ago | parent [-] | |||||||
Interesting. What kind of layout do you have? My documents have one or two-column layouts, often inconsistently across pages or even within a page (which tripped older layout detection methods). Most models seem to understand that well enough so they are good enough for my use case. | ||||||||
| ||||||||