I haven't tried it yet, but I bookmarked this recently: https://github.com/opendataloader-project/opendataloader-pdf