Remix.run Logo
Weves 3 hours ago

What format are the docs being uploaded as? By default, images uploaded into the chat would be directly passed through. PDFs would be parsed and fed to the LLM as text.

Writing is a really common use case, and something we'd like to explore more. Currently people often use Onyx for "write something combining X, Y, and Z documents", but I feel that's just scratching the surface.

gunalx 2 hours ago | parent [-]

I was mostly ranting about open-webui and hoping onyx would be better than the current state. My usecase involves pdfs with lots of complex figures, ocrd through mistral ocr witch gives text, and images for figures (have tried multiple other as well). I would really like to keep the figures as images, as ocr captions really struggles getting the full semantic meaning.

But stoked to get alternatives to the area, will try it out once i get time soon.