Remix.run Logo
dr_dshiv an hour ago

All the models used are shown with each page of translation and each book has a whole data provenance treatment.

You can add it up!

sgc 44 minutes ago | parent | next [-]

I don't see raw token counts, just a list of steps and page counts. For example, what is the rough average token count per page in the ocr and in the translation steps for a Greek book?

I have seen Gemini costs change quite a bit when processing very similar books from the same series lately, mainly because thinking tokens have increased about 5x. Has that has happened to you as well?

Edit: for ocr I am using about 15k-25k tokens per page, but I have a complex prompt.

mmargenot an hour ago | parent | prev | next [-]

How do you handle the more densely written pages in script ? I did a very similar exercise OCRing works from this exact collection, but I stuck with the English books for the first pass.

efilife 19 minutes ago | parent | prev [-]

Can't you just tell him?