Remix clone Hacker News

new | show | ask | jobs Github

	▲	dr_dshiv an hour ago
		All the models used are shown with each page of translation and each book has a whole data provenance treatment. You can add it up!
	▲	sgc 44 minutes ago \| parent \| next [-]
		I don't see raw token counts, just a list of steps and page counts. For example, what is the rough average token count per page in the ocr and in the translation steps for a Greek book? I have seen Gemini costs change quite a bit when processing very similar books from the same series lately, mainly because thinking tokens have increased about 5x. Has that has happened to you as well? Edit: for ocr I am using about 15k-25k tokens per page, but I have a complex prompt.
	▲	mmargenot an hour ago \| parent \| prev \| next [-]
		How do you handle the more densely written pages in script ? I did a very similar exercise OCRing works from this exact collection, but I stuck with the English books for the first pass.
	▲	efilife 19 minutes ago \| parent \| prev [-]
		Can't you just tell him?