Remix.run Logo
cactusplant7374 4 days ago

Do you think ChatGPT could do the same work now? It would be interesting to try it.

fsniper 4 days ago | parent | next [-]

I used Gemini to decode and transcribe an old (and well known) cursive hand written mail. I couldn't read it at all. It managed to do this in a few seconds. I am not sure if it used an already available transcription or not. However if not, it was amazing work.

joshstrange 4 days ago | parent | prev [-]

Almost 2 years ago I did this with ChatGPT. It was soon after you could feed it images as input IIRC. It worked very well. I settled on AWS Textract + ChatGPT to save money and was able to get it to well under 1 cent to take an image and turn it into a recipe you could export to Paprika (and others). I never pursued it further but it was a fun little side project.

At this point I don’t think I’d do the Textract step since LLMs have gotten way better and cheaper. Also you lose some info/context when the model only gets the post-OCR data.