| ▲ | cactusplant7374 4 days ago | |
Do you think ChatGPT could do the same work now? It would be interesting to try it. | ||
| ▲ | fsniper 4 days ago | parent | next [-] | |
I used Gemini to decode and transcribe an old (and well known) cursive hand written mail. I couldn't read it at all. It managed to do this in a few seconds. I am not sure if it used an already available transcription or not. However if not, it was amazing work. | ||
| ▲ | joshstrange 4 days ago | parent | prev [-] | |
Almost 2 years ago I did this with ChatGPT. It was soon after you could feed it images as input IIRC. It worked very well. I settled on AWS Textract + ChatGPT to save money and was able to get it to well under 1 cent to take an image and turn it into a recipe you could export to Paprika (and others). I never pursued it further but it was a fun little side project. At this point I don’t think I’d do the Textract step since LLMs have gotten way better and cheaper. Also you lose some info/context when the model only gets the post-OCR data. | ||