Remix.run Logo
abdusco 21 hours ago

I solved it by generating English subtitles, then passing those to an LLM in chunks that are ~20 entries in size. Include preceding and following subtitles as context for better translation. Make sure to replace the timestamps with simple integer ids, because LLMs like to mangle those, no matter how hard you prompt.

I could share a python script that is working pretty reliably for me.

vevoe 20 hours ago | parent [-]

I'd love to see that script, do you have a link?

abdusco 19 hours ago | parent [-]

https://gist.github.com/abdusco/5bd5c909547f5f9b935dbd2fb2fe...