▲ | MaxikCZ 21 hours ago | ||||||||||||||||
I tried to use whisper to generate non-english subs from english audio, but wasnt able to figure out. I know it can do english subs from non-english audio, and that earlier (less precise) versions could do any language audio -> any language subs, but latest whisper only to english subs. Anyone found a way? | |||||||||||||||||
▲ | abdusco 21 hours ago | parent | next [-] | ||||||||||||||||
I solved it by generating English subtitles, then passing those to an LLM in chunks that are ~20 entries in size. Include preceding and following subtitles as context for better translation. Make sure to replace the timestamps with simple integer ids, because LLMs like to mangle those, no matter how hard you prompt. I could share a python script that is working pretty reliably for me. | |||||||||||||||||
| |||||||||||||||||
▲ | 20 hours ago | parent | prev [-] | ||||||||||||||||
[deleted] |