Remix.run Logo
GaggiX 12 hours ago

There is also: https://github.com/linto-ai/whisper-timestamped

It doesn't use an extra model (so it supports every language that works with Whisper out of the box and use less memory), it works by applying Dynamic Time Warping to cross-attention weights.

oezi 10 hours ago | parent [-]

Just a warning that plain WhisperX is more accurate and Whisper-timestamped has many weird quirks.