Remix.run Logo
andai 5 days ago

Yeah, basically every 15 minute YouTube video, because the amount of actual content I care about is usually 1-2 sentences, and usually ends up being the first sentence of an LLM summary of the transcript.

If something has actual substance I'll watch the whole thing, but that's maybe 10% of videos I find in experience.

Terr_ 5 days ago | parent | next [-]

I'd wager there's 95% of the benefit for 0.1% of the CPU cycles just by having a "search transcript for term" feature, since in most of those cases I've already got a clear agenda for what kind of information I'm seeking.

Many years ago I make a little proof-of-concept for displaying the transcript (closed captions) of a YouTube video as text, and highlighting a word would navigate to that timestamp and vice-versa. Such a thing might be valuable as a browser extension, now that I think of it.

998244353 5 days ago | parent | next [-]

YouTube already supports that natively these days, although it's kind of hidden (and knowing Google, it might very well randomly disappear one day). Open the description of the video, scroll down and click "show transcript".

mrob 5 days ago | parent | prev | next [-]

Searching the transcript has the problem of missing synonyms. This can be solved by the one undeniably useful type of AI: embedding vector search. Embeddings for each line of the transcript can be calculated in advance and compared with the embeddings of the user's search. These models need only a few hundred million parameters for good results.

andai 4 days ago | parent [-]

Yeah, but they fail surprisingly hard on grepping. So the best systems use both simultaneously:

https://www.anthropic.com/engineering/contextual-retrieval

schoen 5 days ago | parent | prev [-]

https://reduct.video/ lets you edit (not just search!) videos that way. Kind of a different way to think about video content!

5 days ago | parent | prev | next [-]
[deleted]
mikkupikku 5 days ago | parent | prev | next [-]

One of the best features of SponsorBlock is crowd sourced timestamps for the meat of the video. Skip right over 20 minutes of rambling to see the cool thing in the thumbnail.

account42 4 days ago | parent | prev | next [-]

The problem here is that you are looking at a video in the first place when all you needed is short textual content.

cindyllm 5 days ago | parent | prev [-]

[dead]