Remix.run Logo
avaer 2 hours ago

Efficient realtime video diffusion will revolutionize the way people use computers even more so than LLMs.

I actually think we are already there with quality, but nobody is going to wait 10 minutes to do a task with video that takes 2 seconds with text.

If Sora/Kling/whatever ran cool locally 24/7 at 60FPS, would anyone ever build a UI? Or a (traditional) OS?

I think it's worth watching the scaling graph.

IsTom 2 hours ago | parent | next [-]

> If Sora/Kling/whatever ran cool locally 24/7 at 60FPS, would anyone ever build a UI?

I like my buttons to stay where I left them.

pavlov an hour ago | parent [-]

Yeah, it’s like asking “why would anyone read a book today when LLMs can generate infinite streams of text”

exe34 an hour ago | parent [-]

those streams of text are often conditioned on the prompts - people are using it to learn about new concepts, and as a hyperpersonalised version of search. it can not only tell you of tools you didn't know existed, but it can show you how to use them.

I do like my buttons to stay where I left them - but that can be conditioned. instead of gnome "designers" telling me the button needs to be wide enough to hit with my left foot, I could tell the system I want this button to be small and in that corner - and add it to my prompt.

andy99 27 minutes ago | parent | prev [-]

There are already low code UI tools, what’s the benefit here, it would just be like a really unreliable one?