Remix.run Logo
From Noise to Image – interactive guide to diffusion(lighthousesoftware.co.uk)
37 points by simedw 2 days ago | 8 comments
whilefalse 2 days ago | parent | next [-]

Hey, I made this, thanks for posting!

It’s purposefully high level and non-technical for a general audience - my theory was that most people who aren’t into tech/AI don’t care too much about training, or how the system got to be the way that it is.

But they do have some interest in how it actually operates once you’ve typed in a prompt.

Happy to answer any questions or take on board feedback

adampunk 7 minutes ago | parent | next [-]

It's quite clever and thoughtful. thanks for making it!

BobbyTables2 2 hours ago | parent | prev | next [-]

Loved the writeup!

Found the manual latent space exploration part really interesting.

Too many LLM/diffusion explanations fall in the proverbial “how to draw an owl” meme without giving a taste as to what’s going on.

plagiarist an hour ago | parent | prev [-]

I enjoyed this a lot.

The interpolations between butterfly and snail were pretty horrifying. But something like Z-Image you could basically concatenate the text and end up with a normal image of both. Is the latent space for "butterfly and snail" just well off the path between the two individually?

It's hard to imagine what is nearby in latent space and how text contributes, so I did really like the section adding words to the prompt 1-by-1.

ibizaman 2 hours ago | parent | prev | next [-]

Oh I particularly loved that you made the prompts themselves interchangeable. Very well done!

K2h 3 hours ago | parent | prev | next [-]

Scrolling through pics on mobile is difficult. Wanted to see all 29 steps but couldnt scroll it reliably.

BobbyTables2 2 hours ago | parent [-]

Turning off the scroll mode worked very well for me on a mobile.

khazhoux 2 hours ago | parent | prev [-]

Amazing explanations!! I absolutely love this. In 10 minutes it’s given me a huge boost in my intuition on diffusion, which I’ve been missing for years.