Remix clone Hacker News

new | show | ask | jobs Github

	▲	in-silico 3 hours ago
		I think some of the visualizations would be much better if you used a pixel-space model instead of a latent diffusion model. Right now we are only seeing the denoising process after it's been morphed by the latent decoder, which looks a lot less intuitive than actual pixel diffusion. If you can't find a suitable pixel-space model, then you can just trivially generate a forward process and play it backwards.
	▲	whilefalse 2 hours ago \| parent [-]
		Thanks that’s a great suggestion.