Remix clone Hacker News

new | show | ask | jobs Github

	▲	addandsubtract 4 days ago
		Great work! Can this technique also be used to run image diffusion models on lower VRAM GPUs?
	▲	GTP 4 days ago \| parent \| next [-]
		Not an expert in machine learning, but AFAIK diffusion models use a completely different architecture, therefore you can't use the same code to run optimized versions of both. But maybe the core ideas can be adapted to diffusion somehow.
	▲	anuarsh 3 days ago \| parent \| prev [-]
		Thanks! I don't have much experience with diffusion models, but technically any multi-layer model could benefit from loading weights one by one