Thanks! I don't have much experience with diffusion models, but technically any multi-layer model could benefit from loading weights one by one