> Despite the name, diffusion LMs have little to do with image diffusion and are much closer to BERT and old good masked language modeling.
Has anyone tried making text the way we do image diffusion? What happens?