Remix.run Logo
Achieving 3X speedups on Google TPUs with diffusion-style speculative decoding(developers.googleblog.com)
4 points by simonpure 5 hours ago