Remix.run Logo
In-Kernel Broadcast Optimization: Co-Designing Kernels for RecSys Inference(pytorch.org)
2 points by gmays 7 hours ago