Remix.run Logo
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels(arxiv.org)
9 points by PaulHoule 5 hours ago