Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
(
arxiv.org
)
9 points
by
PaulHoule
5 hours ago