Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
Accelerating LLM Inference on AMD GPUs with Low-Latency GEMMs
(
rocm.blogs.amd.com
)
2 points
by
matt_d
8 hours ago