Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
shironnnn_
5 hours ago
if on MacOS I recommend llm-mlx which currently renders tokens 10%-15% faster than llama.cpp.