Remix.run Logo
shironnnn_ 5 hours ago

if on MacOS I recommend llm-mlx which currently renders tokens 10%-15% faster than llama.cpp.