Remix clone Hacker News

new | show | ask | jobs Github

	▲	ericdotlee 4 days ago
		What is llama-swap? Been looking for more details about software configs on https://llamabuilds.ai
	▲	elsombrero 3 days ago \| parent [-]
		https://github.com/mostlygeek/llama-swap it's a transparent proxy that automatically launches your selected model with your preferred inference server so that you don't need to manually start/stop the server when you want to switch model so, let's say I have configured roo code to use qwen3 30ba3b as the orchestrator and glm4.5 air as coder, roo code would call the proxy server with model "qwen3" when using orchestrator mode and then kill llama.cpp with qwen3 and restart it with "glm4.5air"