Remix clone Hacker News

new | show | ask | jobs Github

	▲	cpburns2009 2 hours ago
		Looping is a common problem with the Qwen models. I've had good luck using --repeat-penalty=1.1 with llama.cpp and 27B. vLLM should have a similar option.