Remix clone Hacker News

new | show | ask | jobs Github

	▲	mv4 4 hours ago
		I've been using MiniMax M2.7 with vllm on my dual Nvidia Spark cluster. Slow (<20 tps) but functional for most of my use cases.