Remix clone Hacker News

new | show | ask | jobs Github

	▲	slashdave 4 hours ago
		Disk where? LLM requests are routed dynamically. You might not even land in the same data center.
	▲	FuckButtons 2 hours ago \| parent [-]
		But if you have a tiered cache, then waiting several seconds / minutes is still preferable to getting a cache miss. I suspect the larger problem is the amount of tinkering they are doing with the model makes that not viable.