| ▲ | mistercow 2 hours ago | |||||||
> Then the part that matters: where the KV lives When your abstract was clearly generated by an LLM and not curated to at least make it sound human, it does not make me want to read your paper. | ||||||||
| ▲ | numeri an hour ago | parent [-] | |||||||
especially because this is the most painfully glaring flaw in their plan. Their solution is for an inference provider to... store the KV cache (which they can compute!) on-premise, on their own disks, but pay some third party for it? | ||||||||
| ||||||||