| ▲ | am17an a day ago | |
Thank you, there are two things I would like to point out: 1) Google releasing something probably means they don't see it as important. 4-bit KV-cache quantization has been known for a long time. The fact there is almost a mass hysteria about this paper makes me think there is a lack of skepticism in this AI mania, even in relatively tech-savvy crowd. 2) But prices for memory companies are crashing! look around, the whole market is crashing. | ||