Remix.run Logo
nickpsecurity a day ago

You're thinking in a provably-useful direction:

https://arxiv.org/pdf/2312.11514

Tuna-Fish 20 hours ago | parent [-]

HBF is not that. The paper you linked is about how to use flash memory that exists to boost LLM performance, with all kinds of optimization tricks. HBF is about making flash memory that doesn't require any of those tricks, and just has the read throughput that's needed for inference.