Remix.run Logo
Show HN: Turboquant.cpp – Quantize embeddings to 1-4 bits, no training (400 LoC)(github.com)
2 points by andrewmikhail 8 hours ago | 1 comments
andrewmikhail 8 hours ago | parent [-]

[flagged]