Remix.run Logo
paufernandez 7 hours ago

Yeah, but the xz algorithm is also not counted in the bytes... Here the "program" is the LLM, much like your brain remembers things by coding them compressed and then reconstructs them. It is a different type of compression: compression by "understanding", which requires the whole corpus of possible inputs in some representation. The comparison is not fair to classical algorithms yet that's how you can compress a lot more (given a particular language): by having a model of it.

wrs 7 hours ago | parent | next [-]

“Compressors are ranked by the compressed size of enwik9 (10^9 bytes) plus the size of a zip archive containing the decompresser.” [0]

[0] https://www.mattmahoney.net/dc/text.html

7 hours ago | parent | prev [-]
[deleted]