Remix.run Logo
simlevesque 12 hours ago

But when indexing your json or csv, if you have say 10 rows, each row is separated on your disk instead of all together. So a scan for one columb only needs to read a tenth of the disk space used for the data. Obviously this depends on the columns' content.

gdulli 12 hours ago | parent [-]

But you can have a surprisingly large amount of data before the inefficiency you're talking about becomes untenable.