Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sorting, compression algorithm +level, and data types can all have an impact. I noted elsewhere that a Boolean is getting represented as an integer. That’s one bit vs 1-4 bytes.

There is also flexibility in what you define as the dataset. Skinnier, but more focused tables could be space saving vs a wide table that covers everything -will probably break compressible runs of data.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: