Oh god. I have another idea for a really dumb project that might end up as a dumb blog post.

And in this case, the compressed zip file of all the text files I need is 23GB and the index into that database is just under 1GB.

But it's so dumb, I just have to play with it.
Oh yeah, just over 440k files and 120GB. This is going to be a fun exercise. Think I'll need to break out the C++ for this.
Oh no, the files are XML.

TinyXML2? I think it's in vcpkg.
You can follow @olafurw.
Tip: mention @twtextapp on a Twitter thread with the keyword “unroll” to get a link to it.

Latest Threads Unrolled: