Mounting tar archives as a filesystem in WebAssembly

Fri, 24 Apr 2026 00:00:00 +0000

TLDR: instead of extracting a .tar.gz archive, we can generate a small index file which lists the size and offset of each file in the tar, and use this metadata to mount the tar blob directly via Emscripten’s WORKERFS without any copying.

For details see: https://github.com/jeroen/tar-vfs-index

The struggle with tarballs

Lots of data on the internet lives in tarballs, often distributed as gzipped .tar.gz files. To get to this data, we have to download the entire .tar.gz file, decompress it, and then iterate through the blob from beginning to end to make copies of the files we need. This is expensive and painful in memory constrained environments.

Tar on ʕ•ᴥ•ʔ Notes from Jeroen

Mounting tar archives as a filesystem in WebAssembly

The struggle with tarballs