There's good news on that front, if not for the hardware and bandwidth you need.
PDF readers and open-source libraries used in document processing will all need updating to handle the Brotli compression filter.
extract-audio --format parquet --input train-00000-of-00010.parquet --output files-parquet/ extract-audio --format arrow --input data-00000-of-01189.arrow --output ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results