I have many binary files (.wav format) stored in S3 and I would like to read + process them in a distributed fashion with Dask on a cluster.
In PySpark there is