Support multi-process/multi-node sharding for S3IterableDataset
#53
Labels
enhancement
New feature or request
S3IterableDataset
#53
We currently don't have a built in way to do sharding for
S3IterableDataset
, so every worker process in aDataLoader
will see the same stream of objects. We should have a way to do this.In the meantime, something like this from
torchdata
will work as a workaround:The text was updated successfully, but these errors were encountered: