You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Bug: Inconsistent Behavior in StreamingDataloader after loading states
Description
The StreamingDataloader exhibits inconsistent behaviour when handling loading states across different scenarios. Specifically, issues arise when iterating over the dataloader after loading states with complete and partial first epoch.
bhimrazy
changed the title
Bug: Inconsistent Behavior with StreamingDataloader loading states
Bug: Inconsistent Behavior with StreamingDataloader loading states (specific for StreamingDataset)
Aug 14, 2024
bhimrazy
changed the title
Bug: Inconsistent Behavior with StreamingDataloader loading states (specific for StreamingDataset)
Bug: Inconsistent Behavior with StreamingDataloader loading states (specific to StreamingDataset)
Aug 14, 2024
🐛 Bug
Bug: Inconsistent Behavior in StreamingDataloader after loading states
Description
The StreamingDataloader exhibits inconsistent behaviour when handling loading states across different scenarios. Specifically, issues arise when iterating over the dataloader after loading states with complete and partial first epoch.
To Reproduce
Create Optimized Dataset
Bugs
Iterating over the dataloader after loading state with complete one epoch iteration throws error.
a. Without loading state -> [OK]
Output
b. With loading state in intermediate step throws error -> [IndexError]
Output
Iterating over the dataloader after loading state with partial first epoch iteration do not reset after completing the epoch.
Output:
Throws num workers error when loading state with num_worksers=0
Output
current_epoch
is not synchronized with dataloader and dataset in dataloader state, when num_workers is not definedOutput
{'dataset': {'num_samples_yielded': 100,
'num_workers': 0,
'num_workers': 0,
'batch_size': 4,
'current_epoch': 2,
'input_dir_path': '/Users/bhimrajyadav/litdata/my_optimized_dataset',
'input_dir_url': None,
'item_loader': None,
'drop_last': False,
'seed': 42,
'world_size': 1,
'shuffle': False,
'subsampled_files': ['chunk-0-0.bin', 'chunk-1-0.bin'],
'region_of_interest': [(0, 50), (0, 50)]},
'current_epoch': 1,
'num_samples_yielded': 100,
'latest_worker_idx': 0}
Environment
conda
,pip
, source): pipAdditional context
The text was updated successfully, but these errors were encountered: