KeyError in multihead.py example when not using num_workers on MPS #1656

guilhermegog · 2024-07-09T12:54:42Z

Hey everyone, first off thank you for developing this amazing library.

🐛 Describe the bug
So apparently there is a bug, which I am not sure if it is a bug related to avalanche or mps, but if you attempt to run the multihead example without the num_workers parameter set (which I would assume would default to 1), the training loop stops working and a KeyError is raised, with a seemingly random key.

🐜 To Reproduce
In lines 71 and 72 of the multihead.py example just replace with the following lines:
strategy.train(train_task)
strategy.eval(test_stream)

🦋 Additional context
The issue only seems to occur when running the script on 'mps' devices, as when running the same piece of code on a server with cuda the issue does not persust

The text was updated successfully, but these errors were encountered:

guilhermegog added the bug Something isn't working label Jul 9, 2024

guilhermegog changed the title ~~KeyError in Multihead example when not using num_workers on MPS~~ KeyError in multihead.py example when not using num_workers on MPS Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError in multihead.py example when not using num_workers on MPS #1656

KeyError in multihead.py example when not using num_workers on MPS #1656

guilhermegog commented Jul 9, 2024 •

edited

Loading

KeyError in multihead.py example when not using num_workers on MPS #1656

KeyError in multihead.py example when not using num_workers on MPS #1656

Comments

guilhermegog commented Jul 9, 2024 • edited Loading

guilhermegog commented Jul 9, 2024 •

edited

Loading