enable option to disable pin_memory in pytorch #239

rayandrew · 2024-10-24T01:39:11Z

This PR gives us option to disable pin memory in pytorch dataloader.
I encountered error while emulating stormer when we increase the prefetch factor since the memory is being reserved and cannot be released until the dataloader is finished.
Also, pin_memory is beneficial for GPU memory pinning which we do not use at all for DLIO [1, 2]

I set the default to True for backward compatibility, let me know if I should put pin_memory=False as default in the configuration.

Reference:
[1] https://pytorch.org/tutorials/intermediate/pinmem_nonblock.html
[2] https://discuss.pytorch.org/t/when-to-set-pin-memory-to-true/19723/19

rayandrew · 2024-10-24T01:42:03Z

Along with this PR, just want to point out the prefetch factor below while having discussion with @hariharan-devarajan recently

# torch_data_loader.py
        if self._args.read_threads >= 1:
            prefetch_factor = math.ceil(self._args.prefetch_size / self._args.read_threads)
        else:
            prefetch_factor = self._args.prefetch_size

Here, we limit the prefetch factor based on number of threads. Is it intended?
Probably it is better from user perspective to put everything as prefetch factor without making assumption that this number will be divided by number of workers.

hariharan-devarajan · 2024-10-30T15:49:43Z

Along with this PR, just want to point out the prefetch factor below while having discussion with @hariharan-devarajan recently
# torch_data_loader.py
        if self._args.read_threads >= 1:
            prefetch_factor = math.ceil(self._args.prefetch_size / self._args.read_threads)
        else:
            prefetch_factor = self._args.prefetch_size
Here, we limit the prefetch factor based on number of threads. Is it intended? Probably it is better from user perspective to put everything as prefetch factor without making assumption that this number will be divided by number of workers.

@zhenghh04 I agree with Ray that this is confusing for users of the benchmark. I would recommend to not do this.

hariharan-devarajan

This looks good. Thanks

rayandrew added 2 commits October 24, 2024 01:34

enable option to disable pin_memory in pytorch

f3b64ab

add the docs for pytorch pin memory

87db407

hariharan-devarajan requested review from zhenghh04 and hariharan-devarajan October 24, 2024 17:42

hariharan-devarajan approved these changes Oct 30, 2024

View reviewed changes

zhenghh04 approved these changes Oct 30, 2024

View reviewed changes

zhenghh04 merged commit c9225fb into argonne-lcf:main Oct 30, 2024
6 checks passed

rayandrew deleted the feature/add-pin-memory-options branch November 4, 2024 21:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable option to disable pin_memory in pytorch #239

enable option to disable pin_memory in pytorch #239

rayandrew commented Oct 24, 2024

rayandrew commented Oct 24, 2024

hariharan-devarajan commented Oct 30, 2024

hariharan-devarajan left a comment

enable option to disable pin_memory in pytorch #239

enable option to disable pin_memory in pytorch #239

Conversation

rayandrew commented Oct 24, 2024

rayandrew commented Oct 24, 2024

hariharan-devarajan commented Oct 30, 2024

hariharan-devarajan left a comment

Choose a reason for hiding this comment