[Feature Request] Ability to incrementally write a memmap tensordict #968

alexanderswerdlow · 2024-08-21T18:21:37Z

Motivation

A common situation for dataset generation/processing involves writing many tensors to disk from many processes/nodes in parallel, and over a long duration. While shared storage is assumed, the storage itself is often slow and has delays due to NFS caching, etc, and many small file ops cause inefficient operation. In addition, allowing the user to manually flush to disk can help alert the user to file I/O bottlenecks as it's clear what is blocking the code.

My current workflow with tensordicts is to generate 1 per process, periodically save to disk [by deleting the old one and creating a new one] and finally merging all individual tensordicts with a cat.

Solution

Support incremental writing/saving of a memmap tensordict. Writes should persist in memory until a manual flush occurs. The entire file shouldn't be overwritten so as to allow other processes to write to other portions of the tensordict in parallel.

Checklist

I have checked that there is no similar issue in the repo (required)

The text was updated successfully, but these errors were encountered:

vmoens · 2024-08-27T14:16:08Z

In principle I don't see why it wouldn't be possible but the way we work with memmap is through torch.from_file, which does not return a traditional mmap object with flush functionality.

Happy to chat about it with @mikaylagawarecki and @albanD once I'm back from my time off!

alexanderswerdlow added the enhancement New feature or request label Aug 21, 2024

alexanderswerdlow assigned vmoens Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Ability to incrementally write a memmap tensordict #968

[Feature Request] Ability to incrementally write a memmap tensordict #968

alexanderswerdlow commented Aug 21, 2024

vmoens commented Aug 27, 2024

[Feature Request] Ability to incrementally write a memmap tensordict #968

[Feature Request] Ability to incrementally write a memmap tensordict #968

Comments

alexanderswerdlow commented Aug 21, 2024

Motivation

Solution

Checklist

vmoens commented Aug 27, 2024