[FEATURE] Update PyTorch dependency #884

sarthakpati · 2024-06-19T20:05:40Z

Is your feature request related to a problem? Please describe.

Now that PyTorch 2.3.0 has been out for a while, does it make sense to make the switch? There are a few backward incompatible changes [ref] which potentially relate to the work being done by @Geeks-Sid, so I will definitely wait for his comments.

Describe the solution you'd like

N.A.

Describe alternatives you've considered

N.A.

Additional context

Comments/suggestions, @VukW, @szmazurek?

VukW · 2024-06-28T09:26:10Z

Sorry as I am not proficient enough neither in the latest pytorch changes, nor in how GaNDLF uses distributed training. Do we have any tests for multi-gpu training? Cannot find any. If yes, then maybe just running a tests should be enough to ensure new version is ok for us.
Anyway, once in the future we would have to update dependency, so why not now

sarthakpati · 2024-06-28T14:06:24Z

Unfortunately, we do not have any GPU tests right now. 😞

I am fine with updating the dependency right now, but I would like to get the opinion of other developers/contributors/maintainers. 😄

szmazurek · 2024-06-28T15:53:47Z

Hey,
From my perspective why not, probably would be a matter re-running tests and making some corrections, as @VukW says would need to happen anyways.

sarthakpati · 2024-06-28T18:15:35Z

Sounds good, thanks!

Just waiting for @Geeks-Sid to respond and then we can start.

Geeks-Sid · 2024-06-28T22:15:04Z

Looks like the backwards compatibility issue does not affect us, however the tests might be good to be run on GPU's . We are good to go, but is there any issue in staying at current version?

sarthakpati · 2024-06-29T14:51:01Z

the tests might be good to be run on GPU's

Agreed - I am in discussion with a couple of CI providers to give us some extremely limited free GPU compute. Let's see how it goes.

is there any issue in staying at current version?

Nothing specific. Just that moving to the last stable release ensures that we aren't too far back in terms of ensuring latest bug fixes from PyTorch getting propagated forward. And since we will be making a jump with the new API branch anyway, I figured it might make sense to go to the latest one.

szmazurek · 2024-07-12T18:35:24Z

Dears, ragading the torch version - from version 2.2, torch has a built-in flash attention mechanism implemented, see: https://pytorch.org/blog/pytorch2-2/ . @sarthakpati mentioned that in the future we may integrate flash attention to speed up some models that employ the attention, this would be also useful regarding the synthesis module, where some diffusion models use it too. So, considering version updates, we may look directly into 2.2 as that solves both version update and flash attention.

sarthakpati · 2024-07-25T13:29:36Z

Since #845 also involves a torch version update, I think it might be best to let it get merged and tagged before working on this update.

sarthakpati · 2024-08-15T12:36:59Z

So, if there is no further issue with this, I am going to assign this is to @scap3yvt to start work.

sarthakpati added the dependencies Pull requests that update a dependency file label Jun 19, 2024

sarthakpati assigned scap3yvt Aug 15, 2024

scap3yvt mentioned this issue Aug 15, 2024

Torch version updated #919

Merged

10 tasks

sarthakpati closed this as completed in #919 Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Update PyTorch dependency #884

[FEATURE] Update PyTorch dependency #884

sarthakpati commented Jun 19, 2024

VukW commented Jun 28, 2024

sarthakpati commented Jun 28, 2024

szmazurek commented Jun 28, 2024

sarthakpati commented Jun 28, 2024

Geeks-Sid commented Jun 28, 2024

sarthakpati commented Jun 29, 2024

szmazurek commented Jul 12, 2024

sarthakpati commented Jul 25, 2024

sarthakpati commented Aug 15, 2024