Bug fix for named nn.Sequential in pytorch parser #848

JanFSchulte · 2023-08-11T19:07:51Z

Parsing of nn.Sequentials that are named members of a model class results in a naming convention for the tensors in the state_dict of the model different from what the parser expects, since it was so far tested only on unnamed nn.Sequentials. This PR catches this and adjusts the name of the tensors we are importing from the state_dict accordingly. A test is added to ensure that we keep parsing both cases successfully.

Type of change

For a new feature or function, please create an issue first to discuss it
with us before submitting a pull request.

Note: Please delete options that are not relevant.

Bug fix (non-breaking change that fixes an issue)

Tests

To reproduce, this will fail with this PR:


import torch.nn as nn

from hls4ml.converters import convert_from_pytorch_model
from hls4ml.utils.config import config_from_pytorch_model



#simple model with namend sequential
class SeqModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.layer = nn.Sequential(
          nn.Conv2d(1,20,5),
          nn.ReLU(),
          nn.Conv2d(20,64,5),
          nn.ReLU()
        )   

    def forward(self, x):
        output = self.layer(x)
        return output

model = SeqModel()

config = config_from_pytorch_model(model)
output_dir = 'test_pytorch'

convert_from_pytorch_model(
        model, (None, 1, 5, 5), hls_config=config, output_dir=output_dir)

pytests have been added to verify that this keeps working.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

jmitrevs · 2023-08-16T20:43:40Z

hls4ml/converters/pytorch_to_hls.py

+            if "layer." in key:
+                layerInKeyName = True
+
+        if '_' in layer_name:


I am not sure I follow the logic here, but I am not a pytorch expert. Can you double-check?

I had another look and this is a bit convoluted, but it works. The issue is that the layers inside a torch.nn.Sequential get named in the pattern nameOfSquential_n where n just numbers the layers inside the sequential. If the sequential does not have a name, which happens if you just go model = nn.Sequential(..), the layers will have names that just start with an underscore, which hls4ml doesn't like. So we add the prefix "layer" to those in the loop over the layers, which we have to remove when we go and load the tensors. The changes in this PR account for the fact that someone could create a named nn.Sequential just named layer, which then clashes with our previous assumption that if a layer name starts with layer_ it is because we added it by hand to get around the issue with layer names starting with just an _.

A further complication is that while torch.FX reports these structured layer names with underscores while in the state_dict a . is used, so we have to replace the _ with . here. The last complication is the case of layers that used multiple times in the same model. torch.FX reports them as different layers, adding an _n for the n-th time it is used to the layer name. But the tensors in the state dict will not have those modifications, so we also have to account for this.

Bit of a mess, but I have tested all these cases and this implementations catches all edge cases that I'm aware off.

This has now been solved significantly nicer by Vladimir :)

JanFSchulte · 2023-08-25T19:33:36Z

pre-commit.ci autofix

vloncar · 2023-08-25T22:24:20Z

This now includes the changes from #840. There'll be a follow-up PR with the remaining bits we need to parse sPHENIX tracking GNN.

hls4ml/converters/pytorch_to_hls.py

bug fix for named nn.Sequential

b4b2157

jmitrevs added this to the v0.8.0 milestone Aug 12, 2023

jmitrevs added the please test Trigger testing by creating local PR branch label Aug 12, 2023

JanFSchulte added 2 commits August 14, 2023 09:02

make handling of named sequenatials more robust

cb0bfcf

remove print statement

a2d77df

JanFSchulte mentioned this pull request Aug 16, 2023

Add RNN support for Pytorch #850

Merged

7 tasks

jmitrevs reviewed Aug 16, 2023

View reviewed changes

JanFSchulte and others added 5 commits August 17, 2023 08:47

fix reusing of layers

9f93c45

Simplify handling of layer names in Pytorch parser

9083ee3

Support squeeze/unsqueeze in PyTorch parser

dd825b1

Merge branch 'main' into batchNormFix

1ef9120

fix broken merge

26c2d71

pre-commit-ci bot and others added 2 commits August 25, 2023 19:33

[pre-commit.ci] auto fixes from pre-commit hooks

0b16432

Handle skipped layers (dropout etc) properly

2b0ae80

vloncar mentioned this pull request Aug 25, 2023

Improve parsing of non-nn.Sequential PyTorch models #840

Closed

7 tasks

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Aug 27, 2023

jmitrevs reviewed Aug 28, 2023

View reviewed changes

hls4ml/converters/pytorch_to_hls.py Show resolved Hide resolved

jmitrevs approved these changes Aug 28, 2023

View reviewed changes

jmitrevs merged commit 5cac79c into fastmachinelearning:main Aug 28, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix for named nn.Sequential in pytorch parser #848

Bug fix for named nn.Sequential in pytorch parser #848

JanFSchulte commented Aug 11, 2023

jmitrevs Aug 16, 2023

JanFSchulte Aug 17, 2023

JanFSchulte Aug 25, 2023

JanFSchulte commented Aug 25, 2023

vloncar commented Aug 25, 2023

Bug fix for named nn.Sequential in pytorch parser #848

Bug fix for named nn.Sequential in pytorch parser #848

Conversation

JanFSchulte commented Aug 11, 2023

Type of change

Tests

Checklist

jmitrevs Aug 16, 2023

Choose a reason for hiding this comment

JanFSchulte Aug 17, 2023

Choose a reason for hiding this comment

JanFSchulte Aug 25, 2023

Choose a reason for hiding this comment

JanFSchulte commented Aug 25, 2023

vloncar commented Aug 25, 2023