Adding more ConvNeXt variants + Speed optimizations #5253

datumbox · 2022-01-21T18:55:27Z

Adding convnext_small, convnext_base and convnext_large.

Small

torchrun --nproc_per_node=1 train.py --test-only --prototype --weights ConvNeXt_Small_Weights.IMAGENET1K_V1 --model convnext_small -b 1
Acc@1 83.616 Acc@5 96.650

Base

torchrun --nproc_per_node=1 train.py --test-only --prototype --weights ConvNeXt_Base_Weights.IMAGENET1K_V1 --model convnext_base -b 1
Acc@1 84.062 Acc@5 96.870

Large

torchrun --nproc_per_node=1 train.py --test-only --prototype --weights ConvNeXt_Large_Weights.IMAGENET1K_V1 --model convnext_large -b 1
Acc@1 84.414 Acc@5 96.976

Also switch from channels first + Conv1x1 to channels last + Linear layers as it gets a 20% speed boost.

facebook-github-bot · 2022-01-21T18:55:33Z

💊 CI failures summary and remediations

As of commit d708d06 (more details on the Dr. CI page):

2/2 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

CodeQL / build (1/1)

Step: "Build TorchVision" (full log | diagnosis details | 🔁 rerun)

2022-02-01T12:52:57.2791929Z ##[error]Process completed with exit code 1.

2022-02-01T12:52:57.1309743Z     self.finalize_options()
2022-02-01T12:52:57.1310807Z   File "/home/runner/.local/lib/python3.8/site-packages/setuptools/command/develop.py", line 52, in finalize_options
2022-02-01T12:52:57.1311252Z     easy_install.finalize_options(self)
2022-02-01T12:52:57.1312152Z   File "/home/runner/.local/lib/python3.8/site-packages/setuptools/command/easy_install.py", line 276, in finalize_options
2022-02-01T12:52:57.1312603Z     self._fix_install_dir_for_user_site()
2022-02-01T12:52:57.1313539Z   File "/home/runner/.local/lib/python3.8/site-packages/setuptools/command/easy_install.py", line 382, in _fix_install_dir_for_user_site
2022-02-01T12:52:57.1313999Z     self.create_home_path()
2022-02-01T12:52:57.1315068Z   File "/home/runner/.local/lib/python3.8/site-packages/setuptools/command/easy_install.py", line 1338, in create_home_path
2022-02-01T12:52:57.1315590Z     if path.startswith(home) and not os.path.isdir(path):
2022-02-01T12:52:57.1316096Z AttributeError: 'int' object has no attribute 'startswith'
2022-02-01T12:52:57.2791929Z ##[error]Process completed with exit code 1.
2022-02-01T12:52:57.2848483Z Post job cleanup.
2022-02-01T12:52:57.4203399Z [command]/usr/bin/git version
2022-02-01T12:52:57.4259234Z git version 2.34.1
2022-02-01T12:52:57.4302184Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand
2022-02-01T12:52:57.4349471Z [command]/usr/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :
2022-02-01T12:52:57.4733281Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader
2022-02-01T12:52:57.4761732Z http.https://github.com/.extraheader
2022-02-01T12:52:57.4771586Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader
2022-02-01T12:52:57.4831437Z [command]/usr/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :
2022-02-01T12:52:57.5538642Z Cleaning up orphan processes

1 failure not recognized by patterns:

Job	Step	Action
^{cmake_macos_cpu}	^{curl -o conda.sh https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh}
sh conda.sh -b
source $HOME/miniconda3/bin/activate
conda install -yq conda-build cmake
packaging/build_cmake.sh
	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

…x/vision into models/convnext_variants

…odels/convnext_variants

datumbox · 2022-01-31T19:35:02Z

@s9xie @liuzhuang13 as discussed at #5197 (comment) here are some benchmarks on our side. This is the speed of TorchVision's current implementation 2bbb112 VS after applying this patch 290440b. I understand that your numbers were different but I acknowledge that my benchmarks focus only on a specific batch-size and hardware combination. The difference is big but I was hoping that we could fix the speed difference by optimizing the underlying kernel and avoid needing workarounds on TorchVision. Have you spoken with Core about it? Overall what are your thoughts?

liuzhuang13 · 2022-02-01T07:52:14Z

@datumbox Thanks! It seems after your patch our implementations are really similar, both using permuting and linear layers. If I understand your results correctly, yours is 15~20% faster than ours somehow. Any idea why? We will benchmark your current patched version against ours too.

datumbox · 2022-02-01T09:29:20Z

@liuzhuang13 I've assumed it's because I don't use a custom LayerNorm. I would love to know if you can confirm the above with your benchmarks. I've also notified the PyTorch performance team to see if that's something known and work on or something worth improving on the future (I've tagged you and Saining to the post for FYI).

NicolasHug

Thanks @datumbox

datumbox · 2022-02-01T12:30:09Z

Thanks for the review @NicolasHug. I'm going to move now the class to main TorchVision so that we can include it in the upcoming release.

Edit: On second thought, I'll move the graduation to core area on a separate PR.

Summary: * Refactor model builder * Add 3 more convnext variants. * Adding weights for convnext_small. * Fix minor bug. * Fix number of parameters for small model. * Adding weights for the base variant. * Adding weights for the large variant. * Simplify LayerNorm2d implementation. * Optimize speed of CNBlock. * Repackage weights. Reviewed By: kazhang Differential Revision: D33927490 fbshipit-source-id: 569d9f752b1c5d5ba6f9a8f9721b4f91fac6663d

datumbox added 2 commits January 21, 2022 18:30

Refactor model builder

166aacb

Add 3 more convnext variants.

e0c56b5

facebook-github-bot added the cla signed label Jan 21, 2022

datumbox added enhancement module: models prototype and removed cla signed labels Jan 21, 2022

datumbox marked this pull request as draft January 21, 2022 18:56

facebook-github-bot added the cla signed label Jan 21, 2022

datumbox and others added 7 commits January 23, 2022 10:58

Merge branch 'main' into models/convnext_variants

a8df024

Merge branch 'main' into models/convnext_variants

cc979c4

Merge branch 'main' into models/convnext_variants

d1f703b

Adding weights for convnext_small.

1a03524

Fix minor bug.

5c5b1a9

Fix number of parameters for small model.

7652819

Merge branch 'main' into models/convnext_variants

4f8dfed

datumbox mentioned this pull request Jan 28, 2022

Are new models planned to be added? #2707

Open

37 tasks

datumbox and others added 8 commits January 28, 2022 12:20

Adding weights for the base variant.

db63f45

Merge branch 'main' into models/convnext_variants

c3971ad

Adding weights for the large variant.

30a6b6d

Merge branch 'models/convnext_variants' of https://github.com/datumbo…

1ab9030

…x/vision into models/convnext_variants

Merge branch 'main' into models/convnext_variants

f803797

Simplify LayerNorm2d implementation.

cafa02d

Merge remote-tracking branch 'origin/models/convnext_variants' into m…

2bbb112

…odels/convnext_variants

Optimize speed of CNBlock.

290440b

Repackage weights.

fd5c99d

NicolasHug approved these changes Feb 1, 2022

View reviewed changes

datumbox marked this pull request as ready for review February 1, 2022 12:42

datumbox changed the title ~~[WIP] Adding more ConvNeXt variants~~ Adding more ConvNeXt variants + Speed optimizations Feb 1, 2022

Merge branch 'main' into models/convnext_variants

d708d06

datumbox merged commit 82929ae into pytorch:main Feb 1, 2022

datumbox deleted the models/convnext_variants branch February 1, 2022 12:51

datumbox mentioned this pull request Feb 11, 2022

[RFC] Batteries Included - Phase 2 #5410

Closed

24 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding more ConvNeXt variants + Speed optimizations #5253

Adding more ConvNeXt variants + Speed optimizations #5253

datumbox commented Jan 21, 2022 •

edited

Loading

facebook-github-bot commented Jan 21, 2022 •

edited

Loading

datumbox commented Jan 31, 2022 •

edited

Loading

liuzhuang13 commented Feb 1, 2022

datumbox commented Feb 1, 2022

NicolasHug left a comment

datumbox commented Feb 1, 2022 •

edited

Loading

Adding more ConvNeXt variants + Speed optimizations #5253

Adding more ConvNeXt variants + Speed optimizations #5253

Conversation

datumbox commented Jan 21, 2022 • edited Loading

Small

Base

Large

facebook-github-bot commented Jan 21, 2022 • edited Loading

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

CodeQL / build (1/1)

1 failure not recognized by patterns:

datumbox commented Jan 31, 2022 • edited Loading

liuzhuang13 commented Feb 1, 2022

datumbox commented Feb 1, 2022

NicolasHug left a comment

Choose a reason for hiding this comment

datumbox commented Feb 1, 2022 • edited Loading

datumbox commented Jan 21, 2022 •

edited

Loading

facebook-github-bot commented Jan 21, 2022 •

edited

Loading

datumbox commented Jan 31, 2022 •

edited

Loading

datumbox commented Feb 1, 2022 •

edited

Loading