Add ConvNeXt family of models to keras.applications #16321

sayakpaul · 2022-03-29T05:01:12Z

Describe the feature and the current behavior/state.

ConvNeXt [1] family of models doesn't use attention or any new components for that but still achieves really good performance on ImageNet-1k while being efficient. They also demonstrate good performance on a variety of downstream tasks.

ConvNeXt family of models was trained using recipes used for training Vision Transformer-based models. Another point to note is that these models were evolved to simulate the design choices of Swin Transformers [2].

Will this change the current api? How?

Yes:

keras.applications.ConvNeXtBase, keras.applications.ConvNeXtTiny, ...

Who will benefit from this feature?

Keras users using keras.applications in their projects.

Contributing

Do you want to contribute a PR? (yes/no): Yes
If yes, please read this page for instructions
Briefly describe your candidate solution(if contributing): I have implemented the non-isotropic ConvNeXt variants here (with all the original pre-trained params ported). I also co-contributed the TF variant of this to Hugging Face transformers (Add TFConvNextModel huggingface/transformers#15750).

References

[1] https://arxiv.org/abs/2201.03545
[2] https://arxiv.org/abs/2103.14030

@fchollet @LukeWood

The text was updated successfully, but these errors were encountered:

hertschuh · 2022-03-31T17:19:38Z

Hi Sayak, this looks like a great addition. Would you be willing to put up a PR for this?

sayakpaul · 2022-03-31T17:31:19Z

Thanks. I think the following answers your question :)

Do you want to contribute a PR? (yes/no): Yes

LukeWood · 2022-03-31T18:11:24Z

Sweet - yes we are happy to take this, and we can take it in keras.applications. Cheers @sayakpaul and thanks!

LukeWood · 2022-03-31T18:11:38Z

Feel free to send PRs my way

sayakpaul · 2022-04-01T00:46:09Z

Alright! I will get started as soon as I can then.

sayakpaul · 2022-04-17T12:59:36Z

Join the conversation here: #16421.

sayakpaul added the type:feature The user is asking for a new feature. label Mar 29, 2022

google-ml-butler bot assigned sushreebarsa Mar 29, 2022

sushreebarsa assigned sachinprasadhs and unassigned sushreebarsa Mar 29, 2022

sachinprasadhs added the keras-team-review-pending Pending review by a Keras team member. label Mar 31, 2022

hertschuh removed the keras-team-review-pending Pending review by a Keras team member. label Mar 31, 2022

hertschuh assigned LukeWood and unassigned sachinprasadhs Mar 31, 2022

sayakpaul mentioned this issue Apr 16, 2022

Add ConvNeXt models #16421

Merged

copybara-service bot closed this as completed in #16421 May 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ConvNeXt family of models to keras.applications #16321

Add ConvNeXt family of models to keras.applications #16321

sayakpaul commented Mar 29, 2022

hertschuh commented Mar 31, 2022

sayakpaul commented Mar 31, 2022

LukeWood commented Mar 31, 2022

LukeWood commented Mar 31, 2022

sayakpaul commented Apr 1, 2022

sayakpaul commented Apr 17, 2022

Add ConvNeXt family of models to keras.applications #16321

Add ConvNeXt family of models to keras.applications #16321

Comments

sayakpaul commented Mar 29, 2022

References

hertschuh commented Mar 31, 2022

sayakpaul commented Mar 31, 2022

LukeWood commented Mar 31, 2022

LukeWood commented Mar 31, 2022

sayakpaul commented Apr 1, 2022

sayakpaul commented Apr 17, 2022