Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ConvNeXt family of models to keras.applications #16321

Closed
sayakpaul opened this issue Mar 29, 2022 · 6 comments · Fixed by #16421
Closed

Add ConvNeXt family of models to keras.applications #16321

sayakpaul opened this issue Mar 29, 2022 · 6 comments · Fixed by #16421
Assignees
Labels
type:feature The user is asking for a new feature.

Comments

@sayakpaul
Copy link
Contributor

Describe the feature and the current behavior/state.

ConvNeXt [1] family of models doesn't use attention or any new components for that but still achieves really good performance on ImageNet-1k while being efficient. They also demonstrate good performance on a variety of downstream tasks.

ConvNeXt family of models was trained using recipes used for training Vision Transformer-based models. Another point to note is that these models were evolved to simulate the design choices of Swin Transformers [2].

Will this change the current api? How?

Yes:

keras.applications.ConvNeXtBase, keras.applications.ConvNeXtTiny, ...

Who will benefit from this feature?

Keras users using keras.applications in their projects.

Contributing

  • Do you want to contribute a PR? (yes/no): Yes
  • If yes, please read this page for instructions
  • Briefly describe your candidate solution(if contributing): I have implemented the non-isotropic ConvNeXt variants here (with all the original pre-trained params ported). I also co-contributed the TF variant of this to Hugging Face transformers (Add TFConvNextModel huggingface/transformers#15750).

References

[1] https://arxiv.org/abs/2201.03545
[2] https://arxiv.org/abs/2103.14030

@fchollet @LukeWood

@sayakpaul sayakpaul added the type:feature The user is asking for a new feature. label Mar 29, 2022
@sachinprasadhs sachinprasadhs added the keras-team-review-pending Pending review by a Keras team member. label Mar 31, 2022
@hertschuh
Copy link
Collaborator

Hi Sayak, this looks like a great addition. Would you be willing to put up a PR for this?

@hertschuh hertschuh removed the keras-team-review-pending Pending review by a Keras team member. label Mar 31, 2022
@hertschuh hertschuh assigned LukeWood and unassigned sachinprasadhs Mar 31, 2022
@sayakpaul
Copy link
Contributor Author

Thanks. I think the following answers your question :)

Do you want to contribute a PR? (yes/no): Yes

@LukeWood
Copy link
Contributor

Sweet - yes we are happy to take this, and we can take it in keras.applications. Cheers @sayakpaul and thanks!

@LukeWood
Copy link
Contributor

Feel free to send PRs my way

@sayakpaul
Copy link
Contributor Author

Alright! I will get started as soon as I can then.

@sayakpaul
Copy link
Contributor Author

Join the conversation here: #16421.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type:feature The user is asking for a new feature.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants