-
Notifications
You must be signed in to change notification settings - Fork 19.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ConvNeXt family of models to keras.applications #16321
Comments
Hi Sayak, this looks like a great addition. Would you be willing to put up a PR for this? |
Thanks. I think the following answers your question :)
|
Sweet - yes we are happy to take this, and we can take it in keras.applications. Cheers @sayakpaul and thanks! |
Feel free to send PRs my way |
Alright! I will get started as soon as I can then. |
Join the conversation here: #16421. |
Describe the feature and the current behavior/state.
ConvNeXt [1] family of models doesn't use attention or any new components for that but still achieves really good performance on ImageNet-1k while being efficient. They also demonstrate good performance on a variety of downstream tasks.
ConvNeXt family of models was trained using recipes used for training Vision Transformer-based models. Another point to note is that these models were evolved to simulate the design choices of Swin Transformers [2].
Will this change the current api? How?
Yes:
Who will benefit from this feature?
Keras users using
keras.applications
in their projects.Contributing
transformers
(Add TFConvNextModel huggingface/transformers#15750).References
[1] https://arxiv.org/abs/2201.03545
[2] https://arxiv.org/abs/2103.14030
@fchollet @LukeWood
The text was updated successfully, but these errors were encountered: