-
Notifications
You must be signed in to change notification settings - Fork 330
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ConvNeXt Architecture #68
Comments
I have implemented it: https://github.com/sayakpaul/ConvNeXt-TF/. Should be available here soon: https://tfhub.dev/sayakpaul/collections/convnext/1. |
I suppose that one had an inital approach of converting the Pytorch reference impl weights. |
Yeah, absolutely. If you serialize the weights of the converted models you will have something similar to what's expected here. |
More ore less. I think that some of difference in this repo could be:
|
I am not absolutely sure about any of these points. I am also not sure how individual contributors could run the models from scratch without any involved support from the maintainers. For implementing architectures, there could be specific bits while generic bits could still benefit from what the library already offers. If you see what I have implemented, you'd probably notice there's not too many specific bits there. Also, it's helpful to have examples aiding your points. For example, if you could provide an example on what you meant in your first point in the context of this issue thread, that would be super helpful. |
We don't have this infra right now inpalce. So we cannot contribute a Github action orchestrating the training job for reproducibility.
Yes other then thinking about obivious reusable components related to a specific network like new layers, optimizers, losses and metrics that It could introduce and exposed here as API. There was also: So mainly It is just thinking as a library with the network as an e2e integration example of the new (if needed) introduced components API |
Hey @sayakpaul just a heads up, we are planning to hold off on incorporating models for a little bit longer. @qlzh727 has some great ideas on changing the structure for Keras applications a bit, and we'd like to iron those out before adding any models. |
Thanks for letting me know. Does this also mean |
I'd guess so, this should only last a month or so before we have the new sample model ready though. |
Understood. Thank you. |
@LukeWood Is this issue open for contributions as well?! Next to SWIN transformers, ConvNext boasts even higher performance and stats utilizing similar robustness of training datasets! I definitely want to learn more about ConvNext in kerasCV as one of my research projects would have direct benefits having ConvNext integrated into Keras! |
It is not open for contributions as of now. I believe Sayak will be contributing these to Keras core. Is that correct @sayakpaul? As of now KerasCV won't be accepting models until some details are ironed out on our end. |
I am working on it, yes. |
@sayakpaul we can migrate this to KerasCV when you are ready. |
Yes sure. I will start working on it very soon. |
Sayak fixed this! |
Sayak is working on this
The text was updated successfully, but these errors were encountered: