-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Fused Op causes MXNetError #16747
Comments
I suggest turn the fused_op off by default in the 1.6.0 release and announce it as experimental feature, or revert the PR. @szha @eric-haibin-lin @junrushao1994 @DickJC123 @wkcn @reminisce @haojin2 @TaoLv @marcoabreu What do you think? |
I agree to turn the fused_op off by default until fused_op is stable. |
+1 |
Isn't right now the period of finding those integration bugs and fixing them for 1.6 release? I will definitely look into this issue and fix it, not sure why you propose to turn the feature off by default? |
@ptrendx I think we are already in a code-freeze status and the simplest fix is to turn it off by default. We could easily turn it on in 1.6.1 once we have confirmed that it has no impact in all the training scripts (there are plenty of them) and some may take time to run. |
Ok, I sent a clarification email to dev@ as you are not actually the first person to reach out to me with this misunderstanding of code freeze. Code freeze is a period where bugs are found and fixed in order to polish the release and provide the best experience for the end users. I treat the bugs about fusion with highest priority and will do my best to fix them. If I fail to address all issues before the time to make RC, then I agree it should be turned off by default and marked experimental. |
I agree with @ptrendx, we should try to fix the bugs and ship the features if time allows. |
I received the clarification email about the meaning of code freeze and I agree with @ptrendx that we should try to fix it these days and consider to turn it off by default if we fail to do so. BTW, what's the expected date for 1.6 RC? |
I created a PR with a fix. @leezu, could you validate it? |
@ptrendx thanks for the fix. Just confirmed it works. |
Description
After #15167 is merged, GluonNLP CI broke.
Error Message
To Reproduce
git clone https://github.com/dmlc/gluon-nlp; cd gluon-nlp; pytest --color=yes -s scripts -k 'test_finetune_train[float16-WNLI-bert_12_768_12-2]'
Environment
https://pypi.org/project/mxnet-cu100/1.6.0b20191102/
The text was updated successfully, but these errors were encountered: