I'm sorry I've triggered an error in mxnet source code, how can I debug it? It seems like a check failure on custom operators, how can I find more details? #16651
-
So far I'm working in linux dockers, the mxnet was built from source code
I've viewed the source code mentioned above, it looks like this:
|
Beta Was this translation helpful? Give feedback.
Replies: 7 comments
-
@mxnet-label-bot, add [Question, Debug] |
Beta Was this translation helpful? Give feedback.
-
This may be related to #16365 . As a workaround. Can you please remove declare_backward_dependency for the custom op that is getting called and check : https://github.com/dingjiansw101/RoITransformer_DOTA/blob/master/fpn/operator_py/ |
Beta Was this translation helpful? Give feedback.
-
Hey, I'm on the way!
I've got report that the docker running the code cannot reveal the gpu usage properly, I'll redo the experiment later! Can you tell how this happened? |
Beta Was this translation helpful? Give feedback.
-
It may be an issue with how storage type is assigned after invoking declare_backward_dependency callback : https://github.com/apache/incubator-mxnet/blob/master/src/operator/custom/custom.cc#L474 . I haven't had the time recently to dig deeper into this. |
Beta Was this translation helpful? Give feedback.
-
Ok, after I've got enough memory on this computer, the problem emerges again.
|
Beta Was this translation helpful? Give feedback.
-
Was a workaround ever found for this issue? |
Beta Was this translation helpful? Give feedback.
-
@TBag-ui one workaround is mentioned here : #16365 (comment) |
Beta Was this translation helpful? Give feedback.
@TBag-ui one workaround is mentioned here : #16365 (comment)