-
Notifications
You must be signed in to change notification settings - Fork 2.5k
An illegal memory access was encountered #45
Comments
Could you give more information? I suspect it happens because you used a too high learning rate, and training diverged, giving large indices. |
@fmassa , Thanks for you quick reply.
|
So, you have changed the |
Try following the learning rate adaptation rules that I mentioned in the README, they are necessary for training to not diverge |
@fmassa I still can not figure the problem. |
So, to double check:
Is that right? Note that the meaning of Let me know if you still have problems |
@fmassa Thanks for your kind help. I will update if I have got progress. |
hi @fmassa
do u have ideas to solve it? |
@zimenglan-sysu-512 difficult to say without more context. Is this COCO? Are you using a standard model or have you adapted one of the models? It might require some digging to understand where the problem might come from. |
hi @fmassa |
🐛 Bug
I just run the below script with 4 x P100.
Expected behavior
Here is the error information,
It seems that the first two few iterations are ok. (iter: 0, 20)
Then in the iter 40, the number in the bracket becomes nan. Then I got the error informing me that an illegal memory was encountered.
Environment
I just install all the enviroments follow the instructions
The text was updated successfully, but these errors were encountered: