You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Did you search issues to find if somebody asked this question before?
yes
If your question is about hang, did you read this doc?
yes
If your question is about docker, did you read this doc?
yes
Bug report:
I got an error as below. It works okay when the number of mpi processes is less than 12 but the error is shown with more than 12 processes. Can you let me know why it is happend?
I also checked hwloc is installed but no machine does not include it.
`[elsa-05:25920] [[9783,0],1] ORTE_ERROR_LOG: Data unpack would read past end of buffer in file grpcomm_direct.c at line 355
[elsa-02:18424] [[9783,0],2] ORTE_ERROR_LOG: Data unpack would read past end of buffer in file grpcomm_direct.c at line 355
An internal error has occurred in ORTE:
[[9783,0],1] FORCE-TERMINATE AT Data unpack would read past end of buffer:-26 - error grpcomm_direct.c(359)
This is something that should be reported to the developers.
An internal error has occurred in ORTE:
[[9783,0],2] FORCE-TERMINATE AT Data unpack would read past end of buffer:-26 - error grpcomm_direct.c(359)
This is something that should be reported to the developers.
--------------------------------------------------------------------------`
The text was updated successfully, but these errors were encountered:
Environment:
TensorFlow
1.11
0.16.4
4.0.1
10.0
2.4
3.6
Ubuntu 18.04
7.4.0
Checklist:
yes
yes
yes
Bug report:
I got an error as below. It works okay when the number of mpi processes is less than 12 but the error is shown with more than 12 processes. Can you let me know why it is happend?
I also checked hwloc is installed but no machine does not include it.
`[elsa-05:25920] [[9783,0],1] ORTE_ERROR_LOG: Data unpack would read past end of buffer in file grpcomm_direct.c at line 355
[elsa-02:18424] [[9783,0],2] ORTE_ERROR_LOG: Data unpack would read past end of buffer in file grpcomm_direct.c at line 355
An internal error has occurred in ORTE:
[[9783,0],1] FORCE-TERMINATE AT Data unpack would read past end of buffer:-26 - error grpcomm_direct.c(359)
This is something that should be reported to the developers.
An internal error has occurred in ORTE:
[[9783,0],2] FORCE-TERMINATE AT Data unpack would read past end of buffer:-26 - error grpcomm_direct.c(359)
This is something that should be reported to the developers.
--------------------------------------------------------------------------`
The text was updated successfully, but these errors were encountered: