Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

hadoop-ai can't detect ECC error #2146

Closed
mzmssg opened this issue Feb 15, 2019 · 1 comment
Closed

hadoop-ai can't detect ECC error #2146

mzmssg opened this issue Feb 15, 2019 · 1 comment
Assignees
Labels

Comments

@mzmssg
Copy link
Member

mzmssg commented Feb 15, 2019

Organization Name:

Microsoft

Short summary about the issue/question:

Job scheduled to ECC error GPUs

How to reproduce it:

Current ECC checker will fail if ECC error code greater than 1.

@mzmssg
Copy link
Member Author

mzmssg commented Mar 20, 2019

fixed in #2343

@mzmssg mzmssg closed this as completed Mar 20, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants