-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect status when server is powered off (Dell poweredge R740) #110
Comments
Hi, good point, I will have a look at it. |
Hi, When the server is powered off, the FAN and Memory Checks go critical [OK]: INFO: HPE ProLiant BL460c Gen10 (CPU: 2, MEM: 64GB) - BIOS: I41 v1.46 (10/02/2018) - Serial: *** - Power: Off - Name: NOT SET |
Hey, I just pushed a change to the thank you. |
Hi, thanks! I tried the latest changes on the next-release branch: On the same Server it works great now:
I've found some other servers though:
For CPU it's not working yet and also I recieve negative integer overflow for the fans And on another server I have problems with the temp sensors when turned off:
|
Here I additionally recieve a CRITICAL because of a Unknown Battery RAID Controller Status:
|
uiui, this needs a more general approach then can you check if the negative fan values are being sent directly from the iDRAC? |
Yes, sorry I didn't check that before:
|
😄, well, This is quite something. I should add some sanity checks to the returned values and if they are out of range then they should default to 0. What do you think? |
Sounds good! |
This is better but I have the same problem with the --proc check when the system is powered off : [CRITICAL]: Processor CPU.Socket.1 (Intel(R) Xeon(R) Gold 6226R CPU @ 2.90GHz) status is: None |
Will fix it in the next version but wont be until after easter holiday break. |
Hey, One thing I've noticed (on Hosts that are Power: on): Sometimes I randomly get |
Good to hear. But this behavior only occurs with the latest change? And not with older versions? |
I don't know yet, I couldn't reproduce it on the command line - I'll check after the weekend |
Now it's OK for --proc but the problem is now with --power : |
Hi, I especially left out the power supply section. This should be monitored correctly by the BMC even if the server is switched off. I assume it would be important if a power supply fails when the server is in standby. What do you think? |
The BMC say the status is None but you affect the CRITICAL status. |
Seems to be fine, didn't see any more Criticals |
Hi, I just pushed another commit regarding status of power supply if server is switched off. Can you try again please? |
I will try today |
All seems to be fine now |
Thank you for testing, then I will close this issue. |
Hello,
my server is a Dell PowerEdge R740
when the system is powered off, the status does not represant reality :
When the status of a sensor is unknown, why affect the critical status and not the unknown status ?
And, is it possible to exclude Unknown sensor when the server is in powered off state ?
The text was updated successfully, but these errors were encountered: