Recently, during a routine check of a server in BMC, we investigated a hardware error.
BMC means “Baseboard Management Controller” and it is a remote management web interface for servers. BMC is accessible with a web browser. Supermicro servers and Asus servers are providing BMC. The HP equivalent to BMC is “iLo” which stands for “Integrated Lights-Out”. In BMC and iLo one is able to start, turn off or reboot a server. With BMC (and iLo) the operating system can be accessed as well. BMC also shows detailed information about the hardware state of the server.
These remote management interfaces are necessary especially in data centers.
The ProblemDuring our routine check we saw in the BMC event log the following error message:
voltage logged a BMC Event : Upper Non-critical - going high(Reading value : 1.30 V And Threshold value : 1.32 V) was deasserted
We had the thought that parts of the costly server hardware are damaged and need to be replaced. No helpful information to this problem was found by us during an internet search.
The error could mean that the entire mainboard or a sensor on the mainboard is defect.
Before we informed the vendors customer service to organize a replacement we did the following:
We shut down the server hardware and unplugged all cables. Then we pressed the power button of the server several times. Then we waited several minutes and pressed the power button again.
Then we plugged in all the cables again and we started up the server.
The error was not shown during the next hours.
After this procedure we checked the server again and now, weeks later, the error seems to be persistently gone. Often the old tricks, well known from the PC era are still fixing some problems – also problems with modern expensive hardware.
Feel free to rate the blog post or even better leave a comment!
We are sorry that this post was not useful for you!
Let us improve this post!
Tell us how we can improve this post?