A "blood case" caused by a dell R420 power failure, dellr420
It is exaggerated to say that the "blood case" was written, but it took a long night to complete the operation and maintenance. As an O & M engineer, it's common to get up late in the middle of a sudden Server failure. It's strange to start talking about things:
A few days ago, at around midnight, I received a message about server downtime, and then remotely used the dell idrac card. As a result, I couldn't use it to boot normally and sent an email directly to the data center, ask them to reset the idrac Management Card (that is, unplug the power cord and insert it back in 2 minutes ),
After the machine room has been properly operated, you can connect to the server. The result is not good for 1 minute. The machine crashes again. I still just analyzed the problem and asked the machine room to handle the problem, the IDC staff responded that the machine could not be started. At that time, I had
A bad hunch may not be able to sleep tonight. After the phone asks me to know that the server is plugged in with a power cord, some "drips" sound may sound, and I suspect that the power may be faulty. I first changed a power cord, as a result, I found an idle server of the same model.
The power supply is intended to let the IDC room personnel change to look at it. As a result, the IDC room personnel are not powerful, saying that the power supply cannot be removed, and they do not dare to dismantle it. I am afraid that the power supply will be broken down, so I have to find another solution, I suddenly thought that I could switch the hard disk to a server of the same type.
Start the server and restore the online business as soon as possible. Because the server is dell's R420 and is still in the repair period, we called dell's official after-sales hotline: 400-886-8618, if you want dell engineers to repair the site, it's not realistic, it's too late, it's online
The business needs to be restored as soon as possible. Therefore, you can only check whether the hard disk replacement solution is feasible. After obtaining confirmation from dell Technical support, you can change the hard disk solution.
Here we will introduce the two servers (for the sake of convenience, we will record them as A and B, A as A power fault server, and B as A normal server ), two hard disks are used as raid 1 (two hard disks are backed up to each other). The raid card and machine configurations are the same,
The hard disk replacement solution is feasible only when such conditions are met. The specific replacement steps are as follows:
1. Unplug the two hard disks of machine A, shut down machine B, unplug the power cord, and insert the two hard disks of machine A into machine B.
2. Enable the B server to power on and start the server. An error message is displayed, indicating that there is external raid information. You need to import the information and press the on-screen prompt to enter the raid configuration tool.
3. In the "PD Mgmt" tab, we can see that the two hard disks of the same size and State are "Foreign.
4. Switch the label to "Foreige View" and check that two Disks under "Physical Disks" are "Online", but both are "Foreign" hard Disks.
5. move the cursor to PERC H310 Mini (Bus 1, Dev 0), press F2, "Foreign Config" --> "import", Press enter, and then confirm, you can import the raid information successfully. (Remember that after confirmation, it is equivalent to already imported and does not need to be saved)
Figure: