Dell Server hardware monitoring software Openmanager, can monitor the battery, motherboard, temperature, and hard disk and so on. In monitoring, you may experience the following error message:
Info:memory Module 6 [DIMM7, 2048 MB] needs attention:single-bit warning error rate exceeded, single-bit failure error RA Te exceeded
This indicates a problem with memory monitoring, may be loose memory, and so on, but the system can still recognize the memory. A shutdown is required to reseat the memory. Due to the need to stop the machine shutdown, will affect the business. But the problem will always be reported to the police. Everyone will feel annoyed when they receive the alarm message. Memory monitoring can be shielded by the following methods:
1
|
check_openmanage --check storage -b dimm=all
|
You can see that memory and voltage are not detected.
Memory is detected without dimm=all.
The related hardware detection can be shielded. such as temperature detection and so on. Such as:
The code is as follows |
Copy Code |
/usr/local/nagios/libexec/check_openmanage--check storage-b Ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all/bat_charge =all/encl=all/ps=all/fan=all/temp=all/volt=all |