We have previously talked about using OMSA to monitor Dell servers, but must be accessed via the web, and this time we combine nagios+check_openmanage to implement hardware monitoring for Dell servers.
First, let's take a look at the following picture:
It can be seen in two ways to enable monitoring:
The 1.nagios server-side Check_nrpe call is implemented by the monitored check_openmanage, which is installed OMSA and check_openmanage on the monitored side.
2.nagios server side directly through the check_openmanage to remotely monitor, this way to install PERL-NET-SNMP on the Nagios server side, install OMSA,SNMP service on the monitored side, which first installs the SNMP service, then installs the OMSA, This OMSA automatically changes the SNMP configuration file for monitoring.
Personal feeling: Using the second method is more convenient, only need to install, and the first way also needs to configure the Nagios client, in addition, Check_nrpe consumes server performance, so the following is implemented using SNMP.
I. Configuring the Nagios server Side
1. Install Check_openmanage on the Nagios server
wget http://folk.uio.no/trondham/software/files/check_openmanage-3.7.12.tar.gz TAR-XVF Check_ openmanage-3.7.12.tar.gz CD check_openmanage-3.7.12 CP check_openmanage/usr/local/nagios/libexec chown Nagios.nagios /usr/local/nagios/libexec/check_openmanage CP man/check_openmanage.8/usr/share/man/man8 CP Man/check_ Openmanage.conf.5/usr/share/man/man5
2. Install PERL-NET-SNMP on the Nagios server
Centos5 wget http://mirrors.zju.edu.cn/epel/5/i386/epel-release-5-4.noarch.rpm RPM-IVH epel-release-5-4.noarch.rpm Centos6 wget http://mirrors.zju.edu.cn/epel/6/i386/epel-release-6-8.noarch.rpm RPM-IVH epel-release-6-8.noarch.rpm Yum Install Perl-net-snmp perl-config-tiny
SNMP monitoring mode check_openmanage is required PERL-NET-SNMP support, otherwise the following error will occur:
Error:you need Perl module NET::SNMP to run Check_openmanage in SNMP mode
Our Nagios server is now installed.
Two. Configure the monitored side
1. Installing the SNMP Service
Yum-y Install NET-SNMP net-snmp-devel net-snmp-utils
2. Installing OMSA
Wget-q-o-http://linux.dell.com/repo/hardware/latest/bootstrap.cgi | Bash
Yum Install Srvadmin-all
3. Start the OMSA service
/opt/dell/srvadmin/sbin/srvadmin-services.sh start
4. View ports
NETSTAT-NTLP |grep:1311
If there is this port, our Srvadmin installation is successful, if it is not possible that there is a problem with the installation, and we first start with/opt/dell/srvadmin/sbin/srvadmin-services.sh start after installation, Otherwise, there may be a problem with service Dataeng start startup, causing Omsa not to start completely.
Attention:
(1) Be sure to install the SNMP service before installing OMSA, so that OMSA will automatically configure your SNMP service, if the order is reversed may result in a report error:
ERROR: (SNMP) OpenManage are not installed or are not working correctly
This is because our/etc/snmp/snmpd.conf configuration file has a place to change, mainly in the following places: (corrected later as follows)
View all included. 1
Access Notconfiggroup "" Any noauth exact none none
Smuxpeer. 1.3.6.1.4.1.674.10892.1
(2) If the error "SNMP critical:no response from remote host ' x.x.x.x '" is reported, the SNMP service is not installed on the monitored side
Third, configure monitoring items
1. Configure command
#检查存储设备define command { command_name check_storage command_line $USER 1$/check_openmanage-h $HOSTADDRESS $-- Only storage-p-s-b ctrl_fw=0} #检查cpudefine command { command_name check_cpu command_line $USER 1$/check_ Openmanage-h $HOSTADDRESS $--only cpu-p-s-b ctrl_fw=0} #检查内存define command { command_name check_memory command _line $USER 1$/check_openmanage-h $HOSTADDRESS $--only memory-p-s-b ctrl_fw=0} #检查电源define command { command_name ch Eck_power command_line $USER 1$/check_openmanage-h $HOSTADDRESS $--only power-p-s-b ctrl_fw=0} #检查温度define command { command_name check_temp command_line $USER 1$/check_openmanage-h $HOSTADDRESS $--only temp-p-s-w $ARG 1$-C $A rg2$-B ctrl_fw=0}
Where--only refers to only monitor an item,-p is to do the drawing,-S is the status description,-B is the blacklist, because our server firmware version is low, to not affect other monitoring items here we will blacklist it to remove it.
2. Configuring the Monitoring service Group
Define Servicegroup { servicegroup_name dell-openmanage alias Dell Server health Status}
3. Configuring the Monitoring Service
Define service{use Local-service host_name usvr-131,usvr-119,usvr-70,usvr-71 , usvr-72,usvr-73,usvr-82,usvr-83,usvr-84,usvr-85,usvr-86,usvr-87 service_description omsa_storage Check_command Check_storage service_groups dell-openmanage notifications_enabled 1} define Service{ Use Local-service host_name usvr-131,usvr-119,usvr-70,usvr-71,usvr-72,usvr-73, usvr-82,usvr-83,usvr-84,usvr-85,usvr-86,usvr-87 service_description omsa_cpu Check_command check_cpu Service_groups dell-openmanage notifications_enabled 1} define service{use Local-service host_name Usvr-131,usvr-119,usvr-70,usvr-71,usvr-72,usvr-73,usvr-82,usvr-83,usvr-84,u svr-85,usvr-86,usvr-87 service_description omsa_memory Check_command check_memory service_groups Dell-openmanage Notifications_enabled 1} Define service{use Local-service host_name usvr-131,usvr-119,usvr-70,usvr-71,usvr-72,usvr-73,usvr-82,usvr-83,usvr-84,usvr-85,usvr-86,usvr-87 Service_ Description Omsa_power check_command check_power service_groups dell-openmanage Notificati Ons_enabled 1} Define service{use Local-service host_name usvr-131 , usvr-119,usvr-70,usvr-71,usvr-72,usvr-73,usvr-82,usvr-83,usvr-84,usvr-85,usvr-86,usvr-87 service_description Omsa_temp Check_command check_temp! " 0=30/15 "!" 0=40/10 "Service_groups dell-openmanage notifications_enabled 1}
4. Check the configuration file and reload the configuration file
Nagioscheck
Service Nagios Reload
Quad, firewall configuration
Since we are using SNMP to monitor, we need to enable SNMP port UDP 161 on the monitored side to the Nagios server
/sbin/iptables-a input-i em1-p udp-s 10.10.5.89--dport 161-m comment--comment "Nagios snmp"-j ACCEPT
OK, the Dell server hardware monitoring is complete.
Nagios monitors Dell server hardware