The nagiso client requires that the  OMSA (openmanage server administrator) nagios Client installation must be installed on the Dell server omsa (Refer to http://linux.dell.com/repo/hardware/OMSA_7.4.0/) 1, adding Dell's Yum library ( access to http:// linux.dell.com/repo/hardware View Latest version ) wget -q -o - http://linux.dell.com/ REPO/HARDWARE/OMSA_7.4.0/BOOTSTRAP.CGI | BASH 2, installation srvadmin yum install Srvadmin-all -y 3, start srvadmin/opt/dell/srvadmin/sbin/srvadmin-services.sh startnagios server configuration 1, Dell official Omsa monitoring script download (download to nagios server /usr/local/nagios/libexec, and assign Nagios Execute permissions) wget http://folk.uio.no/trondham/software/check_openmanage-3.7.11/check_openmanage also, due to the Check_ OpenManage is a Perl script, so Perl interpreter need to install  PERL-NET-SNMPYUM INSTALL PERL-NET-SNMP   2, The following is a manual execution of the script to get the hardware status #电压./check_openmanage -h 192.168.1.100 --only voltagevoltage ok - 20 voltage probes checked #cpu./check_openmanage -h 192.168.1.100 --only cpuprocessors OK - 1 processors checked #风扇转速./check_openmanage -h 192.168.1.100 --only fansfans ok - 12 fan probes checked# storage./check_openmanage -H 192.168.1.100 --only storageSTORAGE OK - 3 physical drives, 1 logical drives #内存./check_openmanage -h 192.168.1.100 -- only memorymemory ok - 2 memory modules, 32768 mb total memory #电池./check_openmanage -h 192.168.1.100 --only batteriesbatteries ok - 1 batteries checked check_openmanage script for more detailed usage please refer to: http://folk.uio.no/trondham/ software/check_openmanage.html if the above no error, it can be configured to nagios , the configuration method online more, here is not much to say # uninstall openmanage server Administratoryum erase $ (Rpm -qa | grep srvadmin)
Fault Handling
1. When the system log appears Server Administrator (shared library): data engine eventid: 0 a semaphore set has to be created but the system limit for the maximum number of semaphore sets has been exceeded When this thing comes up, it means that your surveillance is no longer available. is probably meant to say: Data engine failed to open successfully due to the maximum number of signals in the system. This requires modifying the system kernel's settings for semaphore sets . Here's how: ipcs -l ------ Shared Memory Limits -------- max number of segments = 4096 max seg size (Kbytes) = 67108864 max total shared memory (Kbytes) = 17179869184 min seg size ( bytes) = 1 ------ Semaphore Limits -------- max number of arrays = 128 max semaphores per array = 250 max semaphores system wide = 32000 max ops per semop call = 32 semaphore max value = 32767 ------ Messages: Limits -------- max queues system wide = 16 max size of message ( bytes) = 65536 default max size of queue ( bytes) = 65536 sysctl -a | grep shm vm.hugetlb_shm_group = 0 kernel.shmmni = 4096 kernel.shmall = 4294967296 kernel.shmmax = 68719476736 Solutions Adjustment max queues system wide max number of arrays sysctl -w kernel.msgmni=16384 sysctl -w kernel.sem= "250 32000 100 1024 " ################################################## echo " kernel.msgmni=16384 " >> /etc/sysctl.conf echo " kernel.sem=\ "250 32000 100 1024\ "" >> /etc/sysctl.conf view ipcs -l ------again shared memory limits --------Max number of segments = 4096max seg size (Kbytes) = 67108864max total shared memory (Kbytes) = 17179869184min seg size (bytes) = 1------ Semaphore Limits --------Max number of arraYs = 1024max semaphores per array = 250max semaphores system wide = 32000max ops per semop call = 100semaphore max value = 32767------ Messages: Limits --------Max queues system wide = 16384max size of message (bytes) = 65536default max size of queue (bytes) = 65536 Restart/opt/dell/srvadmin/sbin/srvadmin-services.sh restart
2
Refused Smux peer:oid snmpv2-smi::enterprises.674.10892.1, descr Systems Management SNMP MIB plug-in Manager SNMP denied
/ETC/INIT.D/SNMPD Restart if your configuration is correct
3
Ipmi_si:could not enable interrupts, failed set, using polled mode. cannot be interrupted, the wrong setting, the way polling is used.
We'll use our own methods to monitor the state of the machine.
get_dell_server_detail.py Collect Dell hardware information to the/tmp directory
cat/data/program/nagios-client/libexec/get_dell_server_detail.py #!/usr/bin/python2.7#-*-coding:utf-8-*-"" "The Dell Server Hardware detailauthor jastme "" "Import commands,ostry:if os.path.exists ('/tmp/dell_hardware_detail.txt '): Passexcept ioerror:f=open ('/tmp/dell_hardware_detail.txt ', ' W ') F.close () def dellserver (): Detail=commands . GetOutput ('/data/program/nagios-client/libexec/check_openmanage-s-d ') ff=open ('/tmp/dell_hardware_detail.txt ', ' W ') ff.write (detail) ff.close () if __name__ = = ' __main__ ': Dellserver ()
Dell Monitoring OMSA Monitoring