1.1.1. Introduction to DELL server hardware monitoring and DELL system management tool OMSA
This article describes how to use Nagios and OMSA to monitor the hardware health status of DELL servers. The Nagios monitoring mode is in NRPE mode. You need to configure the check_openmanage script and install the dell omsa tool.
Use OpenManage and Nagios to monitor DELL server hardware deployment manual:
Http://folk.uio.no/trondham/software/check_openmanage.html
1) What is OMSA?
OMSA is the abbreviation of Dell Openmanage Server Administrator.
Dell OpenManage Server Administrator (OMSA) provides acomprehensive, one-to-one systems management solution in two ways: from anintegrated, web browser-based graphical user interface (GUI) and from a commandline interface (CLI) through the operating system. server Administrator isdesigned for system administrators to manage systems locally and remotely on anetwork. it allows system administrators to focus on managing their entirenetwork by providing comprehensive one-to-one systems management.
2) install OMSA
Yum Source Address of DELL OpenManage:
Http://linux.dell.com/repo/hardware/Linux_Repository_14.04.00/
Configure the yum source of OMSA:
Create the/etc/yum. repos. d/dell-omsa-repository.repo source file:
Or
Execute the command to automatically configure:
The wget-q-O-http://linux.dell.com/repo/hardware/Linux_Repository_14.04.00/bootstrap.cgi | bash
Install OMSA:
Yum install srvadmin-all
Configure the OMSA service to automatically start:
/Opt/dell/srvadmin/sbin/srvadmin-services.sh enable.
Start the OMSA service:
/Opt/dell/srvadmin/sbin/srvadmin-services.sh start
View the omsa service status:
/Opt/dell/srvadmin/sbin/srvadmin-services.sh status
Dell_rbu (module) is running
Ipmidriver is running
Dsm_sa_datamgrd (pid 1331 1197) is running
Dsm_sa_eventmgrd (pid 1381) is running
Dsm_sa_snmpd (pid 1440) is running
Dsm_om_shrsvcd (pid 1508) is running...
Dsm_om_connsvcd (pid 1562) is running...
View the listening status of omsa services:
# Netstat-npae | egrep-iv 'mysql | ssh | xinetd | udevd | crond | syslogd | upstart | auditd'
We noticed that the dsm_om_connsvc service listens to port 1311 of the TCP protocol and provides the http access function.
3) uninstall the web Components in the OMSA toolkit.
In this case, you only need to use the OMSA toolkit to monitor the health status of hardware and do not need to use the Web management function provided by OMSA. To avoid problems caused by improper system firewall or web management, in this example, we uninstall the OMSA web components.
View the listening status of the omsa component:
# Netstat-npae | egrep-iv 'mysql | ssh | xinetd | udevd | crond | syslogd | upstart | auditd'
ActiveInternet connections (servers and established)
ProtoRecv-Q Send-Q Local Address Foreign Address State User Inode PID/Program name
Tcp 0 0: 1311: * LISTEN 0 656427 1563/dsm_om_connsvc
ActiveUNIX domain sockets (servers and established)
ProtoRefCnt Flags Type State I-Node PID/Program name Path
Unix 2 [ACC] stream listening 6555231197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/. ipc/dcsmilpipea
Unix 2 [ACC] stream listening 655525 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/. ipc/dcsmilpipep
Unix 2 [ACC] stream listening 655527 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/. ipc/dcsmilpipeu
Unix 2 [ACC] stream listening 655770 1508/dsm_om_shrsvcd/opt/dell/srvadmin/var/lib/openmanage/shrsvc/dsm_om_shrsvc
Unix 2 [ACC] stream listening 655772 1508/dsm_om_shrsvcd/opt/dell/srvadmin/var/lib/openmanage/shrsvc/omintf5e4
Unix 2 [] stream connected 656423 1563/dsm_om_connsvc
Unix 2 [] stream connected 656015 1563/dsm_om_connsvc
Unix 3 [] stream connected 655972 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/. ipc/dcsmilpipea
Unix 3 [] stream connected 655971 1563/dsm_om_connsvc
Unix 3 [] stream connected 655650 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/. ipc/dcsmilpipea
Unix 3 [] stream connected 655649 1440/dsm_sa_snmpd
Unix 3 [] stream connected 655589 1197/dsm_sa_datamgr/opt/dell/srvadmin/var/lib/openmanage/. ipc/dcsmilpipea
Unix 3 [] stream connected 655588 1381/dsm_sa_eventmg
The dsm_om_connsvc service of the OMSA component listens on TCP port 1311.
View the program called by the dsm_om_connsvc service:
Lsof-p 1563 #1563 is the pid of the dsm_om_connsvc service process
View the rpm package of the program file called by dsm_om_connsvc:
# Rpm-qf/opt/dell/srvadmin/lib64/openmanage/apache-tomcat/lib/tomcat-api.jar
Srvadmin-tomcat-7.4.0-4.97.1.el6.x86_64
# Rpm-qf/opt/dell/srvadmin/lib64/openmanage/jre/lib/jce. jar
Srvadmin-jre-7.4.0-4.98.1.el6.x86_64
Disable the OMSA service:
/Opt/dell/srvadmin/sbin/srvadmin-services.sh stop
Uninstall the srvadmin-tomcat and srvadmin-jre packages:
# Rpm e srvadmin-tomcat-7.4.0-4.97.1.el6.x86_64
Error: Failed dependencies:
Srvadmin-tomcat = 7.4.0 is needed by (installed) srvadmin-webserver-7.4.0-4.1.1.el6.x86_64
# Rpm e srvadmin-webserver-7.4.0-4.1.1.el6.x86_64
Error: Failed dependencies:
Srvadmin-webserver = 7.4.0 is needed by (installed) srvadmin-all-7.4.0-4.1.1.el6.x86_64
Uninstall the srvadmin-webserver package:
# Rpm-e -- nodeps srvadmin-webserver-7.4.0-4.1.1.el6.x86_64
# Rpm e srvadmin-tomcat-7.4.0-4.97.1.el6.x86_64
# Rpm e srvadmin-jre-7.4.0-4.98.1.el6.x86_64
Delete the directory of appache-tomcat:
# Rm-rf/opt/dell/srvadmin/lib64/openmanage/apache-tomcat
Start the OMSA service:
#/Opt/dell/srvadmin/sbin/srvadmin-services.sh start
StartingSystems Management Device Drivers:
Startingdell_rbu: [OK]
Startingipmi driver: Already started [OK]
StartingSystems Management Data Engine:
Startingdsm_sa_datamgrd: [OK]
Startingdsm_sa_eventmgrd: [OK]
Startingdsm_sa_snmpd: [OK]
Starting DSMSA Shared Services: [OK]
Test the check_openmanage script:
./Check_openmanage-d
The output check items are the same as those before the OMSA web component is uninstalled.
Check the OMSA Service Listening status after the OMSA web component is uninstalled:
# Netstat-npae | egrep-iv 'mysql | ssh | xinetd | udevd | crond | syslogd | upstart | auditd'
At this time, the OMSA service only listens for Unix domain sockets, while the Unix domain socket is only used for communication between processes in the current operating system, for example, the check_openmanage script calls the OMSA service to check the hardware health status of the DELL server.
So far, the OMSA Web component has been uninstalled.
4) install the check_openmanage installation package
:
Http://folk.uio.no/trondham/software/check_openmanage.html#download
Download the check_openmanage Toolkit:
Wget http://folk.uio.no/trondham/software/files/check_openmanage-3.7.11.tar.gz
Test the check_openmanage tool:
Tar zxf check_openmanage-3.7.11.tar.gz
Cd check_openmanage-3.7.11
./Check_openmanage-d
./Check_openmanage
If the message "Storage Error" is displayed, add the -- no-storage parameter:
./Check_openmanage -- no-storage
# Check_openmanage will check 50 ~ About 60 DELL services
5) Configure NRPE
Edit the/usr/local/nagios/etc/nrpc. cfg file and add:
Command [check_dell_openmanage] =/path/to/check_openmanage
Or
Command [check_dell_openmanage] =/path/to/check_openmanage -- no-storage
Copy the check_openmanage script to the/usr/local/nagios/libexec/directory:
Cp check_openmanage-3.7.11/check_openmanage/usr/local/nagios/libexec/
Test command:
Check_nrpe-h ip-c check_dell_openmanage
6) Precautions
Check_openmanage is a perl script, so the operating system must have perl installed.
7) What should I do if the server does not have an Internet connection?
If the server does not have an Internet, you can consider creating an iptables nat ing on an Internet-connected machine, map the yum configured by the server only on the Intranet to the Internet, or deploy a yum source in the IDC;
8) What other methods can be used?
If you do not use OMSA and check_openmanage to monitor the health status of hardware, you can also use ipmitool to monitor the health status, but you need to develop your own scripts.
9) applicable environment
If you are using a DELL server, we recommend that you use it.
10) Additional information on OMSA component security
The Dell OpenManage Server Administrator (OMSA) 7.1 and earlier versions have the XSS vulnerability, which allows remote attackers to inject Web scripts or HTML. The DELL manufacturer has released an upgrade patch to fix this security problem. Please download the patch from the vendor's homepage, for details, refer to the "OMSA component security patch" section in the "reference link for relevant materials in this article" at the end of this article ".
11) Other System Management tools provided by DELL vendors
DELL also provides Microsoft System Center management and configuration tools, OracleEntreprise Manager 12c plug-ins, and plug-ins that support HP and IBM. For details, log on to the DELL official website and choose Support> Drivers & Downloads> Select Server type> System Managements.
12) DELL-provided plug-ins for OracleEnterprise Manager 12c
Dell OpenManage Plug-in v1.0 for Oracle Enterprise Manager12c
Dell OpenManage Plug-in for Oracle Enterprise Managerprovides a proactive approach to data center management that delivers featuresfor monitoring Dell server, storage, and networking infrastructures inenvironment managed by Oracle Enterprise Manager (OEM ). it also supportsmapping of database workload to Dell hardware for quicker fault detection andconsole launch of Dell devices to perform troubleshooting, configuration, andmanagement activities. it protects customer's existing investigation in OEM consoleand helps in between of integration and management of Dell devices.
13) What is Unix domain socket?
A Unixdomain socket or IPC socket (inter-process communication socket) is a datacommunications endpoint for exchanging data between processes executing withinthe same host operating system. while similar in functionality to named pipes, Unix domain sockets may be created as connection? Mode (SOCK_STREAM or SOCK_SEQPACKET) or as connectionless (SOCK_DGRAM), while pipesare streams only. processes using Unix domain sockets do not need to share acommon ancestry. the API for Unix domain sockets is similar to that of anInternet socket, but it does not use an underlying network protocol forcommunication. the Unix domain socket facility is a standard component of POSIXoperating systems. unixdomain sockets use the file system as their address name space. they arereferenced by processes as inodes in the file system. this allows two processesto open the same socket in order to communicate. however, communication occursentirely within the operating system kernel. in addition to sending data, processes may send filedescriptors into ss a Unix domain socket connection using the sendmsg () andrecvmsg () system CILS.
14) References
Custom check_openmanage threshold:
Http://dreamway.blog.51cto.com/1281816/1048274
Use the omreport command:
Http://www.sxszjzx.com /~ T096/manual/SC/Dosa/CLI/report.htm
DELL vendor Website:
Http://www.dell.com/support/drivers/us/en/04/ProductSelector/Select/FamilySelection? CategoryPath = all-products % 2Fesuprt_ser_stor_net % 2Fesuprt_poweredge & Family = PowerEdge & DisplayCrumbs = Product % 2520 Type % 40% 2 CServers % 252C % 2520 Storage % 252C % 2520% 2526% 2520 Networking % 40% 2 CPowerEdge & rquery = na
Yum Source Address of the DELL vendor OMSA:
Http://linux.dell.com/repo/hardware/Linux_Repository_14.04.00/
Use OpenManage and Nagios to monitor DELL server hardware deployment manual:
Http://folk.uio.no/trondham/software/check_openmanage.html
OMSA component security patch:
Http://www.dell.com/support/drivers/us/en/19/DriverDetails/Product/poweredge-r710? DriverId = 5JDN0 & osCode = WNET & fileId = 3082293694.
Http://www.dell.com/support/drivers/us/en/19/DriverDetails/Product/poweredge-r710? DriverId = PCXMR & osCode = WNET & fileId = 3082295344.
Http://www.dell.com/support/drivers/us/en/19/DriverDetails/Product/poweredge-r710? DriverId = JJMWP & osCode = WNET & fileId = 3082295338.
The plug-in address provided by DELL for Oracle Enterprise Manager 12c:
Http://www.dell.com/support/drivers/us/en/04/DriverDetails/Product/poweredge-r710? DriverId = XKRM6 & osCode = WS8R2 & fileId = 3356540401 & agecode = en & categoryId = SM
URLs of other management tools provided by DELL vendors:
Http://www.dell.com/support/drivers/us/en/04/ProductSelector/Select/FamilySelection? CategoryPath = all-products % 2Fesuprt_ser_stor_net % 2Fesuprt_poweredge & Family = PowerEdge & DisplayCrumbs = Product % 2520 Type % 40% 2 CServers % 252C % 2520 Storage % 252C % 2520% 2526% 2520 Networking % 40% 2 CPowerEdge & rquery = na
Unix domain socket explanation:
Http://en.wikipedia.org/wiki/Unix_domain_socket