Several commonly used Linux operating system monitoring scripts:, China IT lab alias Keyword: Linux
This article describes several common Linux monitoring scripts that can automatically monitor and alert the traffic, system status, disk space, CPU and memory usage of the host Nic. The shell scripts written based on your needs can better meet your needs and refine the comprehensiveness of host monitoring.
Recently, a friend from the Internet asked me questions about server monitoring. I asked if common server monitoring software, such as cacti and Nagios, can I write shell scripts by myself? The shell scripts written based on your needs can better meet your needs and refine the comprehensiveness of host monitoring.
The following are some of my commonly used host monitoring scripts. You can modify them based on your situation and hope to help you.
1. View host Nic traffic
#! /Bin/bash # Network # Mike. xu while:; do time = 'date + % m "-" % d "" % K ": "% m'day = 'date + % m"-"% d' rx_before = 'ifconfig eth0 | sed-n" 8 "p | awk '{print \ $2 }' | cut-C7-'tx_before = 'ifconfig eth0 | sed-n "8" p | awk '{print \ $6}' | cut-C7-'sleep 2 rx_after =' ifconfig eth0 | SED
-N "8" p | awk '{print \ $2}' | cut-C7-'tx_after = 'ifconfig eth0 | sed-n "8" p | awk '{print \ $6} '| cut-C7-'rx_result = \ $ [(rx_after-rx_before) /256] tx_result = \ $ [(tx_after-tx_before)/256] echo "\ $ time now_in_speed:" \ $ rx_result "kbps now_out_speed:" \ $ tx_result "kbps"
Sleep 2 done
2. system status monitoring
#! /Bin/sh # systemstat. sh # Mike. xu IP = 192.168.1.227 top-N 2 | grep "CPU". /temp/cpu.txt free-M | grep "mem". /temp/mem.txt DF-k | grep "sda1". /temp/drive_sda1.txt # DF-k | grep sda2. /temp/drive_sda2.txt DF-k | grep "/mnt/storage_0". /temp/mnt_storage_0.txt
DF-k | grep "/mnt/storage_pic". /temp/mnt_storage_pic.txt time = 'date + % m ". "% d" "% K": "% m' connect = 'netstat-Na | grep" 219.238.148.30: 80 "| WC-l 'echo" \ $ time \ $ connect ". /temp/connect_count.txt
3. Monitor the disk space of the host. If the disk space exceeds 90%, send a mail to send a warning.
#! /Bin/bash # monitor available disk space = 'df | sed-n'/\ $/P' | gawk '{print \ $5}' | SED's // % // 'If [\ $ space-GE 90] Then fty89@163.com fi
4. Monitor CPU and memory usage
#! /Bin/bash # script to capture system statistics OUTFILE =/home/Xu/capstats.csv
Date = 'date + % m/% d/% y'
Time = 'date + % K: % m: % s'
Timeout = 'uptime'
Vmout = 'vmstat 1 2'
Users = 'echo \ $ timeout | gawk '{print \ $4 }''
Load = 'echo \ $ timeout | gawk '{print \ $9}' | sed "s /,//''
Free = 'echo \ $ vmout | sed-n'/[0-9]/P' | sed-N '2p' | gawk '{print \ $4 }''
Idle = 'echo \ $ vmout | sed-n'/[0-9]/P' | sed-N '2p' | gawk '{print \ $15 }''
Echo "\ $ date, \ $ time, \ $ users, \ $ load, \ $ free, \ $ idle" \ $ OUTFILE
5. Comprehensive host monitoring
#! /Bin/bash # check_xu.sh #0 *****/home/check_xu.sh dat = "'date + % Y % m % d'" hour = "'date + % H '" dir = "/home/oslog/host _ \$ {dat}/\$ {hour}" delay = 60 COUNT = 60 # Whether the responsible directory exist if! Test-d \ $ {dir} Then/bin/mkdir-p \ $ {dir} fi # General
Check export term = Linux/usr/bin/top-B-d \$ {delay}-N \n {count }>\$ {dir}/top _ \\ {dat }. log 2> & 1 & # CPU check/usr/bin/SAR-u \$ {delay }\$ {count }>\\ {dir}/CPU _ \\ {dat }. log 2> & 1 & #/usr/bin/mpstat-P 0 \0 {delay }\$ {count }>\\ {dir}/cpu_0 _ \$ {dat}. log 2> & 1 &
#/Usr/bin/mpstat-P 1 \1 {delay }\$ {count }>\$ {dir}/cpu_1 _ \$ {dat }. log 2> & 1 & # memory check/usr/bin/vmstat \$ {delay }\$ {count }>\\ {dir}/vmstat _ \\ {dat }. log 2> & 1 & # I/O check/usr/bin/iostat \$ {delay }\$ {count }>\\ {dir}/iostat _ \$ {dat }. log 2> & 1 & # Network
Check/usr/bin/SAR-N Dev \$ {delay }\$ {count }>\\ {dir}/NET _ \$ {dat }. log 2> & 1 & #/usr/bin/SAR-n edev \ $ {delay }\$ {count }>\$ {dir}/net_edev _ \$ {dat}. log 2> & 1 &
In crontab, automatic execution is performed hourly:
0 *****/home/check_xu.sh
The CPU, memory, network, and IO statistics of each hour are generated in the/home/oslog/host_yyyymmdd/HH directory.
If a problem occurs in a certain period of time, you can view the corresponding log information to see how the host performance was at that time.
[Edit responsibility: Ultimate] [I want to pick up the wrong one]