-
Notifications
You must be signed in to change notification settings - Fork 11
DAPNET Connectivity Monitoring
In order to check the connectivty between the DAPNET Cores and to track down the temporary cluster breakdowns, it is highly encouraged to install the Nagios NRPE daemon on each node and make it accessable from RWTH Aachen Ham Radio Group central Nagios monitoring instance.
In order to do so, please conduct the following steps:
- Install nagios nrpe-server with
sudo apt-get install nagios-nrpe-server nagios-plugins
- Go to /tmp and download special plugin
cd /tmp
wget http://dev.techno.holics.at/check_traceroute_pingplot/releases/check_traceroute_pingplot-0.1.2.tgz
- Unpack the tgz
tar vfzx check_traceroute_pingplot-0.1.2.tgz
- Copy new plugin to the default location
sudo cp /tmp/check_traceroute_pingplot-0.1.2/check_traceroute_pingplot.pl /usr/lib/nagios/plugins/check_traceroute_pingplot.pl
- Delete the temp files:
rm -r /tmp/check_traceroute_pingplot*
- Set owner
sudo chown root:root /usr/lib/nagios/plugins/check_traceroute_pingplot.pl
- Set permissions
sudo chmod 755 /usr/lib/nagios/plugins/check_traceroute_pingplot.pl
- Install needed perl packages
sudo apt-get install libfile-which-perl
- Test that the plugin is executable by
/usr/lib/nagios/plugins/check_traceroute_pingplot.pl
. You should see the help messages. Otherwise there is something wrong. Maybe you need to install some perl components. - Test that the plugin is working fine by
/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H dapnet.db0sda.ampr.org -w 300 -c 1000
. You should see something like
OK - Traceroute to IP dapnet.db0sda.ampr.org has 20 hops, ping min-avg-max 172.56-174.90-182.45 ms
- Edit the main nrpe-daemon config file
sudo nano /etc/nagios/
- Add in this file the IP of the RWTH Aachen Ham Radio Group Nagios instance to access the plugin's output be changing
allowed_hosts=127.0.0.1
toallowed_hosts=127.0.0.1,44.225.164.4
- Add the commands to be remotely executed in a new file
sudo nano /etc/nagios/nrpe.d/dapnet.cfg
File content:
command[check_traceroute_pingplot_db0fa]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.166.35 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_db0ffl]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.42.182 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_db0rta]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.111.5 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_db0sda]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.164.27 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_db0wa]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.164.162 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_dl5ml]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.166.41 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_dm0max]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.115.4 -p -v -a -w 800 -c 1500
command[check_traceroute_pingplot_on3dhc]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.166.42 -p -v -a -w 800 -c 1500
- If you like, delete the line that contains you own node
- Restart your nrpe server with
sudo systemctl restart nagios-nrpe-server
That's it. Now contact rwth-afu @ online.de
so that they can setup your nagios nrpe node in their config.
You can have a look at the performance of your node at http://monitoring.afu.rwth-aachen.de/nagios3/ with user anonymous
.