Skip to content

DAPNET Connectivity Monitoring

Ralf Wilke DH3WR edited this page Aug 1, 2017 · 3 revisions

Motivation

In order to check the connectivty between the DAPNET Cores and to track down the temporary cluster breakdowns, it is highly encouraged to install the Nagios NRPE daemon on each node and make it accessable from RWTH Aachen Ham Radio Group central Nagios monitoring instance.

Install

In order to do so, please conduct the following steps:

  • Install nagios nrpe-server with sudo apt-get install nagios-nrpe-server nagios-plugins
  • Go to /tmp and download special plugin cd /tmp wget http://dev.techno.holics.at/check_traceroute_pingplot/releases/check_traceroute_pingplot-0.1.2.tgz
  • Unpack the tgz tar vfzx check_traceroute_pingplot-0.1.2.tgz
  • Copy new plugin to the default location sudo cp /tmp/check_traceroute_pingplot-0.1.2/check_traceroute_pingplot.pl /usr/lib/nagios/plugins/check_traceroute_pingplot.pl
  • Delete the temp files: rm -r /tmp/check_traceroute_pingplot*
  • Set owner sudo chown root:root /usr/lib/nagios/plugins/check_traceroute_pingplot.pl
  • Set permissions sudo chmod 755 /usr/lib/nagios/plugins/check_traceroute_pingplot.pl
  • Install needed perl packages sudo apt-get install libfile-which-perl
  • Test that the plugin is executable by /usr/lib/nagios/plugins/check_traceroute_pingplot.pl. You should see the help messages. Otherwise there is something wrong. Maybe you need to install some perl components.
  • Test that the plugin is working fine by /usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H dapnet.db0sda.ampr.org -w 300 -c 1000 . You should see something like

OK - Traceroute to IP dapnet.db0sda.ampr.org has 20 hops, ping min-avg-max 172.56-174.90-182.45 ms

  • Edit the main nrpe-daemon config file sudo nano /etc/nagios/
  • Add in this file the IP of the RWTH Aachen Ham Radio Group Nagios instance to access the plugin's output be changing allowed_hosts=127.0.0.1 to allowed_hosts=127.0.0.1,44.225.164.4
  • Add the commands to be remotely executed in a new file sudo nano /etc/nagios/nrpe.d/dapnet.cfg

File content:

command[check_traceroute_pingplot_db0fa]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.166.35 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_db0ffl]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.42.182 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_db0rta]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.111.5 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_db0sda]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.164.27 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_db0wa]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.164.162 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_dl5ml]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.166.41 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_dm0max]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.115.4 -p -v -a -w 800 -c 1500

command[check_traceroute_pingplot_on3dhc]=/usr/lib/nagios/plugins/check_traceroute_pingplot.pl -H 44.225.166.42 -p -v -a -w 800 -c 1500

  • If you like, delete the line that contains you own node
  • Restart your nrpe server with sudo systemctl restart nagios-nrpe-server

Setup

That's it. Now contact rwth-afu @ online.de so that they can setup your nagios nrpe node in their config.

You can have a look at the performance of your node at http://monitoring.afu.rwth-aachen.de/nagios3/ with user anonymous.