Toolserver:Admin:Nagios

This page was moved from the Toolserver wiki.
Toolserver has been replaced by Toolforge. As such, the instructions here may no longer work, but may still be of historical interest.
Please help by updating examples, links, template links, etc. If a page is still relevant, move it to a normal title and leave a redirect.

Nagios is used to monitor services for problems. In case something breaks, Nagios will notify administrators. It also logs to the IRC channel (#wikimedia-toolserver).

Nagios runs on the HA cluster under the 'nagios' resource group. It has a strong positive resource affinity with the 'www' group, which means it tries to start on the same node as the HA web server. This is necessary for the web interface to work.

The configuration is in /global/misc/nagios/etc. To restart it:

(test the new configuration first)
# /opt/ts/nagios/bin/nagios -v /global/misc/nagios/etc/nagios.cfg
# clrs disable nagios
# clrs enable nagios

Category:Admin:Software