[Beowulf] using Nagios to monitor compute nodes: NPRE vs check_by_ssh

Rahul Nabar rpnabar at gmail.com
Mon Dec 22 20:28:51 EST 2008

I just installed Nagios to try and monitor my 256 compute nodes
centrally. It seems to work like a charm for all the public services
(ping, ssh etc.) but now I was getting more ambitious and wanted to
try to monitor the private services too (disk usage; process loads;
torque ; pbs etc.).

I was just confused whether (1) to use the NPRE plugin (seems like a
pain to deploy onto all 256 nodes) or (2) go via the check_by_ssh
route. (I already have paswordless logins from master-nodes to

I'd like (2) because it is more secure and seems easier to deploy but
I'm a bit afraid if this will overtax my central server.

Any suggestions? Are other users using Nagios here?

Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

More information about the Beowulf mailing list