[Beowulf] Re: recommendation on crash cart for a cluster room:fullcluster KVM is not an option I suppose?

John Hearns hearnsj at googlemail.com
Fri Oct 9 01:54:44 EDT 2009


2009/10/8 Greg Lindahl <lindahl at pbm.com>:

> You haven't mentioned the other things you can use IPMI for.
>
> 1) Console logging. Your machine just crashed. No clue in
> /var/log/messages. "I wonder if it printed something on the console?"
> Answer: ipmi and conman (available in an rpm in Red Hat distros).
>
> 2) Monitoring. Temp, fan speeds, power supply state, events. Answers
> the "why is the little red light on the front of the case lit?"
> question.

At the risk of getting a reputation in these parts, both come as
standard on the SGI ICE cluster. Console logging via IPMI/conman on
the rack leader for all nodes, which is then mounted across to the
admin node.
Temp, fan speed etc. logged on the rack leaders and reported via ESP
monitoring. Ganglia implemented too.
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf



More information about the Beowulf mailing list