Environment monitoring
Leopold Palomo
lepalom at upc.es
Wed Oct 1 10:13:46 EDT 2003
A Dimecres 01 Octubre 2003 14:37, Robert G. Brown va escriure:
> On Wed, 1 Oct 2003, Leopold Palomo Avellaneda wrote:
> > A Dimarts 30 Setembre 2003 22:23, Rocky McGaugh va escriure:
> > > Dont overlook lm_sensors+cron
> >
> > Why?
>
> On a system equipped with an internal sensor, lm_sensors can often read
> e.g. core CPU temperature on the system itself. A polling cron script
> can then read this and take action, e.g. initiate a shutdown if it
> exceeds some threshold.
>
> There are good and bad things about this. A good thing is it addreses
> the real problem -- overheating in the system itself -- and not room
> temperature. CPU's can overheat because of a fan failure when the room
> remains cold, and a sensors-driven poweroff can then save your hardware
> on a node by node basis.
>
> The bad thing is that it does NOT give you any sort of measure of room
> temperature per se, although if you have the poweroff script send you
> mail first, getting deluged with N messages as the entire cluster shuts
> down would be a good clue that your room cooling failed:-). Also,
> lm_sensors has the API from hell. In fact, I would hardly call it an
> API. One has to pretty much craft a polling script on the basis of each
> supported sensor independently, which requires you to know WAY more than
> you ever wanted to about the particular sensor your system may or may
> not have.
>
> Alas, if only somebody would give the lm_sensors folks a copy of a good
> book on XML for christmas, and they decided to take the monumental step
> of converting /proc/sensors into a single xml-based file with the
> RELEVANT information presented in toplevel tags like
>
> <cpu_temp id="0" units="C">50.4</cpu_temp>
>
> and the irrelevant information presented in tags like
>
> <hardware><name>lm78</name><version>1.22a</version></hardware>
>
> then we could ALL reap the fruits of their labor without needing a copy
> of the lm78 version 1.22a API manual and having to write an application
> that supports each of the sensors THROUGH THEIR INTERFACE one at a
> time...;-)
Ok. I was a bit surprise about your sentence. I know that lmsensors is not
perfect, but it does their job. Ok, I don't think that use lm_sensors to try
to calculate the T of the room is a bit excesive.
About the xml,... well, ok, it would be a nice feature, but as plain text,
knowing your hardware it's so good, too.
Best Regards.
Pd How about the pdf, ps, etc?
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf
mailing list