[Beowulf] how can I know that a hard disk died?

Joel Jaeggli joelja at darkwing.uoregon.edu
Fri Aug 12 17:21:40 EDT 2005


On Fri, 12 Aug 2005, Velu Erwan wrote:

> Le jeudi 11 août 2005 à 13:46 -0400, Dimitri Antoniou a écrit :
>>  Hi,
>>
>>  We have a 16-node HP LC1000 cluster, with 3 hard disks
>>  managed by hardware RAID.
> Which hardware raid are you using ? Is it the netraid ?
>
>>  When the disk died, the system didn't notify us,
>>  and we haven't found any message in log files,
>>  at least not anything obvious.
> On many hardware raid you can't :(
> On some you can using some proprietary tools
> On a very few like the 3ware one you can use smartmontools to reach each
> disk of your read to manage its smarts attributes.

the 3ware driver happily logs events to syslog so you can scrape your logs 
for failed disks even if you aren't running 3dmd or tw_cli.

I hold that up as the shining example of how all raid controllers should 
work (at least internal ones).
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
>

-- 
--------------------------------------------------------------------------
Joel Jaeggli  	       Unix Consulting 	       joelja at darkwing.uoregon.edu
GPG Key Fingerprint:     5C6E 0104 BAF0 40B0 5BD3 C38B F000 35AB B67F 56B2
-------------- next part --------------
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf


More information about the Beowulf mailing list