myrinet fault detection

Patrick Geoffray patrick at myri.com
Tue May 27 15:15:39 EDT 2003


Hi Christine,

On Mon, 2003-05-26 at 12:21, Tintin J Marapao wrote:
> -Myrinet has a built in fault /error detection

The hardware provides: 
1) CRC8 and CRC32 to detect bit corruption on the link.
2) SRAM parity to detect memory corruption on the NIC.
3) the PCIDMA chipset checks for PCI parity on DMA Reads (when the NIC
is the PCI target).

> -If there is a built in fault recovery system

2) and 3) are fatal errors, it should not happen in your lifetime unless
faulty hardware. 1) and other cases are recoverable if the firmware you
are using is reliable. GM is reliable, does segmentation/reassembly in
the NIC and ACKs each fragment. It retransmits the data if a packet is
lost or corrupted.

You can find more information on BER here:
http://www.myri.com/cgi-bin/fom?file=245

> -Power management system

What do you mean ?

Patrick
-- 

Patrick Geoffray, PhD
Myricom, Inc.
http://www.myri.com

_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf



More information about the Beowulf mailing list