myrinet fault detection
patrick at myri.com
Tue May 27 15:15:39 EDT 2003
On Mon, 2003-05-26 at 12:21, Tintin J Marapao wrote:
> -Myrinet has a built in fault /error detection
The hardware provides:
1) CRC8 and CRC32 to detect bit corruption on the link.
2) SRAM parity to detect memory corruption on the NIC.
3) the PCIDMA chipset checks for PCI parity on DMA Reads (when the NIC
is the PCI target).
> -If there is a built in fault recovery system
2) and 3) are fatal errors, it should not happen in your lifetime unless
faulty hardware. 1) and other cases are recoverable if the firmware you
are using is reliable. GM is reliable, does segmentation/reassembly in
the NIC and ACKs each fragment. It retransmits the data if a packet is
lost or corrupted.
You can find more information on BER here:
> -Power management system
What do you mean ?
Patrick Geoffray, PhD
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf