[Beowulf] InfiniBand VL15 error

Greg Lindahl lindahl at pbm.com
Tue Dec 2 16:18:29 EST 2008


On Tue, Dec 02, 2008 at 10:24:15AM -0500, Prentice Bisbal wrote:

> #warn: counter VL15Dropped = 476        (threshold 100) lid 1 port 1
> Error check on lid 1 (aurora HCA-1) port 1:  FAILED

IB is blissfully fading from my brain, but I think this refers to
control packets being dropped due to resource limits on the recipient.
That takes talent if you're using a Mellanox HCA, as pretty much all
of the VL15 packets are interpreted by the processor in the HCA.

-- greg


_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.



More information about the Beowulf mailing list