[Beowulf] SMP Nodes [still] Freezing w/ Scyld (follow-up, long)

Timothy R. Whitcomb twhitcomb at apl.washington.edu
Wed Jan 7 16:11:23 EST 2004


No, I'm not using myrinet.  It's a Scyld 28cz4 with 100BaseT
interconnects, dual-processor Tyan AMD boards.

TRW
On Wed, 7 Jan 2004, Jag wrote:

> On Wed, 2004-01-07 at 15:32, Timothy R. Whitcomb wrote:
> > First of all, thanks to everyone who responded to my first message.  I
> > received many helpful suggestions to test, which I have documented
> > here.
> >
> > As a quick review of my (still unsolved) situation, we are running a
> > weather model on a 5-node, 10-processor Scyld Beowulf cluster.  As
> > long as we use a single processor per machine, performance is fine -
> > but when we use both processors on a single node then the node freezes
> > after a variable length of time (sometimes minutes, sometimes hours).
>
> Unfortunately I missed your first post.  Are you using myrinet?  I seem
> to recall some past problems with two MPI jobs on one host running myri
> on Scyld systems.
>
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf



More information about the Beowulf mailing list