[Beowulf] MPICH error

William Gropp gropp at mcs.anl.gov
Wed Dec 10 13:38:46 EST 2003


At 12:12 PM 12/10/2003, you wrote:
>Hi Folks:
>
>   A customer is seeing
>
>         rm_1310:  p4_error: rm_start: net_conn_to_listener failed: 33220
>
>on an MPI job.  Used to work (just last week).  Updated the kernel was the 
>major
>change (added XFS support)
>
>   Any idea of what this is?  I assume a network change.  MPICH 1.2.4.  Do 
> I need
>to recompile MPICH to match the kernel?

No, you shouldn't need to recompile MPICH.  The most likely cause is a 
change in how TCP connections are handled.  See 
http://www-unix.mcs.anl.gov/mpi/mpich/docs/faq.htm#linux-redhat for some 
suggestions.

Bill 

_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf



More information about the Beowulf mailing list