[Beowulf] MPICH error
William Gropp
gropp at mcs.anl.gov
Wed Dec 10 13:38:46 EST 2003
At 12:12 PM 12/10/2003, you wrote:
>Hi Folks:
>
> A customer is seeing
>
> rm_1310: p4_error: rm_start: net_conn_to_listener failed: 33220
>
>on an MPI job. Used to work (just last week). Updated the kernel was the
>major
>change (added XFS support)
>
> Any idea of what this is? I assume a network change. MPICH 1.2.4. Do
> I need
>to recompile MPICH to match the kernel?
No, you shouldn't need to recompile MPICH. The most likely cause is a
change in how TCP connections are handled. See
http://www-unix.mcs.anl.gov/mpi/mpich/docs/faq.htm#linux-redhat for some
suggestions.
Bill
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf
mailing list