OpenPBS under RH7?

Galen Arnold arnoldg at
Thu Apr 26 11:08:50 EDT 2001


Define "pretty good sized".  I may have a patch for you to try if it's the
same problem I ran into.  See the NCSA Scaling Patch and README at:

We're testing another patch to PBS that also helps with scaling--it may be
added to that site soon.


On Thu, 26 Apr 2001, Roger L. Smith wrote:

> Hello Folks,
> We've got a pretty good sized Linux cluster that we are finally about to
> release to our users.  During all of our testing and benchmarking, we've
> been using RedHat 7.0 and the 2.4.2 kernel.  We've worked out most all of
> the problems until we installed OpenPBS (v2.3.12).
> Our test users have been experiencing a problem where the pbs_mom daemon
> on the first node on a given job will die.  It typically seems to die
> either when the job finishes, or when the user tries to qdel the job.
> It's not consistent, however.  One of our users estimates that it fails 2
> out of 5 times.  The corrective action for this includes deleting
> everything out of /var/spool/PBS/mom_priv/jobs/<jobnum*> and restarting
> the daemon on the node.
> This is becoming a very serious issue for us, and may require that I
> downgrade the entire cluster to RedHat 6.2 and/or a 2.2 kernel.  I'm not
> anxious to do this for several reasons.
> Is anyone on this list running a similar configuration, or have any
> experience with this problem?
>  _\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_\|/_
> | Roger L. Smith            Phone:662-325-3625      roger at ERC.MsState.Edu |
> | Systems Administrator     FAX:  662-325-7692 WWW.ERC.MsState.Edu/~roger |
> |-------------------------------------------------------------------------|
> |         Mississippi State University/National Science Foundation        |
> |______Engineering Research Center for Computational Field Simulation_____|
> _______________________________________________
> Beowulf mailing list, Beowulf at
> To change your subscription (digest mode or unsubscribe) visit

Galen Arnold, system engineer--systems group       arnoldg at
National Center for Supercomputing Applications           (217) 244-3473
152 Computer Applications Bldg., 605 E. Spfld. Ave., Champaign, IL 61820

Beowulf mailing list, Beowulf at
To change your subscription (digest mode or unsubscribe) visit

More information about the Beowulf mailing list