[Beowulf] PBS : deleting jobs that were running on a crashed node
shriram1976 at yahoo.com
Mon Jan 26 01:03:30 EST 2004
This is with ref the reply below. I deleted the files
from server_priv/jobs. But the node on which the jobs
were running has completely crashed - doesnt even
reboot. So, cant delete the files from mom_priv/jobs
on the corresponding node.
Hence the jobs are still showing up in "qstat". Any
--- "Brent M. Clements" <bclem at rice.edu> wrote:
> YOu have to remove the job files from both the
> mom_priv/jobs directory on
> the moms that the job was running on as well as the
> directory on the pbs server.
> Brent Clements
> Linux Technology Specialist
> Information Technology
> Rice University
> On Sun, 25 Jan 2004, Shriram R wrote:
> > Hi,
> > We have a 24 node/48 procs linux cluster running
> > Redhat. The queueing system that we use is PBS.
> > of the nodes, node15, conked out completely and is
> > restarting.
> > "pbsnodes -a" shows the state of the node15 as
> > However, jobs which had been running on node15
> > show up when I do a "qstat".
> > I tried to use "qdel" and "qsig" to delete these
> > but the server complains that it is unable to
> > pbs_mom, which is obvious since node15 is down.
> > Can someone tell me how do I delete these jobs
> > the output of "qstat" ?
> > TIA.
> > -shriram
> > __________________________________
> > Do you Yahoo!?
> > Yahoo! SiteBuilder - Free web site building tool.
> Try it!
> > http://webhosting.yahoo.com/ps/sb/
> > _______________________________________________
> > Beowulf mailing list, Beowulf at beowulf.org
> > To change your subscription (digest mode or
> unsubscribe) visit
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf