[Beowulf] PBS : deleting jobs that were running on a crashed node
Shriram R
shriram1976 at yahoo.com
Sun Jan 25 16:40:51 EST 2004
Hi,
We have a 24 node/48 procs linux cluster running
Redhat. The queueing system that we use is PBS. One
of the nodes, node15, conked out completely and is not
restarting.
"pbsnodes -a" shows the state of the node15 as "down".
However, jobs which had been running on node15 still
show up when I do a "qstat".
I tried to use "qdel" and "qsig" to delete these jobs,
but the server complains that it is unable to contact
pbs_mom, which is obvious since node15 is down.
Can someone tell me how do I delete these jobs from
the output of "qstat" ?
TIA.
-shriram
__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf
mailing list