GenericNQS batch system
Lars Henriksen
lars at meshtechnologies.com
Tue Nov 18 04:41:56 EST 2003
Dear beowulfers
I'm having some problems with the Generic NQS batch system.
Creating and using queues on a single host works fine,, but when i try
to submit jobs to queues on remote hosts, it does not work. Does anyone
have experience with that kind of operation?
Here is what i've done:
On the scheduling host (host1):
# qmgr create pipe sched-queue destination = exe-in at host2
# qmgr set lb_out sched-queue
# qmgr enable queue sched-queue
On the host that has to do the job execution (host2):
# qmgr create batch exe-queue pipeonly
# qmgr create pipe exe-in pipeonly destination exe-queue
# qmgr set lb_in exe-in
# qmgr enable queue exe-queue
# qmgr enable queue run-in
# qmgr set scheduler host1
In 'nmapmgr' on both host, entries has been added both for principal
names and aliases.
/etc/hosts.nqs looks like this on both hosts:
* *
So when i try to submit at job to the system on host1:
(top of job description file:)
-------
#QSUB-q sched-queue
#QSUB-eo
#QSUB-r test
<snipped shell code to be executed>
-------
nothing happens :-(
edited syslog from the host where submission is made:
host1 NQS daemon[7467]: psc_spawn: Rqst not scheduled due to none there.
host1 NQS daemon[7467]: psc_spawn: Rqst not scheduled due to none there.
host1 NQS Pipeclient[5899]: Process logging started at Tue Nov 18
10:24:36 2003
host1 NQS Netdaemon[5900]: Netdaemon: Connection from host1
host1 NQS Pipeclient[5899]: Unable to deliver request 31 to a
destination
host1 NQS Pipeclient[5899]: Msg #2:Scheduling request for retry at a
later time
host1 NQS Pipeclient[5899]: Msg #2:Request rescheduled; exiting
A 'qstat -x' shows this:
<SNIP>
Destset = {exe-in at host2 [RETRY]
<Unreachable since: Wed Nov 12 12:35:10 CET 2003>
<Next retry at: Tue Nov 18 10:30:07 CET 2003>
};
</SNIP>
I'm kinda baffled by this...
Well thanks for your patience in reading this. I hope some of you can
give me some pointers...
best regards
Lars
--
Lars Henriksen | MESH-Technologies A/S
Systems Manager & Consultant | Forskerparken 10
www.meshtechnologies.com | DK-5230 Odense M, Denmark
lars at meshtechnologies.com | mobile: +45 2291 2904
direct: +45 6315 7310 | fax: +45 6315 7314
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
More information about the Beowulf
mailing list