Rahmouni Abdelhak
Rahmouni Abdelhak

Reputation: 55

Jobs are stuck in queue forever ( Condor - Parallel Universe )

i've lately trying to configure HTcondor. I managed to submit serial (Vanilla Universe) jobs with no errors or problems. But when i try to submit parallel ( Parallel Universe ) jobs, it seems that the jobs get stucked in queue forever.

***Submit file Code :

universe = parallel
executable = SommeP
output = outputP.$(NODE)
error = errorP.$(NODE)
log = logP
machine_count = 4
should_transfer_files = yes
when_to_transfer_output = on_exit
queue

***Condor_q

enter image description here

***Condor_status ( 1 Master 2 Slaves 6 Slots in total ) enter image description here

Upvotes: 0

Views: 626

Answers (1)

jpatton
jpatton

Reputation: 403

The admin of your condor pool will have to configure execute nodes to see your submit node as the Dedicated Scheduler before Parallel Universe jobs can be scheduled. See the Parallel Applications section in the manual for more information: http://research.cs.wisc.edu/htcondor/manual/current/2_9Parallel_Applications.html#SECTION00392000000000000000

Upvotes: 1

Related Questions