Page 1 of 1

Parallel job error on Neeshub

Posted: Fri Mar 16, 2012 7:59 am
by ozgura
Hello,

I submitted a parallel job using OpenSees NEES resources (8 processors). The analyses started and worked well for about 15 minutes, then my job stopped. In *.stderr file, I found this:

mpirun noticed that process rank 1 with PID 663 on node NEEShub exited on signal 24 (CPU time limit exceeded).

I believe this occurs due to the limit on the wall time of my job request. I can't control it though as we don't submit a qsub file.

Please let me know about this problem.
Thanks,

Re: Parallel job error on Neeshub

Posted: Tue Mar 20, 2012 4:42 pm
by fmk
the default is set to 4 hours. can you check if it works on the hansen resource option.

Re: Parallel job error on Neeshub

Posted: Thu Jun 14, 2012 11:16 am
by bmobashe
Hi Frank,
I have the same problem. I switched to hansen resource option. But my status is always "Submited" and it does not start running. Do you have any suggestion?
I am using opensees SP so I just changed the System command to System Mumps.

Thank you so much for your help,
Bahareh

Re: Parallel job error on Neeshub

Posted: Mon Jun 25, 2012 10:42 pm
by fuyunb