Help:HPCC Torque Queues: Difference between revisions

From CSDMS
m (changed geshi to syntaxhighlight)
m (Update queue descriptions with routing queues)
Line 1: Line 1:
= Job Queues on the CSDMS HPCC =
= Job Queues on the CSDMS HPCC =


We have defined a set of queues to which you can submit your jobs to on beach.  Note that if you do not specify a queue, it will be sent to the default queueMore importantly note that the default queue has a maximum run time of 4 days and so '''if your job has not finished after 4 days, it will be killed'''.
We have defined a set of queues to which you can submit your jobs to on beach.  Although you can submit your job to a queue directly, generally your job will be routed to an appropriate queue, given the resources you request.  As such it is important that you specify what resources you will needOf particular importance in the amount of memory that your job will use and the amount of time you estimate you job will take to run.
 
Please note that some of these queues have maximum runtimes associated with them.  This means that '''your job will be killed''' if it runs over the queue's time limit.


== Cheat Sheet ==
== Cheat Sheet ==


A brief list of the job queues on the CSDMS HPCC.  ''Please note that the queues are still being configured and that the following information may change without notice.'' We should get to a stable set of queues shortly.
Job queues that you can submit to directly,
{|
! align=left width=75 | Queue
! align=left width=125 | Max Run Time
! align=left width=125 | Max Memory
! align=left width=125 | Max Nodes
! align=left | Description
|-
| debug
| 2 hours
| 16 GB
| 54
| Short jobs that might be used for testing
|-
| vip
| 1 day
| 16 GB
| 54
| Important jobs that need to get done ''now''
|}


Queues that you job will be routed to based on resource needs that you specify,
{|
{|
! align=left width=75 | Queue
! align=left width=75 | Queue
! align=left width=125 | Max Run Time
! align=left width=125 | Max Run Time
! align=left width=125 | Max Memory
! align=left width=125 | Max Nodes
! align=left | Description
! align=left | Description
|-
|-
| default
| default
| 4 days
| 4 days
| 16 GB
| 54
| Jobs that should be done within a few days
| Jobs that should be done within a few days
|-
|-
| long
| long
| unlimited
| unlimited
| 16 GB
| 54
| Jobs that will run for a long time
| Jobs that will run for a long time
|-
|-
| debug
| himem
| 2 hours
| Short jobs that might be used for testing
|-
| fat
| unlimited
| unlimited
| 32 GB
| 8
| Jobs that need to use memory-rich nodes
| Jobs that need to use memory-rich nodes
|-
| vip
| 1 day
| Important jobs that need to get done ''now''
|}
|}


== Queue Status ==
Use <tt>qstat</tt> to get an up-to-date summary of the queues.
Use <tt>qstat</tt> to get an up-to-date summary of the queues.
<syntaxhighlight lang="bash" lines="0">
<syntaxhighlight lang="bash" lines="0">
Line 38: Line 61:
Queue            Memory CPU Time Walltime Node  Run Que Lm  State
Queue            Memory CPU Time Walltime Node  Run Que Lm  State
---------------- ------ -------- -------- ----  --- --- --  -----
---------------- ------ -------- -------- ----  --- --- --  -----
special_user      --      --      --      --    0  0 --  E R
workq              --      --      --      --    1  0 --  E R
debug              --  02:00:00    --      --    0  0 --  E R
debug              --  02:00:00    --      --    0  0 --  E R
long              --      --      --      --    1  0 --  E R
fat                --      --      --      --    0  0 --  E R
vip                --  24:00:00    --      --    0  0 --  E R
vip                --  24:00:00    --      --    0  0 --  E R
default            --  96:00:00   --      --    0  0 --  E R
route              --      --      --      --    0  0 --  E R
himem              --      --      --      --    6  0 --  E R
long              15gb    --      --      --   1  0 12   E R
default            15gb 96:00:00 96:00:00   --    0  1 --  E R
                                               ----- -----
                                               ----- -----
                                                   2     0
                                                   7     1
</syntaxhighlight>
</syntaxhighlight>

Revision as of 16:41, 29 July 2009

Job Queues on the CSDMS HPCC

We have defined a set of queues to which you can submit your jobs to on beach. Although you can submit your job to a queue directly, generally your job will be routed to an appropriate queue, given the resources you request. As such it is important that you specify what resources you will need. Of particular importance in the amount of memory that your job will use and the amount of time you estimate you job will take to run.

Please note that some of these queues have maximum runtimes associated with them. This means that your job will be killed if it runs over the queue's time limit.

Cheat Sheet

Job queues that you can submit to directly,

Queue Max Run Time Max Memory Max Nodes Description
debug 2 hours 16 GB 54 Short jobs that might be used for testing
vip 1 day 16 GB 54 Important jobs that need to get done now

Queues that you job will be routed to based on resource needs that you specify,

Queue Max Run Time Max Memory Max Nodes Description
default 4 days 16 GB 54 Jobs that should be done within a few days
long unlimited 16 GB 54 Jobs that will run for a long time
himem unlimited 32 GB 8 Jobs that need to use memory-rich nodes

Queue Status

Use qstat to get an up-to-date summary of the queues.

> qstat -q
Queue            Memory CPU Time Walltime Node  Run Que Lm  State
---------------- ------ -------- -------- ----  --- --- --  -----
debug              --   02:00:00    --      --    0   0 --   E R
vip                --   24:00:00    --      --    0   0 --   E R
route              --      --       --      --    0   0 --   E R
himem              --      --       --      --    6   0 --   E R
long               15gb    --       --      --    1   0 12   E R
default            15gb 96:00:00 96:00:00   --    0   1 --   E R
                                               ----- -----
                                                   7     1