abstract:mills:runjobs:job_status

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
abstract:mills:runjobs:job_status [2018-07-16 15:15] – created anitaabstract:mills:runjobs:job_status [2018-07-16 15:30] (current) – [Resource-management options] anita
Line 29: Line 29:
 The ''qstat'' command also allows the user to see job status information for any other cluster user by means of the ''-u'' flag.  The flag requires a single argument:  a username or the wildcard character (''\*''): The ''qstat'' command also allows the user to see job status information for any other cluster user by means of the ''-u'' flag.  The flag requires a single argument:  a username or the wildcard character (''\*''):
 <code bash> <code bash>
-[(it_css:traine)@farber it_css]$ qstat -u traine+[(it_css:traine)@mills it_css]$ qstat -u traine
    :    :
-[(it_css:traine)@farber it_css]$ qstat -u \*+[(it_css:traine)@mills it_css]$ qstat -u \*
    :    :
 </code> </code>
Line 40: Line 40:
 In all forms discussed above the output from ''qstat'' focuses on jobs.  To instead view the status information in a host-centric format, the ''-f'' option should be added to the ''qstat'' command.  The output from ''qstat -f'' is organized by queue instances (thus, also by compute hosts) with jobs running in a particular queue instance summarized therein: In all forms discussed above the output from ''qstat'' focuses on jobs.  To instead view the status information in a host-centric format, the ''-f'' option should be added to the ''qstat'' command.  The output from ''qstat -f'' is organized by queue instances (thus, also by compute hosts) with jobs running in a particular queue instance summarized therein:
 <code bash> <code bash>
-[(it_css:traine)@farber it_css]$ qstat -f -q 'it_css*'+[(it_css:traine)@mills it_css]$ qstat -f -q 'it_css*'
 queuename                      qtype resv/used/tot. load_avg arch          states queuename                      qtype resv/used/tot. load_avg arch          states
 --------------------------------------------------------------------------------- ---------------------------------------------------------------------------------
Line 91: Line 91:
  
 <code bash> <code bash>
-[(it_css:traine)@farber it_css]$ qhost -h n013 -h n014+[(it_css:traine)@mills it_css]$ qhost -h n013 -h n014
 HOSTNAME                ARCH         NCPU  LOAD  MEMTOT  MEMUSE  SWAPTO  SWAPUS HOSTNAME                ARCH         NCPU  LOAD  MEMTOT  MEMUSE  SWAPTO  SWAPUS
 ------------------------------------------------------------------------------- -------------------------------------------------------------------------------
Line 110: Line 110:
  
 <code bash> <code bash>
-[(it_css:traine)@farber it_css]$ qstat -j 82518+[(it_css:traine)@mills it_css]$ qstat -j 82518
 ============================================================== ==============================================================
 job_number:                 82518 job_number:                 82518
Line 124: Line 124:
 sge_o_shell:                /bin/bash sge_o_shell:                /bin/bash
 sge_o_workdir:              /lustre/work/it_css sge_o_workdir:              /lustre/work/it_css
-sge_o_host:                 farber+sge_o_host:                 mills
 account:                    sge account:                    sge
 cwd:                        /lustre/work/it_css cwd:                        /lustre/work/it_css
 merge:                      y merge:                      y
 hard resource_list:         idle_resources=0,dev_resources=0,exclusive=1,standby_resources=1,scratch_free=1000000 hard resource_list:         idle_resources=0,dev_resources=0,exclusive=1,standby_resources=1,scratch_free=1000000
-mail_list:                  traine@farber.hpc.udel.edu+mail_list:                  traine@mills.hpc.udel.edu
 notify:                     FALSE notify:                     FALSE
 job_name:                   mpibounce.qs job_name:                   mpibounce.qs
Line 159: Line 159:
  
 <code bash> <code bash>
-[(it_css:traine)@farber ~]$ qjobs+[(it_css:traine)@mills ~]$ qjobs
 =============================================================================== ===============================================================================
 JobID  Owner              State    Submitted as JobID  Owner              State    Submitted as
Line 172: Line 172:
  
 <code bash> <code bash>
-[(it_css:traine)@farber ~]$ qjobs -g sandler_thermo+[(it_css:traine)@mills ~]$ qjobs -g sandler_thermo
 =============================================================================== ===============================================================================
 JobID  Owner              State    Submitted as JobID  Owner              State    Submitted as
Line 211: Line 211:
 **Details by cluster** **Details by cluster**
  
-   * [[clusters:farber:runapps#resource-management-options|farber]] +   * [[abstract:mills:runjobs:schedule_jobs#resource-management-options-on-mills|Mills]]
-   * [[clusters:farber:runapps#resource-management-options|Farber]]+
  
 ===== Managing Jobs ===== ===== Managing Jobs =====
Line 265: Line 264:
  
 <code base> <code base>
-[(it_css:traine)@farber it_css]$ qstat -u traine+[(it_css:traine)@mills it_css]$ qstat -u traine
 job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID
 ----------------------------------------------------------------------------------------------------------------- -----------------------------------------------------------------------------------------------------------------
Line 274: Line 273:
  
 <code base> <code base>
-[(it_css:traine)@farber it_css]$ qalter -w p 99154 | tail -10+[(it_css:traine)@mills it_css]$ qalter -w p 99154 | tail -10
 Job 99154 has no permission for cluster queue "puleo-qrsh.q" Job 99154 has no permission for cluster queue "puleo-qrsh.q"
 Job 99154 has no permission for cluster queue "capsl.q+" Job 99154 has no permission for cluster queue "capsl.q+"
Line 289: Line 288:
  
 <code base> <code base>
-[(it_css:traine)@farber it_css]$ qstatgrp+[(it_css:traine)@mills it_css]$ qstatgrp
 CLUSTER QUEUE                   CQLOAD   USED    RES  AVAIL  TOTAL aoACDPS  cdsuE CLUSTER QUEUE                   CQLOAD   USED    RES  AVAIL  TOTAL aoACDPS  cdsuE
 it_css-dev.q                      0.00      0      0     72     72      0      0 it_css-dev.q                      0.00      0      0     72     72      0      0
Line 302: Line 301:
  
 <code base> <code base>
-[(it_css:traine)@farber it_css]$ qalter -pe openmpi 48 99154+[(it_css:traine)@mills it_css]$ qalter -pe openmpi 48 99154
 modified parallel environment of job 99154 modified parallel environment of job 99154
 modified slot range of job 99154 modified slot range of job 99154
-[(it_css:traine)@farber it_css]$ qstat -u traine+[(it_css:traine)@mills it_css]$ qstat -u traine
 job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID
 ----------------------------------------------------------------------------------------------------------------- -----------------------------------------------------------------------------------------------------------------
Line 313: Line 312:
 Another way to get this job running would be to change the resource for the job to run in the standby queue.  To do this you must specify all resources since ''qalter'' completely replaces any parameters previously specified for the job by that option. In this example, we alter the job to run in the standby queue by using Another way to get this job running would be to change the resource for the job to run in the standby queue.  To do this you must specify all resources since ''qalter'' completely replaces any parameters previously specified for the job by that option. In this example, we alter the job to run in the standby queue by using
 <code base> <code base>
-[(it_css:traine)@farber it_css]$ qalter -l idle=0,standby=1 99154+[(it_css:traine)@mills it_css]$ qalter -l idle=0,standby=1 99154
 modified hard resource list of job 99154 modified hard resource list of job 99154
-[(it_css:traine)@farber it_css]$ qstat -u traine+[(it_css:traine)@mills it_css]$ qstat -u traine
 job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID
 ----------------------------------------------------------------------------------------------------------------- -----------------------------------------------------------------------------------------------------------------
Line 328: Line 327:
  
 <code base> <code base>
-[(it_css:traine)@farber it_css]$ qstat -u traine+[(it_css:traine)@mills it_css]$ qstat -u traine
 job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID job-ID  prior   name       user         state submit/start at     queue                          slots ja-task-ID
 ----------------------------------------------------------------------------------------------------------------- -----------------------------------------------------------------------------------------------------------------
Line 337: Line 336:
  
 <code base> <code base>
-[traine@farber ~]$ qstat -j 686924 | grep error+[traine@mills ~]$ qstat -j 686924 | grep error
 error reason    1:          08/12/2014 22:08:27 [1208:60529]: error: can't chdir to /archive/it_css/traine/ex-openmpi: No such file or directory error reason    1:          08/12/2014 22:08:27 [1208:60529]: error: can't chdir to /archive/it_css/traine/ex-openmpi: No such file or directory
 </code> </code>
Line 346: Line 345:
  
 <code base> <code base>
-[traine@farber ~]$ qmod -cj 686924+[traine@mills ~]$ qmod -cj 686924
 </code> </code>
  
  • abstract/mills/runjobs/job_status.1531768552.txt.gz
  • Last modified: 2018-07-16 15:15
  • by anita