Batch Jobs

From bwHPC Wiki
Revision as of 00:21, 15 December 2013 by R Barthel (talk | contribs)
Jump to: navigation, search
Navigation: bwHPC / User Guides

Any kind of calculation on the compute nodes of bwUniCluster requires the user to define calculations as a sequence of commands or single command together with required run time, number of CPU cores and main memory and submit all, i.e., the batch job, to a resource and workload managing software. All bwHPC cluster, including bwUniCluster, have installed the workload managing software MOAB. Therefore any job submission by the user is to be executed by commands of the MOAB software. MOAB queues and runs user jobs based on fair sharing policies.

1 MOAB Queueing System

Batch jobs are submitted using the command msub. The main purpose of the msub command is to specify the resources that are needed to run the job. msub will then queue the job into the input queue. The jobs are organized into different job classes. For each job class there are specific limits for the available resources (number of nodes, number of CPUs, maximum CPU time, maximum memory etc.).

MOAB commands Brief explanation
msub submits an job and queues it in an input queue
checkjob displays detailed job state information
showq displays information about active, eligible, blocked, and/or recently completed jobs
showbf shows what resources are available for immediate use

1.1 Submitting Batch Jobs

1.1.1 msub Command

The syntax and use of msub can be displayed via:

$ man msub

1.1.2 msub Examples Serial Programs Parallel SMP Programs Parallel MPI Programs

1.2 Display Status of submitted Jobs

1.3 Environment Variables for Batch Jobs