Hprc banner tamu.png

Difference between revisions of "Terra:Batch Job Submissions"

From TAMU HPRC
Jump to: navigation, search
(Job Submission)
(tamubatch)
(5 intermediate revisions by 2 users not shown)
Line 4: Line 4:
 
  Submitted batch job 3606
 
  Submitted batch job 3606
  
After a job has been submitted, you may want to check on its progress or cancel it. Below is a list of the most used job monitoring and control commands for jobs on Terra.
+
== tamubatch ==
  
{| class="wikitable" style="text-align: center;"
+
'''tamubatch''' is an automatic batch job script that submits jobs for the user without the need of writing a batch script on the Grace and Terra clusters. The user just needs to provide the executable commands in a text file and tamubatch will automatically submit the job to the cluster. There are flags that the user may specify which allows control over the parameters for the job submitted.
|+ Job Monitoring and Control Commands
+
 
!Function
+
For more information, visit [https://hprc.tamu.edu/wiki/SW:tamubatch this page.]
!Command
 
!Example
 
|-
 
|Submit a job
 
|sbatch [script_file]
 
|sbatch FileName.job
 
|-
 
|Cancel/Kill a job
 
|scancel [job_id]
 
|scancel 101204
 
|-
 
|Check status of a single job
 
|squeue --job [job_id]
 
|squeue --job 101204
 
|-
 
|Check status of all <br> jobs for a user
 
|squeue -u [user_name]
 
|squeue -u terraUser1
 
|}
 
  
 
== tamulauncher ==
 
== tamulauncher ==

Revision as of 12:11, 22 October 2021

Job Submission

Once you have your job file ready, it is time to submit your job. You can submit your job to slurm with the following command:

[NetID@terra1 ~]$ sbatch MyJob.slurm 
Submitted batch job 3606

tamubatch

tamubatch is an automatic batch job script that submits jobs for the user without the need of writing a batch script on the Grace and Terra clusters. The user just needs to provide the executable commands in a text file and tamubatch will automatically submit the job to the cluster. There are flags that the user may specify which allows control over the parameters for the job submitted.

For more information, visit this page.

tamulauncher

tamulauncher provides a convenient way to run a large number of serial or multithreaded commands without the need to submit individual jobs or a Job array. User provides a text file containing all commands that need to be executed and tamulauncher will execute the commands concurrently. The number of concurrently executed commands depends on the batch requirements. When tamulauncher is run interactively the number of concurrently executed commands is limited to at most 8. tamulauncher is available on terra, ada, and curie. There is no need to load any module before using tamulauncher. tamulauncher has been successfully tested to execute over 100K commands.

tamulauncher is preferred over Job Arrays to submit a large number of individual jobs, especially when the run times of the commands are relatively short. It allows for better utilization of the nodes, puts less burden on the batch scheduler, and lessens interference with jobs of other users on the same node.

For more information, visit this page.