ACES: Using the Slurm Scheduler on Composable Resources

Overview

Instructor(s): Dr. Michael Dickens

Time: Tuesday, January 20, 2026 — 1:30PM-4:00PM CT

Location: Online using Zoom

Prerequisite(s): Current ACCESS ID, basic Linux/Unix skills

Introduction to using the Slurm scheduler on the ACES cluster, a composable accelerator testbed at Texas A&M University. Topics covered include multiple job scheduling approaches and job management tools.

A Registration button will appear here when registration has been opened.

Course Materials

  • Slurm Job Scheduling (Spring 2026): PDF

  • Slurm Job Scheduling (Fall 2025): PDF
  • Slurm Job Scheduling (Spring 2025): PDF
  • Slurm Job Scheduling (Fall 2024): PDF   Video
  • Slurm Job Scheduling (Spring 2024): PDF

Participation

During the training, attendees are expected to log in to an HPRC cluster using their own computer and complete the instructor-led examples and exercises.

Learning Objectives and Agenda

In this course, participants will:

  • Learn the basics of HPC architecture
  • Learn the basic components of a job script
  • Learn how to submit a job script
  • Learn how to review job HPC resource usage
  • Learn how to debug failed jobs

This short course will cover various job scheduling approaches using the Slurm Workload Manager on ACES:

  • HPC Architecture
  • SBATCH Parameters
  • Single node jobs
    • single-core
    • multi-core
  • Multi-node jobs
    • MPI jobs
    • TAMULauncher
    • array jobs
  • Monitoring job resource usage
    • at runtime
    • after job completion
    • job debugging