This document summarizes alterations to the Slurm job scheduler configuration on the Caviness cluster.
Two new racks (r05, r06) has been added to the Caviness cluster. Nodes in the new rack must be integrated into the Slurm configuration for job scheduling. First-time investing workgroups must be added to Slurm accounting, and all workgroups' QOS-based resource limits and fairshare factors must be updated.
nodes.conf
file will be modified to include r05, r06.partitions.conf
file will be modified to:topology.conf
file will be modified to include OPA switches/HFIs in r05, r06/opt/shared/slurm/add-ons/bin/opa2slurm
utility (written by IT-RCI staff) will be used to automatically map the OPA networkNo downtime is expected to be required. The version of the configuration will be bumped to v2.4.0.
Date | Time | Goal/Description |
---|---|---|
2023-05-27 | Authoring of this document | |
2023-05-30 | 09:00 | Implementation |