This document summarizes alterations to the Slurm job scheduler configuration on the DARWIN cluster.
See this document discussing swap limits in Slurm jobs.
dynamic_swap_limits SPANK plugin will be compiled and installedplugstack.conf configuration file will be modified to require the dynamic_swap_limits pluginThe following aspects of the system construction will be observed in the configuration of the plugin:
max_swap valuemax_swap per CPUmax_swap per CPUmax_swap per CPUThese details produce the following configuration string:
partition(extended-mem)=none,host(r2v[00-02])=1.5%/cpu,host(r0m01)=0.25%/cpu,default()=1%/cpu
No downtime is expected. The slurmd daemon must be restarted on all compute nodes, but currently-executing jobs/job steps should not be affected (they will reconnect to the new slurmd as necessary to communicate job status, etc.). The slurmctld daemons do not use the SPANK plugin, thus they do not need to be restarted.
| Date | Time | Goal/Description |
|---|---|---|
| 2021-11-19 | Authoring of this document | |
| 2021-11-24 | 09:00 | Implementation |
| 2021-12-01 | 14:54 | Update: appropriate aggregate limits on job cgroup |