Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revisionLast revisionBoth sides next revision | ||
technical:slurm:node-memory-sizes [2019-02-18 12:15] – [Proposed RealMemory sizes] frey | technical:slurm:node-memory-sizes [2019-02-18 12:32] – [Requesting specific node types] frey | ||
---|---|---|---|
Line 23: | Line 23: | ||
The changes did not need to be rolled-back, | The changes did not need to be rolled-back, | ||
- | One additional problem could present itself under the v1.1.3 use of nominal physical memory size for the nodes Consider the following: | + | One additional problem could present itself under the v1.1.3 use of nominal physical memory size for the nodes. |
* A node runs a job requesting 28 cores and 100 GiB of memory, leaving 8 cores and 28 GiB of memory available according to the node configuration. | * A node runs a job requesting 28 cores and 100 GiB of memory, leaving 8 cores and 28 GiB of memory available according to the node configuration. | ||
- | * A second job from a different user, requesting | + | * The same node runs a second job that requests |
- | Since the OS itself occupies some non-trivial amount of the physical memory, the second job eventually extends memory usage above and beyond the amount of physical memory present. | + | Since the OS itself occupies some non-trivial amount of the physical memory, the second job eventually extends memory usage above and beyond the amount of physical memory present |
<WRAP negative round> | <WRAP negative round> | ||
Line 86: | Line 86: | ||
A workgroup QOS which under v1.1.3 had '' | A workgroup QOS which under v1.1.3 had '' | ||
+ | |||
+ | ==== Requesting specific node types ==== | ||
+ | |||
+ | Our Slurm has always defined several // | ||
+ | |||
+ | <code bash> | ||
+ | $ scontrol show node r00n00 | ||
+ | NodeName=r00n00 Arch=x86_64 CoresPerSocket=36 | ||
+ | | ||
+ | | ||
+ | | ||
+ | : | ||
+ | </ | ||
+ | |||
+ | It is possible to constrain a job to execute on a node with a specific nominal memory size using these features: | ||
+ | |||
+ | <code bash> | ||
+ | $ sbatch … --constraint=128GB … | ||
+ | $ sbatch … --constraint=256GB … | ||
+ | $ sbatch … --constraint=512GB … | ||
+ | </ | ||
+ | |||
+ | The other features reflect the model processor present in the node. All Gen1 nodes use the Intel '' | ||
===== Implementation ===== | ===== Implementation ===== | ||