Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revisionLast revisionBoth sides next revision | ||
technical:slurm:node-memory-sizes [2019-02-18 11:34] – frey | technical:slurm:node-memory-sizes [2019-02-18 12:32] – [Requesting specific node types] frey | ||
---|---|---|---|
Line 23: | Line 23: | ||
The changes did not need to be rolled-back, | The changes did not need to be rolled-back, | ||
- | One additional problem could present itself under the v1.1.3 use of nominal physical memory size for the nodes Consider the following: | + | One additional problem could present itself under the v1.1.3 use of nominal physical memory size for the nodes. |
* A node runs a job requesting 28 cores and 100 GiB of memory, leaving 8 cores and 28 GiB of memory available according to the node configuration. | * A node runs a job requesting 28 cores and 100 GiB of memory, leaving 8 cores and 28 GiB of memory available according to the node configuration. | ||
- | * A second job from a different user, requesting | + | * The same node runs a second job that requests |
- | Since the OS itself occupies some non-trivial amount of the physical memory, the second job eventually extends memory usage above and beyond the amount of physical memory present. | + | Since the OS itself occupies some non-trivial amount of the physical memory, the second job eventually extends memory usage above and beyond the amount of physical memory present |
<WRAP negative round> | <WRAP negative round> | ||
Line 61: | Line 61: | ||
<WRAP positive round> | <WRAP positive round> | ||
- | Node configurations will be updated to reflect the chosen sub-nominal RealMemory sizes. | + | Node configurations will be updated to reflect the chosen sub-nominal RealMemory sizes. The // |
</ | </ | ||
- | The // | + | Under mode 1 of // |
<note important> | <note important> | ||
+ | <WRAP positive round> | ||
+ | Workgroup QOS configurations will be updated to reflect the sum over sub-nominal RealMemory sizes rather than nominal sizes used in the v1.1.3 configuration. | ||
+ | </ | ||
+ | |||
+ | In v1.1.3 the node counts in workgroup QOS's were replaced by aggregate memory sizes which summed over the nominal sizes (128 GiB, 256 GiB, 512 GiB). In concert with changing the nodes' RealMemory size, the QOS aggregate must change. | ||
+ | |||
+ | ==== Proposed RealMemory sizes ==== | ||
+ | |||
+ | ^Node type^(PHYS_PAGES*PAGESIZE)/ | ||
+ | |Gen1/128 GiB|128813|126976|124| | ||
+ | |Gen1/256 GiB|257843|256000|250| | ||
+ | |Gen1/512 GiB|515891|514048|502| | ||
+ | |Gen1/ | ||
+ | |Gen1/ | ||
+ | |Gen1/ | ||
+ | |Gen1/ | ||
+ | |||
+ | A workgroup QOS which under v1.1.3 had '' | ||
+ | |||
+ | ==== Requesting specific node types ==== | ||
+ | |||
+ | Our Slurm has always defined several // | ||
+ | |||
+ | <code bash> | ||
+ | $ scontrol show node r00n00 | ||
+ | NodeName=r00n00 Arch=x86_64 CoresPerSocket=36 | ||
+ | | ||
+ | | ||
+ | | ||
+ | : | ||
+ | </ | ||
+ | |||
+ | It is possible to constrain a job to execute on a node with a specific nominal memory size using these features: | ||
+ | |||
+ | <code bash> | ||
+ | $ sbatch … --constraint=128GB … | ||
+ | $ sbatch … --constraint=256GB … | ||
+ | $ sbatch … --constraint=512GB … | ||
+ | </ | ||
+ | |||
+ | The other features reflect the model processor present in the node. All Gen1 nodes use the Intel '' | ||
===== Implementation ===== | ===== Implementation ===== | ||
Line 79: | Line 120: | ||
^Date ^Time ^Goal/ | ^Date ^Time ^Goal/ | ||
- | |2019-02-04| |Authoring of this document| | + | |2019-02-18| |Authoring of this document| |
- | |2019-02-06| |Document shared with Caviness community for feedback| | + | |2019-02-18| |Document shared with Caviness community for feedback| |
- | |2019-02-13| |Add announcement of impending change to login banner| | + | |2019-02-18| |Add announcement of impending change to login banner| |
- | |2019-02-18|09: | + | |2019-02-25|09: |
| |09: | | |09: | ||
- | |2019-02-20| |Remove announcement from login banner| | + | |2019-02-27| |Remove announcement from login banner| |