====== Slurm ====== Slurm is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters. Documentation for the current version of Slurm provided by SchedMD [[https://slurm.schedmd.com/documentation.html|SchedMD Slurm Documentation]]. You may find it helpful when migrating from one scheduler to another such as GridEngine to Slurm to refer to SchedMD's [[https://slurm.schedmd.com/rosetta.pdf|rosetta]] showing equivalent commands across various schedulers and their [[https://slurm.schedmd.com/pdfs/summary.pdf|command/option summary (two pages)]]. * Introduction to Slurm ({{:training:slurm:introduction_to_slurm_october2024.pdf|slides}}) ([[https://capture.udel.edu/media/Introduction+to+Slurm+Workshop+Remote/1_l6bzmv0e|video]]) * [[http://www.hpc.udel.edu/presentations/intro_to_slurm/|Caviness Community Cluster: Intro to Slurm]] It is a good idea to periodically check in ''/opt/shared/templates/slurm/'' for updated or new [[technical:slurm:caviness:templates:start|Caviness templates]] and/or [[technical:slurm:darwin:templates:start|DARWIN templates]] to use as job scripts to run generic or specific applications designed to provide the best performance on Caviness and/or DARWIN.