Update Slurm

2024-06-07 10:42:37 +02:00
parent 75f45b98e7
commit 8378850494

@ -6,3 +6,23 @@ SLURM (Simple Linux Utility for Resource Management) is an open-source workload
General link for documentation: https://slurm.schedmd.com/documentation.html
## Basic commands/concepts
### SLURM Partitions (Queues)/ `sinfo`
A partition, sometimes referred to as a "queue," defines a subset of compute nodes that "belong" to this partition and obey certain restrictions.
To see all available partitions and the restrictions, do `sinfo` on the cluster.
Example on Merlin:
```
CLUSTER: gmerlin6
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
gwendolen up 2:00:00 1 idle merlin-g-100
gwendolen-long up 8:00:00 1 idle merlin-g-100
gpu* up 7-00:00:00 6 mix merlin-g-[002,004,006,010,013-014]
gpu* up 7-00:00:00 7 alloc merlin-g-[003,005,007,009,011-012,015]
gpu* up 7-00:00:00 1 idle merlin-g-008
gpu-short up 2:00:00 6 mix merlin-g-[002,004,006,010,013-014]
gpu-short up 2:00:00 7 alloc merlin-g-[003,005,007,009,011-012,015]
gpu-short up 2:00:00 2 idle merlin-g-[001,008]
```