Add Slurm Commander

This commit is contained in:
caubet_m 2023-01-24 14:03:30 +01:00
parent a667f6414d
commit e4f562e8d6
2 changed files with 37 additions and 4 deletions

BIN
images/Slurm/scom.gif Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 1008 KiB

View File

@ -75,6 +75,39 @@ gpu up 7-00:00:00 1-infinite no NO all 8 allocate
</pre>
</details>
### Slurm commander
The **[Slurm Commander (scom)](https://github.com/CLIP-HPC/SlurmCommander/)** it's a simple but very useful open source text-based user interface for
simple and efficient interaction with Slurm. It is originally developed by the CLoud Infrastructure Project (CLIP-HPC). To use it, one can
simply run the following command:
```bash
scom # merlin6 cluster
SLURM_CLUSTERS=merlin5 scom # merlin5 cluster
SLURM_CLUSTERS=gmerlin6 scom # gmerlin6 cluster
scom -h # Help and extra options
```
With this simple interface, users can interact with queued jobs, as well as getting information about past and present jobs:
* Filtering jobs by substring is possible with the `/` key.
* Users can perform multiple actions on their jobs, such like cancelling or requeing a job, SSH to a node with an already running job,
or getting extended details of the job itself.
Also, users can check the status of the cluster, to get statistics and node usage information as well as getting information about node properties.
The interface also provides a few job templates for different use cases (i.e. MPI, OpenMP, Hybrid, single core). Users can modify these templates,
save it locally to the current directory, and submit the job to the cluster.
{{site.data.alerts.note}}Currently, <span style="color:darkblue;">scom</span> does not provide live updated information for the <span style="color:darkorange;">[Job History]</span> tab.
To update Job History information, users have exit the application with the <span style="color:darkorange;">q</span> key. Other tabs are updated every 5 seconds.
On the other hand, the <span style="color:darkorange;">[Job History]</span> tab contains only information for the <b>merlin6</b> CPU cluster only. Future updates will provide information
for other clusters.
{{site.data.alerts.end}}
For further information about how to use **scom**, please refer to the **[Slurm Commander Project webpage](https://github.com/CLIP-HPC/SlurmCommander/)**
!['scom' text-based user interface]({{ "/images/Slurm/scom.gif" }})
### Job accounting
Users can check detailed information of jobs (pending, running, completed, failed, etc.) with the `sacct` command.
@ -132,7 +165,7 @@ Memory Efficiency: 0.19% of 31.25 GB
The ``sjstat`` command is used to display statistics of jobs under control of SLURM. To use it
```bash
jstat
sjstat
```
<details>
@ -216,11 +249,11 @@ JobID User Procs Pool Status Used Limit Starte
### Graphical user interface
When using **ssh** with X11 forwarding (``ssh -XY``) users can use ``sview``. **SView** is a graphical user
interface to view and modify Slurm state. To run **sview**:
When using **ssh** with X11 forwarding (``ssh -XY``), or when using NoMachine, users can use ``sview``.
**SView** is a graphical user interface to view and modify Slurm states. To run **sview**:
```bash
ssh -XY $username@merlin-l-001.psi.ch
ssh -XY $username@merlin-l-001.psi.ch # Not necessary when using NoMachine
sview
```