diff --git a/images/Slurm/scom.gif b/images/Slurm/scom.gif new file mode 100644 index 0000000..8325e26 Binary files /dev/null and b/images/Slurm/scom.gif differ diff --git a/pages/merlin6/03-Slurm-General-Documentation/monitoring.md b/pages/merlin6/03-Slurm-General-Documentation/monitoring.md index c15955d..dc47e4c 100644 --- a/pages/merlin6/03-Slurm-General-Documentation/monitoring.md +++ b/pages/merlin6/03-Slurm-General-Documentation/monitoring.md @@ -75,6 +75,39 @@ gpu up 7-00:00:00 1-infinite no NO all 8 allocate +### Slurm commander + +The **[Slurm Commander (scom)](https://github.com/CLIP-HPC/SlurmCommander/)** it's a simple but very useful open source text-based user interface for +simple and efficient interaction with Slurm. It is originally developed by the CLoud Infrastructure Project (CLIP-HPC). To use it, one can +simply run the following command: + +```bash +scom # merlin6 cluster +SLURM_CLUSTERS=merlin5 scom # merlin5 cluster +SLURM_CLUSTERS=gmerlin6 scom # gmerlin6 cluster +scom -h # Help and extra options +``` +With this simple interface, users can interact with queued jobs, as well as getting information about past and present jobs: +* Filtering jobs by substring is possible with the `/` key. +* Users can perform multiple actions on their jobs, such like cancelling or requeing a job, SSH to a node with an already running job, +or getting extended details of the job itself. + +Also, users can check the status of the cluster, to get statistics and node usage information as well as getting information about node properties. + +The interface also provides a few job templates for different use cases (i.e. MPI, OpenMP, Hybrid, single core). Users can modify these templates, +save it locally to the current directory, and submit the job to the cluster. + +{{site.data.alerts.note}}Currently, scom does not provide live updated information for the [Job History] tab. +To update Job History information, users have exit the application with the q key. Other tabs are updated every 5 seconds. + +On the other hand, the [Job History] tab contains only information for the merlin6 CPU cluster only. Future updates will provide information +for other clusters. +{{site.data.alerts.end}} + +For further information about how to use **scom**, please refer to the **[Slurm Commander Project webpage](https://github.com/CLIP-HPC/SlurmCommander/)** + +!['scom' text-based user interface]({{ "/images/Slurm/scom.gif" }}) + ### Job accounting Users can check detailed information of jobs (pending, running, completed, failed, etc.) with the `sacct` command. @@ -132,7 +165,7 @@ Memory Efficiency: 0.19% of 31.25 GB The ``sjstat`` command is used to display statistics of jobs under control of SLURM. To use it ```bash -jstat +sjstat ```
@@ -216,11 +249,11 @@ JobID User Procs Pool Status Used Limit Starte ### Graphical user interface -When using **ssh** with X11 forwarding (``ssh -XY``) users can use ``sview``. **SView** is a graphical user -interface to view and modify Slurm state. To run **sview**: +When using **ssh** with X11 forwarding (``ssh -XY``), or when using NoMachine, users can use ``sview``. +**SView** is a graphical user interface to view and modify Slurm states. To run **sview**: ```bash -ssh -XY $username@merlin-l-001.psi.ch +ssh -XY $username@merlin-l-001.psi.ch # Not necessary when using NoMachine sview ```