Add Slurm Commander
This commit is contained in:
parent
a667f6414d
commit
e4f562e8d6
BIN
images/Slurm/scom.gif
Normal file
BIN
images/Slurm/scom.gif
Normal file
Binary file not shown.
After Width: | Height: | Size: 1008 KiB |
@ -75,6 +75,39 @@ gpu up 7-00:00:00 1-infinite no NO all 8 allocate
|
|||||||
</pre>
|
</pre>
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
|
### Slurm commander
|
||||||
|
|
||||||
|
The **[Slurm Commander (scom)](https://github.com/CLIP-HPC/SlurmCommander/)** it's a simple but very useful open source text-based user interface for
|
||||||
|
simple and efficient interaction with Slurm. It is originally developed by the CLoud Infrastructure Project (CLIP-HPC). To use it, one can
|
||||||
|
simply run the following command:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
scom # merlin6 cluster
|
||||||
|
SLURM_CLUSTERS=merlin5 scom # merlin5 cluster
|
||||||
|
SLURM_CLUSTERS=gmerlin6 scom # gmerlin6 cluster
|
||||||
|
scom -h # Help and extra options
|
||||||
|
```
|
||||||
|
With this simple interface, users can interact with queued jobs, as well as getting information about past and present jobs:
|
||||||
|
* Filtering jobs by substring is possible with the `/` key.
|
||||||
|
* Users can perform multiple actions on their jobs, such like cancelling or requeing a job, SSH to a node with an already running job,
|
||||||
|
or getting extended details of the job itself.
|
||||||
|
|
||||||
|
Also, users can check the status of the cluster, to get statistics and node usage information as well as getting information about node properties.
|
||||||
|
|
||||||
|
The interface also provides a few job templates for different use cases (i.e. MPI, OpenMP, Hybrid, single core). Users can modify these templates,
|
||||||
|
save it locally to the current directory, and submit the job to the cluster.
|
||||||
|
|
||||||
|
{{site.data.alerts.note}}Currently, <span style="color:darkblue;">scom</span> does not provide live updated information for the <span style="color:darkorange;">[Job History]</span> tab.
|
||||||
|
To update Job History information, users have exit the application with the <span style="color:darkorange;">q</span> key. Other tabs are updated every 5 seconds.
|
||||||
|
|
||||||
|
On the other hand, the <span style="color:darkorange;">[Job History]</span> tab contains only information for the <b>merlin6</b> CPU cluster only. Future updates will provide information
|
||||||
|
for other clusters.
|
||||||
|
{{site.data.alerts.end}}
|
||||||
|
|
||||||
|
For further information about how to use **scom**, please refer to the **[Slurm Commander Project webpage](https://github.com/CLIP-HPC/SlurmCommander/)**
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
### Job accounting
|
### Job accounting
|
||||||
|
|
||||||
Users can check detailed information of jobs (pending, running, completed, failed, etc.) with the `sacct` command.
|
Users can check detailed information of jobs (pending, running, completed, failed, etc.) with the `sacct` command.
|
||||||
@ -132,7 +165,7 @@ Memory Efficiency: 0.19% of 31.25 GB
|
|||||||
The ``sjstat`` command is used to display statistics of jobs under control of SLURM. To use it
|
The ``sjstat`` command is used to display statistics of jobs under control of SLURM. To use it
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
jstat
|
sjstat
|
||||||
```
|
```
|
||||||
|
|
||||||
<details>
|
<details>
|
||||||
@ -216,11 +249,11 @@ JobID User Procs Pool Status Used Limit Starte
|
|||||||
|
|
||||||
### Graphical user interface
|
### Graphical user interface
|
||||||
|
|
||||||
When using **ssh** with X11 forwarding (``ssh -XY``) users can use ``sview``. **SView** is a graphical user
|
When using **ssh** with X11 forwarding (``ssh -XY``), or when using NoMachine, users can use ``sview``.
|
||||||
interface to view and modify Slurm state. To run **sview**:
|
**SView** is a graphical user interface to view and modify Slurm states. To run **sview**:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
ssh -XY $username@merlin-l-001.psi.ch
|
ssh -XY $username@merlin-l-001.psi.ch # Not necessary when using NoMachine
|
||||||
sview
|
sview
|
||||||
```
|
```
|
||||||
|
|
||||||
|
Loading…
x
Reference in New Issue
Block a user