Added monitoring

This commit is contained in:
2019-10-21 17:27:37 +02:00
parent e39402e98c
commit 89280c92e2
3 changed files with 39 additions and 2 deletions

View File

@ -43,6 +43,8 @@ entries:
url: /merlin6/slurm-examples.html url: /merlin6/slurm-examples.html
- title: Slurm Configuration - title: Slurm Configuration
url: /merlin6/slurm-configuration.html url: /merlin6/slurm-configuration.html
- title: Monitoring
url: /merlin6/monitoring.html
- title: Jupyterhub - title: Jupyterhub
folderitems: folderitems:
- title: Jupyterhub service - title: Jupyterhub service

View File

@ -6,6 +6,6 @@ services:
dockerfile: Dockerfile dockerfile: Dockerfile
image: result/latest image: result/latest
ports: ports:
- "4000:4000" - "4001:4001"
volumes: volumes:
- ".:/src" - ".:/src"

View File

@ -0,0 +1,35 @@
---
title: Monitoring
#tags:
#keywords:
last_updated: 20 June 2019
#summary: ""
sidebar: merlin6_sidebar
permalink: /merlin6/monitoring.html
---
## Monitoring
The following pages contain basic monitoring for Slurm and computing nodes.
Currently, monitoring is based on Grafana + InfluxDB. In the future it will
be moved to a different service based on ElasticSearch + LogStash + Kibana.
In the meantime, the following monitoring pages are available in a best effort
support:
### Merlin6 Monitoring Pages
* Slurm monitoring:
* [Merlin6 Slurm Live Status](https://hpc-monitor01.psi.ch/d/vpwNKUhZz/merlin6-slurm-live-status?refresh=10s&orgId=1)
* [Merlin6 Slurm Overview](https://hpc-monitor01.psi.ch/d/QzBI6QoZz/merlin5-slurm-overview?refresh=10s&orgId=1)
* Nodes monitoring:
* [Merlin6 CPU Nodes Overview](https://hpc-monitor01.psi.ch/d/JmvLR8gZz/merlin6-computing-cpu-nodes?refresh=10s&orgId=1)
* [Merlin6 GPU Nodes Overview](https://hpc-monitor01.psi.ch/d/98l409-mk/merlin6-computing-gpu-nodes?refresh=5s&orgId=1)
### Merlin5 Monitoring Pages
* Slurm monitoring:
* [Merlin5 Slurm Live Status](https://hpc-monitor01.psi.ch/d/UbKbewTWz/merlin5-slurm-live-status?refresh=10s&orgId=1)
* [Merlin5 Slurm Overview](https://hpc-monitor01.psi.ch/d/QzBI6QoZz/merlin5-slurm-overview?refresh=10s&orgId=1)
* Nodes monitoring:
* [Merlin5 CPU Nodes Overview](https://hpc-monitor01.psi.ch/d/a-TsfGpZk/merlin5-computing-cpu-nodes?refresh=10s&orgId=1)