Aide-mémoire — Pedago Watch (Monitoring)
Référence rapide pour la stack de monitoring.
Accès rapides
Services, ports et conteneurs
| Service |
Rôle |
Port hôte |
Port conteneur |
| Uptime Kuma |
Monitoring HTTP/TCP |
127.0.0.1:6001 |
3001 |
| Grafana |
Dashboards + alertes |
127.0.0.1:6050 |
3000 |
| Loki |
Stockage logs |
127.0.0.1:6100 |
3100 |
| Promtail |
Collecte logs Docker → Loki |
— |
— |
Rétention logs Loki : 30 jours
Chemin VPS
/root/projects/pedago-tools-monitoring/
Commandes essentielles
Vérification santé
# Tous les services
docker compose ps
# Health check individuel
curl -s http://127.0.0.1:6050/api/health # Grafana
curl -s http://127.0.0.1:6001 # Uptime Kuma
curl -s http://127.0.0.1:6100/ready # Loki
Logs
cd /root/projects/pedago-tools-monitoring
docker compose logs grafana --tail 20
docker compose logs loki --tail 20
docker compose logs promtail --tail 20
docker compose logs uptime-kuma --tail 20
docker compose logs --tail 10 # tous les services
Vérifier la chaîne Promtail → Loki
# Labels disponibles dans Loki
curl -s "http://127.0.0.1:6100/loki/api/v1/labels" | python3 -m json.tool
# Targets Promtail
curl -s http://127.0.0.1:9080/targets
Gestion des services
cd /root/projects/pedago-tools-monitoring
docker compose up -d
docker compose down
docker compose restart grafana
docker compose restart uptime-kuma
Espace disque et nettoyage
# Espace disque
df -h
docker system df
docker system df -v | grep -E "pedago|monitoring"
# Nettoyage images
docker image prune -f
docker image prune -a --filter "until=168h"
Backup volumes
# Uptime Kuma
docker run --rm -v monitoring_uptime-kuma-data:/data -v $(pwd):/backup \
alpine tar czf /backup/uptime-kuma-backup.tar.gz /data
# Grafana
docker run --rm -v monitoring_grafana-data:/data -v $(pwd):/backup \
alpine tar czf /backup/grafana-backup.tar.gz /data
Pipelines CI/CD
gh run list --repo EpitechAfrik/pedago-tools-monitoring
gh run view RUN_ID --repo EpitechAfrik/pedago-tools-monitoring --log-failed
Volumes Docker
| Volume |
Contenu |
uptime-kuma-data |
Config + historique Uptime Kuma |
grafana-data |
Dashboards + config Grafana |
loki-data |
Logs indexés |
Requêtes LogQL utiles (Grafana)
# Erreurs backend Alert-Parent
{compose_service="backend"} |= "ERROR"
# Erreurs mail
{compose_service="backend"} |= "Mail loop error"
# Erreurs Django EPIDOCS
{compose_service="web"} |= "ERROR"