Aller au contenu

Aide-mémoire — Pedago Watch (Monitoring)

Référence rapide pour la stack de monitoring.

Accès rapides

Service URL
Uptime Kuma uptime.epitools.bj
Grafana grafana.epitools.bj
Repo GitHub

Services, ports et conteneurs

Service Rôle Port hôte Port conteneur
Uptime Kuma Monitoring HTTP/TCP 127.0.0.1:6001 3001
Grafana Dashboards + alertes 127.0.0.1:6050 3000
Loki Stockage logs 127.0.0.1:6100 3100
Promtail Collecte logs Docker → Loki

Rétention logs Loki : 30 jours

Chemin VPS

/root/projects/pedago-tools-monitoring/

Commandes essentielles

Vérification santé

# Tous les services
docker compose ps

# Health check individuel
curl -s http://127.0.0.1:6050/api/health   # Grafana
curl -s http://127.0.0.1:6001              # Uptime Kuma
curl -s http://127.0.0.1:6100/ready        # Loki

Logs

cd /root/projects/pedago-tools-monitoring

docker compose logs grafana --tail 20
docker compose logs loki --tail 20
docker compose logs promtail --tail 20
docker compose logs uptime-kuma --tail 20
docker compose logs --tail 10   # tous les services

Vérifier la chaîne Promtail → Loki

# Labels disponibles dans Loki
curl -s "http://127.0.0.1:6100/loki/api/v1/labels" | python3 -m json.tool

# Targets Promtail
curl -s http://127.0.0.1:9080/targets

Gestion des services

cd /root/projects/pedago-tools-monitoring

docker compose up -d
docker compose down
docker compose restart grafana
docker compose restart uptime-kuma

Espace disque et nettoyage

# Espace disque
df -h
docker system df
docker system df -v | grep -E "pedago|monitoring"

# Nettoyage images
docker image prune -f
docker image prune -a --filter "until=168h"

Backup volumes

# Uptime Kuma
docker run --rm -v monitoring_uptime-kuma-data:/data -v $(pwd):/backup \
  alpine tar czf /backup/uptime-kuma-backup.tar.gz /data

# Grafana
docker run --rm -v monitoring_grafana-data:/data -v $(pwd):/backup \
  alpine tar czf /backup/grafana-backup.tar.gz /data

Pipelines CI/CD

gh run list --repo EpitechAfrik/pedago-tools-monitoring
gh run view RUN_ID --repo EpitechAfrik/pedago-tools-monitoring --log-failed

Volumes Docker

Volume Contenu
uptime-kuma-data Config + historique Uptime Kuma
grafana-data Dashboards + config Grafana
loki-data Logs indexés

Requêtes LogQL utiles (Grafana)

# Erreurs backend Alert-Parent
{compose_service="backend"} |= "ERROR"

# Erreurs mail
{compose_service="backend"} |= "Mail loop error"

# Erreurs Django EPIDOCS
{compose_service="web"} |= "ERROR"