User Tools

Site Tools


infrastructure:monitoring

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

infrastructure:monitoring [2026/02/07 09:02] – Created monitoring documentation admininfrastructure:monitoring [2026/02/07 10:39] (current) – Simplified admin
Line 1: Line 1:
-====== Monitoring (Prometheus/Grafana) ======+====== Monitoring ======
  
-Container 115 auf hv-04 — zentrales Monitoring für die Infrastruktur. +Prometheus + Grafana on CT 115 (10.100.161.115)
- +
-===== Zugang =====+
  
   * **Grafana:** [[https://monitoring.home.miskam.xyz]]   * **Grafana:** [[https://monitoring.home.miskam.xyz]]
-  * **Prometheus:** http://10.100.161.115:9090 (intern) +  * **Prometheus:** http://10.100.161.115:9090
-  * **Credentials:** In Vaultwarden (Network Automation → Grafana Admin) +
- +
-===== Komponenten ===== +
- +
-^ Service ^ Port ^ Pfad ^ +
-| Prometheus | 9090 | /usr/local/bin/prometheus | +
-| Grafana | 3000 | /usr/share/grafana | +
-| node_exporter | 9100 | /usr/local/bin/node_exporter | +
- +
-===== Prometheus Config ===== +
- +
-<code yaml /etc/prometheus/prometheus.yml> +
-global: +
-  scrape_interval: 15s +
- +
-scrape_configs: +
-  - job_name: 'prometheus' +
-    static_configs: +
-      - targets: ['localhost:9090'+
- +
-  - job_name: 'node' +
-    static_configs: +
-      - targets: ['10.100.161.115:9100'+
-        labels: {instance: prometheus} +
-      - targets: ['10.100.160.249:9100'+
-        labels: {instance: openclaw} +
-      # ... weitere Hosts +
-</code> +
- +
-===== Scrape Targets ===== +
- +
-^ Host ^ IP ^ Port ^ +
-| prometheus | 10.100.161.115 | 9100 | +
-| openclaw | 10.100.160.249 | 9100 | +
-| n8n | 10.100.161.111 | 9100 | +
-| vaultwarden | 10.100.161.112 | 9100 | +
-| postgres | 10.100.161.110 | 9100 | +
-| dokuwiki | 10.100.161.113 | 9100 | +
-| uptime-kuma | 10.100.161.114 | 9100 | +
-| traefik | 10.100.161.102 | 9100 | +
-| hv-04 | 10.100.160.100 | 9100 | +
- +
-===== node_exporter Installation ===== +
- +
-Auf allen Hosts via Ansible installiert: +
- +
-<code bash> +
-cd /root/.openclaw/workspace/automation-stack/ansible +
-ansible-playbook site.yml --limit automation_stack +
-</code> +
- +
-Manuell: +
-<code bash> +
-# Download +
-wget https://github.com/prometheus/node_exporter/releases/download/v1.7.0/node_exporter-1.7.0.linux-amd64.tar.gz +
-tar xzf node_exporter-1.7.0.linux-amd64.tar.gz +
-cp node_exporter-1.7.0.linux-amd64/node_exporter /usr/local/bin/ +
- +
-# Service +
-cat > /etc/systemd/system/node_exporter.service << EOF +
-[Unit] +
-Description=Prometheus Node Exporter +
-After=network.target +
- +
-[Service] +
-User=node_exporter +
-ExecStart=/usr/local/bin/node_exporter +
-Restart=always +
- +
-[Install] +
-WantedBy=multi-user.target +
-EOF +
- +
-useradd -rs /bin/false node_exporter +
-systemctl daemon-reload +
-systemctl enable --now node_exporter +
-</code>+
  
-===== Empfohlene Dashboards =====+===== Targets =====
  
-  * **Node Exporter Full** (ID: 1860) — Umfassende Host-Metriken +All hosts run node_exporter on port 9100.
-  * **Node Exporter** (ID: 11074) — Kompakte Übersicht+
  
-Import via Grafana: Dashboards → Import → ID eingeben+^ Host ^ IP ^ 
 +| prometheus | 10.100.161.115 | 
 +| openclaw | 10.100.160.249 | 
 +| n8n | 10.100.161.111 | 
 +| vaultwarden | 10.100.161.112 | 
 +| postgres | 10.100.161.110 | 
 +| dokuwiki | 10.100.161.113 | 
 +| uptime-kuma | 10.100.161.114 | 
 +| traefik | 10.100.161.102 | 
 +| hv-04 | 10.100.160.100 |
  
-----+===== Dashboards =====
  
-//Erstellt2026-02-07//+Import via Grafana**Node Exporter Full** (ID: 1860)
  
infrastructure/monitoring.1770454958.txt.gz · Last modified: by admin