You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

SRE/Observability

From Wikitech-static
< SRE
Revision as of 13:50, 21 June 2021 by imported>Jobo
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
SRE Observability

SRE Observability - Monitoring and Logging (Prometheus/Grafana and ElasticSearch, plus some Kafka).

The Observability team, or "o11y" for short, works across SRE and Technology to provide teams with tools, platforms and insights into how systems and services are performing. It leverages technologies such as Grafana, Kibana/Logstash, Prometheus, AlertManager and more.

The starting point for observability resources at Wikimedia SRE.

Alerts

Logs

Metrics