You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Difference between revisions of "SRE/Observability/Documentation"

From Wikitech-static
Jump to navigation Jump to search
imported>Jobo
m
imported>LMata
(moved resources from main page)
 
Line 2: Line 2:


== SRE [[:Category:SRE Observability|Observability]] documentation ==
== SRE [[:Category:SRE Observability|Observability]] documentation ==
<categorytree mode=pages>SRE Observability</categorytree>
<categorytree mode=pages>SRE Observability</categorytree>The starting point for observability resources at Wikimedia SRE.
 
===Alerts===
*[https://icinga.wikimedia.org/alerts icinga.w.o/alerts]: central monitoring and alerting platform. See also [[Icinga]].
*[https://upload.wikimedia.org/wikipedia/labs/0/0a/Alerting_Infrastructure_design_document_%26_roadmap.pdf Alerting infrastructure roadmap] PDF
===Logs===
*[https://logstash.wikimedia.org/app/kibana Kibana] (a.k.a. logstash): central logging platform. See also [[Logstash]].
*[https://upload.wikimedia.org/wikipedia/labs/5/58/Logging_infrastructure_design_document.pdf Logging infrastructure design document] PDF
===Metrics===
*[https://grafana.wikimedia.org/ grafana.w.o]: central observability platform. See also [[Grafana.wikimedia.org|Grafana]].
*[[Prometheus]], recommended and supported metrics toolkit
*[[Graphite]], supported but deprecated time series framework
*[[Statsd]], supported but deprecated metrics aggregation
*[[Observability/Dashboard_guidelines]], ideas towards better dashboards

Latest revision as of 15:40, 12 July 2021

SRE Observability documentation

The starting point for observability resources at Wikimedia SRE.

Alerts

Logs

Metrics