You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

MariaDB/monitoring

From Wikitech-static
Jump to navigation Jump to search
Wikimedia infrastructure

[edit]

Icinga

Example of icinga checks

TBD

Metrics

Prometheus

Hosts to be monitored at prometheus are controlled by the instance and server inventory at Zarcillo (db1115). In order to update prometheus, hosts have to be inserted, updated or deleted from Zarcillo and then run /usr/local/sbin/mysqld_exporter_config.py on the prometheus hosts (e.g. prometheus1003 and prometheus1004 for eqiad). This scripts run automatically every 30 minutes to check for changes.

The standard mysqld-prometheus-exporter is used for most metrics.

Grafana

Connection anomaly detected through Prometheus metrics+Grafana

Relevant dashboards:

Tendril

Tendril slow query log

Tendril predates Prometheus installation. TBD


Logstash/Kibana

Mediawiki database errors

TBD



This page is a part of the SRE Data Persistence technical documentation
(go here for a list of all our pages)