You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

MariaDB/monitoring: Difference between revisions

From Wikitech-static
Jump to navigation Jump to search
imported>LSobanski
No edit summary
imported>Tim Starling
(→‎Grafana: new dashboard)
 
(One intermediate revision by one other user not shown)
Line 16: Line 16:
* MySQL instance: https://grafana.wikimedia.org/d/000000273/mysql
* MySQL instance: https://grafana.wikimedia.org/d/000000273/mysql
* Replication lag: https://grafana.wikimedia.org/d/000000303/mysql-replication-lag
* Replication lag: https://grafana.wikimedia.org/d/000000303/mysql-replication-lag
* Mediawiki MySQL Loadbalancer: https://grafana.wikimedia.org/d/000000363/mediawiki-mysql-loadbalancer
* MediaWiki LoadBalancer: https://grafana.wikimedia.org/d/G9kbQdRVz/mediawiki-loadbalancer


== Tendril ==
== Orchestrator ==
[[File:Screenshot from 2017-03-21 17-57-04.png|thumb|right|Tendril slow query log]]
[[Orchestrator]]
[[Tendril]] predates Prometheus installation.
TBD <br/><br/><br/>


== Logstash/Kibana ==
== Logstash/Kibana ==

Latest revision as of 06:04, 28 July 2022

Icinga

Example of icinga checks

TBD

Metrics

Prometheus

Hosts to be monitored at prometheus are controlled by the instance and server inventory at Zarcillo (db1115). In order to update prometheus, hosts have to be inserted, updated or deleted from Zarcillo and then run /usr/local/sbin/mysqld_exporter_config.py on the prometheus hosts (e.g. prometheus1003 and prometheus1004 for eqiad). This scripts run automatically every 30 minutes to check for changes.

The standard mysqld-prometheus-exporter is used for most metrics.

Grafana

Connection anomaly detected through Prometheus metrics+Grafana

Relevant dashboards:

Orchestrator

Orchestrator

Logstash/Kibana

Mediawiki database errors

TBD



This page is a part of the SRE Data Persistence technical documentation
(go here for a list of all our pages)