You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Analytics/Systems/Druid/Alerts

From Wikitech-static
< Analytics‎ | Systems‎ | Druid
Revision as of 12:20, 1 November 2021 by imported>Btullis (Added a target page for the Druid related alerts.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

We have a number of alerts set up in Icinga and Alertmanager that relate to Druid and its ingestion jobs.

This page exists as a set of instructions or runbooks to help identify what courses of action might be needed if one or more of these alerts is triggered.

Druid Netflow Supervisor

This alert triggers if the realtime netflow ingestion job receives below a certain threshold of events, over a 30 minutes period.

The critical value is 0 and the warning value is 30.

The grafana dashboard showing the trend data is here.