You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org
Portal:Cloud VPS/Admin/Runbooks/CephClusterInWarning
< Portal:Cloud VPS | Admin | Runbooks
Jump to navigation
Jump to search
The procedures in this runbook require admin permissions to complete.
Error / Incident
The ceph cluster is in warning status, this means that it's not highly available anymore or something might be affecting it's performance, but the cluster is still up and running.
Debugging
See Portal:Cloud VPS/Admin/Runbooks/CephClusterInError for debugging and details.
Support contacts
Usually anyone in the WMCS team should be able to help/debug the issue, subject matter experts (SMEs) would be Andrew Bogott and David Caro.
Related information
- Grafana dashboard: https://grafana.wikimedia.org/d/7TjJENEWz/wmcs-ceph-eqiad-cluster-overview?orgId=1
- Internal documentation: Portal:Cloud_VPS/Admin/Ceph
- Upstream documentation: https://docs.ceph.com/docs/master/rados/operations/monitoring/
Example tasks
- https://phabricator.wikimedia.org/T286649 - OSD daemon crash