You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
When distributing the link to others, prefer including the www. prefix, as that saves an HTTP redirect.
wikimediastatus.net is a public and high-level uptime monitor. It is separated from our production infrastructure and hosted by Atlassian Statuspage.
It was launched in Jan 2022, and is maintained by the SRE team. It is the spiritual successor to status.wikimedia.org, which was hosted by Watchmouse, but no longer under the wikimedia.org domain for security reasons (T293504) and for availability reasons in the event of an outage of Wikimedia DNS and/or our networking infrastructure.
SRE usage instructions
Statograph (automated metrics upload)
|Automatically uploads time-series metrics to the public status page.|
|Puppet classes||Puppet module |
statograph is a tool that uploads timeseries metrics from sources like Prometheus and Graphite to the metrics on your statuspage.io installation.
These metrics are intentionally chosen to be high-level and broad. This means that not only do they show many kinds of possible outages, but also that they are hopefully understandable even to users with limited technical knowledge.
It is executed via a systemd timer that runs once a minute. Runs are idempotent, so this is a simple mechanism to give high availability.
More information on its execution model and on statuspage.io's API can be found in its Uploader class.
Our status page is primarily intended to serve the general public and the news media, although of course we expect community members to also use it as a resource -- although we certainly don't mean to replace, for example, on-wiki technical village pumps. The focus is on very visible/widespread outages.
We selected statuspage.io with the following considerations:
- Because we want the site to be working even in a widespread failure of Wikimedia infrastructure, any solution needs to be hosted externally
- We decided we did not want to take on the engineering effort needed to run scalable external hosting + separate CDN
- There are very few FLOSS status page projects that are more than just "toy" projects, and of those which aren't, even fewer are actively maintained
- statuspage.io had some distinguishing features: not just the basic manually-posted up/down functionality, but also support for automated uploads of timeseries metrics, and SLO-like uptime history on each component
- Launch task: phab:T202061