You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
New pages
Jump to navigation
Jump to search
- 19:54, 5 June 2023 Portal:Cloud VPS/Admin/Openstack roles and policy (hist | edit) [7,402 bytes] imported>Andrew Bogott
- 19:32, 5 June 2023 Portal:Cloud VPS/Admin/Openstack cli (hist | edit) [10,454 bytes] imported>Andrew Bogott
- 17:32, 5 June 2023 Tool:Toolviews (hist | edit) [968 bytes] imported>BryanDavis (Stub out docs for this tool)
- 16:23, 5 June 2023 Page Content Service (hist | edit) [41 bytes] imported>BryanDavis (Removed redirect to Mobileapps (service))
- 16:23, 5 June 2023 PCS (hist | edit) [41 bytes] imported>BryanDavis (Removed redirect to Mobileapps (service))
- 09:40, 4 June 2023 Tool:Itwiki (hist | edit) [2,795 bytes] imported>Valerio Bozzolan (→Known Kubernetes jobs: Help:Toolforge/Jobs framework)
- 13:42, 2 June 2023 Portal:Toolforge/Admin/Runbooks/ToolsToolsDBReplicationLagIsTooHigh (hist | edit) [1,873 bytes] imported>David Caro (Created page with "=== Overview === This happens when the secondary toolsdb host is not able to catch up with the primary one. {{Remark|The procedures in this runbook require admin permissions to complete.|reminder}} === Error / Incident === This usually comes in the form of an [https://prometheus-alerts.wmcloud.org/?q=alertname%3DToolsDBReplicationLagIsTooHigh alert in alertmanager]. There you will get which instances are involved (secondary, primary). === Debugging === You can ssh...")
- 12:46, 2 June 2023 Search Platform/Weekly Updates/2023-06-02 (hist | edit) [2,969 bytes] imported>Gehel (Created page with "= Summary = We spent some time this week to deal with the aftermath of our recent WDQS outages. In particular, we are now equipped to diagnose and block problematic queries faster. Our work on SLOs for Search is unlikely to be completed this quarter. We now have a good working definition of what we want to measure, but we will need help to implement metric collection and create the appropriate dashboards. This is unlikely to be done before the end of the quarter. = Wha...")
- 19:49, 1 June 2023 Analytics/Systems/Cluster/Iceberg (hist | edit) [6,661 bytes] imported>Xcollazo (First dump of Iceberg effort at WMF.)
- 09:43, 1 June 2023 Incidents/2023-05-30 Unintentional +2 on a config patch without deployment (hist | edit) [3,799 bytes] imported>Filippo Giunchedi (Created page with "{{irdoc|status=draft}} == Summary == {{Incident scorecard | task = | paged-num = 0 | responders-num = Amir | coordinators = | start = 2023-05-26 | end = 2023-05-30 | impact = A patch to mediawiki-config was accidentally +2'd and not immediately deployed }} … <!-- Reminder: No private information on this page! --><nowiki>https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/923650</nowiki> was accidentally +2'd and not immediately deployed, leading to conf...")
- 00:21, 1 June 2023 Server Admin Log/Archive 66 (hist | edit) [937,106 bytes] imported>Nhatminh01 (→2023-05-01: add)
- 00:27, 31 May 2023 Fundraising/External-facing/CiviProxy (hist | edit) [1,983 bytes] imported>Cstone (Moved from mediawiki)
- 19:09, 30 May 2023 Event Platform/Stream Processing/Flink (hist | edit) [12,469 bytes] imported>Ottomata (Ottomata moved page Wikidata Query Service/Flink On Kubernetes to Event Platform/Stream Processing/Flink: Flink platform management moving to Event Platform)
- 18:17, 30 May 2023 Event Platform/Stream Processing/Use cases (hist | edit) [8,485 bytes] imported>Ottomata (Moved from https://www.mediawiki.org/wiki/Platform_Engineering_Team/Event_Platform_Value_Stream/Event_Driven_Use_Cases)
- 18:15, 30 May 2023 Event Platform/Stream Processing/Framework Evaluation (hist | edit) [14,878 bytes] imported>Ottomata (Copied in from https://www.mediawiki.org/wiki/Platform_Engineering_Team/Event_Platform_Value_Stream/Stream_Processing_Framework_Evaluation)
- 17:50, 30 May 2023 MediaWiki Event Enrichment/SLO/Mediawiki Page Content Change Enrichment (hist | edit) [3,788 bytes] imported>Ottomata (→Operational)
- 09:42, 29 May 2023 Clinic duty (hist | edit) [95 bytes] imported>Jcrespo (clinic duty link for easier search)
- 15:35, 26 May 2023 Incidents/2023-05-25 eqiad/LVS (hist | edit) [4,862 bytes] imported>JMeybohm (Created page with "{{irdoc|status=draft}} == Summary == {{Incident scorecard | id = eqiad/LVS | task = T337497 | paged-num = 2 | responders-num = 7 | coordinators = Janis | start = 2023-05-25 14:04 | end = 2023-05-25 14:25 | metrics = edits per second, rps in general, 5xx responses from CDN, appserver latency | impact = For approximately 15-20 minutes logged in users connecting to to Wikimedia wikis through our Ashburn datacenter and editors in general may have received 503 errors }} …...")
- 10:30, 26 May 2023 Nova Resource:Tools.lydia/SAL (hist | edit) [88 bytes] imported>Stashbot (wm-bot: <xtex> test)
- 09:12, 26 May 2023 Search Platform/Weekly Updates/2023-05-23 (hist | edit) [1,760 bytes] imported>Gehel (Created page with "= Summary = Another WDQS incident this week disrupted our flow of work. Dealing with page redirect in the context of the Search Update Pipeline is more complex than expected, and involves multiple teams (data engineering, mediawiki core, ML). Hopefully that additional work will benefit more teams, in particular ML. = What we've accomplished = == Improve multilingual zero-results rate == * documentation and some implementation of the framework to evaluate impact https:/...")