You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
New pages
Jump to navigation
Jump to search
- 15:35, 26 May 2023 Incidents/2023-05-25 eqiad/LVS (hist | edit) [4,862 bytes] imported>JMeybohm (Created page with "{{irdoc|status=draft}} == Summary == {{Incident scorecard | id = eqiad/LVS | task = T337497 | paged-num = 2 | responders-num = 7 | coordinators = Janis | start = 2023-05-25 14:04 | end = 2023-05-25 14:25 | metrics = edits per second, rps in general, 5xx responses from CDN, appserver latency | impact = For approximately 15-20 minutes logged in users connecting to to Wikimedia wikis through our Ashburn datacenter and editors in general may have received 503 errors }} …...")
- 10:30, 26 May 2023 Nova Resource:Tools.lydia/SAL (hist | edit) [88 bytes] imported>Stashbot (wm-bot: <xtex> test)
- 09:12, 26 May 2023 Search Platform/Weekly Updates/2023-05-23 (hist | edit) [1,760 bytes] imported>Gehel (Created page with "= Summary = Another WDQS incident this week disrupted our flow of work. Dealing with page redirect in the context of the Search Update Pipeline is more complex than expected, and involves multiple teams (data engineering, mediawiki core, ML). Hopefully that additional work will benefit more teams, in particular ML. = What we've accomplished = == Improve multilingual zero-results rate == * documentation and some implementation of the framework to evaluate impact https:/...")
- 08:03, 26 May 2023 Wikimedia Cloud Services team/EnhancementProposals/Decision record T336376 Decision request - Find a standard place for all the toolforge-related config files (hist | edit) [2,899 bytes] imported>David Caro (Created page with "'''<big>Origin task</big>''': phab:T336376 '''<big>Date of the decision</big>''': 2023-05-26 '''<big>There was no meeting, decision made in the task.</big>''' == Decision taken == Option 1 was chosen. === Rationale === The option of having a config file per-client instead of mixing all the configurations by default on one file makes it easier to manage with debian packages and being able to fallback to the common one keeps the option to reuse open. == Problem...")
- 07:48, 26 May 2023 Wikimedia Cloud Services team/EnhancementProposals/Decision record T335979 Decision request - Toolforge envvars/secrets service (hist | edit) [3,019 bytes] imported>David Caro (Created page with "'''<big>Origin task</big>''': phab:T335979 '''<big>Date of the decision</big>''': 2023-05-26 '''<big>No meeting, decision made in the task</big>''' == Decision taken == Option 2 was chosen. === Rationale === We opted for a simpler cli that tackles secret and potentially non-secret data by the name of <code>envvars</code>, making sure that there's enough documentation to clear any confusion on the user side on the secrecy of the data. == Problem == Currently use...")
- 07:37, 26 May 2023 Wikimedia Cloud Services team/EnhancementProposals/Decision request - How to provide a way to install system dependencies for buildpack-based images (hist | edit) [2,599 bytes] imported>David Caro
- 02:26, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/3 (hist | edit) [130,197 bytes] imported>Komla Sapaty (→fun)
- 02:10, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/12 (hist | edit) [129,682 bytes] imported>Komla Sapaty (added the batch of tools for december)
- 02:10, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/11 (hist | edit) [129,944 bytes] imported>Komla Sapaty (added the batch of tools for november)
- 02:09, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/10 (hist | edit) [130,038 bytes] imported>Komla Sapaty (added the batch of tools for october)
- 02:09, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/9 (hist | edit) [130,214 bytes] imported>Komla Sapaty (added the batch of tools for september)
- 02:08, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/8 (hist | edit) [130,200 bytes] imported>Komla Sapaty (added the batch of tools for august)
- 02:06, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/7 (hist | edit) [129,832 bytes] imported>Komla Sapaty (added the batch of tools for july)
- 02:05, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/6 (hist | edit) [129,996 bytes] imported>Komla Sapaty (added the batch of tools for june)
- 02:04, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/5 (hist | edit) [129,926 bytes] imported>Komla Sapaty (added the batch of tools for may)
- 02:04, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/4 (hist | edit) [130,136 bytes] imported>Komla Sapaty (added the batch of tools for april)
- 01:58, 26 May 2023 Portal:Toolforge/Tool sweep/Lists/2 (hist | edit) [130,586 bytes] imported>Komla Sapaty (added february list of tools)
- 19:56, 24 May 2023 MediaWiki Event Enrichment (hist | edit) [6,636 bytes] imported>Ottomata (Link to Event Platform/Event Utilities)
- 17:51, 24 May 2023 Nova Resource:Lutz/SAL (hist | edit) [349 bytes] imported>Stashbot (wm-bot2: added user chicocvenancio to the project as member - cookbook ran by andrew@bullseye)
- 17:47, 24 May 2023 Nova Resource:Lutz (hist | edit) [109 bytes] imported>Labslogbot (Auto update of instance info.)
- 21:49, 23 May 2023 Incidents/2023-05-23 wdqs CODFW 5xx errors (hist | edit) [4,887 bytes] imported>Bking
- 21:10, 19 May 2023 Incidents/2023-05-19 videoscaler/jobrunner (hist | edit) [5,256 bytes] imported>Dzahn (→Timeline)
- 20:49, 19 May 2023 Incident response/Process improvement (hist | edit) [35,550 bytes] imported>BCornwall (BCornwall moved page Incident Response Process Improvement to Incident response/Process improvement: Moving under the larger incident response topic)
- 20:49, 19 May 2023 Incident response/Process improvement/Definition of an incident (hist | edit) [2,516 bytes] imported>BCornwall (BCornwall moved page Incident Response Process Improvement/Definition of an incident to Incident response/Process improvement/Definition of an incident: Moving under the larger incident response topic)
- 20:15, 19 May 2023 Incident homepage test (hist | edit) [930 bytes] imported>Fabfur
- 18:31, 19 May 2023 Nova Resource:Tools.ldap-beta/SAL (hist | edit) [122 bytes] imported>Stashbot (wm-bot: <legoktm> Move to buildpack webservice image!)
- 13:44, 19 May 2023 Nova Resource:Tools.lucaswerkmeister-wmde-test/SAL (hist | edit) [1,661 bytes] imported>Stashbot (wm-bot: <lucaswerkmeister-wmde> deployed 4d46d0cd73 (Python 3.11.3 :o ))
- 11:45, 19 May 2023 WMDE/Wikidata/Convert Properties (hist | edit) [1,130 bytes] imported>Itamar Givon (Add preparation section)
- 20:50, 18 May 2023 Search Platform/Weekly Updates/2023-05-18 (hist | edit) [972 bytes] imported>Bking (Created page with "= Summary = Working on the post-mortem of the WDQS outage, Search Update pipeline, and optimizing Wikibase index settings. = What we've accomplished = == Search - Analysis == * Continuing data analysis for apostrohpe-like characters (T315118). There are 22 candidate characters, and they get treated differently by different tokenizers (the Hebrew tokenizer straight up converts 5 of them to apostrophes—including Hebrew geresh—which I never noticed before!) and by ICU...")
- 08:40, 18 May 2023 WMDE/Wikidata/Enable Client (hist | edit) [4,186 bytes] imported>Itamar Givon (→Checklist: Formatting fixes)
- 07:50, 18 May 2023 WMDE/Wikidata/Maintenance (hist | edit) [278 bytes] imported>Itamar Givon (Created page with "This page lists various common maintenance and content infrastructure tasks to be performed on Wikidata: * Enable a Wikidata Client * Update Property Suggester")
- 16:07, 17 May 2023 Miscweb/Kubernetes migration steps (hist | edit) [5,151 bytes] imported>Dzahn (Dzahn moved page Miscweb/Kubernetes migation steps to Miscweb/Kubernetes migration steps: typo)
- 18:15, 16 May 2023 Nova Resource:Tools.wikiconcursos/SAL (hist | edit) [186 bytes] imported>Stashbot (wm-bot: <root> Hard stop/start after reports on IRC that maintainer was having trouble getting the webservice running)
- 09:15, 12 May 2023 Search Platform/Weekly Updates/2023-05-12 (hist | edit) [2,538 bytes] imported>Gehel (Created page with "= Summary = We had a major lag issue on Wikidata Query Service codfw cluster not being updated (see below for details). This took significant time and focus to resolve. Work on the Search Update pipeline continues, with conversations with other consumers of the event stream to implement changes that are needed for search. = What we've accomplished = == Search - Analysis == * Starting work on putting in place the required infrastructure to measure the planned improvement...")
- 21:31, 11 May 2023 Deployments/Archive/2023/05 (hist | edit) [79,944 bytes] imported>Thcipriani (Created page with " ==Week of May 01== ==={{Deployment_day|date=2023-04-30}}=== {{Deployment calendar event card |when=2023-04-30 00:00 SF |length=24 |window=No deploys all day! See Deployments/Emergencies if things are broken. |who= |what=No Deploys }} ==={{Deployment_day|date=2023-05-01}}=== {{Deployment calendar event card |when=2023-05-01 00:00 SF |length=1 |window=UTC morning backport window<br/><small>'''Your patch may or may...")
- 21:28, 11 May 2023 Incidents/2023-05-05 wdqs not updating in codfw (hist | edit) [5,737 bytes] imported>Bking (timeline)
- 18:36, 10 May 2023 Fundraising/Team processes/How we use Phabricator (hist | edit) [9,034 bytes] imported>Dwisehaupt (Add the fr-tech 'how we use phabricator page')
- 16:45, 10 May 2023 Data Engineering/Systems/Airflow/Upgrading (hist | edit) [3,670 bytes] imported>Btullis
- 16:43, 10 May 2023 Portal:Toolforge/Admin/Runbooks/TektonUpMetricUnknown (hist | edit) [1,911 bytes] imported>Raymond Ndibe (Created page with "=== Overview === This happens when prometheus has no data from k8s on the tekton-pipelines-controller pod. {{Remark|The procedures in this runbook require admin permissions to complete.|reminder}} === Error / Incident === This usually comes in the form of an [https://prometheus-alerts.wmcloud.org/?q=alertname%3DTektonUpMetricUnknown alert in alertmanager]. There you will get which project (tools, toolsbeta, ...) is the one it's failing for. === Debugging === This i...")
- 23:35, 9 May 2023 Portal:Toolforge/Admin/Runbooks/TektonDown (hist | edit) [2,036 bytes] imported>Raymond Ndibe (Created page with "=== Overview === This is when the tekton-pipelines-controller pod in the tekton-pipelines namespace of tools/toolsbeta k8s cluster is down or can't be reached. {{Remark|The procedures in this runbook require admin permissions to complete.|reminder}} === Error / Incident === This usually comes in the form of an [https://prometheus-alerts.wmcloud.org/?q=alertname%3DTektonDown alert in alertmanager]. There you will get which project (tools, toolsbeta, ...) is the one it...")
- 19:56, 9 May 2023 Nova Resource:Tools.wp-trending/SAL (hist | edit) [328 bytes] imported>Stashbot (wm-bot: <bd808> Updated to 105800e)
- 19:09, 9 May 2023 Incidents/2023-05-05 prometheus down in ulsfo and eqsin (hist | edit) [5,810 bytes] imported>Andrea Denisse (→Timeline)
- 14:03, 9 May 2023 IPoid (hist | edit) [89 bytes] imported>Effie Mouzeli
- 00:35, 9 May 2023 Nova Resource:Tools.ifttt-bd808/SAL (hist | edit) [171 bytes] imported>Stashbot (wm-bot: <bd808> Switched deployment to the new https://gitlab.wikimedia.org/toolforge-repos/ifttt repo)
- 12:57, 8 May 2023 Puppet/Runbooks (hist | edit) [932 bytes] imported>Jbond
- 10:22, 7 May 2023 Tool:Masto-collab (hist | edit) [1,987 bytes] imported>Peachey88 (Update license key to what Module:Tool expects (although its not a valid spdx code))
- 20:40, 6 May 2023 Nova Resource:Imagebulk (hist | edit) [114 bytes] imported>Labslogbot (Auto update of instance info.)
- 22:20, 5 May 2023 Incidents/2023-05-05 prometheus ulsfo and eqsin (hist | edit) [5,661 bytes] imported>Andrea Denisse (→Timeline)
- 15:57, 5 May 2023 Sandbox-oncallregions (hist | edit) [8,614 bytes] imported>Onfirebot (updating roster table)
- 14:04, 5 May 2023 Portal:Toolforge/Admin/Runbooks/HarborProbeUnknown (hist | edit) [1,566 bytes] imported>David Caro (Created page with "=== Overview === This happens when prometheus has no data from the blackbox exported on the harbor instance for the project. {{Remark|The procedures in this runbook require admin permissions to complete.|reminder}} === Error / Incident === This usually comes in the form of an [https://prometheus-alerts.wmcloud.org/?q=alertname%3DHarborProbeUnknown alert in alertmanager]. There you will get which project (tools, toolsbeta, ...) is the one it's failing for. === Debugg...")