You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Release Engineering/SAL: Difference between revisions

From Wikitech-static
Jump to navigation Jump to search
imported>Labslogbot
(Cleanup jobrunner01 logs via -- sudo logrotate --force /etc/logrotate.d/mediawiki_jobrunner (bd808))
imported>Stashbot
(brennen: gitlab repos/releng/scap: added direct membership for some non-releng maintainers who show up frequently/recently in commit log)
 
Line 1: Line 1:
== 2016-07-23 ==
== 2022-11-29 ==
* 20:06 bd808: Cleanup jobrunner01 logs via -- sudo logrotate --force /etc/logrotate.d/mediawiki_jobrunner
* 21:35 brennen: gitlab repos/releng/scap: added direct membership for some non-releng maintainers who show up frequently/recently in commit log
* 20:03 bd808: Deleted jobqueues in redis with no matching wikis: ptwikibooks, labswiki
* 20:33 James_F: Zuul: [wikimedia/wikimania-scholarships] Set as archived for [[phab:T243037|T243037]]
* 19:20 bd808: jobrunner01 spamming /var/log/mediawiki with attempts to process jobs for wiki=labswiki


== 2016-07-22 ==
== 2022-11-27 ==
* 20:26 hashar: T141114 upgraded jenkins-debian-glue from v0.13.0 to v0.17.0  on integration-slave-jessie-1001 and integration-slave-jessie-1002
* 21:27 James_F: Docker: Publishing new php82 images with rc.7 for [[phab:T314093|T314093]]
* 19:07 thcipriani: beta-cluster has successfully used a canary for mediawiki deployments
* 16:53 thcipriani: bumping scap to v.3.2.1 on deployment-tin to test canary deploys, again
* 16:46 thcipriani: rolling back scap version to v.3.2.0
* 16:38 thcipriani: bumping scap to v.3.2.1 on deployment-tin to test canary deploys
* 13:02 hashar: zuul rebased patch queue on tip of upstream branch and force pushed branch. c3d2810...4ddad4e HEAD -> patch-queue/debian/precise-wikimedia (forced update)
* 10:32 hashar: Jenkins restarted and it pooled both integration-slave-jessie-1002  and  integration-slave-trusty-1018
* 10:23 hashar: Jenkins has some random deadlock. Will probably reboot it
* 10:17 hashar: Jenkins can't ssh / add slaves integration-slave-jessie-1002 or  integration-slave-trusty-1018 . Apparently due to some Jenkins deadlock in the ssh slave plugin :-/  Lame way to solve it: restart Jenkins
* 10:10 hashar: rebooting integration-slave-jessie-1002 and integration-slave-trusty-1018 . Hang somehow
* 10:06 hashar: T141083 salt -v '*slave-trusty*' cmd.run 'service mysql start'
* 09:55 hashar: integration-slave-trusty-1001 service mysql start


== 2016-07-21 ==
== 2022-11-25 ==
* 16:11 hashar: Updated our JJB fork cherry picking f74501e781f by madhuvishy.  Was made to support the maven release plugin. Branch bump is 10f2bcd..6fcaf39
* 12:45 hashar: Reloaded Zuul for {{Gerrit|I717ad1fe4ef7b151808b242cdf16f0268c58fbd7}} "add pipelinename to autogenerated:ci tags" # [[phab:T214068|T214068]]
* 16:04 hashar: integration/zuul.git .Updated upstream branch:bc58ea34125f11eb353abc3e5b96ac1efad06141  finally caught up with upstream \O/
* 15:13 hashar: integration/zuul.git .Updated upstream branch:  06770a85fcff810fc3e1673120710100fc7b0601:upstream
* 14:03 hashar: integration/zuul.git bumping upstream branch:  git push d34e0b4:upstream
* 03:18 greg-g: had to do https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code.2Fdb_update twice, seems to be back
* 00:13 bd808: Cherry-picked https://gerrit.wikimedia.org/r/#/c/299825/ to deployment-puppetmaster so wdqs nginx log parsing can be tested


== 2016-07-20 ==
== 2022-11-23 ==
* 13:55 hashar: beta: switching job beta-scap-eqiad to use 'scap sync' per https://gerrit.wikimedia.org/r/#/c/287951/  (poke thcipriani )
* 22:41 urandom: accidentally deleted deployment-sessionstore04
* 12:47 hashar: integration: enabled unattended upgrade on all instances by adding contint::packages::apt to https://wikitech.wikimedia.org/wiki/Hiera:Integration
* 15:07 James_F: Zuul: configure CI for operations/debs/varnish-modules for [[phab:T321309|T321309]]
* 10:28 hashar: beta dropped salt-key on deployment-salt02 for the three instances: deployment-upload.deployment-prep.eqiad.wmflabs , deployment-logstash3.deployment-prep.eqiad.wmflabs and deployment-ores-web.deployment-prep.eqiad.wmflabs
* 10:26 hashar: beta: rebased puppetmaster git repo. "Parsoid: Move to service::node"  has weird conflict https://gerrit.wikimedia.org/r/#/c/298436/
* 10:15 hashar: beta: removing puppet cherry pick of https://gerrit.wikimedia.org/r/#/c/258979/ "mediawiki: add conftool-specifc credentials and scripts"  abandonned/superseeded and caused a conflict
* 08:17 hashar: deployment-fluorine : deleting a puppet lock file /var/lib/puppet/state/agent_catalog_run.lock  (created at 2016-07-18 19:58:46 UTC)
* 01:53 legoktm: deploying https://gerrit.wikimedia.org/r/299930


== 2016-07-18 ==
== 2022-11-22 ==
* 20:56 thcipriani: Deleted deployment-fluorine:/srv/mw-log/archive/*-201605* freed 30 GB
* 21:51 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/859615
* 15:00 hashar: Upgraded Zuul on the Precise slaves to zuul_2.1.0-151-g30a433b-wmf4precise1
* 21:06 TheresNoTime: samtar@deployment-mwmaint02:~$ mwscript extensions/WikimediaMaintenance/createExtensionTables.php zhwiki pagetriage [[phab:T323378|T323378]]
* 12:10 hashar: (restarted qa-morebots)
* 14:11 TheresNoTime: [samtar@deployment-deploy03 ~]$ sudo keyholder arm
* 12:10 hashar: Enabling puppet again on integration-slave-precise-1002 , removing Zuul-server config and adding the slave back in Jenkins pool


== 2016-07-16 ==
== 2022-11-21 ==
* 23:19 paladox: testing morebots
* 14:54 James_F: Zuul: [mediawiki/extensions/WikiLambda] Disable selenium tests for [[phab:T294388|T294388]]
* 14:41 vgutierrez: move deployment-cache-(text{{!}}upload)07 from role::cache::(text{{!}}upload)_haproxy to role::cache::(text{{!}}upload) - [[phab:T323365|T323365]]


== 2016-07-15 ==
== 2022-11-18 ==
* 08:34 hashar: Unpooling integration-slave-precise-1002  will use it as a zuul-server test instance temporarily
* 10:05 hashar: gerrit: change HEAD branch to point to `deploy/wmf/stable-3.5` # [[phab:T307334|T307334]]


== 2016-07-14 ==
== 2022-11-17 ==
* 18:54 ebernhardson: deployment-prep manually edited elasticsearch.yml on deployment-elastic05 and restarted to get it listening on eth0. Still looking into why puppet wrote out wrong config file
* 17:44 taavi: reloading zuul to deploy https://gerrit.wikimedia.org/r/858391
* 09:05 Amir1: rebooting deployment-ores-redis
* 08:29 Amir1: deploying 0e9555f to ores-beta (sca03)


== 2016-07-13 ==
== 2022-11-16 ==
* 16:05 urandom: Installing Cassandra 2.2.6-wmf1 on deployment-restbase0[1-2].deployment-prep.eqiad.wmflabs : T126629
* 20:53 thcipriani: restarting jenkins for update
* 13:58 hashar: T137525 reverted Zuul back to zuul_2.1.0-95-g66c8e52-wmf1precise1_amd64.deb  . It could not connect to Gerrit reliably
* 08:46 hashar: gerrit: reindexed accounts `ssh -p 29418 gerrit.wikimedia.org -- gerrit index start accounts --force` # [[phab:T323135|T323135]]
* 13:46 hashar: T137525 Stopped zuul that ran in a terminal (with -d). Started it with the init script.
* 08:45 hashar: gerrit: deleted 192 LDAP accounts (scheme `gerrit:`) containing upper case characters which had an exact equivalent in an all lower case form. `All-Users.git` commit is {{Gerrit|5e5800ecc8fd5da591567e616898dd6df988c0c8}} # [[phab:T323135|T323135]]
* 11:37 hashar: apt-get upgrade on deployment-mediawiki02
* 08:45 hashar: gerrit: deleted 192 LDAP accounts (scheme `gerrit:`) containing upper case characters which had an exact equivalent in an all lower case form #
* 08:33 hashar: removing deployment-parsoid05 from the Jenkins slaves T140218


== 2016-07-12 ==
== 2022-11-15 ==
* 20:29 hashar: integration: force running unattended upgrade on all instances: salt --batch 4 -v '*' cmd.run 'unattended-upgrade'  . That upgrades diamond and hhvm among others.  imagemagick-common has a prompt though
* 20:21 hashar: gerrit: removed legacy mixed case accounts and moved the extra secondary email to a mailto id for `gerrit:krinkle`, `gerrit:revi`, `gerrit:daniel kinzler`, `gerrit:harej` and `gerrit:samanthanguyen` [[phab:T323135|T323135]]#8397539
* 20:22 hashar: CI force running puppet on all instances: salt --batch 5 -v '*' puppet.run
* 20:20 hashar: gerrit: removed legacy mixed case accounts for `gerrit:Fomafix` and `gerrit:Ricordisamoa` [[phab:T323135|T323135]]#8397539
* 20:04 hashar: Maybe fix unattended upgrade on the CI slaves via https://gerrit.wikimedia.org/r/298568
* 16:25 James_F: Zuul: [mediawiki/services/parsoid] Make MW jobs voting in test
* 16:43 Amir1: deploying f472f65 to ores-beta
* 15:57 James_F: Zuul: [mediawiki/extensions/CampaignEvents] Add Echo as phan dependency for [[phab:T317231|T317231]]
* 10:11 hashar: Github created repos operations-debs-contenttranslation-apertium-mk-en and operations-docker-images-toollabs-images        for Gerrit replication
* 15:24 hashar: gerrit: converted, to all lower case, the Gerrit accounts `username:Kaldari`, `username:Fran McCrory` and `username:SamanthaNguyen`  # [[phab:T323097|T323097]]


== 2016-07-11 ==
== 2022-11-14 ==
* 14:24 hashar: Removing ZeroMQ config from the Jenkins jobs. It is now enabled globally. T139923
* 17:36 hashar: Nuking unused Castor cached files in `/srv/jenkins-workspace/caches` # [[phab:T323051|T323051]]
* 10:16 hashar: T136188: on Trusty slaves, upgrading Chromium from v49 to v51: salt -v '*slave-trusty-*' cmd.run 'apt-get -y install chromium-browser chromium-chromedriver chromium-codecs-ffmpeg-extra'
* 17:35 hashar: Changing Castor cache saving from `/srv/jenkins-workspace/caches/` to `/srv/cache/caches/` which is the one served by rsync [[phab:T323051|T323051]]
* 10:13 hashar: T136188: salt -v '*slave-trusty*' cmd.run 'rm /etc/apt/preferences.d/chromium-*'
* 17:34 hashar: Changing Castor cache saving from `/srv/jenkins-workspace/caches/` to `/srv/cache/caches/` which is the one served by rsync.
* 10:09 hashar: Unpinning Chromium v49 from the Trusty slaves and upgrading to v51 for T136188
* 14:19 James_F: Zuul: [mediawiki/services/function-schemata] Move from node 12 to 16
* 09:34 zeljkof: Enabled ZMQ Event Publisher on all Jobs in Jenkins


== 2016-07-09 ==
== 2022-11-10 ==
* 18:57 legoktm: deploying https://gerrit.wikimedia.org/r/297731 and https://gerrit.wikimedia.org/r/298142
* 21:33 James_F: Docker: Upgrading quibble-buster-php74-coverage with a new vesion of phpunit-patch-coverage for [[phab:T322864|T322864]]
* 14:07 bd808: Testing logstash change https://gerrit.wikimedia.org/r/#/c/298115/ via cherry-pick
* 08:37 hashar: Rebuilding https://integration.wikimedia.org/ci/job/beta-code-update-eqiad/ , it probably failed due to Gerrit being restarted
* 01:09 James_F: Zuul: Make PHP 8.1 voting for all quibble items for [[phab:T316078|T316078]]
* 01:05 James_F: Zuul: Drop mwext-php74-phan-docker from experimental for gate


== 2016-07-08 ==
== 2022-11-09 ==
* 16:08 hashar: scandium: git -C /srv/ssd/zuul/git/mediawiki/services/graphoid remote set-head origin --auto
* 23:02 James_F: Zuul: [mediawiki/core] Add PHP 8.1 phan job for [[phab:T322278|T322278]]
* 16:06 hashar: scandium: git -C /srv/ssd/zuul/git/mediawiki/services/graphoid init &&  git -C /srv/ssd/zuul/git/mediawiki/services/graphoid remote add origin ssh://jenkins-bot@ytterbium.wikimedia.org:29418/mediawiki/services/graphoid
* 14:56 andrewbogott: fixed puppet breakage on several instances
* 14:59 hashar: nodepool: rebuild Trusty image from scratch Image ci-trusty-wikimedia-1467989709 in wmflabs-eqiad is ready
* 12:35 hashar: beta:  find /data/project/upload7/*/*/thumb -type f -atime +30 -delete
* 10:31 hashar: beta: mass delete http://commons.wikimedia.beta.wmflabs.org/wiki/Category:GWToolset_Batch_Upload files T64835
* 10:26 hashar: beta: mass delete http://commons.wikimedia.beta.wmflabs.org/wiki/Category:GWToolset_Batch_Upload files


== 2016-07-07 ==
== 2022-11-08 ==
* 21:41 MaxSem: Chowned php-master/vendor back to jenkins-deploy
* 20:17 dduvall: puppet re-enabled on gitlab-runner hosts ([[phab:T322453|T322453]]) normal log level will be restored on next puppet run
* 13:10 hashar: deleting integration-slave-trusty-1024 and integration-slave-trusty-1025  to free up some RAM. We have enough permanent Trusty slaves. T139535
* 20:01 dduvall: temporarily enabling buildkitd debug logging on gitlab-runner hosts ([[phab:T322453|T322453]])
* 02:43 MaxSem: started redis-server on deployment-stream
* 15:58 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/854535
* 01:14 bd808: Restarted logstash on deployment-logstash2
* 15:26 vgutierrez: delete deployment-ms-be06 - [[phab:T322231|T322231]]
* 01:13 MaxSem: Leaving my hacks for the night to collect data, if needed revert with cd /srv/mediawiki-staging/php-master/vendor && sudo git reset --hard HEAD && sudo chown -hR jenkins-deploy:wikidev .
* 15:21 vgutierrez: shutdown deployment-ms-be06 - [[phab:T322231|T322231]]
* 00:50 bd808: Rebooting deployment-logstash3.eqiad.wmflabs; console full of hung process messages from kernel
* 06:39 vgutierrez: delete deployment-ms-be05 - [[phab:T322231|T322231]]
* 00:27 MaxSem: Initialized ORES on all wikis where it's enabled, was causing job failures
* 06:36 vgutierrez: delete deployment-ms-fe03 - [[phab:T322554|T322554]]
* 00:13 MaxSem: Debugging a fatal in betalabs, might cause syncs to fail
* 06:30 vgutierrez: downgrade to firejail 0.9.44.8-2 on deployment-imagescaler03
* 05:51 vgutierrez: shutdown deployment-ms-fe03 - [[phab:T322554|T322554]]


== 2016-07-06 ==
== 2022-11-07 ==
* 20:30 hashar: beta: restarted mysql on both db1 and db2 so it takes in account the --syslog setting  T119370
* 18:19 vgutierrez: let deployment-cache-upload07 use deployment-ms-fe04 - [[phab:T322554|T322554]]
* 20:08 hashar: beta:  on db1 and db2  move the MariaDB 'syslog' setting under [mysqld_safe] section. Cherry picked https://gerrit.wikimedia.org/r/#/c/296713/3 and reloaded mysql on both instances. T119370
* 15:57 vgutierrez: shutting down deployment-ms-be05 - [[phab:T322231|T322231]]
* 14:54 hashar: Image ci-jessie-wikimedia-1467816381 in wmflabs-eqiad is ready  T133779
* 14:47 hashar_: attempting to refresh ci-jessie-wikimedia image to get librdkafka-dev included for T133779


== 2016-07-05 ==
== 2022-11-03 ==
* 21:54 hasharAway: CI has drained the gate-and-submit queue
* 20:31 hashar: Reloaded Zuul for {{Gerrit|Ic473bd57059d4eccad0f52c1d11d61f6ba1a4ad1}}
* 21:37 hasharAway: Nodepool: nodepool delete  a few instances that would never spawn / have been stuck for ~ 40 minutes
* 19:19 brennen: attempting initial phab1004 phabricator deploy
* 17:45 James_F: Zuul: Add CI for CategoryExplorer and EmailDeletedPages extensions and Cavendish and Pivot skins
* 17:15 James_F: Zuul: Add experimental PHP 8.2 jobs for PHP extensions for [[phab:T314093|T314093]]
* 16:53 James_F: Docker: Publishing initial PHP 8.2 CI test images for [[phab:T314093|T314093]]
* 13:44 TheresNoTime: add `cxserver-beta` (port 8080) proxy for deployment-prep, [[phab:T322323|T322323]]


== 2016-07-04 ==
== 2022-11-02 ==
* 18:58 hashar: Upgrading arcanist on permanent CI slaves since xhpast was broken T137770 
* 22:44 James_F: Zuul: [mediawiki/tools/scap] Mark as archived for [[phab:T322269|T322269]]
* 12:50 yuvipanda: migrating deployment-tin to labvirt1011
* 09:56 vgutierrez: update to HAProxy 2.6.6 in deployment-cache-(text{{!}}upload)07 - [[phab:T321775|T321775]]


== 2016-07-03 ==
== 2022-10-31 ==
* 13:10 paladox: phabricator Update phab-01 and phab-05 (phab-02) and phab-03 to fix a security bug in phabricator (Did the update last night but forgot to log it)
* 15:56 andrewbogott: shutting down  deployment-echostore01, deployment-ms-be0[56], deployment-mdb01, deployment-prometheus02, deployment-wikifeeds01 as per  https://phabricator.wikimedia.org/T306068
* 12:04 jzerebecki: reloading zuul for 7e6a2e2..13ea50f
* 15:50 James_F: Zuul: [mediawiki/libs/RemexHtml] Re-enable PHP 8.1 CI for [[phab:T311450|T311450]]


== 2016-07-02 ==
== 2022-10-28 ==
* 13:38 jzerebecki: reloading zuul for 15127b2..7e6a2e2
* 14:14 zabe: delete deployment-db07 and deployment-db08
* 06:24 hashar: devtools: phabricator-prod-1001: `rmdir /etc/envoy/clusters.d /etc/envoy/listeners.d`
* 06:24 hashar: devtools: `rmdir /etc/envoy/clusters.d /etc/envoy/listeners.d`
* 06:23 hashar: devtools: set `profile::phabricator::main::dumps_rsync_clients: []` project wide to fix up Puppet. Settings got moved to a `role` ( https://gerrit.wikimedia.org/r/c/operations/puppet/+/842875 {{!}} [[phab:T313360|T313360]] )


== 2016-06-30 ==
== 2022-10-27 ==
* 10:31 hashar: Deleting integration-slave-trusty-1015 . Can not bring up mysql T138074  and the ssh slave connection would not hold anyway. Must be broken somehow
* 21:38 James_F: Zuul: [mediawiki/core] Run standalone jobs [[phab:T203694|T203694]]
* 10:04 hashar: Attempting to refresh Nodepool image for Jessie ( ci-jessie-wikimedia ). Been stall for 284 hours (12 days)
* 20:58 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/850198 # [[phab:T321769|T321769]]
* 09:36 hashar: Trusty is missing the package arcanist ... :(
* 09:35 hashar: Attempting to refresh Nodepool image for Trusty ( ci-trusty-wikimedia ). Been stall for 283 hours (12 days)


== 2016-06-28 ==
== 2022-10-26 ==
* 21:33 halfak: deploying ores beec291
* 23:12 dancy: Restarted Zuul CI server due to stall ssh connections which went against the max per user connection limit in Gerrit #[[phab:T308943|T308943]]
* 21:15 halfak: deploying ores 6979a98
* 18:28 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/849638 # [[phab:T321668|T321668]]
* 08:58 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/849480 # [[phab:T321594|T321594]]


== 2016-06-27 ==
== 2022-10-25 ==
* 22:32 eberhardson: deployment-prep deployed gerrit.wikimedia.org/r/296279 to puppetmaster to test kibana4 role
* 16:33 hashar: Updating Jenkins jobs for Quibble 1.4.7 # [[phab:T320935|T320935]] [[phab:T318029|T318029]]
* 19:41 bd808: Rebooting deployment-logstash3.eqiad.wmflabs via wikitech. Console log full of blocked kworker messages, ssh non-responsive, and blocking logstash records being recorded.
* 15:36 hashar: Tag Quibble 1.4.7 @ {{Gerrit|f838a24cc2}} # [[phab:T320935|T320935]] [[phab:T318029|T318029]]
* 18:20 thcipriani: deployment-puppetmaster.deployment-prep:/var/lib/git/labs/private modules/secret/secrets/keyholder keys conflicts resolved
* 14:30 hashar: Manually cleaned /srv/jenkins/workspace on integration-agent-docker-1024
* 18:09 bd808: Git repo at deployment-puppetmaster.deployment-prep:/var/lib/git/labs/private is behind upstream due to multiple modules/secret/secrets/keyholder local files that would be overwritten by upstream changes.
* 07:24 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/848514 # [[phab:T317378|T317378]]


== 2016-06-24 ==
== 2022-10-24 ==
* 15:04 hashar: switch apps-android-wikimedia-* jobs to Jessie T138506
* 17:42 James_F: Zuul: Add new e-mail for Hoo man to allow list
* 14:07 James_F: Killed https://integration.wikimedia.org/ci/job/pywikibot-core-tox-nose-jessie/556/console (stuck for 90 minutes)
* 09:54 hashar: T138506 Adding a JDK installation "Debian - OpenJdk 8" in Jenkins global configuration with JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64


== 2016-06-23 ==
== 2022-10-21 ==
* 13:58 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/295691
* 08:46 hashar: Created https://gerrit.wikimedia.org/r/admin/repos/phabricator/translations # [[phab:T321350|T321350]]
* 12:13 hashar: Deleting integration-saltmaster and recreating it with Jessie T136410
* 10:14 hashar: T137807 Upgrading Jenkins TAP Plugin
* 08:55 hashar: integration: rebased puppet master by dropping a conflicting/obsolete patch
* 08:28 hashar: fixing puppet cert on deployment-cache-text04


== 2016-06-17 ==
== 2022-10-20 ==
* 10:35 jzerebecki: offlined integration-slave-trusty-1015 T138074
* 13:04 hashar: Updating Jenkins jobs to add `AllowEncodedSlashes On` to Apache config https://gerrit.wikimedia.org/r/c/integration/config/+/844974 [[phab:T321278|T321278]]
* 10:06 hashar: Refreshed Nodepool Trusty image
* 12:40 hashar: Building Quibble Docker images to add `AllowEncodedSlashes On` to Apache configuration {{!}} https://gerrit.wikimedia.org/r/c/integration/config/+/844937 {{!}} [[phab:T321278|T321278]]
* 10:02 hashar: Refreshed Nodepool Jessie image
* 07:23 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/844540


== 2016-06-14 ==
== 2022-10-19 ==
* 14:22 hashar: T136971 on tin MediaWiki 1.28.0-wmf.6, from 1.28.0-wmf.6, successfully checked out.  Applying security patches
* 22:27 dduvall: deleted 'trigger-blubber-pipeline-*' 'blubber-pipeline-*' jobs to deploy https://gerrit.wikimedia.org/r/844529
* 11:21 hashar: T137797 Created Gerrit repository operations/debs/geckodriver  to package https://github.com/mozilla/geckodriver
* 22:22 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/844528
* 09:56 hashar: Reloaded Zuul for noop change https://gerrit.wikimedia.org/r/c/integration/config/+/830584 Zuul: [mediawiki/extensions/SearchVue] Mark as in production


== 2016-06-13 ==
== 2022-10-18 ==
* 21:11 hashar: https://integration.wikimedia.org/ci/computer/integration-slave-trusty-1015/ put offline. Jenkins cant ssh / pool it for some reason
* 19:13 hashar: devtools: unbreak puppet on `deploy-1004.devtools.eqiad1.wikimedia.cloud` by applying `profile::mediawiki::scap_client::is_master: true` # [[phab:T319681|T319681]]
* 20:07 hashar: beta: update.php / database update finally pass!
* 17:51 James_F: Zuul: [wikimedia/fundraising/SmashPig] Use composer-test-php74-only template
* 19:55 hashar: T137615 deployment-db2, **eswiki** > CREATE INDEX echo_notification_event ON echo_notification (notification_event);
* 08:03 vgutierrez: wipe deployment-cache-(text{{!}}upload)06 - [[phab:T320930|T320930]]
* 19:22 hashar: T137615 deployment-db2, enwiki > CREATE INDEX echo_notification_event ON echo_notification (notification_event);
* 10:37 hashar: Restarted puppetmaster on integration-puppetmaster (memory leak / can not fork: no memory)
* 10:35 hashar: T137561  salt -v '*trusty*' cmd.run "cd /root/ && dpkg -i firefox_46.0.1+build1-0ubuntu0.14.04.3_amd64.deb"
* 10:23 hashar: Hard reboot integration-slave-trusty-1015
* 08:30 hashar: Beta: `mwscript extensions/Echo/maintenance/removeInvalidTargetPage.php --wiki=enwiki` for T137615


== 2016-06-10 ==
== 2022-10-17 ==
* 15:49 jzerebecki: reloading zuul for 8c048fb..272d1ec
* 16:21 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/843499 [[phab:T309600|T309600]]
* 15:29 jzerebecki: T137561 integration-puppetmaster:/var/lib/git/operations/puppet# git reset --hard 1e1ff12b13b73b5c5e2015a72f51561f10b305d0
* 14:01 vgutierrez: shutdown deployment-cache-(text{{!}}upload)06 - [[phab:T320930|T320930]]
* 15:19 jzerebecki: T137561 integration-saltmaster:~# salt -v '*trusty*' cmd.run "cd /root/ && dpkg -i firefox_46.0.1+build1-0ubuntu0.14.04.3_amd64.deb"
* 13:56 vgutierrez: switch 185.15.56.36 from deployment-cache-text06 to deployment-cache-text07 - [[phab:T320930|T320930]]
* 15:18 jzerebecki: T137561 integration-saltmaster:~# salt -v '*trusty*' cmd.run "cd /root/ && wget 'https://ubuntu.wikimedia.org/ubuntu/pool/main/f/firefox/firefox_46.0.1%2bbuild1-0ubuntu0.14.04.3_amd64.deb'"
* 13:54 vgutierrez: switch 185.15.56.35 from deployment-cache-upload06 to deployment-cache-upload07 - [[phab:T320930|T320930]]
* 15:15 jzerebecki: T137561 integration-puppetmaster:/var/lib/git/operations/puppet# git fetch https://gerrit.wikimedia.org/r/operations/puppet refs/changes/39/293739/1 && git cherry-pick FETCH_HEAD
* 11:02 urbanecm: deployment-prep: wikiadmin@172.16.0.238(wikishared)> source /srv/mediawiki-staging/php-master/extensions/ContentTranslation/sql/significant-edits.sql; # cswiki beta was failing with cx_significant_edits table not found
* 09:41 wm-bot2: Increased quotas by 4 cores ([[phab:T320932|T320932]]) - cookbook ran by arturo@nostromo


== 2016-06-09 ==
== 2022-10-14 ==
* 18:49 hashar: restarting nutcracker on deployment-mediawiki02
* 20:57 James_F: Zuul: Fix dependencies for BlueSpice extensions that depend on VisualEditor
* 16:53 hashar: rebuild Nodepool trusty image ci-trusty-wikimedia-1465490962
* 20:49 James_F: Docker: Publishing helm-linter without deprecated kubeyaml for [[phab:T316348|T316348]]
* 16:37 hashar: Manually deleting old zuul references on scandium.eqiad.wmnet . Running in a screen
* 20:06 James_F: Docker: Publish images with php-ast upgraded from v1.0.14 to v1.1.0
* 16:32 hashar: rebuild Nodepool jessie image ci-jessie-wikimedia-1465489579
* 18:22 dduvall: upgrade of docker on contint hosts aborted due to missing buster package. agents are back online
* 16:03 hashar: Restarting Nodepool
* 18:01 dduvall: upgrading docker on contint servers. agents will be available for a short time
* 16:07 James_F: Zuul: [mediawiki/libs/Zest] Re-enable PHP 8.1 tests for [[phab:T311463|T311463]]
* 15:54 James_F: Zuul: [mediawiki/vendor] Add experimental job to check composer.lock for [[phab:T74952|T74952]]
* 13:48 James_F: Zuul: [css-sanitizer] Re-enable PHP 8.1 jobs for [[phab:T311451|T311451]]


== 2016-06-08 ==
== 2022-10-13 ==
* 02:56 legoktm: / on gallium is read-only
* 21:16 dduvall: all integration-agent-docker-* hosts have been upgraded to docker 20.10.18
* 02:47 legoktm: disabling/enabling gearman in jenkins because everything is stuck
* 20:37 dduvall: starting rolling upgrade of docker on integration-agent-docker-* hosts to deploy https://gerrit.wikimedia.org/r/c/operations/puppet/+/834399


== 2016-06-07 ==
== 2022-10-12 ==
* 19:28 hashar: Nodepool has troubles spawning instances probably due to on going (?) labs maintenance
* 20:09 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/841996
* 14:56 hashar: Restarting Jenkins to upgrade Rebuilder plugin with https://github.com/jenkinsci/rebuild-plugin/pull/34  (sort out parameters not being reinjected)
* 17:07 dduvall: deployed blubberoid using docker-registry.discovery.wmnet/wikimedia/blubber:2022-10-12-162839-production
* 09:02 hashar: Upgrading Jenkins IRC plugin 2.25..2.27 and instant messaging plugin 1.34..1.35  . The former should fix a deadlock on shutdowning Jenkins | T96183


== 2016-06-06 ==
== 2022-10-11 ==
* 19:26 hasharAway: Regenerating Nodepool snapshots for Trusty and Jessie
* 15:49 dduvall: manually (re-)re-running `sudo -u mwpresync /usr/bin/scap stage-train --yes auto` after patch cleanup
* 13:04 hashar: Migrated all qunit jobs to Nodepool T136301 has the related Gerrit changes
* 15:29 dduvall: correction ^ full command is `sudo -u mwpresync /usr/bin/scap stage-train --yes auto`
* 10:05 hashar: migrating mediawiki-core-qunit job to Nodepool instances https://gerrit.wikimedia.org/r/#/c/291322/  T136301
* 15:28 dduvall: manually (re)running `stage-train --yes auto` following cron job failure
* 10:53 TheresNoTime: add MVernon to deployment-prep, [[phab:T316845|T316845]]#8307183


== 2016-06-04 ==
== 2022-10-10 ==
* 00:09 Krinkle: krinkle@integration-slave-trusty-1017:~$ sudo rm -rf /mnt/jenkins-workspace/workspace/mediawiki-extensions-hhvm/src/extensions/Babel (T86730)
* 12:04 TheresNoTime: cherry 836953 picking for [[phab:T316845|T316845]] to deployment-prep/Swift


== 2016-06-03 ==
== 2022-10-08 ==
* 19:18 hashar: Image ci-jessie-wikimedia-1464981111 in wmflabs-eqiad is ready  Zend 5.x for qunit | T136301
* 21:00 Reedy: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/840337
* 15:17 hashar: refreshed Nodepool Trusty image due to some imagemagick upgrade issue. Image ci-trusty-wikimedia-1464966671 in wmflabs-eqiad is ready
* 10:40 hashar: scandium (zuul merger):  rm -fR /srv/ssd/zuul/git/mediawiki/extensions/Collection  T136930


== 2016-06-02 ==
== 2022-10-07 ==
* 12:10 hashar: Upgraded Zuul upstream code being 66c8e52..30a433b package is  2.1.0-151-g30a433b-wmf1precise1
* 13:27 James_F: Zuul: Add two former contractors to the CI allowlist


== 2016-06-01 ==
== 2022-10-06 ==
* 17:49 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/292186
* 13:17 hashar: Mass updating Jenkins jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/839520
* 16:45 tgr: enabling AuthManager on beta cluster
* 13:12 taavi: reloading zuul for https://gerrit.wikimedia.org/r/839472
* 15:20 legoktm: deploying https://gerrit.wikimedia.org/r/292153
* 13:02 jelto: update gitlab-settings to enable admin_mode on gitlab production instances - [[phab:T316419|T316419]]
* 14:44 twentyafterfour: jenkins restart completed
* 13:00 James_F: Docker: Building and publishing php74:0.3.2 and cascade for [[phab:T318918|T318918]]
* 14:36 twentyafterfour: restarting jenkins to install "single use slave" plugin (jenkins will restart when all builds are finished)
* 12:59 jelto: update gitlab-settings to enable admin_mode on gitlab replica instances - [[phab:T316419|T316419]]
* 13:49 hashar: Beta : clearing temporary files under /data/project/upload7  (mainly wikimedia/commons/temp )
* 12:55 jelto: update gitlab-settings to enable admin_mode on gitlab test instance - [[phab:T316419|T316419]]
* 10:29 hashar: Upgraded Linux kernel on deployment-salt02  T136411
* 10:14 hashar: beta: salt-key -d deployment-salt.deployment-prep.eqiad.wmflabs  T136411
* 09:16 hashar: Enabling puppet again on Trusty slaves. Chromium is now properly pinned to version 49 ( https://gerrit.wikimedia.org/r/#/c/291116/3 | T136188 )
* 08:55 hashar: integration slaves : salt -v '*' pkg.upgrade


== 2016-05-31 ==
== 2022-10-05 ==
* 20:24 bd808: Reloading zuul to pick up I58f878f3fd19dfa21a46a52464575cb06aacbb22
* 22:03 James_F: layout: [mediawiki/tools/phan/SecurityCheckPlugin] Publish PHP coverage for [[phab:T279423|T279423]]
* 17:29 hashar: Building docker images for https://gerrit.wikimedia.org/r/814154


== 2016-05-30 ==
== 2022-10-04 ==
* 18:39 hashar: Upgraded our Jenkins Job Builder fork to 1.5.0 + a couple of cherry picks: cd63874...10f2bcd
* 19:46 dduvall: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/838249
* 12:53 hashar: Upgrading Zuul 1cc37f7..66c8e52 T128569
* 08:04 ori: zuul is back up but jobs which were enqueued are gone
* 07:50 ori: restarting jenkins on gallium, too
* 07:49 ori: restarted zuul-merger service on gallium
* 07:44 ori: Disconnecting and then reconnecting Gearman from Jenkins did not appear to do anything; going to depool / repool nodes.
* 07:42 ori: Temporarily disconnecting Gearman from Jenkins, per <https://www.mediawiki.org/wiki/Continuous_integration/Zuul#Known_issues>


== 2016-05-28 ==
== 2022-10-03 ==
* 04:43 ori: depooling integration-slave-trusty-1015 to profile phpunit runs
* 14:22 TheresNoTime: set `ring_manager` host to `deployment-ms-fe03` in deployment-prep's _.yaml. [[phab:T316845|T316845]]
* 13:22 hashar: Triggering CI for design/codex@v0.2.1 using `zuul enqueue-ref --trigger gerrit --pipeline publish --project design/codex --ref refs/tags/v0.2.1 --newrev 4abb7677b3ea076bbd6778977d9a9374cf45015c`  # [[phab:T313767|T313767]]
* 13:15 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/822144 # [[phab:T313767|T313767]]
* 12:47 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/833061
* 09:02 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/837494/


== 2016-05-27 ==
== 2022-09-30 ==
* 19:29 hasharAway: Refreshed Nodepool images
* 18:43 James_F: Triggering graceful restart of zuul to see if that fixes on-going merge/gerrit connection issues.
* 18:13 thcipriani: restarting zuul for deadlock
* 17:07 James_F: Zuul: Make PHP 8.1 non-voting for all skins and extensions [[phab:T316078|T316078]]
* 18:00 thcipriani: Reloading Zuul to deploy I0c3aeacf92d430ad1272f5f00e7fb7182b8a05bf
* 16:46 James_F: Zuul: Make PHP 8.0 and PHP 8.1 voting for all skins and extensions in master for [[phab:T300463|T300463]] and [[phab:T316078|T316078]]
* 02:55 bd808: Deleted deployment-fluorine:/srv/mw-log/archive/*-20160[34]* logs; freed 26G
* 15:12 James_F: Docker: Building and publishing PHP 8.0.24 images for [[phab:T315167|T315167]]
* 02:33 James_F: Zuul: [mediawiki/core] Clean up REL1_35 and REL1_37 PHP 8 jobs
* 02:30 James_F: Zuul: [mediawiki/core] Upgrade PHP 8.0 and 8.1 jobs to full vendor jobs for [[phab:T300463|T300463]] and [[phab:T316078|T316078]]
* 02:27 James_F: Zuul: Drop FIXME messages for [[phab:T318093|T318093]], being Declined


== 2016-05-26 ==
== 2022-09-29 ==
* 22:23 hashar: salt -v '*trusty*' cmd.run 'puppet agent --disable "Chromium needs to be v49. See T136188"'
* 23:54 TheresNoTime: samtar@deployment-jobrunner04:~$ sudo systemctl stop php7.2-fpm.service && sudo systemctl start php7.4-fpm.service
* 21:47 hashar: integration-slave-trusty-1015 still on Chromium 50 .. T136188
* 23:47 TheresNoTime: cherry pick 836953 to deployment-prep
* 21:42 hashar: downgrading chromium-browser on integration-slave-1015  T136188
* 23:09 TheresNoTime: [samtar@deployment-deploy03 ~]$ sudo puppet agent -tv
* 09:24 jzerebecki: reloading zuul for d38ad0a..6798539
* 23:08 TheresNoTime: deployment-deploy03, `sudo systemctl stop php7.2-fpm.service`, `sudo systemctl start php7.4-fpm.service`
* 07:48 gehel: deployment-prep upgrading elasticsearch to 2.3.3 and restarting (T133124)
* 23:03 TheresNoTime: ran `sudo puppet agent -tv` on deployment -deploy03, -mediawiki11, -mediawiki12
* 07:36 dcausse: deployment-prep elastic: updating cirrussearch warmers (T133124)
* 13:56 James_F: Zuul: Drop PHP72 jobs everywhere, and PHP73 everywhere except old branches
* 07:31 gehel: deployment-prep deploying new elasticsearch plugins (T133124)
* 13:41 James_F: Zuul: [mediawiki/core] Drop PHP 7.2 and PHP 7.3 testing for master and wmf for [[phab:T261872|T261872]]
* 13:34 James_F: Zuul: [mediawiki/vendor] Drop PHP72 jobs, use only PHP74 ones
* 12:30 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/836696


== 2016-05-25 ==
== 2022-09-28 ==
* 22:38 Amir1: running puppet agent manually on sca01
* 17:32 brennen: trying a re-publish of dev-images in case https://gitlab.wikimedia.org/repos/releng/dev-images/-/commit/eb82162b4bf443df20998a53bfb06460bfc6a365 didn't get picked up
* 16:26 hashar: 2016-05-25 16:24:35,491 INFO nodepool.image.build.wmflabs-eqiad.ci-trusty-wikimedia: Notice: /Stage[main]/Main/Package[ruby-jsduck]/ensure: ensure changed 'purged' to 'present'  T109005
* 00:30 James_F: Zuul: [mediawiki/services/function-orchestrator] Use direct coverage job here too
* 15:07 hashar: g++ added to Jessie and Trusty Nodepool instances | T119143
* 00:20 James_F: Zuul: [mediawiki/services/function-evaluator] Use direct coverage job, for [[phab:T302608|T302608]]
* 14:12 hashar: Regenerating Nodepool snapshot to include g++ which is required by some NodeJS native modules T119143
* 10:58 hashar: Updating Nodepool ci-jessie-wikimedia snapshot image to get netpbm package installed into it. T126992  https://gerrit.wikimedia.org/r/290651
* 09:30 hashar: Clearing git-sync-upstream script on integration-slave-trusty1013 and integration-slave-trusty-1017. That is only supposed to be on the puppetmaster
* 09:15 hashar: Fixed resolv.conf on integration-slave-trusty-1013 and force running puppet to catch up with change since May 16 19:52
* 09:11 hashar: restarting puppetmaster on integration-puppetmaster  ( memory leak / can not fork)


== 2016-05-24 ==
== 2022-09-27 ==
* 07:03 mobrovac: rebooting deployment-tin, can't log in
* 08:11 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/835182


== 2016-05-23 ==
== 2022-09-26 ==
* 19:35 hashar: killed all mysqld process on Trusty CI slaves
* 21:46 Daimona: Applying schema changes to the wikishared DB on beta for the CampaignEvents extension # [[phab:T318379|T318379]] [[phab:T318120|T318120]]
* 15:49 thcipriani: beta code update not running, disconnect-reconnect dance resulted in: [05/23/16 15:48:39] [SSH] Authentication failed.
* 21:31 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # [[phab:T318120|T318120]]
* 14:32 jzerebecki: offlined integration-slave-trusty-1004 because it can't connect to mysql T135997
* 20:00 dduvall: regenerating 314 jobs for deployment of https://gerrit.wikimedia.org/r/835262
* 13:32 hashar: Upgrading Jenkins git plugins and restarting Jenkins
* 11:40 James_F: Docker: Building and publishing quibble-buster-php74-bundle
* 11:01 hashar: Upgrading hhvm on Trusty slaves. Bring him hhvm compiled against libicu52 instead of libicu48
* 11:40 James_F: Docker
* 09:12 _joe_: deployment-prep: all hhvm hosts in beta upgraded to run on the newer libicu; now running updateCollation.php (T86096)
* 10:52 hashar: Rolling quibble/ruby jobs from php 7.4 to 7.2: `mediawiki-selenium-integration-docker` `legacy-quibble-rubyselenium-docker` # [[phab:T318525|T318525]]
* 09:11 hashar: Image ci-jessie-wikimedia-1463994307 in wmflabs-eqiad is ready
* 09:35 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/834717/
* 09:01 hashar: Image ci-trusty-wikimedia-1463993508 in wmflabs-eqiad is ready
* 08:56 _joe_: deployment-prep: starting upgrade of HHVM to a version linked to libicu52, T86096
* 08:54 hashar: Regenerating Nodepool image manually. Broke over the week-end due to a hhvm/libicu transition. Should get pip 8.1.x now


== 2016-05-20 ==
== 2022-09-23 ==
* 20:30 bd808: Killing https://integration.wikimedia.org/ci/job/mediawiki-extensions-qunit/43608/ which has been running for 5 hours
* 18:09 James_F: Zuul: [wikimedia-cz/web-*] Migrate tests from php73+ to php74+
* 18:06 James_F: Zuul: [labs/tools/guc] Migrate tests from php73+ to php74+
* 18:04 James_F: Zuul: [labs/tools/coverme] Migrate tests from php73+ to php74+
* 15:55 James_F: Docker: Building and publishing php74 versions of composer-security-check, mediawiki-phan, mediawiki-phan-testrun, and phpmetrics
* 13:26 James_F: Zuul: Run php 7.4 phan for extensions and skins


== 2016-05-19 ==
== 2022-09-22 ==
* 16:47 thcipriani: deployment-tin jenkins worker seems to be back online after [https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code.2Fdb_update some prodding]
* 20:40 zabe: shutoff deployment-db07 # [[phab:T318126|T318126]]
* 16:41 thcipriani: beta-code-update eqiad hung for past few hours
* 20:36 zabe: take deployment-prep out of read-only # [[phab:T318126|T318126]]
* 15:16 hashar: Restarted zuul-merger daemons on both gallium and scandium : file descriptors leaked
* 20:32 zabe: failover deployment-prep master from deployment-db07 to deployment-db09 # [[phab:T318126|T318126]]
* 11:59 hashar: CI: salt -v '*' cmd.run 'pip install --upgrade pip==8.1.2'
* 20:25 zabe: set deployment-prep as read-only # [[phab:T318126|T318126]]
* 11:54 hashar: Upgrading pip on CI slaves from 7.0.1 to 8.1.2  https://gerrit.wikimedia.org/r/#/c/289639/
* 16:26 dancy: Upgrading scap to latest code revision in beta cluster
* 10:15 hashar: puppet broken on deployment-tin :     ?[1;31mError: Could not retrieve catalog from remote server: Error 400 on SERVER: Invalid parameter trusted_group on node deployment-tin.deployment-prep.eqiad.wmflabs?[0m
* 10:38 zabe: deployment-db10: start replication # [[phab:T318126|T318126]]


== 2016-05-18 ==
== 2022-09-21 ==
* 13:16 Amir1: deploying a05e830 to ores nodes (sca01 and ores-web)
* 23:34 zabe: shutoff deployment-db08 # [[phab:T318126|T318126]]
* 12:46 urandom: (re)cherry-picking c/284078 to deployment-prep
* 23:00 jeena: restarting zuul to try and fix CI issues
* 11:36 hashar: Restarted qa-morebots
* 20:46 zabe: clone deployment-db10 from deployment-db08 # [[phab:T318126|T318126]]
* 11:36 hashar: Marked mediawiki/core/vendor repository has hidden in Gerrit. It got moved to mediawiki/vendor including the whole history Settings page: https://gerrit.wikimedia.org/r/#/admin/projects/mediawiki/core/vendor
* 18:49 TheresNoTime: cherry-picked [[gerrit:833839]] to deployment-puppetmaster04, testing [[phab:T317417|T317417]]
* 18:19 zabe: install mariadb 10.6 via role::mariadb::beta on deployment-db10 # [[phab:T318126|T318126]]
* 17:55 zabe: create volume db10 and attach to deployment-db10 # [[phab:T318126|T318126]]
* 17:54 zabe: create deployment-db10 as g3.cores8.ram16.disk20 # [[phab:T318126|T318126]]
* 14:21 zabe: deployment-db09: restart mariadb # [[phab:T318126|T318126]]
* 13:55 TheresNoTime: modified deployment-prep "prometheus" security group - port 80, [[phab:T315699|T315699]]
* 13:18 James_F: Jenkins: Dropped 16 more old node jobs left on the server.
* 13:11 James_F: Jenkins: Dropped four old node10 jobs left on the server (oojs-core-node10-browser-docker, ooui-special-node10-plus-php80-composer-docker, wikipeg-special-node10-plus-php72-composer-docker, wikipeg-special-node10-plus-php80-composer-docker)
* 13:05 James_F: Jenkins: Dropped scap-pipeline-stretch and trigger-scap-pipeline-stretch following {{Gerrit|26c74a1}}
* 12:36 hashar: Reloaded Zuul for Remove Stretch from mediawiki/tools/scap - https://gerrit.wikimedia.org/r/833705
* 09:46 andrewbogott: removed some stray whitespace in /var/lib/git/operations/puppet that was preventing rebase on deployment-puppetmaster04.deployment-prep.eqiad.wmflabs


== 2016-05-13 ==
== 2022-09-20 ==
* 14:39 thcipriani: remove shadow l10nupdate user from deployment-tin and mira in beta
* 22:00 zabe: deployment-db09: start replication # [[phab:T318126|T318126]]
* 10:20 hashar: Put integration-slave-trusty-1004 offline.  Ssh/passwd is borked  T135217
* 20:06 zabe: deployment-db09: import dump into mariadb # [[phab:T318126|T318126]]
* 09:59 hashar: Deleting non nodepool mediawiki PHPUnit jobs for T135001 (mediawiki-phpunit-hhvm mediawiki-phpunit-parsertests-hhvm mediawiki-phpunit-parsertests-php55 mediawiki-phpunit-php55)
* 20:04 zabe: rsynced dump from deployment-db08 to deployment-db09 # [[phab:T318126|T318126]]
* 04:06 thcipriani|afk: changed ownership of mwdeploy public keys post shadow mwdeploy user removal is important
* 08:08 hashar: Upgrading CI and releases Jenkins plugins notably to update the git client [[phab:T315897|T315897]]
* 03:47 thcipriani|afk: ldap failure has created a shadow mwdeploy user on beta, deleted using vipw
* 02:06 zabe: created backup of all databases on deployment-db08 # [[phab:T318126|T318126]]


== 2016-05-12 ==
== 2022-09-19 ==
* 22:53 bd808: Started dead mysql on integration-slave-precise-1011
* 23:58 zabe: install mariadb 10.6 via role::mariadb::beta on deployment-db09 # [[phab:T318126|T318126]]
* 23:57 zabe: create volume db09 and attach to deployment-db09 # [[phab:T318126|T318126]]
* 23:57 zabe: create deployment-db09 as g3.cores8.ram16.disk20 # [[phab:T318126|T318126]]
* 20:24 dduvall: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/833066
* 16:54 James_F: Zuul: [operations/mediawiki-config] Switch to PHP 7.4 jobs
* 16:24 James_F: Zuul: [mediawiki/core] Add php80 and php81 to `check php` command
* 15:36 James_F: Zuul: [mediawiki/core] run phan on PHP 7.4 for [[phab:T316518|T316518]]
* 13:50 James_F: Zuul: [mediawiki/core] Add a non-vendor php81 job for main branch for [[phab:T316078|T316078]]
* 12:06 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # [[phab:T316128|T316128]]
* 11:57 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # [[phab:T316128|T316128]]


== 2016-05-11 ==
== 2022-09-16 ==
* 21:05 hashar: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/288128  #T134946
* 15:47 dancy: Upgrading scap to latest code revision in beta cluster
* 20:26 hashar: rebooting integration-slave-trusty-1016  is back up
* 20:15 hashar: rebooting integration-slave-trusty-1016  unreachable somehow
* 16:43 hashar: Reduced number of executors on Trusty instances from 3 to 2. Memory get exhausted causing the tmpfs to drop files and thus MW jobs to fail randomly.
* 13:33 hashar: Added contint::packages::php to Nodepool images  T119139
* 12:59 hashar: Dropping texlive and its dependencies from gallium.
* 12:52 hashar: deleted integration-dev
* 12:51 hashar: creating  integration-dev instance to hopefully have Shinken clean itself
* 11:42 hashar: rebooting deployment-aqs01 via wikitech  T134981
* 10:46 hashar: beta/ci puppetmaster : deleting old tags in /var/lib/git/operations/puppet  and repacking the repos
* 08:49 hashar: Deleting instances deployment-memc02 and deployment-memc03 (Precise instances, migrated to Jessie)  #T134974
* 08:43 hashar: Beta: switching memcached to new Jessie servers by cherry picking https://gerrit.wikimedia.org/r/#/c/288156/ and running puppet on mw app servers  #T134974
* 08:20 hashar: Creating deployment-memc04 and deployment-memc05 to switch beta cluster memcached to Jessie.  m1.medium with security policy "cache" T13497
* 01:44 matt_flaschen: Created Flow-specific External Store tables (blobs_flow1) on all wiki databases on Beta Cluster: T128417


== 2016-05-10 ==
== 2022-09-15 ==
* 19:17 hashar: beta / CI  purging old Linux kernels:  salt -v '*' cmd.run 'dpkg -l|grep ^rc|awk "{ print \$2 }"|grep linux-image|xargs dpkg --purge'
* 19:56 thcipriani: Updating development images on contint primary
* 17:34 cscott: updated OCG to version b0c57a1c6890e9fa1f2c3743fc14cb6a7f244fc3
* 17:24 dduvall: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/832374
* 16:44 bd808: Cleaned up 8.5G of pbuilder tmp output on integration-slave-jessie-1001 with `sudo find /mnt/pbuilder/build -maxdepth 1 -type d -mtime +1 -exec rm -r {} \+`
* 11:03 TheresNoTime: soft reboot deployment-parsoid12, unresponsive
* 16:35 bd808: https://integration.wikimedia.org/ci/job/debian-glue failure on integration-slave-jessie-1001 due to /mnt being 100$ full
* 14:20 hashar: deployment-puppetmaster mass cleaned packages/service/users etc  T134881
* 13:54 moritzm: restarted zuul-merger on scandium for openssl update
* 13:52 moritzm: restarting zuul on gallium for openssl update
* 13:51 moritzm: restarted apache and zuul-merger on gallium for openssl update
* 13:48 hashar: deployment-puppetmaster : dropping role::ci::jenkins_access role::ci::slave::labs and role::ci::slave::labs::common  T134881
* 13:46 hashar: Deleting Jenkins slave deployment-puppetmaster T134881
* 13:45 hashar: Change https://integration.wikimedia.org/ci/job/beta-build-deb/ job to use label selector "DebianGlue && DebianJessie" instead of "BetaDebianRepo"  T134881
* 13:33 hashar: Migrating all debian glue jobs to Jessie permanent slaves T95545
* 13:30 hashar: Adding  integration-slave-jessie-1002 in Jenkins.  it is all puppet compliant
* 12:59 thcipriani|afk: triggering puppet run on scap targets in beta for https://gerrit.wikimedia.org/r/#/c/287918/ cherry pick
* 09:07 hashar: fixed puppet.conf on deployment-cache-text04


== 2016-05-09 ==
== 2022-09-13 ==
* 20:58 hashar: Unbroke puppet on integration-raita.integration.eqiad.wmflabs . Puppet was blocked because role::ci::raita was no more. Fixed by rebasing https://gerrit.wikimedia.org/r/#/c/208024 T115330 
* 22:14 zabe: delete deployment-urldownloader02
* 20:13 hashar: beta: salt -v '*' cmd.run 'dpkg --purge libganglia1 ganglia-monitor; rm -fR /etc/ganglia'  # T134808
* 20:06 hashar: CI, removing ganglia configuration entirely via:  salt -v '*' cmd.run 'rm -fRv /etc/ganglia'  # T134808
* 20:04 hashar: CI, removing ganglia configuration entirely via:  salt -v '*' cmd.run 'dpkg --purge ganglia-monitor'  # T134808
* 16:32 jzerebecki: reloading zuul for 3e2ab56..d663fd0
* 15:39 andrewbogott: migrating deployment-flourine to labvirt1009
* 15:39 hashar: Adding label contintLabsSlave  to integration-slave-jessie1001 and  integration-slave-jessie1002
* 15:26 hashar: Creating integration-slave-jessie-1001 T95545


== 2016-05-06 ==
== 2022-09-12 ==
* 19:45 urandom: Restart cassandra-metrics-collector on deployment-restbase0[1-2]
* 22:15 brennen: Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/merge_requests/19
* 19:41 urandom: Rebasing 02ae1757 on deployment-puppetmaster : T126629
* 20:38 RhinosF1: (for lack of a better place) added Cyberpower678 to acl*userdisable. has enough clue, fairly active and trusted.
* 08:14 James_F: Zuul: [mediawiki/extensions/Realnames] Enable quibble composer jobs


== 2016-05-05 ==
== 2022-09-09 ==
* 22:09 MaxSem: Promoted Yurik and Jgirault to sysops on beta enwiki. Through shell because logging in is broken for me.
* 17:46 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # [[phab:T311126|T311126]]
* 17:25 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # [[phab:T311126|T311126]]
* 17:08 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # [[phab:T316409|T316409]]
* 16:36 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # [[phab:T316409|T316409]]
* 10:49 hashar: devtools: fixed fqdn of instances puppetmaster-1001 and gerrit-prod-1001 by manually editing `/etc/hosts` # [[phab:T317404|T317404]]


== 2016-05-04 ==
== 2022-09-08 ==
* 21:28 cscott: deployed puppet FQDN domain patch for OCG: https://gerrit.wikimedia.org/r/286068 and restarted ocg on deployment-pdf0[12]
* 20:50 dduvall: running `./fab deploy_docker` to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/830919
* 15:03 hashar: beta-scap: deployment-tin.deployment-prep.eqiad.wmflabs Name or service not known
* 15:47 dancy: Upgrading scap to latest code revision in beta cluster
* 15:03 hashar: beta-scap: deployment-tin.deployment-prep.eqiad.wmflabs
* 12:24 hashar: deleting Jenkins job mediawiki-core-phpcs  , replaced by Nodepool version mediawiki-core-phpcs-trusty  T133976
* 12:11 hashar: beta: restarted nginx on varnish caches ( systemctl restart nginx.service ) since they were not listening on port 443 #T134362
* 11:07 hashar: restarted CI puppetmaster  (out of memory leak)
* 10:57 hashar: CI: mass upgrading deb packages
* 10:53 hashar: beta: clearing out leftover apt conf that points to unreachable web proxy : salt -v '*' cmd.run "find /etc/apt -name '*-proxy' -delete"
* 10:48 hashar: Manually fixing nginx upgrade on deployment-cache-text04 and deployment-cache-upload04  see T134362 for details
* 09:27 hashar: deployment-cache-text04 systemctl stop varnish-frontend.service  . To clear out all the stuck CLOSE_WAIT connections  T134346
* 08:33 hashar: fixed puppet on deployment-cache-text04 (race condition generating puppet.conf )


== 2016-05-03 ==
== 2022-09-07 ==
* 23:21 bd808: Changed "Maximum Number of Retries" for ssh agent launch in jenkins for deployment-tin from "0" to "10"
* 15:05 TheresNoTime: making hack changes to beta to test [[phab:T317195|T317195]] resolution
* 23:01 twentyafterfour: rebooting deployment-tin
* 23:00 bd808: Jenkins agent on deployment-tin not spawning; investigating
* 20:02 hashar: Restarting Jenkins
* 16:49 hashar: Notice: /Stage[main]/Contint::Packages::Python/Package[pypy]/ensure: ensure changed 'purged' to 'present'  | T134235
* 16:46 hashar: Refreshing Nodepool Jessie image to have it include pypy | T134235  poke @jayvdb
* 14:49 mobrovac: deployment-tin rebooting it
* 14:25 hashar: beta  salt -v '*' pkg.upgrade
* 14:19 hashar: beta: added unattended upgrade to Hiera::deployment-prep
* 13:30 hashar: Restarted nslcd on deployment-tin ,  pam was refusing authentication for some reason
* 13:29 hashar: beta: got rid of a leftover Wikidata/Wikibase patch that broke scap  salt -v 'deployment-tin*' cmd.run 'sudo -u jenkins-deploy git -C /srv/mediawiki-staging/php-master/extensions/Wikidata/ checkout -- extensions/Wikibase/lib/maintenance/populateSitesTable.php'
* 13:23 hashar: deployment-tin force upgraded HHVM from 3.6 to 3.12
* 09:42 hashar: adding puppet class contint::slave_scripts to deployment-sca01 and deployment-sca02 . Ships multigit.sh  T134239
* 09:31 hashar: Deleting CI slave deployment-cxserver03 , added deployment-sca01 and deployment-sca02 in Jenkins.  T134239
* 09:28 hashar: deployment-sca01 removing puppet lock /var/lib/puppet/state/agent_catalog_run.lock  and running puppet again
* 09:26 hashar: Applying puppet class role::ci::slave::labs::common  on deployment-sca01 and deployment-sca02 (cxserver and parsoid being migrated T134239 )
* 03:33 kart_: Deleted deployment-cxserver03, replaced by deployment-sca0x


== 2016-05-02 ==
== 2022-09-06 ==
* 21:27 cscott: updated OCG to version b775e612520f9cd4acaea42226bcf34df07439f7
* 15:42 bd808: Promoted user 'StrikerBot' to admin on gitlab.wikimedia.org so that Striker can use the account to attach Developer accounts to gitlab via API.
* 21:26 hashar: Nodepool is acting just fine: Demand from gearman: ci-trusty-wikimedia: 457  | <AllocationRequest for 455.0 of ci-trusty-wikimedia>
* 02:05 James_F: Running REL1_39 branch commands for [[phab:T313920|T313920]]
* 21:25 hashar: restarted qa-morebots "2016-05-02 21:22:23,599 ERROR: Died in main event loop"
* 00:20 Krinkle: Prune various old mediawiki/core wmf branches for Gerrit usability, ref [[phab:T303828|T303828]]
* 21:23 hashar: gallium: enqueued 488 jobs directly in Gearman. That is to test https://gerrit.wikimedia.org/r/#/c/286462/ ( mediawiki/extensions to hhvm/zend5.5 on Nodepool). Progress /home/hashar/gerrit-286462.log
* 20:14 hashar: MediaWiki phpunit jobs to run on Nodepool instances \O/
* 16:41 urandom: Forcing puppet run and restarting Cassandra on deployment-restbase0[1-2] : T126629
* 16:40 urandom: Cherry-picking https://gerrit.wikimedia.org/r/operations/puppet refs/changes/78/284078/12 to deployment-puppetmaster : T126629
* 16:24 urandom: Restarat Cassandra on deployment-restbase0[1-2] : T126629
* 16:21 urandom: forcing puppet run on deployment-restbase0[1-2] : T126629
* 16:21 urandom: cherry-picking latest refs/changes/78/284078/11 onto deployment-puppetmaster : T126629
* 09:44 hashar: On zuul-merger instances (gallium / scandium), cleared out pywikibot/core working copy ( rm -fR /srv/ssd/zuul/git/pywikibot/core/ ) T134062


== 2016-04-30 ==
== 2022-09-02 ==
* 18:31 Amir1: deploying d4f63a3 from github.com/wiki-ai/ores-wikimedia-config into targets in beta cluster via scap3
* 15:59 zabe: added vwalters as member of the deployment-prep project [[phab:T316943|T316943]]
* 13:40 Krinkle: " ENOENT: no such file or directory, lstat " failing quibble jobs on integration-agent-docker-1024


== 2016-04-29 ==
== 2022-09-01 ==
* 16:37 jzerebecki: restarting zuul for 4e9d180..ebb191f
* 09:36 zabe: shutoff deployment-urldownloader02
* 15:45 hashar: integration: deleting integration-trusty-1026 and cache-rsync . Maybe that will clear them up from Shinken
* 07:43 hashar: Updating Jenkins jobs for Quibble 1.4.5 > 1.4.6  +  php 7.4 update {{!}} [[phab:T305525|T305525]] {{!}} [[phab:T314586|T314586]] {{!}} [[phab:T316601|T316601]] {{!}} https://gerrit.wikimedia.org/r/c/integration/config/+/828611
* 15:14 hashar: integration: created 'cache-rsync' and 'integration-trusty-1026' , attempting to have Shinken to deprovision them


== 2016-04-28 ==
== 2022-08-31 ==
* 22:03 urandom: deployment-restbase01 upgrade to 2.2.6 complete : T126629
* 23:41 zabe: deleted shutoff deployment-restbase03
* 21:56 urandom: Stopping Cassandra on deployment-restbase01, upgrading package to 2.2.6, and forcing puppet run : T126629
* 16:39 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension # [[phab:T308738|T308738]]
* 21:55 urandom: Snapshotting Cassandra tables on deployment-restbase01 (name = 1461880519833) : T126629
* 16:37 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # [[phab:T312870|T312870]]
* 21:55 urandom: Snapshotting Cassandra tables on deployment-restbase01 : T126629
* 16:21 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # [[phab:T312870|T312870]]
* 21:52 urandom: Forcing puppet run on deployment-restbase02 : T126629
* 16:18 hashar: Tag Quibble 1.4.6 @ {{Gerrit|8828487d0}} # [[phab:T305525|T305525]] [[phab:T314586|T314586]]
* 21:51 urandom: Cherry picking operations/puppet refs/changes/78/284078/10 to puppmaster : T126629
* 15:27 James_F: Docker: Building and publishing quibble-fresnel image based on php74 for [[phab:T316525|T316525]]
* 20:46 urandom: Starting Cassandra on deployment-restbase02 (now v2.2.6) : T126629
* 20:41 urandom: Re-enable puppet and force run on deployment-restbase02 : T126629
* 20:38 urandom: Halting Cassandra on deployment-restbase02, masking systemd unit, and upgrading package(s) to 2.2.6 : T126629
* 20:37 urandom: Snapshotting Cassandra tables on deployment-restbase02 (snapshot name = 1461875833996) : T126629
* 20:37 urandom: Snapshotting Cassandra tables on deployment-restbase02 : T126629
* 20:33 urandom: Cassandra on deployment-restbase01.deployment-prep started : T126629
* 20:25 urandom: Restarting Cassandra on deployment-restbase01.deployment-prep : T126629
* 20:14 urandom: Re-enable puppet on deployment-restbase01.deployment-prep, and force a run : T126629
* 20:12 urandom: cherry-picking https://gerrit.wikimedia.org/r/#/c/284078/ to deployment-puppetmaster : T126629
* 20:06 urandom: Disabling puppet on deployment-restbase0[1-2].deployment-prep : T126629
* 14:43 hashar: Rebuild Nodepool Jessie image. Comes with hhvm
* 12:52 hashar: Puppet is happy on deployment-changeprop
* 12:47 hashar: apt-get upgrade deployment-changeprop  (outdated exim package)
* 12:42 hashar: Rebuild Nodepool Trusty instance to include the PHP wrapper script T126211


== 2016-04-27 ==
== 2022-08-30 ==
* 23:57 thcipriani: nodepool instances running again after an openstack rabbitmq restart by andrewbogott
* 16:00 James_F: Zuul: [labs/tools/heritage] Switch postmerge job to tox-py37-coverage-publish for [[phab:T316627|T316627]]
* 22:51 duploktm: also ran openstack server delete ci-jessie-wikimedia-85342
* 09:32 hashar: doc: on doc1002: `sudo -u doc-uploader rm -fR /srv/doc/mw-tools-scap/` That got moved to `/srv/doc` and a redirect has been set. # [[phab:T315541|T315541]]
* 22:42 legoktm: nodepool delete 85342
* 22:41 matt_flaschen: Deployed https://gerrit.wikimedia.org/r/#/c/285765/ to enable External Store everywhere on Beta Cluster
* 22:38 legoktm: stop/started nodepool
* 22:36 thcipriani: I don't have permission to restart nodepool
* 22:35 thcipriani: restarting nodepool
* 22:18 matt_flaschen: Deployed https://gerrit.wikimedia.org/r/#/c/282440/ to switch Beta Cluster to use External Store for new testwiki writes
* 21:00 hashar: thcipriani downgraded git plugins successfully (we wanted to rule out their upgrade  for some weird issue)
* 20:13 cscott: updated OCG to version e39e06570083877d5498da577758cf8d162c1af4
* 14:10 hashar: restarting Jenkins
* 14:09 hashar: Jenkins upgrading credential plugin 1.24 > 1.27  And Credentials binding plugin 1.6 > 1.7
* 14:07 hashar: Jenkins upgrading git plugin 2.4.1 > 2.4.4
* 14:01 hashar: Jenkins upgrading git client plugin 1.19.1. > 1.19.6
* 13:13 jzerebecki: reloading zuul for 81a1f1a..0993349
* 11:43 hashar: fixed puppet on deployment-cache-text04  T132689
* 10:38 hashar: Rebuild Image ci-trusty-wikimedia-1461753210 in wmflabs-eqiad is ready
* 09:43 hashar: tmh01.deployment-prep.eqiad.wmflabs denies mwdeploy user breaking https://integration.wikimedia.org/ci/job/beta-scap-eqiad/


== 2016-04-26 ==
== 2022-08-29 ==
* 20:45 hashar: Regenerating Nodepool Jessie snapshot to include composer and HHVM | T128092
* 21:25 inflatador: ES6->7 upgrade in beta-cluster [[phab:T315604|T315604]]
* 20:23 jzerebecki: reloading zuul for eb480d8..81a1f1a
* 13:39 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/826985
* 19:25 jzerebecki: reload zuul for 4675213..eb480d8
* 12:34 James_F: Zuul: [mediawiki/core] Don't run phan on PHP 7.4, it doesn't pass; for [[phab:T316518|T316518]]
* 19:25 jzerebecki: 4675213..eb480d8
* 12:08 James_F: Zuul: Enable PHP74 jobs on gate-and-submit-wmf pipeline [Re-re-try] for [[phab:T293924|T293924]]
* 14:18 hashar: Applied security patches to 1.27.0-wmf.22 | T131556
* 11:28 James_F: Zuul: [mediawiki/core] Make PHP 8.1 voting on REL1_38 and REL1_39 for [[phab:T316080|T316080]]
* 12:39 hashar: starting cut of 1.27.0-wmf.22 branch ( poke ostriches )
* 09:00 hashar: Updated Jenkins job mediawiki-quibble-composer-mysql-php80-docker to capture core dumps using https://gerrit.wikimedia.org/r/c/integration/config/+/496392 # [[phab:T315167|T315167]]
* 10:29 hashar: restored integration/phpunit on CI slaves due to https://integration.wikimedia.org/ci/job/operations-mw-config-phpunit/ failling
* 00:28 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/827015
* 09:11 hashar: CI is back up!
* 08:20 hashar: shutoff instance castor, does not seem to be able to start again :( | T133652
* 08:12 hashar: hard rebooting castor instance | T133652
* 08:10 hashar: soft rebooting castor instance | T133652
* 08:06 hashar: CI jobs deadlocked due to castor being unavailable | https://phabricator.wikimedia.org/T133652
* 00:46 thcipriani: temporary keyholder fix in place in beta
* 00:18 thcipriani: beta-scap-eqiad failure due to bad keyholder-auth.d fingerprints


== 2016-04-25 ==
== 2022-08-25 ==
* 20:58 cscott: updated OCG to version 58a720508deb368abfb7652e6a8c7225f95402d2
* 15:53 dancy: Reloading Zuul to deploy  https://gerrit.wikimedia.org/r/c/integration/config/+/826593
* 19:46 hashar: Nodepool now has a couple trusty instances intended to experiment with Zend 5.5 / HHVM migration . https://phabricator.wikimedia.org/T133203#2236625
* 07:37 James_F: Zuul: [mediawiki/extensions/UnlinkedWikibase] Add quibble for [[phab:T316183|T316183]]
* 13:34 hashar: Nodepool is attempting to create a Trusty snapshot with name ci-trusty-wikimedia-1461591203 | T133203
* 13:15 hashar: openstack image create --file /home/hashar/image-trusty-20160425T124552Z.qcow2 ci-trusty-wikimedia --disk-format qcow2 --property show=true  # T133203
* 10:38 hashar: Refreshing Nodepool Jessie snapshot based on new image
* 10:35 hashar: Refreshed Nodepool Jessie image ( image-jessie-20160425T100035Z )
* 09:24 hashar: beta / scap failure filled as T133521
* 09:20 hashar: Keyholder / mwdeploy ssh keys have been messed up on beta cluster somehow :-(
* 08:47 hashar: mwdeploy@deployment-tin has lost ssh host keys file :(


== 2016-04-24 ==
== 2022-08-23 ==
* 17:14 jzerebecki: reloading e06f1fe..672fc84
* 17:40 hashar: Stopping Gerrit
* 11:54 hashar: Manually applied a `docker-pkg` fix on contint2001 to prevent it from downloading unrelated images [[phab:T310458|T310458]]


== 2016-04-22 ==
== 2022-08-22 ==
* 18:13 legoktm: deploying https://gerrit.wikimedia.org/r/284841
* 07:17 taavi: trying to disconnect jenkins from gearman and then re-connect to see if it helps with [[phab:T315818|T315818]]
* 08:13 legoktm: deploying https://gerrit.wikimedia.org/r/284860
* 07:12 taavi: restart zuul-merger on contint2001 [[phab:T315818|T315818]]


== 2016-04-21 ==
== 2022-08-21 ==
* 19:07 thcipriani: scap version testing should be done, puppet should no longer be disabled on hosts
* 13:07 Reedy: looks live various CI jobs (coverage etc) have been stuck for about 8.5 hours
* 18:02 thcipriani: disabling puppet on scap targets to test scap_3.1.0-1+0~20160421173204.70~1.gbp6706e0_all.deb
* 13:00 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/824862


== 2016-04-20 ==
== 2022-08-19 ==
* 22:28 thcipriani: rolling back scap version in beta, legit failure :(
* 23:04 TheresNoTime: resized deployment-mwlog01's /srv volume, restarted
* 21:52 thcipriani: testing new scap version in beta on deployment-tin
* 22:57 TheresNoTime: shutting down deployment-mwlog01 for [[phab:T315707|T315707]]
* 17:54 thcipriani: Reloading Zuul to deploy [[gerrit:284494]]
* 15:50 James_F: Docker: Build stalled out for 30 minutes; terminated and re-started.
* 13:58 hashar: Stopping HHVM on CI slaves by cherry picking a couple puppet patches | T126594
* 15:15 dancy: Upgrading scap to latest code revision in beta cluster
* 13:33 hashar: salt -v '*trusty*' cmd.run 'rm /usr/lib/x86_64-linux-gnu/hhvm/extensions/current'  # Cleanup on CI slaves for T126658
* 15:11 James_F: Docker: Building and publishing images with PHP 8.0.22 for [[phab:T315167|T315167]]
* 13:27 hashar: Restarted integration puppet master service (out of memory / mem leak)


== 2016-04-17 ==
== 2022-08-18 ==
* 01:01 legoktm: deploying https://gerrit.wikimedia.org/r/283837
* 17:01 hashar: Restarted zuul-merger on contint1001 # [[phab:T315586|T315586]]
* 16:42 hashar: Reloaded Zuul for {{Gerrit|Ie83b19699a8526bf67f5610a0aa89dcedc0e3979}}
* 13:14 awight: [beta] Deploying new kartotherian version


== 2016-04-16 ==
== 2022-08-17 ==
* 14:21 Krenair: restarted qa-morebots per request
* 14:18 zabe: fix merge conflicts in deployment-prep private repo # [[phab:T315394|T315394]]
* 14:18 Krenair: <jzerebecki> !log reloading zuul for 3f64dbd..c6411a1
* 10:27 hashar: Built image docker-registry.discovery.wmnet/releng/commit-message-validator:1.0.0  # [[phab:T315159|T315159]]


== 2016-04-13 ==
== 2022-08-16 ==
* 01:48 legoktm: deploying https://gerrit.wikimedia.org/r/282952
* 20:51 RhinosF1: beta: is down see wikitech-l and https://phabricator.wikimedia.org/T315350
* 20:30 hashar: Repooled integration-agent-docker-1028 , it was mysteriously unreachable [[phab:T315372|T315372]]
* 19:18 Krinkle: mediawiki/extensions/EventLogging$ git remote-wildcard-br-d 'wmf/1.35*' 'wmf/1.36*'  'wmf/1.37*' 'wmf/1.38*'
* 19:17 Krinkle: mediawiki/extensions/Scribunto$ git remote-wildcard-br-d 'wmf/1.35*' # ref [[phab:T303828|T303828]]
* 19:16 TheresNoTime: manually running `/usr/local/bin/wmf-beta-update-databases.py` on `deployment-deploy03`
* 17:16 TheresNoTime: soft-rebooting deployment-mediawiki12


== 2016-04-12 ==
== 2022-08-12 ==
* 19:47 bd808: Cleaned up large hhbc cache file on deployment-medaiwiki03 via `sudo service hhvm stop; sudo rm /var/cache/hhvm/fcgi.hhbc.sq3; sudo service hhvm start`
* 17:47 dancy: Restarting zuul
* 19:47 bd808: Cleaned up large hhbc cache file on deployment-medaiwiki02 via `sudo service hhvm stop; sudo rm /var/cache/hhvm/fcgi.hhbc.sq3; sudo service hhvm start`
* 17:42 dancy: Restarting Jenkins in an attempt to get CI jobs running again
* 19:46 bd808: Cleaned up large hhbc cache file on deployment-medaiwiki01 via `sudo service hhvm stop; sudo rm /var/cache/hhvm/fcgi.hhbc.sq3; sudo service hhvm start`
* 00:54 ori: On deployment-cache-<nowiki>{</nowiki>text,upload<nowiki>}</nowiki>06, ran: touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service . Certificate was close to expiry
* 19:10 Amir1: manually rebooted deployment-ores-web
* 19:08 Amir1: manually cherry-picked 282992/2 into to puppetmaster
* 17:05 Amir1: ran puppet agen in sca01 manually in /srv directory
* 11:34 hashar: Jenkins upgrading "Script Security Plugin" from 1.17 to 1.18.1 https://wiki.jenkins-ci.org/display/SECURITY/Jenkins+Security+Advisory+2016-04-11


== 2016-04-11 ==
== 2022-08-11 ==
* 21:23 csteipp: deployed and reverted oath
* 21:11 mutante: restarted phd service on phab2001
* 20:30 thcipriani: relaunched slave-agent on integration-slave-trusty-1025, back online
* 19:12 brennen: Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/merge_requests/16
* 20:19 thcipriani: integration-slave-trusty-1025 horizon console filled with INFO: task jbd2/vda1-8:170 blocked for more than 120 seconds. rebooting
* 12:26 jnuche: Reenabled CI beta sync jobs after cluster incident
* 20:13 thcipriani: killing stuck jobs, marking integration-slave-trusty-1025 as offline temporarily
* 11:48 jnuche: Temporarily disabled CI beta sync jobs until issue in cluster is resolved
* 14:42 thcipriani: deployment-mediawiki01 disk full :(
* 10:25 zabe: take deployment-prep out of read-only mode


== 2016-04-08 ==
== 2022-08-10 ==
* 22:46 matt_flaschen: Created blobs1 table for all wiki DBs on Beta Cluster
* 11:36 jnuche: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/822052
* 14:34 hashar: Image ci-jessie-wikimedia-1460125717 in wmflabs-eqiad is ready  adds package 'unzip' | T132144
* 12:49 hashar: Image ci-jessie-wikimedia-1460119481 in wmflabs-eqiad is ready , adds package 'zip' | T132144
* 09:30 hashar: Removed label hasAndroidSdk from gallium . That prevent that slave from sometime running the job apps-android-commons-build 
* 08:42 hashar: Rebased puppet master and fixed conflict with https://gerrit.wikimedia.org/r/#/c/249490/


== 2016-04-07 ==
== 2022-08-09 ==
* 20:16 hashar: deployment-mediawiki02.deployment-prep.eqiad.wmflabs , cleared up random left over stuff / big logs etc
* 22:11 James_F: Docker: Building and publishing quibble-buster-php74-coverage for PHP7.4+ coverage
* 20:08 hashar: deployment-mediawiki02.deployment-prep.eqiad.wmflabs / is full
* 21:56 James_F: Two failures in devimage build: releng/eventlogging and releng/buster-swift53 – nothing new from me, looks like they've been broken for a bit?
* 21:17 James_F: Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/merge_requests/17
* 21:07 James_F: Zuul: Enable PHP74 jobs on gate-and-submit-wmf pipeline [Re-try] for [[phab:T293924|T293924]]
* 19:42 James_F: Docker: Re-build and publish quibble-buster-php74 based on Wikimedia PHP not sury-php for [[phab:T293851|T293851]]


== 2016-04-05 ==
== 2022-08-08 ==
* 23:56 marxarelli: Removed cherry-pick and rebased /var/lib/git/operations/puppet on integration-puppetmaster after merge of https://gerrit.wikimedia.org/r/#/c/281706/
* 15:56 taavi: gerrit: used `ssh gerrit.wikimedia.org -p 29418 gerrit close-connection` to disconnect four of sgimeno's stuck sessions
* 21:58 marxarelli: Restarting puppetmaster on integration-puppetmaster
* 14:43 James_F: jforrester@doc1002:~$ sudo -u doc-uploader rm -rf /srv/doc/wikibase-vuejs-components/ for [[phab:T309872|T309872]]
* 21:53 marxarelli: Cherry picked https://gerrit.wikimedia.org/r/#/c/281706/ on integration-puppetmaster and applying on integration-slave-trusty-1014
* 13:23 James_F: Zuul: [mediawiki/libs/metrics-platform] Run Java jobs on maven file paths for [[phab:T314630|T314630]]
* 10:32 hashar: gallium removing texlive
* 10:28 jnuche: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/821166
* 10:29 hashar: gallium removing libav / ffmpeg. No more needed since jobs are no more running on that server


== 2016-04-04 ==
== 2022-08-05 ==
* 17:30 greg-g: Phabricator going down in about 10 minutes to hopefully address the overheating issue: T131742
* 16:02 James_F: Docker: Building and publishing composer-security-check:1.1.1 for [[phab:T296967|T296967]]
* 10:06 hashar: integration: salt -v '*-slave*' cmd.run 'rm /usr/local/bin/grunt; rm -fR /usr/local/lib/node_modules/grunt-cli'  | T124474
* 15:40 James_F: Zuul: [mediawiki/services/function-*] Switch coverage to node16
* 10:04 hashar: integration: salt -v '*-slave*' cmd.run 'npm -g uninstall  grunt-cli' | T124474
* 15:33 James_F: Zuul: [mediawiki/libs/metrics-platform] Add experimental regular java jobs for [[phab:T314630|T314630]]
* 03:15 greg-g: Phabricator is down
* 14:48 James_F: Zuul: Add WelpThatWorked to allow list
* 14:48 James_F: Zuul: [mediawiki/extensions/MenuEditor] BlueSpiceDiscovery dependency is a skin


== 2016-04-03 ==
== 2022-08-04 ==
* 07:02 legoktm: deploying https://gerrit.wikimedia.org/r/281079
* 15:21 dancy: Deleting beta-mediawiki-config-update-eqiad job
* 03:16 Amir1: manually rebooted deployment-ores-web and deployment-sca01
* 15:16 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/820405
* 10:01 TheresNoTime: clearing out stuck beta deployment jobs [[phab:T314378|T314378]] [[phab:T72597|T72597]]


== 2016-04-02 ==
== 2022-08-03 ==
* 22:58 Amir1: added local hack to pupetmaster to make scap3 provider more verbose
* 21:05 James_F: Zuul: Doing a graceful restart to see if this clears the fork-bombed CI jobs.
* 19:46 hashar: Upgrading Jenkins Gearman plugin to v2.0 , bring in diff registration for faster updates of Gearman server
* 20:13 taavi: reloading zuul for https://gerrit.wikimedia.org/r/820212
* 14:39 Amir1: manually added 281170/5 to beta puppetmaster
* 17:44 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/820171
* 14:22 Amir1: manually added 281161/1 to beta puppetmaster
* 14:57 brennen: gitlab: flipping admin bit for bd808 for API testing purposes
* 11:31 Reedy: deleted archived logs older than 30 days from deployment-fluorine
* 14:11 James_F: Zuul: [wikimedia/vuejs-components] Mark as archived for [[phab:T309872|T309872]]
* 12:00 James_F: Ran `zuul-test-repo design/codex postmerge` on contint2001 to finally run coverage for Codex
* 11:58 James_F: Zuul: Run publish jobs on branches called 'main' too


== 2016-04-01 ==
== 2022-08-02 ==
* 22:16 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/281046
* 19:26 James_F: Zuul: [design/codex] Switch coverage job back to -direct
* 21:13 hashar: Image ci-jessie-wikimedia-1459544873 in wmflabs-eqiad is ready
* 15:23 dancy: Deleted beta-build-scap-deb and beta-publish-deb Jenkins jobs. (https://gerrit.wikimedia.org/r/c/integration/config/+/819028)
* 20:57 hashar: Refreshing Nodepool snapshot to hopefully get npm 2.x installed T124474
* 15:22 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/819028
* 20:37 hashar: Added Luke081515 as a member of deployment-prep (beta cluster) labs project
* 07:55 TheresNoTime: cleared stuck beta deployment jobs [[phab:T72597|T72597]]
* 20:31 hashar: Dropping grunt-cli from the permanent slaves.  People can have it installed by listing it in their package.json devDependencies https://gerrit.wikimedia.org/r/#/c/280974/
* 14:06 hashar: integration: removed sudo policy permitting sudo as any member of the project for any member of the project, which included jenkins-deploy user
* 14:05 hashar: integration: removed sudo policy permitting sudo as root for any member of the project, which included jenkins-deploy user
* 11:23 bd808: Freed 4.5G on deployment-fluorine:/srv/mw-log by deleting wfDebug.log
* 04:00 Amir1: manually rebooted deployment-sca01
* 00:16 csteipp: created oathauth_users table on centralauth db in beta


== 2016-03-31 ==
== 2022-08-01 ==
* 21:19 legoktm: deploying https://gerrit.wikimedia.org/r/280756
* 23:16 James_F: Zuul: [design/codex] Switch to node16
* 13:52 hashar: rebasing integration puppetmaster (it had some merge commit )
* 23:16 James_F: 16:15:59 <+wikibugs> (Merged) jenkins-bot: Zuul: [design/codex] Switch to node16 [integration/config] - https://gerrit.wikimedia.org/r/819185 (owner: Jforrester)
* 01:40 Krinkle: Purge npm cache in integration-slave-trusty-1015:/mnt/home/jenkins-deploy/.npm was corrupted around March 23 19:00 for unknown reasons (T130895)
* 22:53 TheresNoTime: remove stuck beta deployment jobs
* 22:51 dduvall: re-armed keyholder on deploy-1004.devtools following reboot
* 22:50 James_F: Zuul: Don't use browser-direct-coverage where browser-coverage will do
* 22:49 dduvall: modified `deployment_hosts` puppet config for devtools project to allow deployments from `deploy-1004`
* 22:24 dduvall: armed keyholder with phabricator key on deploy-1004.devtools
* 22:11 dduvall: setting puppetmaster to project standalone for deploy-1004.devtools
* 21:01 James_F: Zuul: [mediawiki/extensions/Phonos] Add comment about deployment timing for [[phab:T314306|T314306]]
* 21:00 James_F: Zuul: [mediawiki/extensions/BlueSpiceCustomMenu] Add MenuEditor dependency
* 15:53 taavi: reloading zuul for https://gerrit.wikimedia.org/r/819097
* 09:14 TheresNoTime: clearing stuck beta CI jobs


== 2016-03-30 ==
== 2022-07-29 ==
* 19:32 twentyafterfour: deleted some nutcracker and hhvm log files on deployment-mediawiki01 to free space
* 22:16 James_F: Zuul: Configure CI for the forthcoming REL1_39 branches for [[phab:T313919|T313919]]
* 15:37 hashar: Gerrit has trouble sending emails T131189
* 18:00 brennen: using standalone puppetmaster in devtools to test phabricator scap3 changes
* 13:48 Reedy: deployment-prep Make that deployment-tmh01
* 13:48 Reedy: deployment-prep upgrade hhvm on deployment-mediawiki01 and reboot
* 13:35 Reedy: deployment-prep upgrade hhvm on deployment-mediawiki03 and reboot
* 12:16 gehel: deployment-prep restarting varnish on deployment-cache-text04
* 11:04 Amir1: cherry-picked 280413/1 in beta puppetmaster, manually running puppet agent in deployment-ores-web
* 10:22 Amir1: cherry-picking 280403 to beta puppetmaster and manually running puppet agent in deployment-ores-web


== 2016-03-29 ==
== 2022-07-28 ==
* 23:22 marxarelli: running jenkins-jobs update config/ 'mwext-donationinterfacecore125-testextension-zend53' to deploy https://gerrit.wikimedia.org/r/#/c/280261/  
* 17:54 brennen: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/818189/
* 19:52 Amir1: manually updated puppetmaster, deleted SSL cert key in deployment-ores-web in VM, running puppet agent manually
* 02:20 jzerebecki: reloading zuul fo 46923c8..c0937ee


== 2016-03-26 ==
== 2022-07-27 ==
* 22:38 jzerebecki: reloading zuul for 2d7e050..46923c8
* 13:55 James_F: Zuul: [mediawiki/core] Add a non-vendor php80 job for main branch [[phab:T300463|T300463]]
* 13:08 James_F: Zuul: [mediawiki/core] Make php80 voting on REL1_38 for [[phab:T274965|T274965]]
* 13:04 James_F: Zuul: Add php81 experimental job everywhere we have php80
* 12:39 James_F: Zuul: [mediawiki/extensions/WikibaseLexeme] Add WikibaseLexemeCirrusSearch dep
* 03:48 Krinkle: Click "Disable publishing" for a dozen repos created recently, including OAuthRateLimiter, ref [[phab:T143162|T143162]], [[phab:T193565|T193565]]


== 2016-03-25 ==
== 2022-07-25 ==
* 23:55 marxarelli: deleting instances integration-slave-trusty-1002 and integration-slave-trusty-1005
* 22:16 dduvall: re-enabled puppet on untrusted runners following testing of https://gerrit.wikimedia.org/r/c/operations/puppet/+/815769
* 23:54 marxarelli: deleting jenkins nodes integration-slave-trusty-1002 and integration-slave-trusty-1005
* 21:25 dduvall: disabling puppet on untrusted gitlab-runners to test deployment of https://gerrit.wikimedia.org/r/c/operations/puppet/+/815769
* 23:41 marxarelli: completed rolling manual deploy of https://gerrit.wikimedia.org/r/#/c/279640/ to trusty slaves
* 23:27 marxarelli: starting rolling offline/remount/online of trusty slaves to increase tmpfs size
* 23:22 marxarelli: pooled new trusty slaves integration-slave-trusty-1024 and integration-slave-trusty-1025
* 23:13 jzerebecki: reloading zuul fro 0aec21d..2d7e050
* 22:14 marxarelli: creating new jenkins node for integration-slave-trusty-1024
* 22:11 marxarelli: rebooting integration-slave-trusty-{1024,1025} before pooling as replacements for trusty-1002 and trusty-1005
* 21:06 marxarelli: repooling integration-slave-trusty-{1005,1002} to help with load while replacement instances are provisioning
* 16:59 marxarelli: depooling integration-slave-trusty-1002 until DNS resolution can be resolved. still investigating disk space issue


== 2016-03-24 ==
== 2022-07-23 ==
* 16:39 thcipriani: restarted rsync service on deployment-tin
* 17:43 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/816251
* 13:45 thcipriani|afk: rearmed keyholder on deployment-tin
* 04:41 Krinkle: beta-update-databases-eqiad and beta-scap-eqiad stuck for over 8 hours (IRC notifier plugin deadlock)
* 03:28 Krinkle: beta-mediawiki-config-update-eqiadqueued has been stuck for over 5 hours.


== 2016-03-23 ==
== 2022-07-21 ==
* 23:00 Krinkle: rm-rf integration-slave-trusty-1013:/mnt/home/jenkins-deploy/tmpfs/jenkins-2/karma-54925082/ (bad permissions, caused Karma issues)
* 21:55 dancy: Upgrading scap to 4.11.2-1+0~20220720160115.349~1.gbpd4a6cb in beta cluster
* 19:02 legoktm: restarted zuul


== 2016-03-22 ==
== 2022-07-20 ==
* 17:40 legoktm: deploying https://gerrit.wikimedia.org/r/278926
* 15:43 dancy: Upgrading scap to 4.11.1-1+0~20220720154238.348~1.gbp94de82 in beta cluster
* 13:19 James_F: Zuul: [mediawiki/extensions/VueTest] Add extension-codehealth pipeline


== 2016-03-21 ==
== 2022-07-19 ==
* 21:55 hashar: zuul: almost all MediaWiki extensions migrated to run the npm job on Nodepool (with Node.js 4.3)  T119143 . All tested. Will monitor the build results that ran overnight tomorrow
* 17:40 dancy: Upgrading scap to 4.11.0-1+0~20220719173732.346~1.gbpe07bc9 in beta cluster
* 20:28 hashar: Mass running npm-node-4.3 jobs against MediaWiki extensions to make sure they all pass ( https://gerrit.wikimedia.org/r/#/c/278004/  |  T119143 )
* 17:00 urbanecm: deployment-prep: urbanecm@deployment-mwmaint02:~$ mwscript extensions/GrowthExperiments/maintenance/migrateWikitextMentorList.php --wiki=arwiki # [[phab:T310905|T310905]]
* 17:40 elukey: executed git rebase --interactive on deployment-puppetmaster.deployment-prep.eqiad.wmflabs to remove https://gerrit.wikimedia.org/r/#/c/278713/
* 15:46 elukey: hacked manually the cdh puppet submodule on deployment-puppetmaster.deployment-prep.eqiad.wmflabs - please let me know if interfere with anybody's tests
* 14:24 elukey: executed git submodule update --init on deployment-puppetmaster.deployment-prep.eqiad.wmflabs
* 11:25 elukey: beta: cherry picked https://gerrit.wikimedia.org/r/#/c/278713/ to test an updated to the cdh module (analytics)
* 11:13 hashar: beta: rebased puppet master which had a conflict on https://gerrit.wikimedia.org/r/#/c/274711/  which got merged meanwhile (saves Elukey )
* 11:02 hashar: beta: added Elukey (wikimedia ops) to the project as member and admin


== 2016-03-19 ==
== 2022-07-18 ==
* 13:04 hashar: Jenkins: added ldap-labs-codfw.wikimedia.org as a fallback LDAP server  T130446
* 19:43 dancy: Upgrading scap to 4.10.0-1+0~20220718175214.344~1.gbpe518a1 in beta cluster
* 13:40 Lucas_WMDE: lucaswerkmeister-wmde@deployment-deploy03:~$ sql wikishared --write < /srv/mediawiki-staging/php-master/extensions/CampaignEvents/db_patches/mysql/tables-generated.sql # [[phab:T311752|T311752]]
* 10:40 hashar: Refreshing Jenkins jobs for https://gerrit.wikimedia.org/r/814745
* 09:58 hashar: Refreshing Jenkins jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/814730 jjb: update php jobs to have php-pcov included
* 09:46 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/814728


== 2016-03-18 ==
== 2022-07-17 ==
* 17:16 jzerebecki: reloading zuul for e33494f..89a9659
* 13:00 taavi: reloading zuul for https://gerrit.wikimedia.org/r/814356


== 2016-03-17 ==
== 2022-07-16 ==
* 21:10 thcipriani: updating scap on deployment-tin to test D133
* 00:10 mutante: doc1002 - sudo systemctl start rsync-doc-doc2001.codfw.wmnet - Icinga alerted after an 'rsync warning: some files vanished before they could be transferred (code 24)' - but all is ok on next attempt
* 18:31 cscott: updated OCG to version c1a8232594fe846bd2374efd8f7c20d7e97ac449
* 09:34 hashar: deployment-jobrunner01 deleted /var/log/apache/*.gz  T130179
* 09:04 hashar: Upgrading hhvm and related extensions on jobrunner01  T130179


== 2016-03-16 ==
== 2022-07-15 ==
* 14:28 hashar: Updated jobs having the package manager cache system (castor) via https://gerrit.wikimedia.org/r/#/c/277774/
* 15:59 hashar: Built pcov php docker images [[phab:T280170|T280170]]
* 15:46 hashar: contint2001: `docker-system-prune-dangling.service`  it failed overnight cause Docker was not running. That should clear Icinga state # [[phab:T313119|T313119]]
* 14:05 James_F: Zuul: [mediawiki/tools/wikilambda-cli] Switch to node16 jobs
* 13:05 James_F: Docker: Building node16 images for CI for [[phab:T313075|T313075]], this time actually.
* 12:30 hashar: Starting docker on contint2001.wikimedia.org # [[phab:T313119|T313119]]
* 12:20 hashar: rebuilding `php??` images for pcov https://gerrit.wikimedia.org/r/c/integration/config/+/694621 # [[phab:T280170|T280170]]
* 10:55 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/813967
* 10:49 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/813932


== 2016-03-15 ==
== 2022-07-14 ==
* 15:17 jzerebecki: added wikidata.beta.wmflabs.org in https://wikitech.wikimedia.org/wiki/Special:NovaAddress to deployment-cache-text04.deployment-prep.eqiad.wmflabs
* 18:50 James_F: Docker: Building node16 images for CI for [[phab:T313075|T313075]]
* 14:19 hashar: Image ci-jessie-wikimedia-1458051246 in wmflabs-eqiad is ready  T124447
* 14:52 James_F: Zuul: [mediawiki/skins/BlueSpiceSkin] Archive for [[phab:T203215|T203215]]
* 14:14 hashar: Refreshing Nodepool snapshot images so it get a fresh copy of slave-scripts  T124447
* 14:48 James_F: Zuul: [mediawiki/extensions/BlueSpiceExtensions] Archive
* 14:08 hashar: Deploying slave script change https://gerrit.wikimedia.org/r/#/c/277508/ "npm-install-dev.py: Use config.dev.yaml instead of config.yaml" for T124447
* 14:42 James_F: Zuul: [mediawiki/extensions/BlueSpiceBookshelfUI] Archive for [[phab:T268085|T268085]]
* 14:38 James_F: Zuul: [mediawiki/tools/wikilambda-cli] Install node14 CI


== 2016-03-14 ==
== 2022-07-13 ==
* 22:18 greg-g: new jobs weren't processing in Zuul, lego fixed it and blamed Reedy
* 23:23 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/813720
* 20:13 hashar: Updating Jenkins jobs mwext-Wikibase-* so they no more rely on --with-phpunit ( ping @hoo https://gerrit.wikimedia.org/r/#/c/277330/ )
* 20:31 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/813707
* 17:03 Krinkle: Doing full Zuul restart due to deadlock (T128569)
* 10:18 moritzm: re-enabled systemd unit for logstash on deployment-logstash2


== 2016-03-11 ==
== 2022-07-12 ==
* 22:42 legoktm: deploying https://gerrit.wikimedia.org/r/276901
* 17:29 Amir1: dropping tl_namespace and tl_title from templatelinks in fawiki ([[phab:T312865|T312865]])
* 19:41 legoktm: legoktm@integration-slave-trusty-1001:/mnt/jenkins-workspace/workspace$ sudo rm -rf mwext-Echo-testextension-* # because it was broken


== 2016-03-10 ==
== 2022-07-11 ==
* 20:22 hashar: Nodepool Image ci-jessie-wikimedia-1457641052 in wmflabs-eqiad is ready
* 22:55 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/812934
* 20:19 hashar: Refreshing Nodepool to include the 'varnish' package T128188 
* 19:46 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/812467
* 20:05 hashar: apt-get upgrade integration-slave-jessie1001  (bring in ffmpeg update and nodejs among other things)
* 12:22 hashar: Nodeppol Image ci-jessie-wikimedia-1457612269 in wmflabs-eqiad is ready
* 12:18 hashar: Nodepool: rebuilding image to get mathoid/graphoid packages included (hopefully) T119693 T128280


== 2016-03-09 ==
== 2022-07-10 ==
* 17:56 bd808: Cleaned up git clone state in deployment-tin.deployment-prep:/srv/mediawiki-staging/php-master and queued beta-code-update-eqiad to try again (T129371)
* 00:07 Krinkle: krinkle@mediawiki12$ sudo enable-puppet
* 17:48 bd808: Git clone at deployment-tin.deployment-prep:/srv/mediawiki-staging/php-master in completely horrible state. Investigating
* 17:22 bd808: Fixed https://integration.wikimedia.org/ci/job/beta-mediawiki-config-update-eqiad/4452/
* 17:19 bd808: Manually cleaning up broken rebase in deployment-tin.deployment-prep:/srv/mediawiki-staging
* 16:27 bd808: Removed cherry-pick of https://gerrit.wikimedia.org/r/#/c/274696 ; manually cleaned up systemd unit and restarted logstash on deployment-logstash2
* 14:59 hashar: Image ci-jessie-wikimedia-1457535250 in wmflabs-eqiad is ready T129345
* 14:57 hashar: Rebuilding snapshot image to get Xvfb enabled at boot time T129345
* 13:04 moritzm: cherrypicked patch to deployment-prep which provides a systemd unit for logstash
* 10:52 hashar: Image ci-jessie-wikimedia-1457520493 in wmflabs-eqiad is ready
* 10:29 hashar: Nodepool: created new image and refreshing snapshot in attempt to get Xvfb running T129320 T128090


== 2016-03-08 ==
== 2022-07-09 ==
* 23:42 legoktm: running CentralAuth's checkLocalUser.php --verbose=1 --delete=1 on deployment-tin for T115198
* 20:39 ori: ori@deployment-mediawiki12:~$ sudo apt install php-tideways-xhprof-dbgsym
* 21:33 hashar: Nodepool  Image ci-jessie-wikimedia-1457472606 in wmflabs-eqiad is ready
* 17:25 ori: Cherry-picked {{Gerrit|Ief73cc553}} (varnish: use libvmod-querysort on Beta Cluster) on deployment-prep Puppetmaster. Can be reverted if there are any issues.
* 19:23 hashar: Zuul inject DISPLAY https://gerrit.wikimedia.org/r/#/c/273269/
* 06:16 Krinkle: krinkle@mediawiki12$ sudo disable-puppet
* 16:03 hashar: Image ci-jessie-wikimedia-1457452766 is ready T128090
* 06:08 ori: ori@deployment-mediawiki12: userdel systemd-coredump, followed by apt install systemd-coredump
* 15:59 hashar: Nodepool: refreshing snapshot image to ship browsers+Xvfb for T128090
* 05:50 Krinkle: krinkle@deployment-mediawiki-12$ sudo apt-get install systemd-coredump  # ref [[phab:T312689|T312689]]
* 14:27 hashar: Mass refreshed CI slave-scripts 1d2c60d..e27c292
* 13:38 hashar: Rebased integration puppet master. Dropped a make-wmf-branch patch and the one for raita role
* 11:26 hashar: Nodepool: created new snapshot to set puppet $::labsproject : ci-jessie-wikimedia-1457436175 hoping to fix hiera lookup T129092
* 02:51 ori: deployment-prep Updating HHVM on deployment-mediawiki01
* 02:27 ori: deployment-prep Updating HHVM on deployment-mediawiki02
* 01:50 Krinkle: integration-saltmater: salt -v '*slave-trusty*' cmd.run 'rm -rf /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm/src/skins/BlueSky' (T117710)
* 01:50 Krinkle: integration-saltmater: salt -v '*slave-trusty*' cmd.run 'rm -rf /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm-composer/src/skins/BlueSky'


== 2016-03-07 ==
== 2022-07-07 ==
* 21:03 hashar: Nodepool upgraded to 0.1.1-wmf.4 , it no more waits 1 minute before deleted a used node | T118573
* 22:42 TheresNoTime: clear stuck beta deployment jobs (again), [[phab:T72597|T72597]]
* 20:05 hashar: Upgrading Nodepool from 0.1.1-wmf3 to 0.1.1-wmf.4 with andrewbogott | T118573
* 21:10 TheresNoTime: clear stuck beta deployment jobs, [[phab:T72597|T72597]]
* 16:47 urbanecm: deployment-prep: wikiadmin@172.16.3.206(enwiki)> delete from growthexperiments_mentor_mentee where gemm_mentor_id=93651; # testing a specific workflow in Special:MentorDashboard
* 12:22 hashar: integration: rebooting `integration-agent-docker-1039` [[phab:T312534|T312534]]


== 2016-03-06 ==
== 2022-07-05 ==
* 10:20 legoktm: deploying https://gerrit.wikimedia.org/r/274911
* 14:17 dwalden: restarted mathoid service on deployment-docker-mathoid01
* 11:39 hashar: Reloaded Zuul for `skip selenium for Wikibase repo/rest-api` https://gerrit.wikimedia.org/r/c/integration/config/+/811258
* 08:49 hauskatze: Diffusion rORES repository. Changed URI settings: enabled SSH push for mirroring; disabled HTTP {{!}} [[phab:T311390|T311390]]


== 2016-03-04 ==
== 2022-06-30 ==
* 19:31 hashar: Nodepool Image ci-jessie-wikimedia-1457119603 in wmflabs-eqiad is ready - T128846
* 22:02 TheresNoTime: unstuck beta-mediawiki-config-update-eqiad jobs, will comment at [[phab:T72597|T72597]]
* 13:29 hashar: Nodepool Image ci-jessie-wikimedia-1457097785 in wmflabs-eqiad is ready
* 21:05 TheresNoTime: cancelled beta-code-update-eqiad#398138 to make way for pending beta-scap-sync-world#57641, queued another beta-code-update-eqiad
* 08:42 hashar: CI deleting integration-slave-precise-1001 (2 executors). It is not in labs DNS which causes bunch of issues, no need for the capacity anymore. T128802
* 16:47 taavi: reloading zuul to deploy https://gerrit.wikimedia.org/r/810053
* 02:49 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/274889
* 00:11 Krinkle: salt -v --show-timeout '*slave*' cmd.run "bash -c 'cd /srv/deployment/integration/slave-scripts; git pull'"


== 2016-03-03 ==
== 2022-06-29 ==
* 23:37 legoktm: salt -v --show-timeout '*slave*' cmd.run "bash -c 'cd /srv/deployment/integration/slave-scripts; git pull'"
* 14:48 ori: Clearing data from incomplete migration on Wikifunctionswiki via sql.php
* 22:34 legoktm: mysql not running on integration-slave-precise-1002, manually starting (T109704)
* 13:39 TheresNoTime: clearing stuck beta deployment jobs, watching to ensure they catch up :')
* 22:30 legoktm: mysql not running on integration-slave-precise-1011, manually starting (T109704)
* 22:19 legoktm: mysql not running on integration-slave-precise-1012, manually starting (T109704)
* 22:07 legoktm: deploying https://gerrit.wikimedia.org/r/274821
* 21:58 Krinkle: Reloading Zuul to deploy (EventLogging and AdminLinks)  https://gerrit.wikimedia.org/r/274821  /
* 18:49 thcipriani: killing deployment-bastion since it is no longer used
* 14:23 hashar: https://integration.wikimedia.org/ci/computer/integration-slave-trusty-1011/ is out of disk space


== 2016-03-02 ==
== 2022-06-28 ==
* 16:22 jzerebecki: reloading zuul for 9398fa1..943f17b
* 14:45 TheresNoTime: clear stuck beta deployment jobs, now running & will keep an eye
* 10:38 hashar: Zuul should no more be caught in death loop due to Depends-On on an  event-schemas change. Hole filled with https://gerrit.wikimedia.org/r/#/c/274356/ T128569
* 13:39 hashar: gerrit: added `Cindy-the-browser-test-bot` to the `Service Users` group https://gerrit.wikimedia.org/r/admin/groups/d39fe9cefd40ca1a07e372c0d7bd7e72ce2e4a2f,members {{!}} [[phab:T311370|T311370]]
* 08:53 hashar: gerrit set-account Jsahleen --inactive    T108854
* 09:37 hashar: phabricator: changed username of rORES Phab>Gerrit replication from `phab` to `phabricator` # [[phab:T311390|T311390]]
* 01:19 thcipriani: force restarting zuul because the queue is very stuck https://www.mediawiki.org/wiki/Continuous_integration/Zuul#Restart
* 01:13 thcipriani: following steps for gearman deadlock: https://www.mediawiki.org/wiki/Continuous_integration/Zuul#Known_issues


== 2016-03-01 ==
== 2022-06-27 ==
* 23:10 Krinkle: Updated Jenkins configuration to also support php5 and hhvm for Console Sections detection of "PHPUnit"
* 21:19 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/809022
* 17:05 hashar: gerrit: set accounts inactive for Eloquence and Mgrover. Former employees of wmf and mail bounceback
* 19:28 Reedy: Reloading Zuul to deploy https://phabricator.wikimedia.org/T308406
* 16:41 hashar: Restarted Jenkins
* 16:32 hashar: Bunch of Jenkins job got stall because I have killed threads in Jenkins to unblock  integration-slave-trusty-1003 :-(
* 12:14 hashar:  integration-slave-trusty-1003 is back online
* 12:13 hashar: Might have killed the proper Jenkins thread to unlock integration-slave-trusty-1003
* 12:03 hashar: Jenkins can not pool back integration-slave-trusty-1003  Jenkins master has a bunch of blocking threads pilling up with hudson.plugins.sshslaves.SSHLauncher.afterDisconnect() locked somehow
* 11:41 hashar: Rebooting integration-slave-trusty-1003 (does not reply to salt / ssh)
* 10:34 hashar: Image ci-jessie-wikimedia-1456827861 in wmflabs-eqiad is ready
* 10:24 hashar: Refreshing Nodepool snapshot instances
* 10:22 hashar: Refreshing Nodepool base image to speed instances boot time (dropping open-iscsi package https://gerrit.wikimedia.org/r/#/c/273973/ )


== 2016-02-29 ==
== 2022-06-24 ==
* 16:23 hashar: salt -v '*slave*' cmd.run 'rm -fR /mnt/jenkins-workspace/workspace/mwext*jslint' T127362
* 20:52 taavi: added `denisse` as a member
* 16:17 hashar: Deleting all mwext-.*-jslint jobs from Jenkins. Paladox has migrated all of them to jshint/jsonlint generic jobs T127362
* 16:16 hashar: Deleting all mwext-.*-jslint jobs from Jenkins. Paladox has migrated all of them to jshint/jsonlint generic jobs
* 09:46 hashar: Jenkins installing Yaml Axis Plugin 0.2.0


== 2016-02-28 ==
== 2022-06-23 ==
* 01:30 Krinkle: Rebooting integration-slave-precise-1012 – Might help T109704 (MySQL not running)
* 15:59 taavi: reload zuul for https://gerrit.wikimedia.org/r/808021


== 2016-02-26 ==
== 2022-06-22 ==
* 15:14 jzerebecki: salt -v --show-timeout '*slave*' cmd.run "bash -c 'cd /srv/deployment/integration/slave-scripts; git pull'" T128191
* 17:36 taavi: gerrit: add tfellows to the extension-OpenBadges group per request in [[phab:T308278|T308278]]
* 15:14 jzerebecki: salt -v --show-timeout '*slave*' cmd.run "bash -c 'cd /srv/deployment/integration/slave-scripts; git pull'"
* 17:35 taavi: gerrit: create group extension-JsonData with robla in it, make it an owner of mediawiki/extensions/JsonData per request in [[phab:T303147|T303147]]
* 14:44 hashar: (since it started, dont be that scared!)
* 16:19 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/807586
* 14:44 hashar: Nodepool has triggered 40 000 instances
* 09:35 hashar: Switched `gitlab-prod-1001.devtools.eqiad1.wikimedia.cloud` instance to use the project Puppet master `puppetmaster-1001.devtools.eqiad1.wikimedia.cloud`
* 11:53 hashar: Restarted memcached on deployment-memc02  T128177
* 09:08 hashar: contint1001 , contint2002: deleting `.git/logs` from all zuul-merger repositories. We do not need the reflog `sudo -u zuul find /srv/zuul/git -type d -name .git -print -execdir rm -fR .git/logs \;` # [[phab:T307620|T307620]]
* 11:53 hashar: memcached process on deployment-memc02 seems to have a nice leak of socket usages (from lost) and plainly refuse connections (bunch of CLOSE_WAIT)  T128177
* 09:00 hashar: contint1001 , contint2002: setting `core.logallrefupdates=false` on all Zuul merger git repositories: `sudo -u zuul find /srv/zuul/git -type d -name .git -print -execdir git config core.logallrefupdates false \;` # [[phab:T307620|T307620]]
* 11:53 hashar: memcached process on deployment-memc02 seems to have a nice leak of socket usages (from lost) and plainly refuse connections (bunch of CLOSE_WAIT)
* 07:46 hashar: Building operations-puppet docker image for https://gerrit.wikimedia.org/r/c/integration/config/+/807180
* 11:40 hashar: deployment-memc04 find /etc/apt -name '*proxy' -delete  (prevented apt-get update)
* 11:26 hashar: beta: salt -v '*' cmd.run 'apt-get -y install ruby-msgpack'  . I am tired of seeing puppet debug messages: "Debug: Failed to load library 'msgpack' for feature 'msgpack'"
* 11:24 hashar: puppet keep restarting nutcracker apparently T128177
* 11:20 hashar: Memcached error for key "enwiki:flow_workflow%3Av2%3Apk:63dc3cf6a7184c32477496d63c173f9c:4.8" on server "127.0.0.1:11212": SERVER HAS FAILED AND IS DISABLED UNTIL TIMED RETRY


== 2016-02-25 ==
== 2022-06-21 ==
* 22:38 hashar: beta: maybe deployment-jobunner01 is processing jobs a bit faster now.  Seems like hhvm went wild
* 22:01 brennen: gitlab-runners: re-registering all shared runners
* 22:23 hashar: beta: jobrunner01  had apache/hhvm killed somehow .... Blame me
* 17:55 dancy: Upgrading scap to 4.9.4-1+0~20220621174226.320~1.gbp56e4d4 in beta cluster
* 21:56 hashar: beta: stopped jobchron / jobrunner on deployment-jobrunner01  and restarting them by running puppet
* 21:49 hashar: beta did a git-deploy of jobrunner/jobrunner hoping to fix puppet run on deployment-jobrunner01 and apparently it did! T126846
* 11:21 hashar: deleting workspace /mnt/jenkins-workspace/workspace/browsertests-Wikidata-WikidataTests-linux-firefox-sauce on slave-trusty-1015
* 10:08 hashar: Jenkins upgraded T128006
* 01:44 legoktm: deploying https://gerrit.wikimedia.org/r/273170
* 01:39 legoktm: deploying https://gerrit.wikimedia.org/r/272955 (undeployed) and https://gerrit.wikimedia.org/r/273136
* 01:37 legoktm: deploying https://gerrit.wikimedia.org/r/273136
* 00:31 thcipriani: running puppet on beta to update scap to latest packaged version: sudo salt -b '10%' -G 'deployment_target:scap/scap' cmd.run 'puppet agent -t'
* 00:20 thcipriani: deployment-tin not accepting jobs for some time, ran through https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code.2Fdb_update, is back now


== 2016-02-24 ==
== 2022-06-20 ==
* 19:55 legoktm: legoktm@deployment-tin:~$ mwscript extensions/ORES/maintenance/PopulateDatabase.php --wiki=enwiki
* 16:30 urbanecm: add sgimeno as a project member (Growth engineer with need for access)
* 18:30 bd808: "configuration file '/etc/nutcracker/nutcracker.yml' syntax is invalid"
* 15:50 ori: On deployment-cache-<nowiki>{</nowiki>text,upload<nowiki>}</nowiki>06, ran: touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service ([[phab:T310957|T310957]])
* 18:27 bd808: nutcracker dead on mediawiki01; investigating
* 14:07 ori: restarted acme-chief on deployment-acme-chief03
* 17:20 hashar: Deleted Nodepool instances so new ones get to use the new snapshot ci-jessie-wikimedia-1456333979
* 17:12 hashar: Refreshing nodepool snapshot. Been stall since Feb 15th T127755
* 17:01 bd808: https://wmflabs.org/sal/releng missing SAL data since 2016-02-20T20:19 due to bot crash; needs to be backfilled from wikitech data (T127981)
* 16:43 hashar: sal on elastic search is stall https://phabricator.wikimedia.org/T127981
* 15:07 hasharAW: beta app servers have lost access to memcached due to bad nutcracker conf | T127966
* 14:41 hashar: beta: we have a lost a memcached server 11:51am UTC


== 2016-02-23 ==
== 2022-06-17 ==
* 22:45 thcipriani: deployment-puppetmaster is in a weird rebase state
* 17:15 ori: provisioned deployment-cache-text07 in deployment-prep to test query normalization via VCL
* 22:25 legoktm: running sync-common manually on deployment-mediawiki02
* 01:08 TimStarling: on deployment-docker-cpjobqueue01 and deployment-docker-changeprop01 I redeployed the changeprop configuration, reverting the PHP 7.4 hack
* 09:59 hashar: Deleted a bunch of mwext-.*-jslint jobs that are no more in used (migrated to either 'npm' or  'jshint' / 'jsonlint' )


== 2016-02-22 ==
== 2022-06-16 ==
* 22:06 bd808: Restarted puppetmaster service on deployment-puppetmaster to "fix" error "invalid byte sequence in US-ASCII"
* 12:24 hashar: gitlab: runner-1030: `docker volume prune -f`
* 17:46 jzerebecki: ssh integration-slave-trusty-1017.eqiad.wmflabs 'sudo -u jenkins-deploy rm -rf /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm/src/.git/config.lock
* 12:24 hashar: gitlab: runner-1026: `docker volume prune -f`
* 16:47 gehel: deployment-prep upgrading deployment-logstash2 to elasticsearch 1.7.5
* 10:02 elukey: ran `scap install-world --batch` to allow scap/puppet to work on ml-cache100[2,3]
* 10:26 gehel: deployment-prep upgrading elastic-search to 1.7.5 on deployment-elastic0[5-8]


== 2016-02-20 ==
== 2022-06-15 ==
* 20:19 Krinkle: beta-code-update-eqiad job repeatedly stuck at "IRC notifier plugin"
* 22:39 brennen: phabricator: tagged release/2022-06-15/1 ([[phab:T310742|T310742]])
* 19:29 Krinkle: beta-code-update-eqiad broken because deployment-tin:/srv/mediawiki-staging/php-master/extensions/MobileFrontend/includes/MobileFrontend.hooks.php was modified on the server without commit
* 16:31 hashar: integration-agent-docker-1035: docker image prune
* 19:22 Krinkle: Various beta-mediawiki-config-update-eqiad jobs have been stuck 'queued' for > 24 hours
* 15:26 dancy: Upgrading scap to 4.9.4-1+0~20220615151557.315~1.gbped3b8d in beta cluster


== 2016-02-19 ==
== 2022-06-14 ==
* 12:09 hashar: killed https://integration.wikimedia.org/ci/job/beta-code-update-eqiad/  been running for 13 hours. Blocked because slave went offline due to labs reboots yesterday
* 21:30 TheresNoTime: clear out stuck `beta-scap-sync-world` jobs (repeatedly per each queued `beta-mediawiki-config-update-eqiad` job), queued jobs now running. monitored for until each job had run successfully. jobs up to date
* 10:15 hashar: Creating a bunch of repository in GitHub to fix Gerrit replication errors
* 17:18 brennen: starting 1.39.0-wmf.16 ([[phab:T308069|T308069]]) transcript in deploy1002:~brennen/1.39.0-wmf.16.log
* 13:35 TheresNoTime: clear stuck `beta-scap-sync-world` job, other queued jobs now running. Cancel running `beta-update-databases-eqiad` job, will ensure it runs on the next timer
* 00:42 TimStarling: on deployment-deploy03 removed helm2, as was done in production


== 2016-02-18 ==
== 2022-06-13 ==
* 19:20 legoktm: deploying https://gerrit.wikimedia.org/r/271583 and https://gerrit.wikimedia.org/r/271581, both no-ops
* 22:04 TheresNoTime: cleared out stalled Jenkins beta jobs on `deployment-deploy03`, manually started `beta-code-update-eqiad` job & watched to completion. all caught up
* 18:14 legoktm: deploying https://gerrit.wikimedia.org/r/271012
* 04:33 hashar: Restarting Docker on contint1001.wikimedia.org , apparently can't build images anymore
* 17:36 legoktm: deploying https://gerrit.wikimedia.org/r/271555
* 16:01 hashar: deleting instance  integration-slave-precise-1003  think we have enough precise slaves
* 10:44 hashar: Nodepool: JenkinsException: Could not parse JSON info for server[https://integration.wikimedia.org/ci/]


== 2016-02-17 ==
== 2022-06-12 ==
* 07:36 legoktm: deploying https://gerrit.wikimedia.org/r/271201
* 21:13 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/804777
* 01:01 yuvipanda: attempting to turn off NFS on 52 instances on deployment-prep project


== 2016-02-16 ==
== 2022-06-10 ==
* 23:22 yuvipanda: new instances on deployment-prep no longer get NFS because of https://wikitech.wikimedia.org/w/index.php?title=Hiera%3ADeployment-prep&type=revision&diff=311783&oldid=311781
* 15:20 James_F: Zuul: [mediawiki/extensions/SearchVue] Add initial CI jobs for [[phab:T309932|T309932]]
* 23:18 hashar: jenkins@gallium find /var/lib/jenkins/config-history/nodes -maxdepth 1 -type d -name 'ci-jessie*' -exec rm -vfR {} \;
* 08:28 hashar: Reloaded Zuul to remove mediawiki/services/parsoid from CI dependencies # https://gerrit.wikimedia.org/r/c/integration/config/+/803990
* 23:17 hashar: Jenkins accepting slave creations again. Root cause is /var/lib/jenkins/config-history/nodes/ has reached the 32k inode limit.
* 04:27 TimStarling: on deployment-deploy03 running scap sync-world -v with PHP 7.4 for [[phab:T295578|T295578]]
* 23:14 hashar: Jenkins: Could not create rootDir /var/lib/jenkins/config-history/nodes/ci-jessie-wikimedia-34969/2016-02-16_22-40-23
* 04:03 TimStarling: on deployment-deploy03 running scap sync-world -v with PHP 7.2 for [[phab:T295578|T295578]] sanity check
* 23:02 hashar: Nodepool can not authenticate with Jenkins anymore. Thus it can not add slaves it spawned.
* 22:56 hashar: contint: Nodepool instances pool exhausted
* 21:14 andrewbogott: deployment-logstash2 migration finished
* 20:49 jzerebecki: reloading zuul for 3bf7584..67fec7b
* 19:58 andrewbogott: migrating deployment-logstash2 to labvirt1010
* 19:00 hashar: tin: checking out mw 1.27.0-wmf.14
* 15:23 hashar: integration-make-wmfbranch : /mnt/make-wmf-branch  mount now has gid=wikidev and group setuid (i.e. mode 2775)
* 15:20 hashar: integration-make-wmfbranch : change tmpfs to /mnt/make-wmf-branch  (from /var/make-wmf-branch )
* 11:30 jzerebecki: T117710 integration-saltmaster:~# salt -v '*slave-trusty*' cmd.run 'rm -rf /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm-composer/src/skins/BlueSky'
* 09:52 hashar: will cut the wmf branches this afternoon starting around 14:00 CET


== 2016-02-15 ==
== 2022-06-09 ==
* 16:28 jzerebecki: reloading zuul for 2d16ad3..3bb0afa
* 22:49 dancy: Upgrading scap to 4.9.1-1+0~20220609211227.304~1.gbpe48c42 in beta cluster
* 16:10 hashar: Image ci-jessie-wikimedia-1455552377 in wmflabs-eqiad is ready
* 16:39 brennen: gitlab shared runners: re-registering to apply image allowlist configuration
* 15:25 jzerebecki: reloading zuul for e174335..2d16ad3
* 15:23 hashar: Image ci-jessie-wikimedia-1455549539 in wmflabs-eqiad is ready
* 15:19 hashar: Regenerating Nodepool snapshot. Slave scripts have 0 bytes...
* 15:04 hashar: Slave scripts added to Nodepool instances! Image ci-jessie-wikimedia-1455548346 in wmflabs-eqiad is ready
* 11:05 hashar: Image ci-jessie-wikimedia-1455534001 in wmflabs-eqiad is ready
* 07:52 legoktm: deploying https://gerrit.wikimedia.org/r/270686
* 06:52 legoktm: legoktm@gallium:/srv/org/wikimedia/doc$ sudo -u jenkins-slave rm -rf EventLogging/ GuidedTour/ MultimediaViewer/ TemplateData/
* 06:22 legoktm: deploying https://gerrit.wikimedia.org/r/270677
* 06:12 legoktm: deploying https://gerrit.wikimedia.org/r/270675
* 06:02 legoktm: deploying https://gerrit.wikimedia.org/r/270674
* 05:56 legoktm: deploying https://gerrit.wikimedia.org/r/270673
* 05:32 legoktm: deploying https://gerrit.wikimedia.org/r/270670
* 04:05 legoktm: deploying https://gerrit.wikimedia.org/r/270667
* 03:26 legoktm: deploying https://gerrit.wikimedia.org/r/270665
* 02:56 legoktm: deploying https://gerrit.wikimedia.org/r/270657


== 2016-02-14 ==
== 2022-06-08 ==
* 23:54 legoktm: deploying https://gerrit.wikimedia.org/r/270656
* 17:14 hashar: Reloaded Zuul for {{Gerrit|I39342265033e82ae13998f53defe6612dc6819b4}}
* 23:25 legoktm: deploying https://gerrit.wikimedia.org/r/270654
* 15:57 dancy: Set `profile::mediawiki::php::restarts::ensure: present` in deployment-prep hiera config for [[phab:T237033|T237033]]
* 23:13 legoktm: also deploying https://gerrit.wikimedia.org/r/#/c/265098/
* 09:28 hashar: Reloaded Zuul for "Add doc publish for Translate" https://gerrit.wikimedia.org/r/792134
* 23:11 legoktm: deploying https://gerrit.wikimedia.org/r/270651
* 05:18 bd808: tools.stashbot Testing after restart (T126419)


== 2016-02-13 ==
== 2022-06-06 ==
* 06:42 bd808: restarted nutcracker on deployment-mediawiki01
* 14:37 James_F: Zuul: [mediawiki/extensions/ImageSuggestions] Mark as in production for [[phab:T302711|T302711]]
* 06:32 bd808: jobrunner on deployment-jobrunner01 enabled after reverting changes from T87928 that caused T126830
* 05:51 bd808: disabled jobrunner process on jobrunner01; queue full of jobs broken by T126830
* 05:31 bd808: trebuchet clone of /srv/jobrunner/jobrunner broken on jobrunner01; failing puppet runs
* 05:25 bd808: jobrunner process on deployment-jobrunner01 badly broken; investigating
* 05:20 bd808: Ran https://phabricator.wikimedia.org/P2273 on deployment-jobrunner01.deployment-prep.eqiad.wmflabs; freed ~500M; disk utilization still at 94%


== 2016-02-12 ==
== 2022-06-02 ==
* 23:54 hashar: beta cluster broken since 20:30 UTC  https://logstash-beta.wmflabs.org/#/dashboard/elasticsearch/fatalmonitor  havent looked
* 15:33 dancy: Upgrading scap to 4.8.1-1+0~20220602153109.295~1.gbp318d9c in beta cluster
* 17:36 hashar: salt -v '*slave-trusty*' cmd.run 'apt-get -y install texlive-generic-extra'    # T126422
* 11:26 hashar: Restarting Jenkins on contint2001
* 17:32 hashar: adding texlive-generic-extra on CI slaves by cherry picking https://gerrit.wikimedia.org/r/#/c/270322/ - T126422
* 11:19 hashar: Restarting Jenkins on releases1002
* 17:19 hashar: get rid of integration-dev   it is broken somehow
* 17:10 hashar: Nodepool back at spawning instances.  contintcloud has been migrated in wmflabs
* 16:51 thcipriani: running  sudo salt '*' -b '10%' deploy.fixurl to fix deployment-prep trebuchet urls
* 16:31 hashar: bd808 added support for saltbot to update tasks automagically!!!! T108720
* 03:10 yurik: attempted to sync graphoid from gerrit 270166 from deployment-tin, but it wouldn't sync.  Tried to git pull sca02, submodules wouldn't pull


== 2016-02-11 ==
== 2022-05-31 ==
* 22:53 thcipriani: shutting down deployment-bastion
* 21:16 dancy: Upgrading scap to 4.8.0-1+0~20220531211114.292~1.gbp8dbbcf in beta cluster
* 21:28 hashar: pooling back slaves 1001 to 1006
* 17:40 dancy: Upgrading scap to 4.8.0-1+0~20220531173912.291~1.gbp21a7ef in beta cluster
* 21:18 hashar: re enabling hhvm service on slaves ( https://phabricator.wikimedia.org/T126594 ) Some symlink is missing and only provided by the upstart script grrrrrrr https://phabricator.wikimedia.org/T126658
* 17:33 dancy: Reverted to scap 4.8.0-1+0~20220524160924.288~1.gbp794a08 in beta cluster
* 20:52 legoktm: deploying https://gerrit.wikimedia.org/r/270098
* 17:07 dancy: Upgrading scap to 4.8.0-1+0~20220531170512.289~1.gbp143729 in beta cluster
* 20:35 hashar: depooling the six recent slaves: /usr/lib/x86_64-linux-gnu/hhvm/extensions/current/luasandbox.so cannot open shared object file
* 20:29 hashar: pooling integration-slave-trusty-1004 integration-slave-trusty-1005 integration-slave-trusty-1006
* 20:14 hashar: pooling integration-slave-trusty-1001 integration-slave-trusty-1002 integration-slave-trusty-1003
* 19:35 marxarelli: modifying deployment server node in jenkins to point to deployment-tin
* 19:27 thcipriani: running sudo salt -b '10%' '*' cmd.run 'puppet agent -t' from deployment-salt
* 19:27 twentyafterfour: Keeping notes on the ticket: https://phabricator.wikimedia.org/T126537
* 19:24 thcipriani: moving deployment-bastion to deployment-tin
* 17:59 hashar: recreated instances with proper names:  integration-slave-trusty-{1001-1006}
* 17:52 hashar: Created integration-slave-trusty-{1019-1026} as m1.large  (note 1023 is an exception it is for Android).  Applied role::ci::slave , lets wait for puppet to finish
* 17:42 Krinkle: Currently testing https://gerrit.wikimedia.org/r/#/c/268802/ in Beta Labs
* 17:27 hashar: Depooling all the ci.medium slaves and deleting them.
* 17:27 hashar: I tried. The ci.medium instances are too small and MediaWiki tests really need 1.5GBytes of memory :-(
* 16:00 hashar: rebuilding integration-dev https://phabricator.wikimedia.org/T126613
* 15:27 Krinkle: Deploy Zuul config change https://gerrit.wikimedia.org/r/269976
* 11:46 hashar: salt -v '*' cmd.run '/etc/init.d/apache2 restart'  might help for Wikidata browser tests failling
* 11:32 hashar: disabling hhvm service on CI slaves ( https://phabricator.wikimedia.org/T126594 , cherry picked both patches )
* 10:50 hashar: reenabled puppet on CI. All transitioned to a 128MB tmpfs (was 512MB)
* 10:16 hashar: pooling back integration-slave-trusty-1009 and integration-slave-trusty-1010  (tmpfs shrunken)
* 10:06 hashar: disabling puppet on all CI slaves. Trying to lower tmpfs 512MB to 128MB  ( https://gerrit.wikimedia.org/r/#/c/269880/ )
* 02:45 legoktm: deploying https://gerrit.wikimedia.org/r/269853 https://gerrit.wikimedia.org/r/269893


== 2016-02-10 ==
== 2022-05-30 ==
* 23:54 hashar_: depooling Trusty slaves that only have 2GB of ram that is not enough.  https://phabricator.wikimedia.org/T126545
* 11:47 jelto: apply gitlab-settings to gitlab1004 - [[phab:T307142|T307142]]
* 22:55 hashar_: gallium: find /var/lib/jenkins/config-history/config -type f -wholename '*/2015*' -delete  (  https://phabricator.wikimedia.org/T126552 )
* 11:46 jelto: apply gitlab-settings to gitlab1003 - [[phab:T307142|T307142]]
* 22:34 Krinkle: Zuul is back up and procesing Gerrit events, but jobs are still queued indefinitely. Jenkins is not accepting new jobs
* 22:31 Krinkle: Full restart of Zuul. Seems Gearman/Zuul got stuck. All executors were idling. No new Gerrit events processed either.
* 21:22 legoktm: cherry-picking https://gerrit.wikimedia.org/r/#/c/269370/ on integration-puppetmaster again
* 21:17 hashar: CI dust have settled.  Krinkle and I have pooled a lot more Trusty slaves to accommodate for the overload caused by switching to php55 (jobs run on Trusty)
* 21:08 hashar: pooling trusty slaves 1009, 1010, 1021, 1022  with 2 executors  (they are ci.medium)
* 20:38 hashar: cancelling mediawiki-core-jsduck-publish  and mediawiki-core-doxygen-publish jobs manually.  They will catch up on next merge
* 20:34 Krinkle: Pooled integration-slave-trusty-1019 (new)
* 20:28 Krinkle: Pooled integration-slave-trusty-1020 (new)
* 20:24 Krinkle: created integration-slave-trusty-1019 and integration-slave-trusty-1020 (ci1.medium)
* 20:18 hashar: created integration-slave-trusty-1009 and 1010 (trusty ci.medium)
* 20:06 hashar: creating integration-slave-trusty-1021 and integration-slave-trusty-1022 (ci.medium)
* 19:48 greg-g: that cleanup was done by apergos
* 19:48 greg-g: did cleanup across all integration slaves, some were very close to out of room. results:  https://phabricator.wikimedia.org/P2587
* 19:43 hashar: Dropping slaves Precise m1.large  integration-slave-precise-1014 and  integration-slave-precise-1013 , most load shifted to Trusty (php53 -> php55 transition)
* 18:20 Krinkle: Creating a Trusty slave to support increased demand following MediaWIki php53(precise)>php55(trusty) bump
* 16:06 jzerebecki: reloading zuul for 41a92d5..5b971d1
* 15:42 jzerebecki: reloading zuul for 639dd40..41a92d5
* 14:12 jzerebecki: recover a bit of disk space: integration-saltmaster:~# salt --show-timeout '*slave*' cmd.run 'rm -rf /mnt/jenkins-workspace/workspace/*WikibaseQuality*'
* 13:46 jzerebecki: reloading zuul for 639dd40
* 13:15 jzerebecki: reloading zuul for 3be81c1..e8e0615
* 08:07 legoktm: deploying https://gerrit.wikimedia.org/r/269619
* 08:03 legoktm: deploying https://gerrit.wikimedia.org/r/269613 and https://gerrit.wikimedia.org/r/269618
* 06:41 legoktm: deploying https://gerrit.wikimedia.org/r/269607
* 06:34 legoktm: deploying https://gerrit.wikimedia.org/r/269605
* 02:59 legoktm: deleting 14GB broken workspace of  mediawiki-core-php53lint from  integration-slave-precise-1004
* 02:37 legoktm: deleting /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm-composer on trusty-1017, it had a skin cloned into it
* 02:26 legoktm: queuing mwext jobs server-side to identify failing ones
* 02:21 legoktm: deploying https://gerrit.wikimedia.org/r/269582
* 01:03 legoktm: deploying https://gerrit.wikimedia.org/r/269576


== 2016-02-09 ==
== 2022-05-28 ==
* 23:17 legoktm: deploying https://gerrit.wikimedia.org/r/269551
* 19:09 TheresNoTime: deployment-deploy04 live, not referenced by anything [[phab:T309437|T309437]]
* 23:02 legoktm: gracefully restarting zuul
* 22:57 legoktm: deploying https://gerrit.wikimedia.org/r/269547
* 22:29 legoktm: deploying https://gerrit.wikimedia.org/r/269540
* 22:18 legoktm: re-enabling puppet on all CI slaves
* 22:02 legoktm: reloading zuul to see if it'll pickup the new composer-php53 job
* 21:53 legoktm: enabling puppet on just integration-slave-trusty-1012
* 21:52 legoktm: cherry-picked https://gerrit.wikimedia.org/r/#/c/269370/ onto integration-puppetmaster
* 21:50 legoktm: disabling puppet on all trusty/precise CI slaves
* 21:40 legoktm: deploying https://gerrit.wikimedia.org/r/269533
* 17:49 marxarelli: disabled/enabled gearman in jenkins, connection works this time
* 17:49 marxarelli: performed stop/start of zuul on gallium to restore zuul and gearman
* 17:45 marxarelli: "Failed: Unable to Connect" in jenkins when testing gearman connection
* 17:40 marxarelli: killed old zull process manually and restarted service
* 17:39 marxarelli: restart of zuul fails as well. old process cannot be killed
* 17:38 marxarelli: reloading zuul fails with "failed to kill 13660: Operation not permitted"
* 16:06 bd808: Deleted corrupt integration-slave-precise-1003:/mnt/jenkins-workspace/workspace/mediawiki-core-php53lint/.git
* 15:11 hashar: mira: /srv/mediawiki-staging/multiversion/checkoutMediaWiki 1.27.0-wmf.13 php-1.27.0-wmf.13
* 14:51 hashar: ./make-wmf-branch -n 1.27.0-wmf.13 -o master
* 14:50 hashar: pooling back integration-slave-precise1001 - 1004.  Manually fetched git repos in workspace for  mediawiki core php53
* 14:49 hashar: make-wmf-branch instance: created a local ssh key pair and set the config to use User: hashar
* 14:13 hashar: pooling  https://integration.wikimedia.org/ci/computer/integration-slave-precise-1012/  Mysql is back .. Blame puppet
* 14:12 hashar: de pooling  https://integration.wikimedia.org/ci/computer/integration-slave-precise-1012/  Mysql is gone somehow
* 14:04 hashar: Manually git fetching  mediawiki-core in /mnt/jenkins-workspace/workspace/mediawiki-core-php53lint of slaves precise 1001 to 1004  (git on Precise is remarkably too slow)
* 13:28 hashar: salt '*trusty*' cmd.run 'update-alternatives --set php /usr/bin/hhvm'
* 13:28 hashar: salt '*precise*' cmd.run 'update-alternatives --set php /usr/bin/php5'
* 13:18 hashar: salt -v --batch=3 '*slave*' cmd.run 'puppet agent -tv'
* 13:15 hashar: removing https://gerrit.wikimedia.org/r/#/c/269370/ from CI puppet master
* 13:14 hashar: slave recurse infinitely doing /bin/bash -eu /srv/deployment/integration/slave-scripts/bin/mw-install-mysql.sh  then loop over /bin/bash /usr/bin/php maintenance/install.php --confpath /mnt/jenkins-workspace/workspace/mediawiki-core-qunit/src --dbtype=mysql --dbserver=127.0.0.1:3306 --dbuser=jenkins_u2 --dbpass=pw_jenkins_u2 --dbname=jenkins_u2_mw --pass testpass TestWiki WikiAdmin  https://phabricator.wikimedia.org/T126327
* 12:46 hashar: Mass testing php loop of death:  salt -v '*slave*' cmd.run 'timeout 2s /srv/deployment/integration/slave-scripts/bin/php --version'
* 12:40 hashar: mass rebooting CI slaves from wikitech
* 12:39 hashar: salt -v '*' cmd.run "bash -c 'cd /srv/deployment/integration/slave-scripts; git pull'"
* 12:33 hashar: all slaves dieing due to PHP looping
* 12:02 legoktm: re-enabling puppet on all trusty/precise slaves
* 11:20 legoktm: cherry-picked https://gerrit.wikimedia.org/r/#/c/269370/ on integration-puppetmaster
* 11:20 legoktm: enabling puppet just on integration-slave-trusty-1012
* 11:13 legoktm: disabling puppet on all *(trusty|precise)* slaves
* 10:26 hashar: pooling in  integration-slave-trusty-1018
* 03:19 legoktm: deploying https://gerrit.wikimedia.org/r/269359
* 02:53 legoktm: deploying https://gerrit.wikimedia.org/r/238988
* 00:39 hashar: gallium edited /usr/share/python/zuul/local/lib/python2.7/site-packages/zuul/trigger/gerrit.py  and modified:  replication_timeout = 300 -> replication_timeout = 10
* 00:37 hashar: live hacking Zuul code to have it stop sleeping() on force merge
* 00:36 hashar: killing zuul


== 2016-02-08 ==
== 2022-05-27 ==
* 23:48 legoktm: finally deploying https://gerrit.wikimedia.org/r/269327
* 22:55 zabe: zabe@deployment-mwmaint02:~$ mwscript extensions/WikiLambda/maintenance/updateTypedLists.php --wiki=wikifunctionswiki --db # started ~20 min ago
* 23:14 hashar: zuul promote --pipeline gate-and-submit --changes 269065,2 https://gerrit.wikimedia.org/r/#/c/269065/
* 22:49 TheresNoTime: manually running database update script: samtar@deployment-deploy03:~$ /usr/local/bin/wmf-beta-update-databases.py
* 23:10 hashar: pooling integration-slave-precise-1001 1002 1004
* 22:09 TheresNoTime: samtar@deployment-deploy03:~$ sudo keyholder arm
* 22:47 hashar: Err need to reboot newly provisioned instances before adding them to Jenkins (kernel upgrade,apache restart etc)
* 21:44 TheresNoTime: hard rebooted deployment-deploy03 as soft reboot unresponsive
* 22:45 hashar: Pooled https://integration.wikimedia.org/ci/computer/integration-slave-precise-1003/
* 21:44 bd808: `sudo wmcs-openstack role add --user zabe --project deployment-prep projectadmin` ([[phab:T309419|T309419]])
* 22:25 hashar: integration-slave-precise-{1001-1004} applied role::ci::slave::labs, running puppet in slaves.  I have added the instances as Jenkins slaves and put them offline.  Whenever puppet is done, we can mark them online in Jenkins then monitor the jobs running on them are working properly
* 21:10 zabe: zabe@deployment-deploy03:~$ sudo keyholder arm
* 22:15 hashar: Provisioning integration-slave-precise-{1001-1004} https://phabricator.wikimedia.org/T126274 (need more php53 slots)
* 20:53 bd808: `sudo wmcs-openstack role add --user samtar --project deployment-prep projectadmin` ([[phab:T309415|T309415]])
* 22:13 hashar: Deleted cache-rsync instance superseded by castor instance
* 20:49 dancy: Initiated hard reboot of deployment-deploy03.deployment-prep
* 22:10 hashar: Deleting pmcache.integration.eqiad.wmflabs (was to investigate various kind of central caches).
* 20:14 marxarelli: aborting pending mediawiki-extensions-php53 job for CheckUser
* 20:08 bd808: toggled "Enable Gearman" off and on in Jenkins to wake up deployment-bastion workers
* 14:54 hashar: nodepool: refreshed snapshot image , Image ci-jessie-wikimedia-1454942958 in wmflabs-eqiad is ready
* 14:47 hashar: regenerated nodepool reference image (got rid of grunt-cli https://gerrit.wikimedia.org/r/269126 )
* 09:41 legoktm: deploying https://gerrit.wikimedia.org/r/269093 https://gerrit.wikimedia.org/r/269094
* 09:36 hashar: restarting integration puppetmaster (out of memory / cannot fork)
* 06:11 bd808: tgr set $wgAuthenticationTokenVersion on beta cluster (test run for T124440)
* 02:09 legoktm[NE]: deploying https://gerrit.wikimedia.org/r/268047
* 00:57 legoktm[NE]: deploying https://gerrit.wikimedia.org/r/268031


== 2016-02-06 ==
== 2022-05-26 ==
* 18:34 jzerebecki: reloading zuul for bdb2ed4..46ccca9
* 18:33 dancy: Updated Jenkins beta-* job configs
* 16:51 TheresNoTime: manually triggered beta-update-databases-eqiad post-merge of {{Gerrit|2c7b5825}}
* 16:51 brennen: puppetmaster-1001.devtools: resetting ops/puppet checkout to production branch


== 2016-02-05 ==
== 2022-05-25 ==
* 13:30 hashar: beta cleaning out /data/project/logs/archive  was from pre logstash area.  We no more log this way since May 2015 apparently
* 18:38 TheresNoTime: (@ ~18:20UTC) samtar@deployment-mwmaint02:~$ mwscript resetUserEmail.php --wiki=wikidatawiki Mahir256 [snip] [[phab:T309230{{!}}T309230]]
* 13:29 hashar: beta deleting /data/project/swift-disk  created in august 2014 , unused since june 2015.  Was a fail attempt at bringing swift to beta
* 15:46 dancy: Restarted apache2 on gerrit1001
* 13:27 hashar: beta: reclaiming disk space from extensions.git. On bastion: find /srv/mediawiki-staging/php-master/extensions/.git/modules -maxdepth 1 -type d -print -execdir git gc \;
* 13:03 hashar: integration-slave-trusty-1011 went out of disk space. Did some brute clean up and git gc.
* 05:21 Tim: configured mediawiki-extensions-qunit to only run on integration-slave-trusty-1017, did a rebuild and then switched it back


== 2016-02-04 ==
== 2022-05-24 ==
* 22:08 jzerebecki: reloading zuul for bed7be1..f57b7e2
* 15:15 dancy: Upgrading scap to 4.7.1-1+0~20220524151055.286~1.gbpe809e8 in beta cluster
* 21:51 hashar: salt-key -d integration-slave-jessie-1001.eqiad.wmflabs
* 13:35 James_F: Zuul: [mediawiki/tools/code-utils] Add composer test CI for [[phab:T309099|T309099]]
* 21:50 hashar: salt-key -d integration-slave-precise-1011.eqiad.wmflabs
* 11:36 TheresNoTime: cleared stuck beta deployment jobs per https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code/db_update
* 00:57 bd808: Got deployment-bastion processing Jenkins jobs again via instructions left by my past self at https://phabricator.wikimedia.org/T72597#747925
* 00:43 bd808: Jenkins agent on deployment-bastion.eqiad doing the trick where it doesn't pick up jobs again


== 2016-02-03 ==
== 2022-05-23 ==
* 22:24 bd808: Manually ran sync-common on deployment-jobrunner01.eqiad.wmflabs to pickup wmf-config changes that were missing (InitializeSettings, Wikibase, mobile)
* 19:21 inflatador: Deleted deployment-elastic0[5-7] in favor of newer bullseye hosts [[phab:T299797|T299797]]
* 17:43 marxarelli: Reloading Zuul to deploy previously undeployed Icd349069ec53980ece2ce2d8df5ee481ff44d5d0 and Ib18fe48fe771a3fe381ff4b8c7ee2afb9ebb59e4
* 18:37 dancy: Reverted to scap 4.7.1-1+0~20220505181519.270~1.gbpeb47ae in beta cluster
* 15:12 hashar: apt-get upgrade deployment-sentry2
* 18:35 dancy: Upgrading beta cluster scap to 4.7.1-1+0~20220523183110.280~1.gbpaa0826
* 15:03 hashar: redeployed rcstream/rcstream on deployment-stream by using git-deploy on deployment-bastion
* 14:49 James_F: Zuul: Enforce Postgres and SQLite support via in-mediawiki-tarball
* 14:55 hashar: upgrading deployment-stream
* 08:37 elukey: move kafka jumbo in deployment-prep to fixed uid/gid - [[phab:T296982|T296982]]
* 14:42 hashar: pooled back integration-slave-trusty-1015  Seems ok
* 08:29 elukey: move kafka main in deployment-prep to fixed uid/gid - [[phab:T296982|T296982]]
* 14:35 hashar: manually triggered a bunch of browser tests jobs
* 08:06 elukey: move kafka logging in deployment-prep to fixed uid/gid - [[phab:T296982|T296982]]
* 11:40 hashar: apt-get upgrade deployment-ms-be01 and deployment-ms-be02
* 11:32 hashar: fixing puppet.conf on deployment-memc04
* 11:09 hashar: restarting beta cluster puppetmaster just in case
* 11:07 hashar: beta: apt-get upgrade on delpoyment-cache* hosts  and checking puppet
* 10:59 hashar: integration/beta:  deleting /etc/apt/apt.conf.d/*proxy  files. There is no need for them, in fact web proxy is not reachable from labs
* 10:53 hashar: integration: switched puppet repo back to 'production' branch, rebased.
* 10:49 hashar: various beta cluster have puppet errors ..
* 10:46 hashar: integration-slave-trusty-1013 heading to out of disk space on /mnt ...
* 10:42 hashar: integration-slave-trusty-1016 out of disk space on /mnt ...
* 03:45 bd808: Puppet failing on deployment-fluorine with "Error: Could not set uid on user[datasets]: Execution of '/usr/sbin/usermod -u 10003 datasets' returned 4: usermod: UID '10003' already exists"
* 03:44 bd808: Freed 28G by deleting deployment-fluorine:/srv/mw-log/archive/*2015*
* 03:42 bd808: Ran deployment-bastion.deployment-prep:/home/bd808/cleanup-var-crap.sh and freed 565M


== 2016-02-02 ==
== 2022-05-22 ==
* 18:32 marxarelli: Reloading Zuul to deploy If1f3cb60f4ccb2c1bca112900dbada03a8588370
* 18:39 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/795818/
* 17:42 marxarelli: cleaning mwext-donationinterfacecore125-testextension-php53 workspace on integration-slave-precise-1013
* 17:06 ostriches: running sync-common on mw2051 and mw1119
* 09:38 hashar: Jenkins is fully up and operational
* 09:33 hashar: restarting Jenkins
* 08:47 hashar: pooling back integration-slave-precise1011 , puppet run got fixed ( https://phabricator.wikimedia.org/T125474 )
* 03:48 legoktm: deploying https://gerrit.wikimedia.org/r/267828
* 03:29 legoktm: deploying https://gerrit.wikimedia.org/r/266941
* 00:42 legoktm: due to T125474
* 00:42 legoktm: marked integration-slave-precise-1011 as offline
* 00:39 legoktm: precise-1011 slave hasn't had a puppet run in 6 days


== 2016-02-01 ==
== 2022-05-21 ==
* 23:53 bd808: Logstash working again; I applied a change to the default mapping template for Elasticsearch that ensures that fields named "timestamp" are indexed as plain strings
* 23:05 legoktm: deployed https://gerrit.wikimedia.org/r/c/integration/config/+/794756/
* 23:46 bd808: Elasticsearch index template for beta logstash cluster making crappy guesses about syslog events; dropped 2016-02-01 index; trying to fix default mappings
* 14:11 hashar: Icinga reports `Gerrit Health Check SSL Expiry` errors filed as [[phab:T308908|T308908]]
* 23:09 bd808: HHVM logs causing rejections during document parse when inserting in Elasticsearch from logstash. They contain a "timestamp" field that looks like "Feb  1 22:56:39" which is making the mapper in Elasticsearch sad.
* 23:04 bd808: Elasticsearch on deployment-logstash2 rejecting all documents with 400 status. Investigating
* 22:50 bd808: Copying deployment-logstash2.deployment-prep:/var/log/logstash/logstash.log to /srv for debugging later
* 22:48 bd808: deployment-logstash2.deployment-prep:/var/log/logstash/logstash.log is 11G of fail!
* 22:46 bd808: root partition on deployment-logstash2 full
* 22:43 bd808: No data in logstash since 2016-01-30T06:55:37.838Z; investigating
* 15:33 hashar: Image ci-jessie-wikimedia-1454339883 in wmflabs-eqiad is ready
* 15:01 hashar: Refreshing Nodepool image. Might have npm/grunt properly set up
* 03:15 legoktm: deploying https://gerrit.wikimedia.org/r/267630


== 2016-01-31 ==
== 2022-05-20 ==
* 13:35 hashar: Jenkins IRC bot started falling at Jan 30 01:04:00 2016  for whatever reason....  Should be fine now
* 16:21 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/793809
* 13:33 hashar: cancelling/aborting jobs that are stuck while reporting to IRC (mostly browser tests and beta cluster jobs)
* 13:32 hashar: Jenkins jobs are being blocked because they can no more report back to IRC :-(((
* 13:28 hashar: Jenkins jobs are being blocked because they can no more report back to IRC :-(((


== 2016-01-30 ==
== 2022-05-19 ==
* 12:46 hashar: integration-slave-jessie-1001 : fixed puppet.con server name and ran puppet
* 19:34 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/793527
* 14:31 hashar: Reloaded zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/793458 {{!}} Don't re-trigger the test pipeline on patches with C+2 already


== 2016-01-29 ==
== 2022-05-18 ==
* 18:43 thcipriani: updated scap on beta
* 19:31 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/793028
* 16:44 thcipriani: deployed scap updates on beta
* 18:45 brennen: gitlab: created placeholder /repos/mediawiki group for squatting purposes
* 11:58 _joe_: upgraded hhvm to 3.6 wm8 in deployment-prep
* 08:29 hashar: Updating SSH Build agent from 1.31.5 to 1.32.0 on CI Jenkins to prevent an issue when uploading `remoting.jar`  # [[phab:T307339|T307339]]#7937268
* 07:32 hashar: Deleting Jenkins agent configuration for `integration-castor03` # [[phab:T252071|T252071]]


== 2016-01-28 ==
== 2022-05-17 ==
* 23:22 MaxSem: Updated portals on betalabs to master
* 23:26 James_F: Zuul: [mediawiki/extensions/Phonos] Install basic quibble CI for [[phab:T308558|T308558]]
* 22:23 hashar: salt '*slave-precise*' cmd.run 'apt-get install php5-ldap'  ( https://phabricator.wikimedia.org/T124613 )  will need to be puppetized
* 18:17 thcipriani: cleaning npm cache on slave machines: salt -v '*slave*' cmd.run 'sudo -i -u jenkins-deploy -- npm cache clean'
* 18:12 thcipriani: running npm cache clean on integration-slave-precise-1011 sudo -i -u jenkins-deploy -- npm cache clean
* 15:25 hashar: apt-get upgrade deployment-sca01 and deployment-sca02
* 15:09 hashar: fixing puppet.conf hostname on deployment-upload  deployment-conftool  deployment-tmh01 deployment-zookeeper01 and deployment-urldownloader
* 15:06 hashar: fixing puppet.con hostname on deployment-upload.deployment-prep.eqiad.wmflabs and running puppet
* 15:00 hashar: Running puppet on deployment-memc02 and deployment-elastic07 . It is catching up with lot of changes
* 14:59 hashar: fixing puppet hostnames on deployment-elastic07
* 14:59 hashar: fixing puppet hostnames on deployment-memc02
* 14:55 hashar: Deleted salt keys deployment-pdf01.eqiad.wmflabs and deployment-memc04.eqiad.wmflabs  (obsolete,  entries with '.deployment-prep.' are already there)
* 07:38 jzerebecki: reload zuul for 4951444..43a030b
* 05:55 jzerebecki: doing https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code.2Fdb_update
* 03:49 mobrovac: deployment-prep re-enabled puppet on deployment-restbase0x
* 02:49 mobrovac: deployment-prep deployment-restbase01 disabled puppet to set up cassandra for  
* 02:27 mobrovac: deployment-prep recreating deployment-restbase01 for T125003
* 02:23 mobrovac: deployment-prep deployment-restbase02 disabled puppet to recreate deployment-restbase01 for T125003
* 01:42 mobrovac: deployment-prep recreating deployment-sca02 for T125003
* 01:28 mobrovac: deployment-prep recreating deployment-sca01 for T125003
* 00:36 mobrovac: deployment-prep re-imaging deployment-mathoid for T125003
* 00:02 jzerebecki: integration-slave-trusty-1016:~$ sudo -i rm -rf /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm/src/skins/Donate


== 2016-01-27 ==
== 2022-05-16 ==
* 23:49 jzerebecki: integration-slave-precise-1011:~$ sudo -i /etc/init.d/salt-minion restart
* 19:31 inflatador: bking@deployment-elastic07 halted deployment-elastic07 in beta ES cluster; will decom on Friday [[phab:T299797|T299797]]
* 23:46 jzerebecki: work around https://phabricator.wikimedia.org/T117710 : salt --show-timeout '*slave*' cmd.run 'rm -rf /mnt/jenkins-workspace/workspace/mwext-testextension-hhvm/src/skins/BlueSky'
* 19:02 inflatador: bking@deployment-elastic06 halted deployment-elastic06 in beta ES cluster; will decom on Friday [[phab:T299797|T299797]]
* 21:19 cscott: updated OCG to version 64050af0456a43344b32e3e93561a79207565eaf (should be no-op after yesterday's deploy)
* 08:33 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/791809
* 10:29 hashar: triggered bunch of browser tests, deployment-redis01 was dead/faulty
* 10:08 hashar: mass restarting redis-server process on deployment-redis01 (for https://phabricator.wikimedia.org/T124677 )
* 10:07 hashar: mass restarting redis-server process on deployment-redis01
* 09:00 hashar: beta:  commenting out "latency-monitor-threshold 100" parameter from any /etc/redis/redis.conf we have ( https://phabricator.wikimedia.org/T124677 ). Puppet will not reapply it unless distribution is Jessie


== 2016-01-26 ==
== 2022-05-14 ==
* 16:51 cscott: updated OCG to version 64050af0456a43344b32e3e93561a79207565eaf
* 23:19 James_F: Zuul: Add Dreamy_Jazz to CI allow list
* 12:14 hashar: Added Jenkins IRC bot (wmf-insecte) to #wikimedia-perf for https://gerrit.wikimedia.org/r/#/c/265631/
* 23:17 James_F: Zuul: [mediawiki/extensions/LocalisationUpdate] Move out of production section
* 09:30 hashar: restarting Jenkins to upgrade the gearman plugin with https://review.openstack.org/#/c/271543/
* 20:25 urbanecm: add TheresNoTime (samtar) as a project member per request
* 04:18 bd808: integration-slave-jessie-1001:/mnt full; cleaned up 15G of files in /mnt/pbuilder/build (27 hours after the last time I did that)


== 2016-01-25 ==
== 2022-05-13 ==
* 18:59 twentyafterfour: started redis-server on deployment-redis01 by commenting out latency-monitor-threshold from the redis.conf
* 22:59 James_F: Zuul: [mediawiki/extensions/SocialProfile] Add WikiEditor as a CI dependency
* 15:22 hashar: CI: fixing kernels not upgrading via:  rm /boot/grub/menu.lst ; update-grub -y  (i.e.: regenerate the Grub menu from scratch)
* 22:52 James_F: Zuul: Add Tranve to CI allow list
* 14:21 hashar: integration-slave-trusty-1015.integration.eqiad.wmflabs  is gone. I have failed the kernel upgrade / grub update
* 22:01 hashar: reloaded zuul for https://gerrit.wikimedia.org/r/791688
* 01:35 bd808: integration-slave-jessie-1001:/mnt full; cleaned up 15G of files in /mnt/pbuilder/build
* 18:58 inflatador: bking@deployment-elastic05 halted deployment-elastic05 in beta ES cluster; will decom in 1 wk [[phab:T299797|T299797]]
* 17:18 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/791644/
* 13:16 taavi: added user Zoranzoki21 to extension-HidePrefix gerrit group [[phab:T305317|T305317]]


== 2016-01-24 ==
== 2022-05-12 ==
* 06:45 legoktm: deploying https://gerrit.wikimedia.org/r/266039
* 22:09 inflatador: bking@deployment-elastic05 banned deployment-elastic05 from beta ES cluster in preparation for decom [[phab:T299797|T299797]]
* 06:13 legoktm: deploying https://gerrit.wikimedia.org/r/266041
* 19:53 hashar: gerrit: triggering full replication to gerrit2001 to test [[phab:T307137|T307137]]
* 16:00 hashar: contint2001 and contint1001 now automatically run `docker system prune --force` every day  and `docker system prune --force` on Sunday {{!}} https://gerrit.wikimedia.org/r/c/operations/puppet/+/773784/
* 15:05 brennen: gitlab-prod-1001.devtools: soft reboot
* 00:46 brennen: gitlab: disabling container registries on all existing projects ([[phab:T307537|T307537]])


== 2016-01-22 ==
== 2022-05-11 ==
* 23:58 legoktm: removed skins from mwext-qunit workspace on trusty-1013 slave
* 23:20 brennen: gitlab-prod-1001.devtools: container registry currently enabled
* 23:34 legoktm: rm -rf /mnt/jenkins-workspace/workspace/mediawiki-phpunit-php53 on slave precise 1012
* 18:58 brennen: gitlab-prod-1001.devtools: setting to use devtools standalone puppetmaster
* 22:45 legoktm: deploying https://gerrit.wikimedia.org/r/265864
* 22:27 hashar: rebooted all CI slaves using OpenStackManager
* 22:09 hashar: rebooting deployment-redis01 (kernel upgrade)
* 21:22 hashar: Image ci-jessie-wikimedia-1453497269 in wmflabs-eqiad is ready (with node 4.2 for https://phabricator.wikimedia.org/T119143 )
* 21:14 hashar: updating nodepool snapshot based on new image
* 21:12 hashar: rebuilding nodepool reference image
* 20:04 hashar: Image ci-jessie-wikimedia-1453492820 in wmflabs-eqiad is ready
* 20:00 hashar: Refreshing nodepool image to hopefully get Nodejs 4.2.4 https://phabricator.wikimedia.org/T124447  https://gerrit.wikimedia.org/r/#/c/265802/
* 16:32 hashar: Nuked corrupted git repo on integration-slave-precise-1012 /mnt/jenkins-workspace/workspace/mediawiki-extensions-php53
* 12:23 hashar: beta: reinitialized keyholder on deployment-bastion. The proxy apparently  had no identity
* 09:32 hashar: beta cluster Jenkins job have been stalled for 9hours and 25 minutes. Disabling/reenabling the Gearman plugin to remove the deadlock


== 2016-01-21 ==
== 2022-05-10 ==
* 21:41 hashar: restored role::mail::mx on deployment-mx
* 12:06 hashar: Updating Quibble jobs to image 1.4.5 with Memcached enabled {{!}} https://gerrit.wikimedia.org/r/c/integration/config/+/790641 {{!}} [[phab:T300340|T300340]]
* 21:36 hashar: dropping role::mail::mx from deployment-mx  to let  puppet  run
* 10:55 hashar: Updating `wmf-quibble-*` jobs to Quibble 1.4.5 # https://gerrit.wikimedia.org/r/c/integration/config/+/790638/
* 21:33 hashar: rebooting deployment-jobrunner01  / kernel upgrade / /tmp is only 1MBytes
* 08:36 hashar: Updating wikibase-client-docker and wikibase-repo-docker to Quibble 1.4.5 + supervisord https://gerrit.wikimedia.org/r/c/integration/config/+/790621
* 21:19 hashar: fixing up deployment-jobrunner01  /tmp and / disks are full
* 08:30 hashar: Updating MediaWiki coverage jobs to Quibble image 1.4.5 + supervisord https://gerrit.wikimedia.org/r/c/integration/config/+/790381
* 19:57 thcipriani: ran REPAIR TABLE globalnames; on centralauth db
* 08:24 hashar: Updating codehealth jobs to Quibble 1.4.5 + supervisord https://gerrit.wikimedia.org/r/c/integration/config/+/790380/
* 19:48 legoktm: deploying https://gerrit.wikimedia.org/r/265552
* 08:23 hashar: Updating MediaWiki Phan jobs to Quibble 1.4.5 https://gerrit.wikimedia.org/r/c/integration/config/+/790377
* 19:39 legoktm: deploying jjb changes for https://gerrit.wikimedia.org/r/264990
* 19:25 legoktm: deploying https://gerrit.wikimedia.org/r/265546
* 01:59 jzerebecki: jenkins-deploy@deployment-bastion:/srv/mediawiki-staging/php-master/extensions/SpellingDictionary$ rm -r modules/jquery.uls && git rm modules/jquery.uls
* 01:00 jzerebecki: jenkins-deploy@deployment-bastion:/srv/mediawiki-staging/php-master/extensions$ git pull && git submodule update --init --recursive
* 00:57 jzerebecki: jenkins-deploy@deployment-bastion:/srv/mediawiki-staging/php-master/extensions$ git reset HEAD SpellingDictionary


== 2016-01-20 ==
== 2022-05-09 ==
* 20:05 hashar: beta sudo find /data/project/upload7/math -type f -delete  (probably some old left over)
* 21:43 James_F: Beta Cluster: Shutting down old deployment-restbase03 instance for [[phab:T295375|T295375]]
* 19:50 hashar: beta: on commons ran deleteArchivedFile.php : Nuked 7130 files
* 20:33 hashar: Manually cancelling deadlock build jobs for beta https://integration.wikimedia.org/ci/view/Beta/ # [[phab:T307963|T307963]]
* 19:49 hashar: beta : foreachwiki deleteArchivedRevisions.php -delete
* 19:26 hasharAway: Nuked all files from http://commons.wikimedia.beta.wmflabs.org/wiki/Category:GWToolset_Batch_Upload
* 19:19 hasharAway: beta: sudo find /data/project/upload7/*/*/temp -type f -delete
* 19:14 hasharAway: beta: sudo rm /data/project/upload7/*/*/lockdir/*
* 18:57 hasharAway: beta cluster code has been stalled for roughly 2h30
* 18:55 hasharAway: disconnecting Gearman plugin to remove deadlock for beta cluster rjobs
* 17:06 hashar: clearing files from beta-cluster to prepare for Swift migration.  python pwb.py delete.py -family:betacommons -lang:en -cat:'GWToolset Batch Upload' -verbose -putthrottle:0 -summary:'Clearing out old batched upload to save up disk space for Swift migration'


== 2016-01-19 ==
== 2022-05-08 ==
* 22:25 legoktm: deleting *zend* workspaces on precise slaves
* 12:33 urbanecm: deployment-prep: urbanecm@deployment-mwmaint02:~$ foreachwikiindblist growthexperiments extensions/GrowthExperiments/maintenance/migrateMenteeOverviewFiltersToPresets.php --update # [[phab:T304057|T304057]]
* 21:58 thcipriani: trying https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code.2Fdb_update again
* 21:57 thcipriani: beta-scap-eqiad still can't find executor on deployment-bastion.eqiad
* 21:52 thcipriani: following steps at https://www.mediawiki.org/wiki/Continuous_integration/Jenkins#Hung_beta_code.2Fdb_update for deployment-bastion
* 19:34 legoktm: deleting all *zend* jobs from jenkins
* 09:40 hashar: Created github repo https://github.com/wikimedia/operations-debs-varnish4
* 03:59 legoktm: deploying https://gerrit.wikimedia.org/r/264912 and https://gerrit.wikimedia.org/r/264922


== 2016-01-17 ==
== 2022-05-06 ==
* 18:02 legoktm: deploying https://gerrit.wikimedia.org/r/264605
* 12:55 hashar: Migrated Castor service from integration-castor03 to integration-castor05 # [[phab:T252071|T252071]]


== 2016-01-16 ==
== 2022-05-05 ==
* 21:47 legoktm: deploying https://gerrit.wikimedia.org/r/264489
* 22:57 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789723
* 21:36 legoktm: deploying https://gerrit.wikimedia.org/r/264488
* 22:31 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789721
* 21:29 legoktm: deploying https://gerrit.wikimedia.org/r/264487
* 22:28 dduvall: created 2 new jobs to deploy https://gerrit.wikimedia.org/r/789720
* 21:21 legoktm: deploying https://gerrit.wikimedia.org/r/264483 https://gerrit.wikimedia.org/r/264485
* 22:24 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789718
* 20:58 legoktm: deploying https://gerrit.wikimedia.org/r/264492
* 22:21 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/789717
* 18:55 jzerebecki: reloadin zuul for 996c558..5f8eb50
* 22:15 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789714
* 09:12 legoktm: deploying https://gerrit.wikimedia.org/r/264448
* 22:13 dduvall: created 2 new jobs to deploy https://gerrit.wikimedia.org/r/789713
* 09:01 legoktm: deploying https://gerrit.wikimedia.org/r/264446 and https://gerrit.wikimedia.org/r/264447
* 22:09 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789711
* 07:46 legoktm: sudo -u jenkins-deploy mv /mnt/jenkins-workspace/workspace/mediawiki-core-phplint /mnt/jenkins-workspace/workspace/mediawiki-core-php53lint on all precise slaves
* 22:07 dduvall: created 2 new jobs to deploy https://gerrit.wikimedia.org/r/789710
* 07:17 legoktm: deploying https://gerrit.wikimedia.org/r/264444
* 21:57 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789707/1
* 06:31 legoktm: deploying https://gerrit.wikimedia.org/r/264441
* 21:51 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789706
* 06:10 legoktm: added phpflavor-php53 label to all phpflavor-zend slaves
* 21:48 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789704
* 21:44 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789703
* 21:38 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789698
* 21:35 dduvall: created 4 jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789697
* 21:26 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789694
* 21:22 dduvall: creating 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789693
* 18:27 dduvall: reenabled puppet on integration-agent-docker-1023.integration.eqiad1.wikimedia.cloud
* 18:25 dancy: Update to scap 4.7.1-1+0~20220505181519.270~1.gbpeb47ae in beta cluster
* 18:16 dduvall: disabled puppet on integration-agent-docker-1023.integration.eqiad1.wikimedia.cloud for deployment of https://gerrit.wikimedia.org/r/c/operations/puppet/+/768774
* 16:29 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789650
* 16:26 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789649
* 14:25 hashar: Created integration-castor05
* 12:28 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/789179 and https://gerrit.wikimedia.org/r/789232
* 07:45 hashar: deployment-prep: removed a few queued Jenkins  builds from https://integration.wikimedia.org/ci/view/Beta/


== 2016-01-15 ==
== 2022-05-04 ==
* 12:17 hashar: restarting Jenkins for plugins updates
* 21:29 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789285
* 02:49 bd808: Trying to fix submodules in deployment-bastion:/srv/mediawiki-staging/php-master/extensions for T123701
* 21:16 dduvall: created 1 new job to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789284
* 21:07 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789278
* 21:00 dduvall: created 2 jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789277
* 20:48 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/789274
* 20:44 dduvall: creating 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789273
* 20:31 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789265
* 20:25 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789264
* 20:22 urbanecm: urbanecm@deployment-mwmaint02:~$ mwscript extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=commonswiki --logwiki=metawiki "There'sNoTime" "TheresNoTime" # [[phab:T307590|T307590]]
* 20:14 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789259/1
* 20:11 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789258
* 18:54 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789245
* 18:47 dduvall: creating 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789244
* 18:31 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789238
* 18:24 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789237
* 17:51 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789225
* 17:22 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789218
* 17:12 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789217
* 16:11 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789204
* 16:01 dduvall: created 2 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789203
* 16:01 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789195
* 15:42 dduvall: created 2 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/789194
* 13:44 James_F: Zuul: [mediawiki/services/function-evaluator] Use bespoke pipeline jobs only [[phab:T307507|T307507]]


== 2016-01-14 ==
== 2022-05-03 ==
* 20:06 legoktm: deploying https://gerrit.wikimedia.org/r/264122
* 23:35 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/788871
* 19:32 legoktm: deploying https://gerrit.wikimedia.org/r/264114
* 23:23 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/788868
* 19:18 legoktm: deploying https://gerrit.wikimedia.org/r/264108
* 22:03 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/788806
* 22:01 dduvall: created 4 new jobs to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/788806
* 21:40 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/788798
* 21:27 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/788799
* 21:25 dduvall: created trigger-pipelinelib-pipeline-test and pipelinelib-pipeline-test jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/788799
* 11:50 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/788682


== 2016-01-13 ==
== 2022-05-02 ==
* 21:06 hashar: beta cluster code is up to date again. Got delayed by roughly 4 hours.
* 15:09 dancy: Updating beta cluster scap to 4.7.1-1+0~20220502085300.264~1.gbp367de7?
* 20:55 hashar: unlocked Jenkins jobs for beta cluster by disabling/reenabling  Jenkins Gearman client
* 10:06 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/786934 # [[phab:T301766|T301766]]
* 10:15 hashar: beta: fixed puppet on deployment-elastic06 . Was still using cert/hostname without .deployment-prep. .... Mass update occurring.


== 2016-01-12 ==
== 2022-04-29 ==
* 23:30 legoktm: deploying https://gerrit.wikimedia.org/r/263757 https://gerrit.wikimedia.org/r/263756
* 21:49 brennen: created https://gitlab.wikimedia.org/toolforge-repos and https://gitlab.wikimedia.org/cloudvps-repos for cloud tenants ([[phab:T305301|T305301]])
* 13:32 hashar: beta cluster: running /usr/local/sbin/cleanup-pam-config
* 18:37 James_F: Zuul: Add SimilarEditors dependency on QuickSurveys extension for [[phab:T297687|T297687]]
* 13:29 hashar: integration running /usr/local/sbin/cleanup-pam-config  on slaves


== 2016-01-11 ==
== 2022-04-28 ==
* 22:24 hashar: Deleting old references on Zuul-merger for mediawiki/core : <tt>/usr/share/python/zuul/bin/python /home/hashar/zuul-clear-refs.py --until 15 /srv/ssd/zuul/git/mediawiki/core </tt>
* 20:31 James_F: Zuul: Add PHP81 as voting for libraries, PHP extensions etc. for [[phab:T293509|T293509]]
* 22:21 hashar: gallium in /srv/ssd/zuul/git/mediawiki/core$  git gc --prune=all && git remote update --prune
* 18:57 brennen: finished editing mediawiki-new-errors
* 22:21 hashar: scandium  in /srv/ssd/zuul/git/mediawiki/core$  git gc --prune=all && git remote update --prune
* 18:50 brennen: adding some filters to mediawiki-new-errors, including one based on https://wikitech.wikimedia.org/wiki/Performance/Runbook/Kibana_monitoring#Filtering_by_query_string
* 07:35 legoktm: deploying https://gerrit.wikimedia.org/r/263319
* 09:03 hashar: Gerrit upgraded to 3.4.4  at roughly 8:00 UTC


== 2016-01-07 ==
== 2022-04-27 ==
* 23:16 legoktm: deleted /mnt/jenkins-workspace/workspace/mediawiki-extensions-qunit/src/extensions/PdfHandler/.git/refs/heads/wmf/1.26wmf16.lock on slave 1013
* 19:06 hashar: Updating operations/software/gerrit branches and tags from upstream # [[phab:T292759|T292759]]
* 06:32 legoktm: deploying https://gerrit.wikimedia.org/r/262868
* 15:20 hashar: Updating non-quibble jobs to composer 2.3.3 {{!}} [[phab:T303867|T303867]] {{!}} https://gerrit.wikimedia.org/r/c/integration/config/+/777029
* 02:24 legoktm: deploying https://gerrit.wikimedia.org/r/262855
* 01:25 jzerebecki: reloading zuul for b0a5335..c16368a


== 2016-01-06 ==
== 2022-04-26 ==
* 21:13 thcipriani: kicking integration puppetmaster, weird node unable to find definition.
* 15:40 brennen: train 1.39.0-wmf.9 ([[phab:T305215|T305215]]): no current blockers - expect to start train ops after the toolhub deployment window wraps, so some time after 17:00 UTC; taking a pre-train stroll-around-the-block break before that.
* 21:11 jzerebecki: on scandium: sudo -u zuul rm -rf /srv/ssd/zuul/git/mediawiki/services/mathoid
* 13:46 James_F: Deleting deployment-mx02.deployment-prep.eqiad1.wikimedia.cloud for [[phab:T306068|T306068]]
* 21:04 legoktm: ^ on gallium
* 13:38 James_F: Zuul: [mediawiki/extensions/SimilarEditors] Install basic prod CI for [[phab:T306897|T306897]]
* 21:04 legoktm: manually deleted /srv/ssd/zuul/git/mediawiki/services/mathoid to force zuul to re-clone it
* 12:33 hashar: Manually pruned dangling docker images on contint1001 and contint2001
* 20:17 hashar: beta: dropped a few more /etc/apt/apt.conf.d/*-proxy files. webproxy is no more reachable from labs
* 08:30 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/780824
* 09:44 hashar: CI/beta: deleting all git tags from /var/lib/git/operations/puppet and doing git repack
* 08:09 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/785204
* 09:39 hashar: restoring puppet hacks on beta cluster puppetmaster.
* 09:35 hashar: beta/CI: salt -v '*' cmd.run 'rm -v /etc/apt/apt.conf.d/*-proxy'  https://phabricator.wikimedia.org/T122953


== 2016-01-05 ==
== 2022-04-25 ==
* 16:54 hashar_: Removed elastic search from CI slaves https://phabricator.wikimedia.org/T89083 https://gerrit.wikimedia.org/r/#/c/259301/
* 17:29 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/779450
* 03:45 Krinkle: integration-slave-trusty-1015: rm -rf /mnt/home/jenkins-deploy/.npm per https://integration.wikimedia.org/ci/job/mediawiki-core-qunit/56577/console
* 15:31 James_F: Zuul: [mediawiki/extensions/RegularTooltips] Add basic quibble CI


== 2016-01-04 ==
== 2022-04-20 ==
* 21:06 hashar: gallium has puppet enabled again
* 16:25 zabe: root@deployment-cache-upload06:~# touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service
* 20:53 hashar: stopping puppet on gallium and live hacking Zuul configuration for https://phabricator.wikimedia.org/T122656


== 2016-01-02 ==
== 2022-04-18 ==
* 03:17 yurik: purged varnishs on deployment-cache-text04
* 19:27 brennen: gitlab runners: deleting a number of stale runners with no contacts in > 2 months which are most likely no longer extant
* 16:49 brennen: phabricator: created phame blog https://phabricator.wikimedia.org/phame/blog/view/22/ for [[phab:T306329|T306329]]
* 16:48 brennen: phabricator: adding self to acl*blog-admins
* 15:33 James_F: Shutting off deployment-wdqs01 from the Beta Cluster project per [[phab:T306054|T306054]]; it's apparently unused, so this shouldn't break anything.


== 2016-01-01 ==
== 2022-04-14 ==
* 22:17 bd808: No nodepool ci-jessie-* hosts seen in Jenkins interface and rake-jessie jobs backing up
* 22:30 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/779969
* 16:09 brennen: removed or renamed 4 filters from mediawiki-new-errors per check-new-error-tasks/check.sh


== Archive ==
== 2022-04-12 ==
* [[/Archive 1|Archive 1]] (September 2014 - December 2015)
* 21:49 brennen: Updating dev-images docker-pkg files on primary contint for elastic 7.10.2
* 21:46 brennen: Updating dev-images docker-pkg files on primary contint for elastic 6.8.23
* 21:37 brennen: Updating dev-images docker-pkg files on primary contint for apache & elasticsearch changes ([[phab:T304290|T304290]], [[phab:T305143|T305143]])
* 16:05 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/779500
* 15:55 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/779498 https://gerrit.wikimedia.org/r/779141
 
== 2022-04-08 ==
* 11:08 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/778287
 
== 2022-04-07 ==
* 06:07 urbanecm: deployment-prep: foreachwiki extensions/GrowthExperiments/maintenance/T304461.php --delete # [[phab:T304461|T304461]], output is at P24204
* 05:54 urbanecm: deployment-prep: mwscript extensions/GrowthExperiments/maintenance/T304461.php --wiki=<nowiki>{</nowiki>enwiki,cswiki<nowiki>}</nowiki> --delete # [[phab:T304461|T304461]]
 
== 2022-04-06 ==
* 20:03 thcipriani: rebooting phabricator
* 11:44 James_F: Zuul: [mediawiki/extensions/WikiEditor] Add BetaFeatures to phan deps for [[phab:T304596|T304596]]
 
== 2022-04-04 ==
* 22:43 James_F: dockerfiles: [composer-scratch] Upgrade composer to 2.3.3 and cascade for [[phab:T294260|T294260]]
* 18:49 hashar: Reloading Zuul to revert https://gerrit.wikimedia.org/r/776179
* 18:23 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/776179
* 17:50 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/775796
* 12:12 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/776723
* 10:28 James_F: Zuul: [mediawiki/extensions/WikiLambda] Publish PHP and JS documentation
* 08:54 jnuche: redeploying Zuul
 
== 2022-04-02 ==
* 12:00 zabe: apply https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CentralAuth/+/773903 on deployment-prep centralauth databases
 
== 2022-03-31 ==
* 20:58 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/775957
 
== 2022-03-29 ==
* 14:20 James_F: Zuul: [mediawiki/extensions/IPInfo] Add EventLogging phan dependency for [[phab:T304948|T304948]]
* 12:32 hashar: integration-agent-docker-1039: clearing leftover pipelinelib builds: `sudo rm -fR /srv/jenkins/workspace/workspace/*`  [[phab:T304932|T304932]] [[phab:T302477|T302477]]
* 05:35 hashar: Relocate castor directory on integration-castor03 from `/srv/jenkins-workspace/caches` to `/srv/castor` https://gerrit.wikimedia.org/r/c/operations/puppet/+/774771
 
== 2022-03-28 ==
* 16:55 hashar: integration: created instance integration-castor04 with flavor `g3.cores8.ram32.disk20` (twice more ram than integration-castor03) # [[phab:T252071|T252071]]
* 16:49 hashar: integration: created 320G volume https://horizon.wikimedia.org/project/volumes/3f90c3f2-158d-4e45-a919-0f048f47c3b6/ . Intended to migrate integration-castor03 [[phab:T252071|T252071]]
* 10:34 hashar: contint2001 and contint1001: pruning obsolete branches from the zuul-merger: `sudo -H -u zuul find /srv/zuul/git -type d -name .git -print -execdir git -c url."https://gerrit.wikimedia.org/r/".insteadOf="ssh://jenkins-bot@gerrit.wikimedia.org:29418/" remote prune origin \;` [[phab:T220606|T220606]]
* 10:25 hashar: Changed `Trainsperiment Survey Questions` surveys permissions to be open outside of WMF and limited to 1 answer (forcing signin) https://docs.google.com/forms/u/0/d/e/1FAIpQLSd0Nc2jGkAGW-5rTiKN2EHWzfw2HeHm13N-ZCw1xUdE3z6woQ/formrestricted
* 10:18 hashar: contint2001 and contint1001: pruning all git reflog entries from the zuul-merger: `sudo -u zuul find /srv/zuul/git -name .git -type d -execdir git reflog expire --expire=all --all`.  They are useless and no more generated since https://gerrit.wikimedia.org/r/c/operations/puppet/+/757943
* 09:53 hashar: Tag Quibble 1.4.5 @ {{Gerrit|abe16d574}} {{!}} [[phab:T291549|T291549]]
 
== 2022-03-27 ==
* 13:23 James_F: Zuul: [releng/phatality] Make the node14 CI job voting [[phab:T304736|T304736]]
 
== 2022-03-26 ==
* 02:37 Reedy: beta-update-databases-eqiad is back to @hourly
 
== 2022-03-25 ==
* 23:51 Reedy: temporarily turning off period building of beta-update-databases-eqiad until it's run to completion
* 23:21 Reedy: running /usr/local/bin/wmf-beta-update-databases.py manually
* 20:22 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/773866
* 20:02 brennen: mediawiki-new-errors: ran check-new-error-tasks/check.sh and cleared "resolved" filters
* 09:43 hashar: Building Quibble Docker images to rename quibble-with-apache to quibble-with-supervisord
 
== 2022-03-24 ==
* 20:00 hashar: reloading Zuul for {{Gerrit|Id844e1723a38eed627af03397cf0ad90c7b09a32}} # [[phab:T299320|T299320]]
* 20:00 James_F: Clearing integration-castor03:/srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/mwgate-node14-docker/_cacache/content-v2/sha512/22/ for [[phab:T304652|T304652]]
* 15:00 James_F: Zuul: [design/codex] Publish code coverage reports for [[phab:T303899|T303899]]
* 09:37 Lucas_WMDE: killed a beta-scap-sync-world job manually, let’s see if that helps getting beta updates unstuck
 
== 2022-03-23 ==
* 17:35 brennen: restarting phabricator for [[phab:T304540|T304540]], brief downtime expected
* 14:56 dancy: Updating scap to 4.5.0-1+0~20220321191814.216~1.gbp24bc64 in beta cluster
 
== 2022-03-22 ==
* 14:44 hashar: gerrit: `./deploy_artifacts.py --version=3.3.10 gerrit.war` [[phab:T304226|T304226]]
* 13:50 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/771945
 
== 2022-03-21 ==
* 08:35 hashar: The castor cache for mediawiki/core wmf/1.39-wmf.1 is actually empty!
* 08:32 hashar: Nuking npm castor cache /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/wmf-quibble-selenium-php72-docker/npm/ # [[phab:T300203|T300203]]
 
== 2022-03-18 ==
* 14:18 elukey: restart testing of kafka logging TLS certificates (may affect logstash in beta, ping me in case it is a problem)
* 13:22 hashar: Rolling back Quibble jobs from 1.4.4 [[phab:T304147|T304147]]
* 07:41 elukey: experimenting with PKI and kafka logging on deployment-prep, logstash dashboard/traffic may be down (please ping me in case it is a problem)
 
== 2022-03-17 ==
* 19:11 hashar: Building Docker images for Quibble 1.4.4
* 19:06 hashar: Tag Quibble 1.4.4 @ {{Gerrit|56b2c9ba52c}} # [[phab:T300340|T300340]]
* 16:25 hashar: Switching Quibble jobs to use memcached rather than APCu {{!}} https://gerrit.wikimedia.org/r/c/integration/config/+/770468 {{!}} [[phab:T300340|T300340]]
* 14:11 hashar: Update all jobs to support `CASTOR_HOST` env variable {{!}} https://gerrit.wikimedia.org/r/770921 {{!}} [[phab:T216244|T216244]] {{!}} [[phab:T252071|T252071]]
* 14:07 hashar: Building Docker image to support `CASTOR_HOST` {{!}} https://gerrit.wikimedia.org/r/770921 {{!}} [[phab:T216244|T216244]]
 
== 2022-03-16 ==
* 22:00 James_F: Docker: Publishing sonar-scanner:4.6.0.2311-3 for [[phab:T303958|T303958]]
* 20:13 James_F: Zuul: [mediawiki/services/function-evaluator and …/function-orchestrator] Switch to npm coverage job for [[phab:T302607|T302607]] and [[phab:T302608|T302608]]
* 19:48 zabe: apply https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CentralAuth/+/769424/ on deployment-prep
* 19:43 taavi: apply https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CentralAuth/+/771347/ on deployment-prep
 
== 2022-03-15 ==
* 18:26 brennen: gitlab: removed most existing /people groups
* 18:10 brennen: gitlab: finished migrating access for all existing people groups to direct project membership ([[phab:T274461|T274461]], [[phab:T300935|T300935]])
* 16:49 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/770963
* 14:30 hashar: CI Jenkins: globally defined CASTOR_HOST=integration-castor03.integration.eqiad.wmflabs via https://integration.wikimedia.org/ci/configure # [[phab:T216244|T216244]]
* 14:17 hashar: Apply label `castor` to node https://integration.wikimedia.org/ci/computer/integration-castor03/ # [[phab:T216244|T216244]]
* 01:37 James_F: Zuul: Switch services/function* publish job from node12 to node14
* 01:14 James_F: Zuul: [wikidata/query-builder] Switch branchdeploy from node12 to node14
* 00:08 James_F: Zuul: [wikipeg] Switch from node12 to node14 special job
 
== 2022-03-14 ==
* 23:57 James_F: Zuul: [ooui] Switch from node12 to node14
* 23:46 James_F: Docker: Publishing node14-test-browser-php80-composer:0.1.0
* 23:27 James_F: Zuul: Drop legacy node12 templates except the one for Services
* 23:10 James_F: Zuul: [oojs/router] Drop custom job and just use the generic node14 one
* 23:08 James_F: Zuul: [oojs/core] Switch from node12 to node14 jobs
* 22:46 James_F: Zuul: [unicodejs] Switch from node12 to node14
* 22:25 James_F: Zuul: [VisualEditor/VisualEditor] Switch from node12 to node14
* 19:51 James_F: Zuul: Migrate almost all libraries and tools from node12 to node14 for [[phab:T267890|T267890]]
* 15:36 James_F: Zuul: Switch extension-javascript-documentation from node12 to node14 for [[phab:T267890|T267890]]
* 15:21 James_F: Zuul: Switch all mwgate jobs from node12 to node14 for [[phab:T267890|T267890]]
* 09:52 hashar: Building Quibble Docker images for https://gerrit.wikimedia.org/r/757867 {{!}} [[phab:T300340|T300340]]
* 08:54 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/770079
 
== 2022-03-11 ==
* 04:02 zabe: zabe@deployment-mwmaint02:~$ mwscript extensions/CentralAuth/maintenance/populateGlobalEditCount.php --wiki=metawiki
 
== 2022-03-10 ==
* 20:45 zabe: apply https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CentralAuth/+/769416 on deployment-prep centralauth databases
* 20:25 James_F: Zuul: [mediawiki/extensions/VueTest] Add basic quibble CI
* 20:03 Krinkle: Updating docker-pkg files on contint primary for  https://gerrit.wikimedia.org/r/768843
* 15:12 hashar: updating Quibble jenkins jobs
* 14:26 James_F: Docker: Publishing new versions of quibble-buster and cascade adding unzip for [[phab:T250496|T250496]] / [[phab:T303417|T303417]].
* 11:43 Amir1: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/769668
* 09:59 dwalden: restarted apache on deployment-mediawiki11 # [[phab:T302699|T302699]]
 
== 2022-03-09 ==
* 17:08 hashar: Updating Gerrit Comment.soy to get rid of a literal `null` string being inserted in notification emails {{!}} https://gerrit.wikimedia.org/r/c/operations/puppet/+/768005 {{!}} https://phabricator.wikimedia.org/T288312
 
== 2022-03-08 ==
* 20:31 brennen: requiring 2fa for all users under /repos
 
== 2022-03-07 ==
* 10:53 zabe: restarted apache on deployment-mediawiki11 # [[phab:T302699|T302699]]
 
== 2022-03-04 ==
* 20:29 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/768146
* 19:13 Krinkle: Reloading Zuul to deploy  https://gerrit.wikimedia.org/r/768068
 
== 2022-03-03 ==
* 19:13 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/767864
* 15:37 James_F: Docker: Publishing sury-php images based on bullseye not stretch and cascade for [[phab:T278203|T278203]]
* 14:43 hashar: Reloading Zuul for {{Gerrit|Iae45cae8ec209a3e795fe4fd7dd92290565277db}}
* 12:47 hashar: Upgrading Quibble on CI Jenkins jobs from 1.3.0 to 1.4.3 https://gerrit.wikimedia.org/r/c/integration/config/+/767749/
* 10:30 hashar: Building Docker images for Quibble 1.4.3
* 10:22 hashar: Tagged Quibble 1.4.3 @ {{Gerrit|cf5cd1a0a07}}
* 09:24 hashar: Building Docker images for Quibble 1.4.2
* 09:20 hashar: Tag Quibble 1.4.2 @ {{Gerrit|63d2855a1e}} # [[phab:T302226|T302226]] [[phab:T302707|T302707]]
 
== 2022-03-02 ==
* 19:53 James_F: Zuul: Configure CI for the forthcoming REL1_38 branches for [[phab:T302908|T302908]]
* 15:56 dancy: Updating scap to 4.4.1-1+0~20220302155149.192~1.gbpe351d6 in beta
* 15:27 Krinkle: Reloading Zuul to deploy  https://gerrit.wikimedia.org/r/767493
* 15:04 taavi: resolve merge conflicts on deployment-puppetmaster04
 
== 2022-02-28 ==
* 19:29 brennen: removing mutante (dzahn) as application-level gitlab admin; adding as owner of /repos for the time being to facilitate some migrations
* 19:22 dancy: Update scap to 4.4.0-1+0~20220228192031.189~1.gbp0a8436 in beta
* 19:17 brennen: adding mutante (dzahn) as application-level gitlab admin
 
== 2022-02-26 ==
* 20:05 zabe: apply [[phab:T302658|T302658]] on deployment-prep centralauth databases
* 13:24 zabe: apply [[phab:T302660|T302660]] on deployment-prep centralauth databases
* 13:19 zabe: apply [[phab:T302659|T302659]] on deployment-prep centralauth databases
 
== 2022-02-24 ==
* 16:02 dancy: Updating beta cluster scap to 4.4.0-1+0~20220224155429.187~1.gbp66c5c2
* 13:44 hashar: integration/config now fully enforces shellcheck https://gerrit.wikimedia.org/r/756088
* 13:13 hashar: Built image docker-registry.discovery.wmnet/releng/castor:0.2.5
* 13:10 hashar: Updating castor-save-workspace-cache job https://gerrit.wikimedia.org/r/764817
* 11:54 hashar: Built image docker-registry.discovery.wmnet/releng/shellcheck:0.1.1
* 11:41 hashar: Built image docker-registry.discovery.wmnet/releng/sonar-scanner:4.6.0.2311-2
* 11:04 hashar: Built image docker-registry.discovery.wmnet/releng/operations-puppet:0.8.6
* 08:58 hashar: Built image docker-registry.discovery.wmnet/releng/mediawiki-phan-testrun:0.2.1
 
== 2022-02-23 ==
* 23:21 dancy: Update beta cluster scap to 4.3.1-1+0~20220223231645.183~1.gbp8ddb60
* 20:10 dancy: Updating scap in beta
* 19:23 hashar: Built docker-registry.discovery.wmnet/releng/logstash-filter-verifier:0.0.3
* 12:41 hashar: Depooling integration-agent-puppet-docker-1002 , pooling integration-agent-puppet-docker-1003 # [[phab:T252071|T252071]]
* 10:21 hashar: Created Bullseye instance integration-agent-puppet-docker-1003 https://horizon.wikimedia.org/project/instances/96cf9ddc-daa3-4c9f-8c21-cdd58e95973e/  # [[phab:T252071|T252071]]
* 08:37 hashar: Removing Stretch based integration-agent-qemu-1001 # [[phab:T284774|T284774]]
 
== 2022-02-22 ==
* 16:41 zabe: zabe@deployment-mwmaint02:~$ foreachwiki migrateUserGroup.php oversight suppress # [[phab:T112147|T112147]]
* 13:28 urbanecm: deployment-prep: Create database for incubatorwiki ([[phab:T210492|T210492]])
 
== 2022-02-21 ==
* 14:58 hashar: Reverting Quibble jobs from 1.4.0 to 1.3.0 # [[phab:T302226|T302226]]
* 07:31 hashar: Switching Quibble jobs from Quibble 1.3.0 to 1.4.0 # [[phab:T300340|T300340]] [[phab:T291549|T291549]] [[phab:T225730|T225730]]
* 07:27 hashar: Refreshing all Jenkins jobs
 
== 2022-02-20 ==
* 10:32 qchris: Manually triggering replication run of Gerrit's analytics/datahub to populate newly created analytics-datahub GitHub repo
 
== 2022-02-19 ==
* 12:19 taavi: restart trafficserver-tls on deployment-cache-text06
* 02:15 James_F: Zuul: [design/codex] Publish the Netlify preview on every patch for [[phab:T293705|T293705]]
* 00:35 James_F: Manually re-triggered a build of the docs of Codex (via `zuul-test-repo design/codex postmerge`) now that we actually set the environment vars for [[phab:T293705|T293705]]
 
== 2022-02-18 ==
* 22:54 James_F: Zuul: [branchdeploy-codex-node14-npm-docker] Create as experimental for [[phab:T293705|T293705]]
* 22:14 James_F: Jenkins: Defined BRANCHDEPLOY_AUTH_TOKEN_codex and BRANCHDEPLOY_SITE_ID_codex secrets for [[phab:T293705|T293705]]
* 13:44 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/763724 [[phab:T301453|T301453]]
* 09:21 hashar: Reloading Zuul for {{Gerrit|I1494abb5e9e28da951ffb72154a074a16a0f8381}}
 
== 2022-02-17 ==
* 21:48 brennen: added Dzahn (mutante) to acl*repository-admins on phabricator
* 15:58 zabe: root@deployment-cache-upload06:~# touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service # [[phab:T301995|T301995]]
* 13:35 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/763207
* 13:20 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/763458
* 11:12 hashar: Bringing deployment-deploy03 back
* 11:07 hashar: Disabled deployment-deploy03 Jenkins agent in order to revert some mediawiki/core patch and test the outcome
 
== 2022-02-16 ==
* 18:20 hashar: Tag Quibble 1.4.1 @ {{Gerrit|d4bd2801de}} # [[phab:T300301|T300301]]
* 16:42 dancy: Updating to scap 4.3.1-1+0~20220216163646.173~1.gbp823710?in beta
* 12:55 jelto: apply gitlab-settings to gitlab-prod-1001.devtools.eqiad1.wikimedia.cloud
* 10:09 hashar: Reloading Zuul for {{Gerrit|I997fee0f160ca3049b8085879831bfe175096ced}}
* 09:59 hashar: Reloading Zuul for {{Gerrit|I2ffa016563ad37f1e7c13dcce81deb8ab411c9e2}}
 
== 2022-02-15 ==
* 21:12 dancy: rebooting deployment-mediawiki12.deployment-prep.eqiad1.wikimedia.cloud to try to revive beta wikis
* 20:59 dancy: Killed runaway puppet agent on deployment-mediawiki11.deployment-prep.eqiad1.wikimedia.cloud
* 16:24 hashar: Restarting CI Jenkins for plugins updates
* 16:21 hashar: Upgrading Jenkins plugins on releases Jenkins
* 16:06 hashar: Rollback fresh-test Jenkins job to the version intended to run on integration-agent-qemu-1001
* 15:26 hashar: Reloading Zuul for {{Gerrit|If80b4b4cfa5c1a869ceb220f5b11c272b384a721}}
 
== 2022-02-14 ==
* 16:28 dancy: Updating scap in beta cluster to 4.3.1-1+0~20220211225318.167~1.gbp315b2c
* 16:16 Amir1: Reloading Zuul to deploy  https://gerrit.wikimedia.org/r/c/integration/config/+/762471
* 15:41 hashar: Messing up with fresh-test Jenkns job to polish up Qemu / qcow2 integration
* 14:26 jnuche: Jenkins upgrade complete [[phab:T301361|T301361]]
* 13:54 jnuche: Jenkins contint instances are going to be restarted soon
 
== 2022-02-12 ==
* 18:22 urbanecm: deployment-prep: reboot deployment-eventgate-3 ([[phab:T289029|T289029]])
 
== 2022-02-10 ==
* 17:29 jeena: reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/761602
 
== 2022-02-09 ==
* 15:22 taavi: deleted shutoff deployment-mx02
 
== 2022-02-08 ==
* 17:34 taavi: remove scap from deployment-kafka-main/jumbo
* 16:23 taavi: hard reboot misbehaving deployment-echostore01
* 13:39 taavi: delete /srv/mediawiki-staging.save on deployment-deploy03
 
== 2022-02-07 ==
* 20:55 taavi: added Zabe as member of the deployment-prep project [[phab:T301179|T301179]]
* 18:19 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/760550
 
== 2022-02-04 ==
* 00:21 Krinkle: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/759622
 
== 2022-02-03 ==
* 18:41 taavi: deployment-prep: route /w/api.php to deployment-mediawiki11, trying to reduce load on a single server
* 14:53 hashar: Building Docker images for Quibble 1.4.0  (prepared by kostajh)
* 13:51 kostajh: Tag Quibble 1.4.0 @ {{Gerrit|4231bc2832395d94e29a332fe8d863301a0cd441}} # [[phab:T300340|T300340]] [[phab:T291549|T291549]] [[phab:T225730|T225730]]
 
== 2022-02-02 ==
* 16:50 dancy: Upgrading scap to 4.2.2-1+0~20220202164708.157~1.gbp376a16 in beta.
* 16:12 dancy: Upgrading scap to 4.2.2-1+0~20220201161808.156~1.gbp1c1c64 in beta
 
== 2022-02-01 ==
* 17:27 addshore: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/734654
* 00:34 tgr: deployment-pre un-cherry-picked gerrit 758584 from beta puppetmaster, patch is now merged [[phab:T300591|T300591]]
* 00:12 tgr: deployment-prep cherry-picked gerrit 758584 to beta puppetmaster [[phab:T300591|T300591]]
 
== 2022-01-31 ==
* 19:01 James_F: Re-configured Jenkins job mediawiki-i18n-check-docker to {{Gerrit|9e3ea96c548d7a84be763d38c2d118bc861cf189}} for [[phab:T222216|T222216]]
* 10:49 hashar: Added integration-agent-qemu-1003 with label `Qemu` # [[phab:T284774|T284774]]
 
== 2022-01-28 ==
* 21:45 taavi: running recountCategories.php on all beta wikis per [[phab:T299823|T299823]]#7652496
* 14:27 hashar: taking heapdump  of CI Jenkins `sudo -u jenkins /usr/lib/jvm/java-11-openjdk-amd64/bin/jmap -dump:live,format=b,file=/var/lib/jenkins/202201281527.hprof xxxx`
 
== 2022-01-27 ==
* 20:26 hashar: Successfully published image docker-registry.discovery.wmnet/releng/logstash-filter-verifier:0.0.2  # [[phab:T299431|T299431]]
* 19:34 Amir1: Reloading Zuul to deploy 757464
* 16:00 hashar: Pooling back agents 1035 1036 1037 1038 , they could not connect due to ssh host mismatch since yesterday they all got attached to instance 1033 and accepted that host key # [[phab:T300214|T300214]]
* 09:16 hashar: integration: cumin --force 'name:docker' 'apt install rsync'  # [[phab:T300236|T300236]]
* 09:05 hashar: integration: cumin --force 'name:docker' 'apt install rsync'  # [[phab:T300214|T300214]]
* 00:24 thcipriani: restarting jenkins
 
== 2022-01-26 ==
* 20:29 hashar: Completed migration of integration-agent-docker-XXXX instances from Stretch to Bullseye - [[phab:T252071|T252071]]
* 19:55 hashar: deleting integration-agent-docker-1014 which only has the `codehealth` label. A short live experiment no more used since October 2nd 2019 - https://gerrit.wikimedia.org/r/c/integration/config/+/540362 - [[phab:T234259|T234259]]
* 18:56 hashar: integration: pooled in Jenkins a few more Bullseye docker agents for [[phab:T252071|T252071]]
* 18:17 hashar: integration: pooled in Jenkins a few Bullseye docker agent for [[phab:T252071|T252071]]
* 16:45 hashar: integration: creating  integration-agent-docker-1023  based on buster with new flavor `g3.cores8.ram24.disk20.ephemeral60.4xiops` # [[phab:T290783|T290783]]
 
== 2022-01-25 ==
* 20:17 James_F: Zuul: [mediawiki/extensions/CentralAuth] Drop UserMerge dependency
* 16:39 James_F: Zuul: Mark Math extension as now tarballed in parameter_functions for [[phab:T232948|T232948]]
* 15:57 James_F: Zuul: [mediawiki/extensions/Math] Add Math to the main gate for [[phab:T232948|T232948]]
* 13:44 hashar: Jenkins CI: added Logger https://integration.wikimedia.org/ci/log/ProcessTree%20-%20T299995/ to watch `hudson.util.ProcessTree` for [[phab:T299995|T299995]]
* 10:02 hashar: integration: removing usage of `role::ci::slave::labs::docker::docker_lvm_volume` in Horizon following https://gerrit.wikimedia.org/r/c/operations/puppet/+/755948  . Docker role instances now always have a 24G partition for Docker
* 09:59 hashar: integration-agent-qemu-1001: resized /srv to 100% disk free: `lvextend -r -l +100%FREE /dev/mapper/vd-second--local--disk` # [[phab:T299996|T299996]]
* 09:59 hashar: integration-agent-qemu-1001: resizing /dev/mapper/vd-second--local--disk (/srv) to 20G : `resize2fs -p /dev/mapper/vd-second--local--disk 20G` # [[phab:T299996|T299996]]
* 09:51 hashar: integration-agent-qemu-1001: resizing /dev/mapper/vd-second--local--disk (/srv) to 20G : `resize2fs -p /dev/mapper/vd-second--local--disk 20G`
* 09:51 hashar: integration-agent-qemu-1003: nuked /dev/vd/second-local-disk and /srv to make room for a docker logical volume. That has fixed puppet  [[phab:T299996|T299996]]
* 09:22 Reedy: unblocked beta again
* 07:32 Krinkle: integration-castor03:/srv/jenkins-workspace/caches$ sudo rm -rf castor-mw-ext-and-skins/
 
== 2022-01-24 ==
* 21:44 Reedy: unstick beta ci jobs
* 21:19 jeena: reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/756523
* 20:36 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/756139
* 17:28 hashar: Nuke castor caches on integration-castor03 : sudo rm -fR /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/<nowiki>{</nowiki>quibble-vendor-mysql-php72-selenium-docker,wmf-quibble-selenium-php72-docker<nowiki>}</nowiki>  # [[phab:T299933|T299933]]
* 17:28 hashar: Nuke castor caches on integration-castor03 : sudo rm -fR /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/<nowiki>{</nowiki>quibble-vendor-mysql-php72-selenium-docker,wmf-quibble-selenium-php72-docker<nowiki>}</nowiki>
 
== 2022-01-22 ==
* 13:40 taavi: apply [[phab:T299827|T299827]] on deployment-prep centralauth database
* 11:44 taavi: restart varnish-frontend.service on deployment-cache-upload06 to clear puppet agent failure alerts
 
== 2022-01-21 ==
* 18:12 taavi: resolved merge conflicts on deployment-puppetmaster04
* 15:50 hashar: integration-puppetmaster-02: deleted 2021 snapshot tags in puppet repo and ran `git gc --prune=now`
 
== 2022-01-20 ==
* 20:24 James_F: Zuul: [Kartographer] Add parsoid as dependency for CI jobs
* 20:22 James_F: Zuul: [DiscussionTools] Add Gadgets as dependency for Phan jobs
* 20:04 dancy: Jenkins beta jobs are back online, using scap prep auto now.
* 19:19 dancy: Pausing beta Jenkins jobs to make a copy of /srv/mediawiki-staging in preparation for testing
* 19:10 dancy: Unpacking scap (4.1.1-1+0~20220120175448.144~1.gbp517f9d) over (4.1.1-1+0~20220113154148.133~1.gbp6e3a17) on deploy03
* 18:07 hashar: Updating Quibble jobs to have MediaWiki files written on the hosts /srv partition (38G) instead of inside the container which ends in /var/lib/docker (24G) https://gerrit.wikimedia.org/r/755743  # [[phab:T292729|T292729]]
* 16:31 hashar: Rebalancing /var/lib/docker and /srv partitions on CI agents {{!}} https://gerrit.wikimedia.org/r/755713
* 12:12 hashar: contint2001 deleting all the Docker images (they will be pulled as needed)
* 12:10 hashar: contint2001 : docker container prune && docker image prune
* 12:07 hashar: contint1001 deleting all the Docker images (they will be pulled as needed)
* 12:04 hashar: contint1001 `docker image prune`
* 11:51 hashar: Cleaning very old Docker images on contint1001.wikimedia.Org
 
== 2022-01-19 ==
* 18:20 hashar: Adding  https://integration.wikimedia.org/ci/computer/contint1001/ back to the pool again
* 17:31 hashar: Adding  https://integration.wikimedia.org/ci/computer/contint1001/ back to the pool after the machine got powercycled # [[phab:T299542|T299542]]
* 10:38 Reedy: kill some stuck jobs [[phab:T299485|T299485]]
 
== 2022-01-18 ==
* 19:56 hashar: building Docker images for https://gerrit.wikimedia.org/r/754951
* 18:01 taavi: added ryankemper as a member of the deployment-prep project
* 15:00 hashar: Updating Jenkins jobs for Quibble 1.3.0  with proper PHP version in the images # [[phab:T299389|T299389]]
* 11:39 hashar: Rolling back Quibble 1.3.0 jobs due to php configuration files with at least releng/quibble-buster73:1.3.0  # [[phab:T299389|T299389]]
* 08:07 hashar: Updating Jenkins jobs for Quibble to pass `--parallel-npm-install` https://gerrit.wikimedia.org/r/c/integration/config/+/754569
* 08:02 hashar: Updating Jenkins jobs for Quibble 1.3.0
 
== 2022-01-17 ==
* 16:28 hashar: Building Quibble 1.3.0 Docker images
* 16:16 hashar: Tagged Quibble 1.3.0 @ {{Gerrit|2b2c7f9a45}} # [[phab:T297480|T297480]] [[phab:T226869|T226869]] [[phab:T294931|T294931]]
* 08:32 hashar: Refreshing all Jenkins jobs with jjb to take in account recent changes related to the Jinja2 docker macro
 
== 2022-01-14 ==
* 15:56 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/753981
* 14:59 hashar: Starting VM integration-agent-docker-1022 which was in shutdown state since December and is Bullseye based # [[phab:T290783|T290783]]
* 13:49 hashar: Restarting all CI Docker agents via Horizon to apply new flavor settings [[phab:T265615|T265615]] [[phab:T299211|T299211]]
* 01:47 dancy: revert to scap 4.1.1-1+0~20220113154148.133~1.gbp6e3a17 in beta
 
== 2022-01-13 ==
* 18:02 dancy: Updating scap to 4.1.1-1+0~20220113154506.135~1.gbp523480 on all beta hosts
* 17:54 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/753792
* 16:27 dancy: testing scap prep auto on deployment-deploy03
* 15:52 dancy: Update scap to 4.1.1-1+0~20220113154506.135~1.gbp523480 on deployment-deploy03
* 11:27 hashar: Updating Jenkins job to normalize usage of `docker run --workdir` https://gerrit.wikimedia.org/r/c/integration/config/+/753457
* 10:52 hashar: Restarting Jenkins CI for plugins update
* 10:42 hashar: Applied Jenkins built-in node migration to CI Jenkins (`master` > `built-in` renaming) # [[phab:T298691|T298691]]
* 10:14 taavi: cancelled stuck deployment-prep jobs on jenkins
 
== 2022-01-12 ==
* 18:58 hashar: Applied plugins update to https://releases-jenkins.wikimedia.org/
 
== 2022-01-11 ==
* 09:18 hashar: Updating all Jenkins jobs following recent "noop" refactorings
 
== 2022-01-10 ==
* 17:13 dancy: Update beta scap to 4.1.0-1+0~20220107203309.130~1.gbpcd0ace
* 14:01 James_F: Zuul: Add gate-and-submit-l10n to Isa for [[phab:T222291|T222291]]
 
== 2022-01-05 ==
* 19:15 taavi: run `sudo chown -R jenkins-deploy:wikidev public/dists/bullseye-deployment-prep/` on deployment-deploy03
* 17:31 hashar: Deploying Zuul change https://gerrit.wikimedia.org/r/c/integration/config/+/751697  to get rid of the wmf-quibble-apache jobs # [[phab:T285649|T285649]]
* 10:48 hashar: CI: switching MediaWiki selenium from php built-in server to Apache # https://gerrit.wikimedia.org/r/751697
* 09:24 hashar: Updating Quibble jobs to use latest image (provides `quibble-with-apache` entrypoint) https://gerrit.wikimedia.org/r/c/integration/config/+/751685/
 
== 2022-01-04 ==
* 12:49 hashar: Reloading Zuul for "api-testing: rename jobs to shorter forms"  https://gerrit.wikimedia.org/r/751422
* 09:48 hashar: Builder Quibble Docker images with Apache included https://gerrit.wikimedia.org/r/c/integration/config/+/748104
* 09:47 hashar: Reloading Zuul for "Add CentralAuth to phan dependency list for GrowthExperiments" https://gerrit.wikimedia.org/r/751383
 
== 2022-01-03 ==
* 14:37 hashar: Upgraded Java 11 on contint2001 && contint1001.  Restarted CI Jenkins.
* 14:35 hashar: Upgraded Java 11 on releases1002 && releases2002
 
 
{{SAL-archives/Release Engineering}}


__NOTOC__
<noinclude>[[Category:SAL]]</noinclude>
<noinclude>[[Category:SAL]]</noinclude>

Latest revision as of 21:35, 29 November 2022

2022-11-29

  • 21:35 brennen: gitlab repos/releng/scap: added direct membership for some non-releng maintainers who show up frequently/recently in commit log
  • 20:33 James_F: Zuul: [wikimedia/wikimania-scholarships] Set as archived for T243037

2022-11-27

  • 21:27 James_F: Docker: Publishing new php82 images with rc.7 for T314093

2022-11-25

  • 12:45 hashar: Reloaded Zuul for I717ad1 "add pipelinename to autogenerated:ci tags" # T214068

2022-11-23

  • 22:41 urandom: accidentally deleted deployment-sessionstore04
  • 15:07 James_F: Zuul: configure CI for operations/debs/varnish-modules for T321309

2022-11-22

  • 21:51 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/859615
  • 21:06 TheresNoTime: samtar@deployment-mwmaint02:~$ mwscript extensions/WikimediaMaintenance/createExtensionTables.php zhwiki pagetriage T323378
  • 14:11 TheresNoTime: [samtar@deployment-deploy03 ~]$ sudo keyholder arm

2022-11-21

  • 14:54 James_F: Zuul: [mediawiki/extensions/WikiLambda] Disable selenium tests for T294388
  • 14:41 vgutierrez: move deployment-cache-(text|upload)07 from role::cache::(text|upload)_haproxy to role::cache::(text|upload) - T323365

2022-11-18

  • 10:05 hashar: gerrit: change HEAD branch to point to `deploy/wmf/stable-3.5` # T307334

2022-11-17

2022-11-16

  • 20:53 thcipriani: restarting jenkins for update
  • 08:46 hashar: gerrit: reindexed accounts `ssh -p 29418 gerrit.wikimedia.org -- gerrit index start accounts --force` # T323135
  • 08:45 hashar: gerrit: deleted 192 LDAP accounts (scheme `gerrit:`) containing upper case characters which had an exact equivalent in an all lower case form. `All-Users.git` commit is 5e5800e # T323135
  • 08:45 hashar: gerrit: deleted 192 LDAP accounts (scheme `gerrit:`) containing upper case characters which had an exact equivalent in an all lower case form #

2022-11-15

  • 20:21 hashar: gerrit: removed legacy mixed case accounts and moved the extra secondary email to a mailto id for `gerrit:krinkle`, `gerrit:revi`, `gerrit:daniel kinzler`, `gerrit:harej` and `gerrit:samanthanguyen` T323135#8397539
  • 20:20 hashar: gerrit: removed legacy mixed case accounts for `gerrit:Fomafix` and `gerrit:Ricordisamoa` T323135#8397539
  • 16:25 James_F: Zuul: [mediawiki/services/parsoid] Make MW jobs voting in test
  • 15:57 James_F: Zuul: [mediawiki/extensions/CampaignEvents] Add Echo as phan dependency for T317231
  • 15:24 hashar: gerrit: converted, to all lower case, the Gerrit accounts `username:Kaldari`, `username:Fran McCrory` and `username:SamanthaNguyen` # T323097

2022-11-14

  • 17:36 hashar: Nuking unused Castor cached files in `/srv/jenkins-workspace/caches` # T323051
  • 17:35 hashar: Changing Castor cache saving from `/srv/jenkins-workspace/caches/` to `/srv/cache/caches/` which is the one served by rsync T323051
  • 17:34 hashar: Changing Castor cache saving from `/srv/jenkins-workspace/caches/` to `/srv/cache/caches/` which is the one served by rsync.
  • 14:19 James_F: Zuul: [mediawiki/services/function-schemata] Move from node 12 to 16

2022-11-10

  • 21:33 James_F: Docker: Upgrading quibble-buster-php74-coverage with a new vesion of phpunit-patch-coverage for T322864
  • 08:37 hashar: Rebuilding https://integration.wikimedia.org/ci/job/beta-code-update-eqiad/ , it probably failed due to Gerrit being restarted
  • 01:09 James_F: Zuul: Make PHP 8.1 voting for all quibble items for T316078
  • 01:05 James_F: Zuul: Drop mwext-php74-phan-docker from experimental for gate

2022-11-09

  • 23:02 James_F: Zuul: [mediawiki/core] Add PHP 8.1 phan job for T322278
  • 14:56 andrewbogott: fixed puppet breakage on several instances

2022-11-08

  • 20:17 dduvall: puppet re-enabled on gitlab-runner hosts (T322453) normal log level will be restored on next puppet run
  • 20:01 dduvall: temporarily enabling buildkitd debug logging on gitlab-runner hosts (T322453)
  • 15:58 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/854535
  • 15:26 vgutierrez: delete deployment-ms-be06 - T322231
  • 15:21 vgutierrez: shutdown deployment-ms-be06 - T322231
  • 06:39 vgutierrez: delete deployment-ms-be05 - T322231
  • 06:36 vgutierrez: delete deployment-ms-fe03 - T322554
  • 06:30 vgutierrez: downgrade to firejail 0.9.44.8-2 on deployment-imagescaler03
  • 05:51 vgutierrez: shutdown deployment-ms-fe03 - T322554

2022-11-07

  • 18:19 vgutierrez: let deployment-cache-upload07 use deployment-ms-fe04 - T322554
  • 15:57 vgutierrez: shutting down deployment-ms-be05 - T322231

2022-11-03

  • 20:31 hashar: Reloaded Zuul for Ic473bd
  • 19:19 brennen: attempting initial phab1004 phabricator deploy
  • 17:45 James_F: Zuul: Add CI for CategoryExplorer and EmailDeletedPages extensions and Cavendish and Pivot skins
  • 17:15 James_F: Zuul: Add experimental PHP 8.2 jobs for PHP extensions for T314093
  • 16:53 James_F: Docker: Publishing initial PHP 8.2 CI test images for T314093
  • 13:44 TheresNoTime: add `cxserver-beta` (port 8080) proxy for deployment-prep, T322323

2022-11-02

  • 22:44 James_F: Zuul: [mediawiki/tools/scap] Mark as archived for T322269
  • 09:56 vgutierrez: update to HAProxy 2.6.6 in deployment-cache-(text|upload)07 - T321775

2022-10-31

  • 15:56 andrewbogott: shutting down deployment-echostore01, deployment-ms-be0[56], deployment-mdb01, deployment-prometheus02, deployment-wikifeeds01 as per https://phabricator.wikimedia.org/T306068
  • 15:50 James_F: Zuul: [mediawiki/libs/RemexHtml] Re-enable PHP 8.1 CI for T311450

2022-10-28

  • 14:14 zabe: delete deployment-db07 and deployment-db08
  • 06:24 hashar: devtools: phabricator-prod-1001: `rmdir /etc/envoy/clusters.d /etc/envoy/listeners.d`
  • 06:24 hashar: devtools: `rmdir /etc/envoy/clusters.d /etc/envoy/listeners.d`
  • 06:23 hashar: devtools: set `profile::phabricator::main::dumps_rsync_clients: []` project wide to fix up Puppet. Settings got moved to a `role` ( https://gerrit.wikimedia.org/r/c/operations/puppet/+/842875 | T313360 )

2022-10-27

2022-10-26

2022-10-25

2022-10-24

  • 17:42 James_F: Zuul: Add new e-mail for Hoo man to allow list

2022-10-21

2022-10-20

2022-10-19

2022-10-18

  • 19:13 hashar: devtools: unbreak puppet on `deploy-1004.devtools.eqiad1.wikimedia.cloud` by applying `profile::mediawiki::scap_client::is_master: true` # T319681
  • 17:51 James_F: Zuul: [wikimedia/fundraising/SmashPig] Use composer-test-php74-only template
  • 08:03 vgutierrez: wipe deployment-cache-(text|upload)06 - T320930

2022-10-17

  • 16:21 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/843499 T309600
  • 14:01 vgutierrez: shutdown deployment-cache-(text|upload)06 - T320930
  • 13:56 vgutierrez: switch 185.15.56.36 from deployment-cache-text06 to deployment-cache-text07 - T320930
  • 13:54 vgutierrez: switch 185.15.56.35 from deployment-cache-upload06 to deployment-cache-upload07 - T320930
  • 11:02 urbanecm: deployment-prep: wikiadmin@172.16.0.238(wikishared)> source /srv/mediawiki-staging/php-master/extensions/ContentTranslation/sql/significant-edits.sql; # cswiki beta was failing with cx_significant_edits table not found
  • 09:41 wm-bot2: Increased quotas by 4 cores (T320932) - cookbook ran by arturo@nostromo

2022-10-14

  • 20:57 James_F: Zuul: Fix dependencies for BlueSpice extensions that depend on VisualEditor
  • 20:49 James_F: Docker: Publishing helm-linter without deprecated kubeyaml for T316348
  • 20:06 James_F: Docker: Publish images with php-ast upgraded from v1.0.14 to v1.1.0
  • 18:22 dduvall: upgrade of docker on contint hosts aborted due to missing buster package. agents are back online
  • 18:01 dduvall: upgrading docker on contint servers. agents will be available for a short time
  • 16:07 James_F: Zuul: [mediawiki/libs/Zest] Re-enable PHP 8.1 tests for T311463
  • 15:54 James_F: Zuul: [mediawiki/vendor] Add experimental job to check composer.lock for T74952
  • 13:48 James_F: Zuul: [css-sanitizer] Re-enable PHP 8.1 jobs for T311451

2022-10-13

2022-10-12

  • 20:09 dduvall: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/841996
  • 17:07 dduvall: deployed blubberoid using docker-registry.discovery.wmnet/wikimedia/blubber:2022-10-12-162839-production

2022-10-11

  • 15:49 dduvall: manually (re-)re-running `sudo -u mwpresync /usr/bin/scap stage-train --yes auto` after patch cleanup
  • 15:29 dduvall: correction ^ full command is `sudo -u mwpresync /usr/bin/scap stage-train --yes auto`
  • 15:28 dduvall: manually (re)running `stage-train --yes auto` following cron job failure
  • 10:53 TheresNoTime: add MVernon to deployment-prep, T316845#8307183

2022-10-10

  • 12:04 TheresNoTime: cherry 836953 picking for T316845 to deployment-prep/Swift

2022-10-08

2022-10-07

  • 13:27 James_F: Zuul: Add two former contractors to the CI allowlist

2022-10-06

2022-10-05

2022-10-04

2022-10-03

2022-09-30

  • 18:43 James_F: Triggering graceful restart of zuul to see if that fixes on-going merge/gerrit connection issues.
  • 17:07 James_F: Zuul: Make PHP 8.1 non-voting for all skins and extensions T316078
  • 16:46 James_F: Zuul: Make PHP 8.0 and PHP 8.1 voting for all skins and extensions in master for T300463 and T316078
  • 15:12 James_F: Docker: Building and publishing PHP 8.0.24 images for T315167
  • 02:33 James_F: Zuul: [mediawiki/core] Clean up REL1_35 and REL1_37 PHP 8 jobs
  • 02:30 James_F: Zuul: [mediawiki/core] Upgrade PHP 8.0 and 8.1 jobs to full vendor jobs for T300463 and T316078
  • 02:27 James_F: Zuul: Drop FIXME messages for T318093, being Declined

2022-09-29

  • 23:54 TheresNoTime: samtar@deployment-jobrunner04:~$ sudo systemctl stop php7.2-fpm.service && sudo systemctl start php7.4-fpm.service
  • 23:47 TheresNoTime: cherry pick 836953 to deployment-prep
  • 23:09 TheresNoTime: [samtar@deployment-deploy03 ~]$ sudo puppet agent -tv
  • 23:08 TheresNoTime: deployment-deploy03, `sudo systemctl stop php7.2-fpm.service`, `sudo systemctl start php7.4-fpm.service`
  • 23:03 TheresNoTime: ran `sudo puppet agent -tv` on deployment -deploy03, -mediawiki11, -mediawiki12
  • 13:56 James_F: Zuul: Drop PHP72 jobs everywhere, and PHP73 everywhere except old branches
  • 13:41 James_F: Zuul: [mediawiki/core] Drop PHP 7.2 and PHP 7.3 testing for master and wmf for T261872
  • 13:34 James_F: Zuul: [mediawiki/vendor] Drop PHP72 jobs, use only PHP74 ones
  • 12:30 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/836696

2022-09-28

2022-09-27

2022-09-26

  • 21:46 Daimona: Applying schema changes to the wikishared DB on beta for the CampaignEvents extension # T318379 T318120
  • 21:31 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # T318120
  • 20:00 dduvall: regenerating 314 jobs for deployment of https://gerrit.wikimedia.org/r/835262
  • 11:40 James_F: Docker: Building and publishing quibble-buster-php74-bundle
  • 11:40 James_F: Docker
  • 10:52 hashar: Rolling quibble/ruby jobs from php 7.4 to 7.2: `mediawiki-selenium-integration-docker` `legacy-quibble-rubyselenium-docker` # T318525
  • 09:35 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/834717/

2022-09-23

  • 18:09 James_F: Zuul: [wikimedia-cz/web-*] Migrate tests from php73+ to php74+
  • 18:06 James_F: Zuul: [labs/tools/guc] Migrate tests from php73+ to php74+
  • 18:04 James_F: Zuul: [labs/tools/coverme] Migrate tests from php73+ to php74+
  • 15:55 James_F: Docker: Building and publishing php74 versions of composer-security-check, mediawiki-phan, mediawiki-phan-testrun, and phpmetrics
  • 13:26 James_F: Zuul: Run php 7.4 phan for extensions and skins

2022-09-22

  • 20:40 zabe: shutoff deployment-db07 # T318126
  • 20:36 zabe: take deployment-prep out of read-only # T318126
  • 20:32 zabe: failover deployment-prep master from deployment-db07 to deployment-db09 # T318126
  • 20:25 zabe: set deployment-prep as read-only # T318126
  • 16:26 dancy: Upgrading scap to latest code revision in beta cluster
  • 10:38 zabe: deployment-db10: start replication # T318126

2022-09-21

  • 23:34 zabe: shutoff deployment-db08 # T318126
  • 23:00 jeena: restarting zuul to try and fix CI issues
  • 20:46 zabe: clone deployment-db10 from deployment-db08 # T318126
  • 18:49 TheresNoTime: cherry-picked gerrit:833839 to deployment-puppetmaster04, testing T317417
  • 18:19 zabe: install mariadb 10.6 via role::mariadb::beta on deployment-db10 # T318126
  • 17:55 zabe: create volume db10 and attach to deployment-db10 # T318126
  • 17:54 zabe: create deployment-db10 as g3.cores8.ram16.disk20 # T318126
  • 14:21 zabe: deployment-db09: restart mariadb # T318126
  • 13:55 TheresNoTime: modified deployment-prep "prometheus" security group - port 80, T315699
  • 13:18 James_F: Jenkins: Dropped 16 more old node jobs left on the server.
  • 13:11 James_F: Jenkins: Dropped four old node10 jobs left on the server (oojs-core-node10-browser-docker, ooui-special-node10-plus-php80-composer-docker, wikipeg-special-node10-plus-php72-composer-docker, wikipeg-special-node10-plus-php80-composer-docker)
  • 13:05 James_F: Jenkins: Dropped scap-pipeline-stretch and trigger-scap-pipeline-stretch following 26c74a1
  • 12:36 hashar: Reloaded Zuul for Remove Stretch from mediawiki/tools/scap - https://gerrit.wikimedia.org/r/833705
  • 09:46 andrewbogott: removed some stray whitespace in /var/lib/git/operations/puppet that was preventing rebase on deployment-puppetmaster04.deployment-prep.eqiad.wmflabs

2022-09-20

  • 22:00 zabe: deployment-db09: start replication # T318126
  • 20:06 zabe: deployment-db09: import dump into mariadb # T318126
  • 20:04 zabe: rsynced dump from deployment-db08 to deployment-db09 # T318126
  • 08:08 hashar: Upgrading CI and releases Jenkins plugins notably to update the git client T315897
  • 02:06 zabe: created backup of all databases on deployment-db08 # T318126

2022-09-19

  • 23:58 zabe: install mariadb 10.6 via role::mariadb::beta on deployment-db09 # T318126
  • 23:57 zabe: create volume db09 and attach to deployment-db09 # T318126
  • 23:57 zabe: create deployment-db09 as g3.cores8.ram16.disk20 # T318126
  • 20:24 dduvall: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/833066
  • 16:54 James_F: Zuul: [operations/mediawiki-config] Switch to PHP 7.4 jobs
  • 16:24 James_F: Zuul: [mediawiki/core] Add php80 and php81 to `check php` command
  • 15:36 James_F: Zuul: [mediawiki/core] run phan on PHP 7.4 for T316518
  • 13:50 James_F: Zuul: [mediawiki/core] Add a non-vendor php81 job for main branch for T316078
  • 12:06 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # T316128
  • 11:57 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # T316128

2022-09-16

  • 15:47 dancy: Upgrading scap to latest code revision in beta cluster

2022-09-15

  • 19:56 thcipriani: Updating development images on contint primary
  • 17:24 dduvall: Updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/832374
  • 11:03 TheresNoTime: soft reboot deployment-parsoid12, unresponsive

2022-09-13

  • 22:14 zabe: delete deployment-urldownloader02

2022-09-12

2022-09-09

  • 17:46 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # T311126
  • 17:25 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # T311126
  • 17:08 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # T316409
  • 16:36 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # T316409
  • 10:49 hashar: devtools: fixed fqdn of instances puppetmaster-1001 and gerrit-prod-1001 by manually editing `/etc/hosts` # T317404

2022-09-08

2022-09-07

  • 15:05 TheresNoTime: making hack changes to beta to test T317195 resolution

2022-09-06

  • 15:42 bd808: Promoted user 'StrikerBot' to admin on gitlab.wikimedia.org so that Striker can use the account to attach Developer accounts to gitlab via API.
  • 02:05 James_F: Running REL1_39 branch commands for T313920
  • 00:20 Krinkle: Prune various old mediawiki/core wmf branches for Gerrit usability, ref T303828

2022-09-02

  • 15:59 zabe: added vwalters as member of the deployment-prep project T316943
  • 13:40 Krinkle: " ENOENT: no such file or directory, lstat " failing quibble jobs on integration-agent-docker-1024

2022-09-01

2022-08-31

  • 23:41 zabe: deleted shutoff deployment-restbase03
  • 16:39 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension # T308738
  • 16:37 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (2/2) # T312870
  • 16:21 Daimona: Applying schema change to the wikishared DB on beta for the CampaignEvents extension (1/2) # T312870
  • 16:18 hashar: Tag Quibble 1.4.6 @ 8828487d0 # T305525 T314586
  • 15:27 James_F: Docker: Building and publishing quibble-fresnel image based on php74 for T316525

2022-08-30

  • 16:00 James_F: Zuul: [labs/tools/heritage] Switch postmerge job to tox-py37-coverage-publish for T316627
  • 09:32 hashar: doc: on doc1002: `sudo -u doc-uploader rm -fR /srv/doc/mw-tools-scap/` That got moved to `/srv/doc` and a redirect has been set. # T315541

2022-08-29

2022-08-25

2022-08-23

  • 17:40 hashar: Stopping Gerrit
  • 11:54 hashar: Manually applied a `docker-pkg` fix on contint2001 to prevent it from downloading unrelated images T310458

2022-08-22

  • 07:17 taavi: trying to disconnect jenkins from gearman and then re-connect to see if it helps with T315818
  • 07:12 taavi: restart zuul-merger on contint2001 T315818

2022-08-21

2022-08-19

  • 23:04 TheresNoTime: resized deployment-mwlog01's /srv volume, restarted
  • 22:57 TheresNoTime: shutting down deployment-mwlog01 for T315707
  • 15:50 James_F: Docker: Build stalled out for 30 minutes; terminated and re-started.
  • 15:15 dancy: Upgrading scap to latest code revision in beta cluster
  • 15:11 James_F: Docker: Building and publishing images with PHP 8.0.22 for T315167

2022-08-18

  • 17:01 hashar: Restarted zuul-merger on contint1001 # T315586
  • 16:42 hashar: Reloaded Zuul for Ie83b19
  • 13:14 awight: [beta] Deploying new kartotherian version

2022-08-17

  • 14:18 zabe: fix merge conflicts in deployment-prep private repo # T315394
  • 10:27 hashar: Built image docker-registry.discovery.wmnet/releng/commit-message-validator:1.0.0 # T315159

2022-08-16

  • 20:51 RhinosF1: beta: is down see wikitech-l and https://phabricator.wikimedia.org/T315350
  • 20:30 hashar: Repooled integration-agent-docker-1028 , it was mysteriously unreachable T315372
  • 19:18 Krinkle: mediawiki/extensions/EventLogging$ git remote-wildcard-br-d 'wmf/1.35*' 'wmf/1.36*' 'wmf/1.37*' 'wmf/1.38*'
  • 19:17 Krinkle: mediawiki/extensions/Scribunto$ git remote-wildcard-br-d 'wmf/1.35*' # ref T303828
  • 19:16 TheresNoTime: manually running `/usr/local/bin/wmf-beta-update-databases.py` on `deployment-deploy03`
  • 17:16 TheresNoTime: soft-rebooting deployment-mediawiki12

2022-08-12

  • 17:47 dancy: Restarting zuul
  • 17:42 dancy: Restarting Jenkins in an attempt to get CI jobs running again
  • 00:54 ori: On deployment-cache-{text,upload}06, ran: touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service . Certificate was close to expiry

2022-08-11

  • 21:11 mutante: restarted phd service on phab2001
  • 19:12 brennen: Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/merge_requests/16
  • 12:26 jnuche: Reenabled CI beta sync jobs after cluster incident
  • 11:48 jnuche: Temporarily disabled CI beta sync jobs until issue in cluster is resolved
  • 10:25 zabe: take deployment-prep out of read-only mode

2022-08-10

2022-08-09

  • 22:11 James_F: Docker: Building and publishing quibble-buster-php74-coverage for PHP7.4+ coverage
  • 21:56 James_F: Two failures in devimage build: releng/eventlogging and releng/buster-swift53 – nothing new from me, looks like they've been broken for a bit?
  • 21:17 James_F: Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/merge_requests/17
  • 21:07 James_F: Zuul: Enable PHP74 jobs on gate-and-submit-wmf pipeline [Re-try] for T293924
  • 19:42 James_F: Docker: Re-build and publish quibble-buster-php74 based on Wikimedia PHP not sury-php for T293851

2022-08-08

  • 15:56 taavi: gerrit: used `ssh gerrit.wikimedia.org -p 29418 gerrit close-connection` to disconnect four of sgimeno's stuck sessions
  • 14:43 James_F: jforrester@doc1002:~$ sudo -u doc-uploader rm -rf /srv/doc/wikibase-vuejs-components/ for T309872
  • 13:23 James_F: Zuul: [mediawiki/libs/metrics-platform] Run Java jobs on maven file paths for T314630
  • 10:28 jnuche: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/821166

2022-08-05

  • 16:02 James_F: Docker: Building and publishing composer-security-check:1.1.1 for T296967
  • 15:40 James_F: Zuul: [mediawiki/services/function-*] Switch coverage to node16
  • 15:33 James_F: Zuul: [mediawiki/libs/metrics-platform] Add experimental regular java jobs for T314630
  • 14:48 James_F: Zuul: Add WelpThatWorked to allow list
  • 14:48 James_F: Zuul: [mediawiki/extensions/MenuEditor] BlueSpiceDiscovery dependency is a skin

2022-08-04

2022-08-03

  • 21:05 James_F: Zuul: Doing a graceful restart to see if this clears the fork-bombed CI jobs.
  • 20:13 taavi: reloading zuul for https://gerrit.wikimedia.org/r/820212
  • 17:44 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/820171
  • 14:57 brennen: gitlab: flipping admin bit for bd808 for API testing purposes
  • 14:11 James_F: Zuul: [wikimedia/vuejs-components] Mark as archived for T309872
  • 12:00 James_F: Ran `zuul-test-repo design/codex postmerge` on contint2001 to finally run coverage for Codex
  • 11:58 James_F: Zuul: Run publish jobs on branches called 'main' too

2022-08-02

2022-08-01

  • 23:16 James_F: Zuul: [design/codex] Switch to node16
  • 23:16 James_F: 16:15:59 <+wikibugs> (Merged) jenkins-bot: Zuul: [design/codex] Switch to node16 [integration/config] - https://gerrit.wikimedia.org/r/819185 (owner: Jforrester)
  • 22:53 TheresNoTime: remove stuck beta deployment jobs
  • 22:51 dduvall: re-armed keyholder on deploy-1004.devtools following reboot
  • 22:50 James_F: Zuul: Don't use browser-direct-coverage where browser-coverage will do
  • 22:49 dduvall: modified `deployment_hosts` puppet config for devtools project to allow deployments from `deploy-1004`
  • 22:24 dduvall: armed keyholder with phabricator key on deploy-1004.devtools
  • 22:11 dduvall: setting puppetmaster to project standalone for deploy-1004.devtools
  • 21:01 James_F: Zuul: [mediawiki/extensions/Phonos] Add comment about deployment timing for T314306
  • 21:00 James_F: Zuul: [mediawiki/extensions/BlueSpiceCustomMenu] Add MenuEditor dependency
  • 15:53 taavi: reloading zuul for https://gerrit.wikimedia.org/r/819097
  • 09:14 TheresNoTime: clearing stuck beta CI jobs

2022-07-29

  • 22:16 James_F: Zuul: Configure CI for the forthcoming REL1_39 branches for T313919
  • 18:00 brennen: using standalone puppetmaster in devtools to test phabricator scap3 changes

2022-07-28

2022-07-27

  • 13:55 James_F: Zuul: [mediawiki/core] Add a non-vendor php80 job for main branch T300463
  • 13:08 James_F: Zuul: [mediawiki/core] Make php80 voting on REL1_38 for T274965
  • 13:04 James_F: Zuul: Add php81 experimental job everywhere we have php80
  • 12:39 James_F: Zuul: [mediawiki/extensions/WikibaseLexeme] Add WikibaseLexemeCirrusSearch dep
  • 03:48 Krinkle: Click "Disable publishing" for a dozen repos created recently, including OAuthRateLimiter, ref T143162, T193565

2022-07-25

2022-07-23

2022-07-21

  • 21:55 dancy: Upgrading scap to 4.11.2-1+0~20220720160115.349~1.gbpd4a6cb in beta cluster

2022-07-20

  • 15:43 dancy: Upgrading scap to 4.11.1-1+0~20220720154238.348~1.gbp94de82 in beta cluster
  • 13:19 James_F: Zuul: [mediawiki/extensions/VueTest] Add extension-codehealth pipeline

2022-07-19

  • 17:40 dancy: Upgrading scap to 4.11.0-1+0~20220719173732.346~1.gbpe07bc9 in beta cluster
  • 17:00 urbanecm: deployment-prep: urbanecm@deployment-mwmaint02:~$ mwscript extensions/GrowthExperiments/maintenance/migrateWikitextMentorList.php --wiki=arwiki # T310905

2022-07-18

2022-07-17

2022-07-16

  • 00:10 mutante: doc1002 - sudo systemctl start rsync-doc-doc2001.codfw.wmnet - Icinga alerted after an 'rsync warning: some files vanished before they could be transferred (code 24)' - but all is ok on next attempt

2022-07-15

2022-07-14

  • 18:50 James_F: Docker: Building node16 images for CI for T313075
  • 14:52 James_F: Zuul: [mediawiki/skins/BlueSpiceSkin] Archive for T203215
  • 14:48 James_F: Zuul: [mediawiki/extensions/BlueSpiceExtensions] Archive
  • 14:42 James_F: Zuul: [mediawiki/extensions/BlueSpiceBookshelfUI] Archive for T268085
  • 14:38 James_F: Zuul: [mediawiki/tools/wikilambda-cli] Install node14 CI

2022-07-13

2022-07-12

  • 17:29 Amir1: dropping tl_namespace and tl_title from templatelinks in fawiki (T312865)

2022-07-11

2022-07-10

  • 00:07 Krinkle: krinkle@mediawiki12$ sudo enable-puppet

2022-07-09

  • 20:39 ori: ori@deployment-mediawiki12:~$ sudo apt install php-tideways-xhprof-dbgsym
  • 17:25 ori: Cherry-picked Ief73cc553 (varnish: use libvmod-querysort on Beta Cluster) on deployment-prep Puppetmaster. Can be reverted if there are any issues.
  • 06:16 Krinkle: krinkle@mediawiki12$ sudo disable-puppet
  • 06:08 ori: ori@deployment-mediawiki12: userdel systemd-coredump, followed by apt install systemd-coredump
  • 05:50 Krinkle: krinkle@deployment-mediawiki-12$ sudo apt-get install systemd-coredump # ref T312689

2022-07-07

  • 22:42 TheresNoTime: clear stuck beta deployment jobs (again), T72597
  • 21:10 TheresNoTime: clear stuck beta deployment jobs, T72597
  • 16:47 urbanecm: deployment-prep: wikiadmin@172.16.3.206(enwiki)> delete from growthexperiments_mentor_mentee where gemm_mentor_id=93651; # testing a specific workflow in Special:MentorDashboard
  • 12:22 hashar: integration: rebooting `integration-agent-docker-1039` T312534

2022-07-05

2022-06-30

  • 22:02 TheresNoTime: unstuck beta-mediawiki-config-update-eqiad jobs, will comment at T72597
  • 21:05 TheresNoTime: cancelled beta-code-update-eqiad#398138 to make way for pending beta-scap-sync-world#57641, queued another beta-code-update-eqiad
  • 16:47 taavi: reloading zuul to deploy https://gerrit.wikimedia.org/r/810053

2022-06-29

  • 14:48 ori: Clearing data from incomplete migration on Wikifunctionswiki via sql.php
  • 13:39 TheresNoTime: clearing stuck beta deployment jobs, watching to ensure they catch up :')

2022-06-28

2022-06-27

2022-06-24

  • 20:52 taavi: added `denisse` as a member

2022-06-23

2022-06-22

  • 17:36 taavi: gerrit: add tfellows to the extension-OpenBadges group per request in T308278
  • 17:35 taavi: gerrit: create group extension-JsonData with robla in it, make it an owner of mediawiki/extensions/JsonData per request in T303147
  • 16:19 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/807586
  • 09:35 hashar: Switched `gitlab-prod-1001.devtools.eqiad1.wikimedia.cloud` instance to use the project Puppet master `puppetmaster-1001.devtools.eqiad1.wikimedia.cloud`
  • 09:08 hashar: contint1001 , contint2002: deleting `.git/logs` from all zuul-merger repositories. We do not need the reflog `sudo -u zuul find /srv/zuul/git -type d -name .git -print -execdir rm -fR .git/logs \;` # T307620
  • 09:00 hashar: contint1001 , contint2002: setting `core.logallrefupdates=false` on all Zuul merger git repositories: `sudo -u zuul find /srv/zuul/git -type d -name .git -print -execdir git config core.logallrefupdates false \;` # T307620
  • 07:46 hashar: Building operations-puppet docker image for https://gerrit.wikimedia.org/r/c/integration/config/+/807180

2022-06-21

  • 22:01 brennen: gitlab-runners: re-registering all shared runners
  • 17:55 dancy: Upgrading scap to 4.9.4-1+0~20220621174226.320~1.gbp56e4d4 in beta cluster

2022-06-20

  • 16:30 urbanecm: add sgimeno as a project member (Growth engineer with need for access)
  • 15:50 ori: On deployment-cache-{text,upload}06, ran: touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service (T310957)
  • 14:07 ori: restarted acme-chief on deployment-acme-chief03

2022-06-17

  • 17:15 ori: provisioned deployment-cache-text07 in deployment-prep to test query normalization via VCL
  • 01:08 TimStarling: on deployment-docker-cpjobqueue01 and deployment-docker-changeprop01 I redeployed the changeprop configuration, reverting the PHP 7.4 hack

2022-06-16

  • 12:24 hashar: gitlab: runner-1030: `docker volume prune -f`
  • 12:24 hashar: gitlab: runner-1026: `docker volume prune -f`
  • 10:02 elukey: ran `scap install-world --batch` to allow scap/puppet to work on ml-cache100[2,3]

2022-06-15

  • 22:39 brennen: phabricator: tagged release/2022-06-15/1 (T310742)
  • 16:31 hashar: integration-agent-docker-1035: docker image prune
  • 15:26 dancy: Upgrading scap to 4.9.4-1+0~20220615151557.315~1.gbped3b8d in beta cluster

2022-06-14

  • 21:30 TheresNoTime: clear out stuck `beta-scap-sync-world` jobs (repeatedly per each queued `beta-mediawiki-config-update-eqiad` job), queued jobs now running. monitored for until each job had run successfully. jobs up to date
  • 17:18 brennen: starting 1.39.0-wmf.16 (T308069) transcript in deploy1002:~brennen/1.39.0-wmf.16.log
  • 13:35 TheresNoTime: clear stuck `beta-scap-sync-world` job, other queued jobs now running. Cancel running `beta-update-databases-eqiad` job, will ensure it runs on the next timer
  • 00:42 TimStarling: on deployment-deploy03 removed helm2, as was done in production

2022-06-13

  • 22:04 TheresNoTime: cleared out stalled Jenkins beta jobs on `deployment-deploy03`, manually started `beta-code-update-eqiad` job & watched to completion. all caught up
  • 04:33 hashar: Restarting Docker on contint1001.wikimedia.org , apparently can't build images anymore

2022-06-12

2022-06-10

  • 15:20 James_F: Zuul: [mediawiki/extensions/SearchVue] Add initial CI jobs for T309932
  • 08:28 hashar: Reloaded Zuul to remove mediawiki/services/parsoid from CI dependencies # https://gerrit.wikimedia.org/r/c/integration/config/+/803990
  • 04:27 TimStarling: on deployment-deploy03 running scap sync-world -v with PHP 7.4 for T295578
  • 04:03 TimStarling: on deployment-deploy03 running scap sync-world -v with PHP 7.2 for T295578 sanity check

2022-06-09

  • 22:49 dancy: Upgrading scap to 4.9.1-1+0~20220609211227.304~1.gbpe48c42 in beta cluster
  • 16:39 brennen: gitlab shared runners: re-registering to apply image allowlist configuration

2022-06-08

  • 17:14 hashar: Reloaded Zuul for I393422
  • 15:57 dancy: Set `profile::mediawiki::php::restarts::ensure: present` in deployment-prep hiera config for T237033
  • 09:28 hashar: Reloaded Zuul for "Add doc publish for Translate" https://gerrit.wikimedia.org/r/792134

2022-06-06

  • 14:37 James_F: Zuul: [mediawiki/extensions/ImageSuggestions] Mark as in production for T302711

2022-06-02

  • 15:33 dancy: Upgrading scap to 4.8.1-1+0~20220602153109.295~1.gbp318d9c in beta cluster
  • 11:26 hashar: Restarting Jenkins on contint2001
  • 11:19 hashar: Restarting Jenkins on releases1002

2022-05-31

  • 21:16 dancy: Upgrading scap to 4.8.0-1+0~20220531211114.292~1.gbp8dbbcf in beta cluster
  • 17:40 dancy: Upgrading scap to 4.8.0-1+0~20220531173912.291~1.gbp21a7ef in beta cluster
  • 17:33 dancy: Reverted to scap 4.8.0-1+0~20220524160924.288~1.gbp794a08 in beta cluster
  • 17:07 dancy: Upgrading scap to 4.8.0-1+0~20220531170512.289~1.gbp143729 in beta cluster

2022-05-30

  • 11:47 jelto: apply gitlab-settings to gitlab1004 - T307142
  • 11:46 jelto: apply gitlab-settings to gitlab1003 - T307142

2022-05-28

  • 19:09 TheresNoTime: deployment-deploy04 live, not referenced by anything T309437

2022-05-27

  • 22:55 zabe: zabe@deployment-mwmaint02:~$ mwscript extensions/WikiLambda/maintenance/updateTypedLists.php --wiki=wikifunctionswiki --db # started ~20 min ago
  • 22:49 TheresNoTime: manually running database update script: samtar@deployment-deploy03:~$ /usr/local/bin/wmf-beta-update-databases.py
  • 22:09 TheresNoTime: samtar@deployment-deploy03:~$ sudo keyholder arm
  • 21:44 TheresNoTime: hard rebooted deployment-deploy03 as soft reboot unresponsive
  • 21:44 bd808: `sudo wmcs-openstack role add --user zabe --project deployment-prep projectadmin` (T309419)
  • 21:10 zabe: zabe@deployment-deploy03:~$ sudo keyholder arm
  • 20:53 bd808: `sudo wmcs-openstack role add --user samtar --project deployment-prep projectadmin` (T309415)
  • 20:49 dancy: Initiated hard reboot of deployment-deploy03.deployment-prep

2022-05-26

  • 18:33 dancy: Updated Jenkins beta-* job configs
  • 16:51 TheresNoTime: manually triggered beta-update-databases-eqiad post-merge of 2c7b5825
  • 16:51 brennen: puppetmaster-1001.devtools: resetting ops/puppet checkout to production branch

2022-05-25

  • 18:38 TheresNoTime: (@ ~18:20UTC) samtar@deployment-mwmaint02:~$ mwscript resetUserEmail.php --wiki=wikidatawiki Mahir256 [snip] T309230
  • 15:46 dancy: Restarted apache2 on gerrit1001

2022-05-24

2022-05-23

  • 19:21 inflatador: Deleted deployment-elastic0[5-7] in favor of newer bullseye hosts T299797
  • 18:37 dancy: Reverted to scap 4.7.1-1+0~20220505181519.270~1.gbpeb47ae in beta cluster
  • 18:35 dancy: Upgrading beta cluster scap to 4.7.1-1+0~20220523183110.280~1.gbpaa0826
  • 14:49 James_F: Zuul: Enforce Postgres and SQLite support via in-mediawiki-tarball
  • 08:37 elukey: move kafka jumbo in deployment-prep to fixed uid/gid - T296982
  • 08:29 elukey: move kafka main in deployment-prep to fixed uid/gid - T296982
  • 08:06 elukey: move kafka logging in deployment-prep to fixed uid/gid - T296982

2022-05-22

2022-05-21

2022-05-20

2022-05-19

2022-05-18

  • 19:31 hashar: Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/793028
  • 18:45 brennen: gitlab: created placeholder /repos/mediawiki group for squatting purposes
  • 08:29 hashar: Updating SSH Build agent from 1.31.5 to 1.32.0 on CI Jenkins to prevent an issue when uploading `remoting.jar` # T307339#7937268
  • 07:32 hashar: Deleting Jenkins agent configuration for `integration-castor03` # T252071

2022-05-17

  • 23:26 James_F: Zuul: [mediawiki/extensions/Phonos] Install basic quibble CI for T308558

2022-05-16

2022-05-14

  • 23:19 James_F: Zuul: Add Dreamy_Jazz to CI allow list
  • 23:17 James_F: Zuul: [mediawiki/extensions/LocalisationUpdate] Move out of production section
  • 20:25 urbanecm: add TheresNoTime (samtar) as a project member per request

2022-05-13

2022-05-12

  • 22:09 inflatador: bking@deployment-elastic05 banned deployment-elastic05 from beta ES cluster in preparation for decom T299797
  • 19:53 hashar: gerrit: triggering full replication to gerrit2001 to test T307137
  • 16:00 hashar: contint2001 and contint1001 now automatically run `docker system prune --force` every day and `docker system prune --force` on Sunday | https://gerrit.wikimedia.org/r/c/operations/puppet/+/773784/
  • 15:05 brennen: gitlab-prod-1001.devtools: soft reboot
  • 00:46 brennen: gitlab: disabling container registries on all existing projects (T307537)

2022-05-11

  • 23:20 brennen: gitlab-prod-1001.devtools: container registry currently enabled
  • 18:58 brennen: gitlab-prod-1001.devtools: setting to use devtools standalone puppetmaster

2022-05-10

2022-05-09

2022-05-08

  • 12:33 urbanecm: deployment-prep: urbanecm@deployment-mwmaint02:~$ foreachwikiindblist growthexperiments extensions/GrowthExperiments/maintenance/migrateMenteeOverviewFiltersToPresets.php --update # T304057

2022-05-06

  • 12:55 hashar: Migrated Castor service from integration-castor03 to integration-castor05 # T252071

2022-05-05

2022-05-04

2022-05-03

2022-05-02

2022-04-29

2022-04-28

2022-04-27

2022-04-26

  • 15:40 brennen: train 1.39.0-wmf.9 (T305215): no current blockers - expect to start train ops after the toolhub deployment window wraps, so some time after 17:00 UTC; taking a pre-train stroll-around-the-block break before that.
  • 13:46 James_F: Deleting deployment-mx02.deployment-prep.eqiad1.wikimedia.cloud for T306068
  • 13:38 James_F: Zuul: [mediawiki/extensions/SimilarEditors] Install basic prod CI for T306897
  • 12:33 hashar: Manually pruned dangling docker images on contint1001 and contint2001
  • 08:30 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/780824
  • 08:09 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/785204

2022-04-25

2022-04-20

  • 16:25 zabe: root@deployment-cache-upload06:~# touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service

2022-04-18

  • 19:27 brennen: gitlab runners: deleting a number of stale runners with no contacts in > 2 months which are most likely no longer extant
  • 16:49 brennen: phabricator: created phame blog https://phabricator.wikimedia.org/phame/blog/view/22/ for T306329
  • 16:48 brennen: phabricator: adding self to acl*blog-admins
  • 15:33 James_F: Shutting off deployment-wdqs01 from the Beta Cluster project per T306054; it's apparently unused, so this shouldn't break anything.

2022-04-14

2022-04-12

2022-04-08

2022-04-07

  • 06:07 urbanecm: deployment-prep: foreachwiki extensions/GrowthExperiments/maintenance/T304461.php --delete # T304461, output is at P24204
  • 05:54 urbanecm: deployment-prep: mwscript extensions/GrowthExperiments/maintenance/T304461.php --wiki={enwiki,cswiki} --delete # T304461

2022-04-06

  • 20:03 thcipriani: rebooting phabricator
  • 11:44 James_F: Zuul: [mediawiki/extensions/WikiEditor] Add BetaFeatures to phan deps for T304596

2022-04-04

2022-04-02

2022-03-31

2022-03-29

  • 14:20 James_F: Zuul: [mediawiki/extensions/IPInfo] Add EventLogging phan dependency for T304948
  • 12:32 hashar: integration-agent-docker-1039: clearing leftover pipelinelib builds: `sudo rm -fR /srv/jenkins/workspace/workspace/*` T304932 T302477
  • 05:35 hashar: Relocate castor directory on integration-castor03 from `/srv/jenkins-workspace/caches` to `/srv/castor` https://gerrit.wikimedia.org/r/c/operations/puppet/+/774771

2022-03-28

2022-03-27

  • 13:23 James_F: Zuul: [releng/phatality] Make the node14 CI job voting T304736

2022-03-26

  • 02:37 Reedy: beta-update-databases-eqiad is back to @hourly

2022-03-25

  • 23:51 Reedy: temporarily turning off period building of beta-update-databases-eqiad until it's run to completion
  • 23:21 Reedy: running /usr/local/bin/wmf-beta-update-databases.py manually
  • 20:22 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/773866
  • 20:02 brennen: mediawiki-new-errors: ran check-new-error-tasks/check.sh and cleared "resolved" filters
  • 09:43 hashar: Building Quibble Docker images to rename quibble-with-apache to quibble-with-supervisord

2022-03-24

  • 20:00 hashar: reloading Zuul for Id844e1 # T299320
  • 20:00 James_F: Clearing integration-castor03:/srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/mwgate-node14-docker/_cacache/content-v2/sha512/22/ for T304652
  • 15:00 James_F: Zuul: [design/codex] Publish code coverage reports for T303899
  • 09:37 Lucas_WMDE: killed a beta-scap-sync-world job manually, let’s see if that helps getting beta updates unstuck

2022-03-23

  • 17:35 brennen: restarting phabricator for T304540, brief downtime expected
  • 14:56 dancy: Updating scap to 4.5.0-1+0~20220321191814.216~1.gbp24bc64 in beta cluster

2022-03-22

2022-03-21

  • 08:35 hashar: The castor cache for mediawiki/core wmf/1.39-wmf.1 is actually empty!
  • 08:32 hashar: Nuking npm castor cache /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/wmf-quibble-selenium-php72-docker/npm/ # T300203

2022-03-18

  • 14:18 elukey: restart testing of kafka logging TLS certificates (may affect logstash in beta, ping me in case it is a problem)
  • 13:22 hashar: Rolling back Quibble jobs from 1.4.4 T304147
  • 07:41 elukey: experimenting with PKI and kafka logging on deployment-prep, logstash dashboard/traffic may be down (please ping me in case it is a problem)

2022-03-17

2022-03-16

2022-03-15

2022-03-14

  • 23:57 James_F: Zuul: [ooui] Switch from node12 to node14
  • 23:46 James_F: Docker: Publishing node14-test-browser-php80-composer:0.1.0
  • 23:27 James_F: Zuul: Drop legacy node12 templates except the one for Services
  • 23:10 James_F: Zuul: [oojs/router] Drop custom job and just use the generic node14 one
  • 23:08 James_F: Zuul: [oojs/core] Switch from node12 to node14 jobs
  • 22:46 James_F: Zuul: [unicodejs] Switch from node12 to node14
  • 22:25 James_F: Zuul: [VisualEditor/VisualEditor] Switch from node12 to node14
  • 19:51 James_F: Zuul: Migrate almost all libraries and tools from node12 to node14 for T267890
  • 15:36 James_F: Zuul: Switch extension-javascript-documentation from node12 to node14 for T267890
  • 15:21 James_F: Zuul: Switch all mwgate jobs from node12 to node14 for T267890
  • 09:52 hashar: Building Quibble Docker images for https://gerrit.wikimedia.org/r/757867 | T300340
  • 08:54 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/770079

2022-03-11

  • 04:02 zabe: zabe@deployment-mwmaint02:~$ mwscript extensions/CentralAuth/maintenance/populateGlobalEditCount.php --wiki=metawiki

2022-03-10

2022-03-09

2022-03-08

  • 20:31 brennen: requiring 2fa for all users under /repos

2022-03-07

  • 10:53 zabe: restarted apache on deployment-mediawiki11 # T302699

2022-03-04

2022-03-03

2022-03-02

  • 19:53 James_F: Zuul: Configure CI for the forthcoming REL1_38 branches for T302908
  • 15:56 dancy: Updating scap to 4.4.1-1+0~20220302155149.192~1.gbpe351d6 in beta
  • 15:27 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/767493
  • 15:04 taavi: resolve merge conflicts on deployment-puppetmaster04

2022-02-28

  • 19:29 brennen: removing mutante (dzahn) as application-level gitlab admin; adding as owner of /repos for the time being to facilitate some migrations
  • 19:22 dancy: Update scap to 4.4.0-1+0~20220228192031.189~1.gbp0a8436 in beta
  • 19:17 brennen: adding mutante (dzahn) as application-level gitlab admin

2022-02-26

  • 20:05 zabe: apply T302658 on deployment-prep centralauth databases
  • 13:24 zabe: apply T302660 on deployment-prep centralauth databases
  • 13:19 zabe: apply T302659 on deployment-prep centralauth databases

2022-02-24

  • 16:02 dancy: Updating beta cluster scap to 4.4.0-1+0~20220224155429.187~1.gbp66c5c2
  • 13:44 hashar: integration/config now fully enforces shellcheck https://gerrit.wikimedia.org/r/756088
  • 13:13 hashar: Built image docker-registry.discovery.wmnet/releng/castor:0.2.5
  • 13:10 hashar: Updating castor-save-workspace-cache job https://gerrit.wikimedia.org/r/764817
  • 11:54 hashar: Built image docker-registry.discovery.wmnet/releng/shellcheck:0.1.1
  • 11:41 hashar: Built image docker-registry.discovery.wmnet/releng/sonar-scanner:4.6.0.2311-2
  • 11:04 hashar: Built image docker-registry.discovery.wmnet/releng/operations-puppet:0.8.6
  • 08:58 hashar: Built image docker-registry.discovery.wmnet/releng/mediawiki-phan-testrun:0.2.1

2022-02-23

  • 23:21 dancy: Update beta cluster scap to 4.3.1-1+0~20220223231645.183~1.gbp8ddb60
  • 20:10 dancy: Updating scap in beta
  • 19:23 hashar: Built docker-registry.discovery.wmnet/releng/logstash-filter-verifier:0.0.3
  • 12:41 hashar: Depooling integration-agent-puppet-docker-1002 , pooling integration-agent-puppet-docker-1003 # T252071
  • 10:21 hashar: Created Bullseye instance integration-agent-puppet-docker-1003 https://horizon.wikimedia.org/project/instances/96cf9ddc-daa3-4c9f-8c21-cdd58e95973e/ # T252071
  • 08:37 hashar: Removing Stretch based integration-agent-qemu-1001 # T284774

2022-02-22

  • 16:41 zabe: zabe@deployment-mwmaint02:~$ foreachwiki migrateUserGroup.php oversight suppress # T112147
  • 13:28 urbanecm: deployment-prep: Create database for incubatorwiki (T210492)

2022-02-21

  • 14:58 hashar: Reverting Quibble jobs from 1.4.0 to 1.3.0 # T302226
  • 07:31 hashar: Switching Quibble jobs from Quibble 1.3.0 to 1.4.0 # T300340 T291549 T225730
  • 07:27 hashar: Refreshing all Jenkins jobs

2022-02-20

  • 10:32 qchris: Manually triggering replication run of Gerrit's analytics/datahub to populate newly created analytics-datahub GitHub repo

2022-02-19

  • 12:19 taavi: restart trafficserver-tls on deployment-cache-text06
  • 02:15 James_F: Zuul: [design/codex] Publish the Netlify preview on every patch for T293705
  • 00:35 James_F: Manually re-triggered a build of the docs of Codex (via `zuul-test-repo design/codex postmerge`) now that we actually set the environment vars for T293705

2022-02-18

2022-02-17

  • 21:48 brennen: added Dzahn (mutante) to acl*repository-admins on phabricator
  • 15:58 zabe: root@deployment-cache-upload06:~# touch /srv/trafficserver/tls/etc/ssl_multicert.config && systemctl reload trafficserver-tls.service # T301995
  • 13:35 hashar: Reloading Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/763207
  • 13:20 Reedy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/763458
  • 11:12 hashar: Bringing deployment-deploy03 back
  • 11:07 hashar: Disabled deployment-deploy03 Jenkins agent in order to revert some mediawiki/core patch and test the outcome

2022-02-16

  • 18:20 hashar: Tag Quibble 1.4.1 @ d4bd2801de # T300301
  • 16:42 dancy: Updating to scap 4.3.1-1+0~20220216163646.173~1.gbp823710?in beta
  • 12:55 jelto: apply gitlab-settings to gitlab-prod-1001.devtools.eqiad1.wikimedia.cloud
  • 10:09 hashar: Reloading Zuul for I997fee
  • 09:59 hashar: Reloading Zuul for I2ffa01

2022-02-15

  • 21:12 dancy: rebooting deployment-mediawiki12.deployment-prep.eqiad1.wikimedia.cloud to try to revive beta wikis
  • 20:59 dancy: Killed runaway puppet agent on deployment-mediawiki11.deployment-prep.eqiad1.wikimedia.cloud
  • 16:24 hashar: Restarting CI Jenkins for plugins updates
  • 16:21 hashar: Upgrading Jenkins plugins on releases Jenkins
  • 16:06 hashar: Rollback fresh-test Jenkins job to the version intended to run on integration-agent-qemu-1001
  • 15:26 hashar: Reloading Zuul for If80b4b

2022-02-14

  • 16:28 dancy: Updating scap in beta cluster to 4.3.1-1+0~20220211225318.167~1.gbp315b2c
  • 16:16 Amir1: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/762471
  • 15:41 hashar: Messing up with fresh-test Jenkns job to polish up Qemu / qcow2 integration
  • 14:26 jnuche: Jenkins upgrade complete T301361
  • 13:54 jnuche: Jenkins contint instances are going to be restarted soon

2022-02-12

  • 18:22 urbanecm: deployment-prep: reboot deployment-eventgate-3 (T289029)

2022-02-10

2022-02-09

  • 15:22 taavi: deleted shutoff deployment-mx02

2022-02-08

  • 17:34 taavi: remove scap from deployment-kafka-main/jumbo
  • 16:23 taavi: hard reboot misbehaving deployment-echostore01
  • 13:39 taavi: delete /srv/mediawiki-staging.save on deployment-deploy03

2022-02-07

2022-02-04

2022-02-03

  • 18:41 taavi: deployment-prep: route /w/api.php to deployment-mediawiki11, trying to reduce load on a single server
  • 14:53 hashar: Building Docker images for Quibble 1.4.0 (prepared by kostajh)
  • 13:51 kostajh: Tag Quibble 1.4.0 @ 4231bc2 # T300340 T291549 T225730

2022-02-02

  • 16:50 dancy: Upgrading scap to 4.2.2-1+0~20220202164708.157~1.gbp376a16 in beta.
  • 16:12 dancy: Upgrading scap to 4.2.2-1+0~20220201161808.156~1.gbp1c1c64 in beta

2022-02-01

2022-01-31

  • 19:01 James_F: Re-configured Jenkins job mediawiki-i18n-check-docker to 9e3ea96 for T222216
  • 10:49 hashar: Added integration-agent-qemu-1003 with label `Qemu` # T284774

2022-01-28

  • 21:45 taavi: running recountCategories.php on all beta wikis per T299823#7652496
  • 14:27 hashar: taking heapdump of CI Jenkins `sudo -u jenkins /usr/lib/jvm/java-11-openjdk-amd64/bin/jmap -dump:live,format=b,file=/var/lib/jenkins/202201281527.hprof xxxx`

2022-01-27

  • 20:26 hashar: Successfully published image docker-registry.discovery.wmnet/releng/logstash-filter-verifier:0.0.2 # T299431
  • 19:34 Amir1: Reloading Zuul to deploy 757464
  • 16:00 hashar: Pooling back agents 1035 1036 1037 1038 , they could not connect due to ssh host mismatch since yesterday they all got attached to instance 1033 and accepted that host key # T300214
  • 09:16 hashar: integration: cumin --force 'name:docker' 'apt install rsync' # T300236
  • 09:05 hashar: integration: cumin --force 'name:docker' 'apt install rsync' # T300214
  • 00:24 thcipriani: restarting jenkins

2022-01-26

  • 20:29 hashar: Completed migration of integration-agent-docker-XXXX instances from Stretch to Bullseye - T252071
  • 19:55 hashar: deleting integration-agent-docker-1014 which only has the `codehealth` label. A short live experiment no more used since October 2nd 2019 - https://gerrit.wikimedia.org/r/c/integration/config/+/540362 - T234259
  • 18:56 hashar: integration: pooled in Jenkins a few more Bullseye docker agents for T252071
  • 18:17 hashar: integration: pooled in Jenkins a few Bullseye docker agent for T252071
  • 16:45 hashar: integration: creating integration-agent-docker-1023 based on buster with new flavor `g3.cores8.ram24.disk20.ephemeral60.4xiops` # T290783

2022-01-25

  • 20:17 James_F: Zuul: [mediawiki/extensions/CentralAuth] Drop UserMerge dependency
  • 16:39 James_F: Zuul: Mark Math extension as now tarballed in parameter_functions for T232948
  • 15:57 James_F: Zuul: [mediawiki/extensions/Math] Add Math to the main gate for T232948
  • 13:44 hashar: Jenkins CI: added Logger https://integration.wikimedia.org/ci/log/ProcessTree%20-%20T299995/ to watch `hudson.util.ProcessTree` for T299995
  • 10:02 hashar: integration: removing usage of `role::ci::slave::labs::docker::docker_lvm_volume` in Horizon following https://gerrit.wikimedia.org/r/c/operations/puppet/+/755948 . Docker role instances now always have a 24G partition for Docker
  • 09:59 hashar: integration-agent-qemu-1001: resized /srv to 100% disk free: `lvextend -r -l +100%FREE /dev/mapper/vd-second--local--disk` # T299996
  • 09:59 hashar: integration-agent-qemu-1001: resizing /dev/mapper/vd-second--local--disk (/srv) to 20G : `resize2fs -p /dev/mapper/vd-second--local--disk 20G` # T299996
  • 09:51 hashar: integration-agent-qemu-1001: resizing /dev/mapper/vd-second--local--disk (/srv) to 20G : `resize2fs -p /dev/mapper/vd-second--local--disk 20G`
  • 09:51 hashar: integration-agent-qemu-1003: nuked /dev/vd/second-local-disk and /srv to make room for a docker logical volume. That has fixed puppet T299996
  • 09:22 Reedy: unblocked beta again
  • 07:32 Krinkle: integration-castor03:/srv/jenkins-workspace/caches$ sudo rm -rf castor-mw-ext-and-skins/

2022-01-24

  • 21:44 Reedy: unstick beta ci jobs
  • 21:19 jeena: reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/756523
  • 20:36 Krinkle: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/756139
  • 17:28 hashar: Nuke castor caches on integration-castor03 : sudo rm -fR /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/{quibble-vendor-mysql-php72-selenium-docker,wmf-quibble-selenium-php72-docker} # T299933
  • 17:28 hashar: Nuke castor caches on integration-castor03 : sudo rm -fR /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/{quibble-vendor-mysql-php72-selenium-docker,wmf-quibble-selenium-php72-docker}

2022-01-22

  • 13:40 taavi: apply T299827 on deployment-prep centralauth database
  • 11:44 taavi: restart varnish-frontend.service on deployment-cache-upload06 to clear puppet agent failure alerts

2022-01-21

  • 18:12 taavi: resolved merge conflicts on deployment-puppetmaster04
  • 15:50 hashar: integration-puppetmaster-02: deleted 2021 snapshot tags in puppet repo and ran `git gc --prune=now`

2022-01-20

  • 20:24 James_F: Zuul: [Kartographer] Add parsoid as dependency for CI jobs
  • 20:22 James_F: Zuul: [DiscussionTools] Add Gadgets as dependency for Phan jobs
  • 20:04 dancy: Jenkins beta jobs are back online, using scap prep auto now.
  • 19:19 dancy: Pausing beta Jenkins jobs to make a copy of /srv/mediawiki-staging in preparation for testing
  • 19:10 dancy: Unpacking scap (4.1.1-1+0~20220120175448.144~1.gbp517f9d) over (4.1.1-1+0~20220113154148.133~1.gbp6e3a17) on deploy03
  • 18:07 hashar: Updating Quibble jobs to have MediaWiki files written on the hosts /srv partition (38G) instead of inside the container which ends in /var/lib/docker (24G) https://gerrit.wikimedia.org/r/755743 # T292729
  • 16:31 hashar: Rebalancing /var/lib/docker and /srv partitions on CI agents | https://gerrit.wikimedia.org/r/755713
  • 12:12 hashar: contint2001 deleting all the Docker images (they will be pulled as needed)
  • 12:10 hashar: contint2001 : docker container prune && docker image prune
  • 12:07 hashar: contint1001 deleting all the Docker images (they will be pulled as needed)
  • 12:04 hashar: contint1001 `docker image prune`
  • 11:51 hashar: Cleaning very old Docker images on contint1001.wikimedia.Org

2022-01-19

2022-01-18

  • 19:56 hashar: building Docker images for https://gerrit.wikimedia.org/r/754951
  • 18:01 taavi: added ryankemper as a member of the deployment-prep project
  • 15:00 hashar: Updating Jenkins jobs for Quibble 1.3.0 with proper PHP version in the images # T299389
  • 11:39 hashar: Rolling back Quibble 1.3.0 jobs due to php configuration files with at least releng/quibble-buster73:1.3.0 # T299389
  • 08:07 hashar: Updating Jenkins jobs for Quibble to pass `--parallel-npm-install` https://gerrit.wikimedia.org/r/c/integration/config/+/754569
  • 08:02 hashar: Updating Jenkins jobs for Quibble 1.3.0

2022-01-17

  • 16:28 hashar: Building Quibble 1.3.0 Docker images
  • 16:16 hashar: Tagged Quibble 1.3.0 @ 2b2c7f9a45 # T297480 T226869 T294931
  • 08:32 hashar: Refreshing all Jenkins jobs with jjb to take in account recent changes related to the Jinja2 docker macro

2022-01-14

  • 15:56 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/753981
  • 14:59 hashar: Starting VM integration-agent-docker-1022 which was in shutdown state since December and is Bullseye based # T290783
  • 13:49 hashar: Restarting all CI Docker agents via Horizon to apply new flavor settings T265615 T299211
  • 01:47 dancy: revert to scap 4.1.1-1+0~20220113154148.133~1.gbp6e3a17 in beta

2022-01-13

  • 18:02 dancy: Updating scap to 4.1.1-1+0~20220113154506.135~1.gbp523480 on all beta hosts
  • 17:54 dancy: Reloading Zuul to deploy https://gerrit.wikimedia.org/r/753792
  • 16:27 dancy: testing scap prep auto on deployment-deploy03
  • 15:52 dancy: Update scap to 4.1.1-1+0~20220113154506.135~1.gbp523480 on deployment-deploy03
  • 11:27 hashar: Updating Jenkins job to normalize usage of `docker run --workdir` https://gerrit.wikimedia.org/r/c/integration/config/+/753457
  • 10:52 hashar: Restarting Jenkins CI for plugins update
  • 10:42 hashar: Applied Jenkins built-in node migration to CI Jenkins (`master` > `built-in` renaming) # T298691
  • 10:14 taavi: cancelled stuck deployment-prep jobs on jenkins

2022-01-12

2022-01-11

  • 09:18 hashar: Updating all Jenkins jobs following recent "noop" refactorings

2022-01-10

  • 17:13 dancy: Update beta scap to 4.1.0-1+0~20220107203309.130~1.gbpcd0ace
  • 14:01 James_F: Zuul: Add gate-and-submit-l10n to Isa for T222291

2022-01-05

2022-01-04

2022-01-03

  • 14:37 hashar: Upgraded Java 11 on contint2001 && contint1001. Restarted CI Jenkins.
  • 14:35 hashar: Upgraded Java 11 on releases1002 && releases2002


Archives