You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Server Admin Log: Difference between revisions

From Wikitech-static
Jump to navigation Jump to search
imported>Labslogbot
(uranium - deleted apache logs older than 90 days (mutante))
imported>Stashbot
(brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1080.eqiad.wmnet with OS bullseye)
 
Line 1: Line 1:
== July 9 ==
== 2023-02-03 ==
* 01:07 mutante: uranium - deleted apache logs older than 90 days
* 00:35 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1080.eqiad.wmnet with OS bullseye
* 00:45 RoanKattouw: Running populateContentModel.php --wiki=cawiki --table=revision --ns=5
* 00:20 RoanKattouw: Ran populateContentModel.php --table=revision for odd-numbered namespaces on officewiki for T105245


== July 8 ==
== 2023-02-02 ==
* 23:07 logmsgbot: catrope Synchronized php-1.26wmf13/extensions/Flow: SWAT (duration: 00m 14s)
* 22:58 brett@cumin2002: END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp1080.eqiad.wmnet with OS bullseye
* 23:06 bd808: Restarted logstash on logstash1001; no hhvm input seen for last hour
* 22:15 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp1079.eqiad.wmnet
* 22:56 gwicke: finished rolling restart of cassandra cluster to apply https://gerrit.wikimedia.org/r/#/c/223495/
* 22:12 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1079.eqiad.wmnet with OS bullseye
* 22:45 mutante: zirconium - stop puppet for role switch
* 22:01 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1080.eqiad.wmnet with OS bullseye
* 22:33 logmsgbot: legoktm Synchronized php-1.26wmf13/includes/changes/EnhancedChangesList.php: Unbreak missing flags in enhanced RC (duration: 00m 12s)
* 22:00 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp1078.eqiad.wmnet
* 22:08 logmsgbot: hoo Synchronized php-1.26wmf13/extensions/Wikidata/: Update Wikibase: Fix JavaScript ULS usage (duration: 00m 20s)
* 21:58 zabe@deploy1002: Finished scap: Backport for [[gerrit:886149{{!}}Stop writing to cuc_comment everywhere (T233004)]] (duration: 07m 58s)
* 21:51 logmsgbot: manybubbles Synchronized php-1.26wmf12/extensions/CirrusSearch/: Stop some fatals in cirrus (duration: 00m 13s)
* 21:52 zabe@deploy1002: zabe: Backport for [[gerrit:886149{{!}}Stop writing to cuc_comment everywhere (T233004)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet
* 21:41 logmsgbot: bd808 Synchronized php-1.26wmf13/includes/api/ApiMain.php: Revert Count API module instantiations and Hook runs (2/2) (duration: 00m 12s)
* 21:50 zabe@deploy1002: Started scap: Backport for [[gerrit:886149{{!}}Stop writing to cuc_comment everywhere (T233004)]]
* 21:40 logmsgbot: bd808 Synchronized php-1.26wmf13/includes/Hooks.php: Revert Count API module instantiations and Hook runs (1/2) (duration: 00m 12s)
* 21:47 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1078.eqiad.wmnet with OS bullseye
* 21:39 logmsgbot: bd808 Synchronized php-1.26wmf13/extensions/CirrusSearch/includes/CirrusSearch.php: Suppress interwiki results when they would break (duration: 00m 12s)
* 21:47 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1079.eqiad.wmnet with reason: host reimage
* 21:08 bblack: graphite: wiped /var/log/upstart/statsite* logs, restarted statsite processes
* 21:44 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp1079.eqiad.wmnet with reason: host reimage
* 20:56 csteipp: deployed patches for T103022 & T103023
* 21:30 brennen: end of utc late backport & config window
* 20:53 csteipp: deployed patch for T94116 for wmf12/wmf13
* 21:30 brennen@deploy1002: Finished scap: Backport for [[gerrit:886118{{!}}Enable client preferences everywhere (T327979)]] (duration: 11m 14s)
* 20:30 gwicke: added explicit exit 1 in /etc/init.d/cassandra on restbase1008 to prevent cassandra from starting up there; is puppet restarting it?
* 21:23 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1078.eqiad.wmnet with reason: host reimage
* 20:29 subbu: deployed parsoid sha c4cfc527
* 21:22 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1079.eqiad.wmnet with OS bullseye
* 20:15 gwicke: bounced cassandra on restbase1001
* 21:22 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp1077.eqiad.wmnet
* 20:05 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Jul  8 20:05:09 UTC 2015 (duration 5m 8s)
* 21:21 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1077.eqiad.wmnet with OS bullseye
* 19:32 gwicke: stopped cassandra on restbase1008
* 21:21 brennen@deploy1002: brennen and nray: Backport for [[gerrit:886118{{!}}Enable client preferences everywhere (T327979)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet
* 19:27 logmsgbot: twentyafterfour Synchronized php-1.26wmf13: deploying UniversalLanguageSelector commit 2e0990ac9879 (duration: 01m 58s)
* 21:20 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp1078.eqiad.wmnet with reason: host reimage
* 19:26 urandom: restbase rolling restart
* 21:19 brennen@deploy1002: Started scap: Backport for [[gerrit:886118{{!}}Enable client preferences everywhere (T327979)]]
* 18:21 jgage: ran 'kafka preferred-replica-election' to promote analytics1021 back to Leader
* 21:18 brennen@deploy1002: Finished scap: Backport for [[gerrit:885359{{!}}Disable write old for CheckUserLog reason everywhere (T233004)]] (duration: 12m 02s)
* 18:05 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group1 wikis to 1.26wmf13
* 21:07 brennen@deploy1002: brennen and dreamyjazz: Backport for [[gerrit:885359{{!}}Disable write old for CheckUserLog reason everywhere (T233004)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet
* 17:16 moritzm: installed libwmf security updates on various systems
* 21:06 brennen@deploy1002: Started scap: Backport for [[gerrit:885359{{!}}Disable write old for CheckUserLog reason everywhere (T233004)]]
* 17:09 gwicke: bounced cassandra on restbase1004
* 20:59 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1078.eqiad.wmnet with OS bullseye
* 15:25 mutante: handing over adminship of the "test" mailman list to John F. Lewis (was: Thehelpfulone) due to inactivity
* 20:59 brett@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1078.eqiad.wmnet with OS bullseye
* 13:36 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: raise db1041 load (duration: 00m 13s)
* 20:52 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1077.eqiad.wmnet with reason: host reimage
* 12:58 paravoid: manually dpkg -P ferm on potassium
* 20:49 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp1077.eqiad.wmnet with reason: host reimage
* 12:52 paravoid: rmmod all iptables/netfilter-related modules from potassium
* 20:28 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1078.eqiad.wmnet with OS bullseye
* 11:23 godog: bounce cassandra on restbase1004, heap space
* 20:28 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp1077.eqiad.wmnet with OS bullseye
* 11:12 _joe_: mw1153 passed the smoke tests, repooling
* 20:23 rzl: rzl@apt1001:~$ sudo -i reprepro -C main include bullseye-wikimedia /home/rzl/httpbb/bullseye/httpbb_0.0.3-1+deb11u1_amd64.changes  # [[phab:T328280|T328280]]
* 11:08 godog: bounce cassandra on restbase1004 and restbase1005 'cannot achieve consistency level quorum'
* 20:21 rzl: rzl@apt1001:~$ sudo -i reprepro -C main include buster-wikimedia /home/rzl/httpbb/buster/httpbb_0.0.3-1_amd64.changes  # [[phab:T328280|T328280]]
* 10:50 godog: bounce cassandra on restbase1004, death by compaction
* 20:11 zabe@deploy1002: Finished scap: Backport for [[gerrit:886135{{!}}Stop writing to cuc_user and cuc_user_text everywhere (T233004)]] (duration: 09m 39s)
* 09:43 ori: _joe_: starting reimaging of mw1153, depooling it and scheduling downtime (at 9:21 UTC)
* 20:03 zabe@deploy1002: zabe: Backport for [[gerrit:886135{{!}}Stop writing to cuc_user and cuc_user_text everywhere (T233004)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet
* 09:42 ori: Nuked /var/lib/carbon/whisper/ResourceLoader on graphite[12]001. Data prior to rollout of I55f0c44cd considered bogus.
* 20:02 bking@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host elastic2037.codfw.wmnet
* 09:42 ori: morebots, are you OK?
* 20:01 zabe@deploy1002: Started scap: Backport for [[gerrit:886135{{!}}Stop writing to cuc_user and cuc_user_text everywhere (T233004)]]
* 09:41 godog: bounce nutcracker on silver
* 19:55 bking@cumin1001: START - Cookbook sre.hosts.reboot-single for host elastic2037.codfw.wmnet
* 09:33 _joe_: starting reimaging of mw1153, depooling it and scheduling downtime (at 9:21 UTC)
* 19:54 ryankemper: [[phab:T328674|T328674]] [Elastic] With puppet disabled on elastic* fleet, `ryankemper@elastic2037:~$ sudo run-puppet-agent --force` to verify changes in https://gerrit.wikimedia.org/r/886055
* 09:26 hashar: upgraded plugins on jenkins and restarting it
* 19:30 dancy@deploy1002: rebuilt and synchronized wikiversions files: group2 wikis to 1.40.0-wmf.21  refs [[phab:T325584|T325584]]
* 09:06 hashar: Jenkins registering jobs with Zuul
* 19:28 zabe@deploy1002: say aborted:  (duration: 00m 03s)
* 08:41 hashar: Jenkins is migrating old build histories. Lot of disk IO happening
* 18:42 zabe@deploy1002: Finished scap: Backport for [[gerrit:886127{{!}}Stop writing to cuc_comment in group1 wikis (T233004)]] (duration: 08m 19s)
* 08:11 hashar: shutdowning Jenkins for upgrade.
* 18:36 zabe@deploy1002: zabe: Backport for [[gerrit:886127{{!}}Stop writing to cuc_comment in group1 wikis (T233004)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet
* 05:57 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Jul  8 05:57:10 UTC 2015 (duration 57m 9s)
* 18:34 zabe@deploy1002: Started scap: Backport for [[gerrit:886127{{!}}Stop writing to cuc_comment in group1 wikis (T233004)]]
* 05:46 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1041, warm up (duration: 00m 13s)
* 18:08 aokoth@cumin1001: END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Upgrade GitLab Production (gitlab1004) to 15.7.6-ce.0
* 02:31 logmsgbot: LocalisationUpdate completed (1.26wmf13) at 2015-07-08 02:31:24+00:00
* 18:08 bd808@deploy1002: helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply
* 02:16 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-08 02:16:50+00:00
* 18:08 bd808@deploy1002: helmfile [eqiad] START helmfile.d/services/developer-portal: apply
* 02:16 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 00m 48s)
* 18:08 jiji@cumin1001: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc2043.codfw.wmnet with OS bullseye
* 18:07 bd808@deploy1002: helmfile [codfw] DONE helmfile.d/services/developer-portal: apply
* 18:06 bd808@deploy1002: helmfile [codfw] START helmfile.d/services/developer-portal: apply
* 18:05 bd808@deploy1002: helmfile [staging] DONE helmfile.d/services/developer-portal: apply
* 18:05 bd808@deploy1002: helmfile [staging] START helmfile.d/services/developer-portal: apply
* 18:03 jiji@cumin1001: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1037.eqiad.wmnet with OS bullseye
* 17:52 jiji@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2043.codfw.wmnet with reason: host reimage
* 17:49 jiji@cumin1001: START - Cookbook sre.hosts.downtime for 2:00:00 on mc2043.codfw.wmnet with reason: host reimage
* 17:47 jiji@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1037.eqiad.wmnet with reason: host reimage
* 17:45 jiji@cumin1001: START - Cookbook sre.hosts.downtime for 2:00:00 on mc1037.eqiad.wmnet with reason: host reimage
* 17:33 jiji@cumin1001: START - Cookbook sre.hosts.reimage for host mc2043.codfw.wmnet with OS bullseye
* 17:32 jiji@cumin1001: START - Cookbook sre.hosts.reimage for host mc1037.eqiad.wmnet with OS bullseye
* 17:29 aokoth@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Upgrade GitLab Production (gitlab1004) to 15.7.6-ce.0
* 17:12 elukey@deploy1002: helmfile [eqiad] DONE helmfile.d/services/changeprop: sync
* 17:12 elukey@deploy1002: helmfile [eqiad] START helmfile.d/services/changeprop: sync
* 16:53 mvolz@deploy1002: helmfile [eqiad] DONE helmfile.d/services/zotero: apply
* 16:52 mvolz@deploy1002: helmfile [eqiad] START helmfile.d/services/zotero: apply
* 16:51 mvolz@deploy1002: helmfile [codfw] DONE helmfile.d/services/zotero: apply
* 16:50 dancy@deploy1002: Installation of scap version "4.34.0" completed for 561 hosts
* 16:50 mvolz@deploy1002: helmfile [codfw] START helmfile.d/services/zotero: apply
* 16:50 dancy@deploy1002: Installing scap version "4.34.0" for 561 hosts
* 16:50 mvolz@deploy1002: helmfile [staging] DONE helmfile.d/services/zotero: apply
* 16:49 mvolz@deploy1002: helmfile [staging] START helmfile.d/services/zotero: apply
* 16:48 mvolz@deploy1002: helmfile [staging] DONE helmfile.d/services/zotero: apply
* 16:48 mvolz@deploy1002: helmfile [staging] START helmfile.d/services/zotero: apply
* 16:47 elukey@deploy1002: helmfile [codfw] DONE helmfile.d/services/changeprop: sync
* 16:46 elukey@deploy1002: helmfile [codfw] START helmfile.d/services/changeprop: sync
* 16:25 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2007.codfw.wmnet
* 16:18 mvolz@deploy1002: helmfile [eqiad] DONE helmfile.d/services/zotero: apply
* 16:17 mvolz@deploy1002: helmfile [eqiad] START helmfile.d/services/zotero: apply
* 16:17 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2007.codfw.wmnet
* 16:17 mvolz@deploy1002: helmfile [codfw] DONE helmfile.d/services/zotero: apply
* 16:16 mvolz@deploy1002: helmfile [codfw] START helmfile.d/services/zotero: apply
* 16:16 mvolz@deploy1002: helmfile [staging] DONE helmfile.d/services/zotero: apply
* 16:15 mvolz@deploy1002: helmfile [staging] START helmfile.d/services/zotero: apply
* 16:10 volans: uploaded python3-wmflib_1.2.1 to apt.wikimedia.org buster-wikimedia,bullseye-wikimedia
* 16:10 dzahn@cumin1001: END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab Replica gitlab2002 to 15.7.6-ce.0
* 15:40 jnuche@deploy1002: Finished deploy [releng/jenkins-deploy@e38efa6] (releasing): (no justification provided) (duration: 07m 01s)
* 15:38 aokoth@cumin1001: END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab2002.wikimedia.org with reason: Security Release
* 15:37 aokoth@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Security Release
* 15:35 aokoth@cumin1001: END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab2002.wikimedia.org with reason: Security Release
* 15:35 aokoth@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Security Release
* 15:34 dzahn@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrade GitLab Replica gitlab2002 to 15.7.6-ce.0
* 15:33 jnuche@deploy1002: Started deploy [releng/jenkins-deploy@e38efa6] (releasing): (no justification provided)
* 15:24 jmm@cumin2002: END (FAIL) - Cookbook sre.network.configure-switch-interfaces (exit_code=99) for host ganeti3004
* 15:17 jmm@cumin2002: START - Cookbook sre.network.configure-switch-interfaces for host ganeti3004
* 15:06 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2006.codfw.wmnet
* 15:03 jmm@cumin2002: END (PASS) - Cookbook sre.dns.netbox (exit_code=0)
* 15:03 jmm@cumin2002: END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast3004 was renamed as ganeti4004 - jmm@cumin2002"
* 15:02 jmm@cumin2002: START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast3004 was renamed as ganeti4004 - jmm@cumin2002"
* 15:00 vgutierrez: rolling restart of varnish in cache::text - [[phab:T315676|T315676]]
* 14:59 jmm@cumin2002: START - Cookbook sre.dns.netbox
* 14:59 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2006.codfw.wmnet
* 14:55 cgoubert@deploy1002: helmfile [staging] DONE helmfile.d/services/zotero: apply
* 14:45 cgoubert@deploy1002: helmfile [staging] START helmfile.d/services/zotero: apply
* 14:39 cgoubert@deploy1002: helmfile [staging] DONE helmfile.d/services/zotero: apply
* 14:31 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2005.codfw.wmnet
* 14:29 cgoubert@deploy1002: helmfile [staging] START helmfile.d/services/zotero: apply
* 14:25 moritzm: installing containerd security updates on codfw k8s nodes
* 14:24 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2005.codfw.wmnet
* 13:34 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp1076.eqiad.wmnet,service=ats-be
* 13:34 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp1076.eqiad.wmnet,service=cdn
* 13:10 kharlan:: Deployed security patch for [[phab:T328643|T328643]]
* 13:09 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1076.eqiad.wmnet with OS bullseye
* 13:04 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' .
* 13:03 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' .
* 13:03 kharlan:: Deployed security patch for [[phab:T328643|T328643]]
* 13:02 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' .
* 13:01 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2004.codfw.wmnet
* 13:00 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' .
* 12:55 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2004.codfw.wmnet
* 12:47 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1076.eqiad.wmnet with reason: host reimage
* 12:47 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' .
* 12:46 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' .
* 12:44 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp1076.eqiad.wmnet with reason: host reimage
* 12:42 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' .
* 12:42 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' .
* 12:39 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' .
* 12:39 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' .
* 12:29 btullis@deploy1002: Finished deploy [analytics/superset/deploy@5175ad7]: Production deployment for numpy downgrade (duration: 00m 42s)
* 12:29 claime: Work ongoing on m2 and m3
* 12:29 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2003.codfw.wmnet
* 12:29 btullis@deploy1002: Started deploy [analytics/superset/deploy@5175ad7]: Production deployment for numpy downgrade
* 12:23 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp1076.eqiad.wmnet with OS bullseye
* 12:22 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2003.codfw.wmnet
* 12:08 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' .
* 12:08 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' .
* 11:46 isaranto@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' .
* 11:42 mvolz@deploy1002: helmfile [eqiad] DONE helmfile.d/services/citoid: apply
* 11:42 mvolz@deploy1002: helmfile [eqiad] START helmfile.d/services/citoid: apply
* 11:41 mvolz@deploy1002: helmfile [eqiad] DONE helmfile.d/services/citoid: apply
* 11:41 mvolz@deploy1002: helmfile [eqiad] START helmfile.d/services/citoid: apply
* 11:40 mvolz@deploy1002: helmfile [codfw] DONE helmfile.d/services/citoid: apply
* 11:39 mvolz@deploy1002: helmfile [codfw] START helmfile.d/services/citoid: apply
* 11:38 mvolz@deploy1002: helmfile [staging] DONE helmfile.d/services/citoid: apply
* 11:37 mvolz@deploy1002: helmfile [staging] START helmfile.d/services/citoid: apply
* 11:37 Lucas_WMDE: lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix {{!}} tee [[phab:T328634|T328634]]-namespaceDupes-4.out # [[phab:T328634|T328634]] – made some progress then errored out again
* 11:32 Lucas_WMDE: lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix --add-prefix=[[phab:T328634|T328634]]/ {{!}} tee [[phab:T328634|T328634]]-namespaceDupes-3.out # [[phab:T328634|T328634]] – seemed to finish the first 20 pages and then go into an infinite loop, I Ctrl+Ced it
* 11:28 Lucas_WMDE: lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix --add-prefix=[[phab:T328634|T328634]]/ {{!}} tee [[phab:T328634|T328634]]-namespaceDupes-2.out # [[phab:T328634|T328634]] – another error but made more progress
* 11:23 Lucas_WMDE: lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix {{!}} tee [[phab:T328634|T328634]]-namespaceDupes.out # [[phab:T328634|T328634]] – failed quickly, details in task
* 11:22 elukey@deploy1002: helmfile [staging] DONE helmfile.d/services/changeprop: sync
* 11:22 elukey@deploy1002: helmfile [staging] START helmfile.d/services/changeprop: sync
* 11:12 mvolz@deploy1002: helmfile [staging] DONE helmfile.d/services/zotero: apply
* 11:02 mvolz@deploy1002: helmfile [staging] START helmfile.d/services/zotero: apply
* 10:27 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2002.codfw.wmnet
* 10:19 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2002.codfw.wmnet
* 10:17 isaranto@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' .
* 10:11 moritzm: restarting FPM on mw canaries to pick up tiff security updates
* 10:04 moritzm: installing tiff security updates
* 09:59 btullis@cumin1001: END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2001.codfw.wmnet
* 09:55 elukey@deploy1002: helmfile [eqiad] DONE helmfile.d/services/eventgate-main: sync
* 09:54 elukey@deploy1002: helmfile [eqiad] START helmfile.d/services/eventgate-main: sync
* 09:51 btullis@cumin1001: START - Cookbook sre.hosts.reboot-single for host aqs2001.codfw.wmnet
* 09:40 elukey@deploy1002: helmfile [codfw] DONE helmfile.d/services/eventgate-main: sync
* 09:40 elukey@deploy1002: helmfile [codfw] START helmfile.d/services/eventgate-main: sync
* 09:19 ayounsi@cumin1001: END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 398143
* 09:19 ayounsi@cumin1001: START - Cookbook sre.network.peering with action 'configure' for AS: 398143
* 09:16 jelto@cumin1001: END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Upgrade GitLab Replica gitlab1004 to 15.7.6
* 09:13 apergos: UTC morning backport and config training window done
* 09:13 elukey@deploy1002: helmfile [staging] DONE helmfile.d/services/eventgate-main: sync
* 09:12 elukey@deploy1002: helmfile [staging] START helmfile.d/services/eventgate-main: sync
* 09:11 elukey: roll restart of eventgate-main pods in wikikube eqiad/codfw to pick up new stream configs - [[phab:T328576|T328576]]
* 08:57 ariel@deploy1002: Finished scap: Backport for [[gerrit:885927{{!}}Enable wgMinervaEnableSiteNotice for bnwiktionary (T328630)]] (duration: 10m 56s)
* 08:48 ariel@deploy1002: ariel and aishik: Backport for [[gerrit:885927{{!}}Enable wgMinervaEnableSiteNotice for bnwiktionary (T328630)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet
* 08:46 ariel@deploy1002: Started scap: Backport for [[gerrit:885927{{!}}Enable wgMinervaEnableSiteNotice for bnwiktionary (T328630)]]
* 08:39 jelto@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Upgrade GitLab Replica gitlab1004 to 15.7.6
* 08:37 tgr@deploy1002: Finished scap: Backport for [[gerrit:885928{{!}}campaigns: Donor landing page translations for sv, it, ja, fr, nl (T321370)]], [[gerrit:885929{{!}}campaigns: Donor landing page translations for sv, it, ja, fr, nl (T321370)]] (duration: 14m 26s)
* 08:27 tgr@deploy1002: tgr: Backport for [[gerrit:885928{{!}}campaigns: Donor landing page translations for sv, it, ja, fr, nl (T321370)]], [[gerrit:885929{{!}}campaigns: Donor landing page translations for sv, it, ja, fr, nl (T321370)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet
* 08:23 tgr@deploy1002: Started scap: Backport for [[gerrit:885928{{!}}campaigns: Donor landing page translations for sv, it, ja, fr, nl (T321370)]], [[gerrit:885929{{!}}campaigns: Donor landing page translations for sv, it, ja, fr, nl (T321370)]]
* 06:17 kart_: Updated cxserver to 2023-02-02-004918-production ([[phab:T129470|T129470]], [[phab:T172035|T172035]], [[phab:T327842|T327842]])
* 06:16 kartik@deploy1002: helmfile [eqiad] DONE helmfile.d/services/cxserver: apply
* 06:15 kartik@deploy1002: helmfile [eqiad] START helmfile.d/services/cxserver: apply
* 06:13 kartik@deploy1002: helmfile [codfw] DONE helmfile.d/services/cxserver: apply
* 06:12 kartik@deploy1002: helmfile [codfw] START helmfile.d/services/cxserver: apply
* 06:09 kartik@deploy1002: helmfile [staging] DONE helmfile.d/services/cxserver: apply
* 06:09 kartik@deploy1002: helmfile [staging] START helmfile.d/services/cxserver: apply
* 04:00 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp5024.eqsin.wmnet
* 03:22 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5024.eqsin.wmnet with OS bullseye
* 03:21 ejegg: payments-wiki upgraded from {{Gerrit|f20a2208}} to {{Gerrit|53d1a58d}}
* 02:49 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5024.eqsin.wmnet with reason: host reimage
* 02:46 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5024.eqsin.wmnet with reason: host reimage
* 02:14 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp5024.eqsin.wmnet with OS bullseye
* 02:14 brett@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5024.eqsin.wmnet with OS bullseye
* 01:56 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp5024.eqsin.wmnet with OS bullseye
* 01:55 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp5023.eqsin.wmnet
* 01:55 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5023.eqsin.wmnet with OS bullseye
* 01:50 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp1075.eqiad.wmnet,service=ats-be
* 01:50 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp1075.eqiad.wmnet,service=cdn
* 01:49 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1075.eqiad.wmnet with OS bullseye
* 01:27 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1075.eqiad.wmnet with reason: host reimage
* 01:24 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp1075.eqiad.wmnet with reason: host reimage
* 01:21 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5023.eqsin.wmnet with reason: host reimage
* 01:18 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5023.eqsin.wmnet with reason: host reimage
* 01:07 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp1075.eqiad.wmnet with OS bullseye
* 00:44 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp5023.eqsin.wmnet with OS bullseye
* 00:06 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp5022.eqsin.wmnet
* 00:04 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5022.eqsin.wmnet with OS bullseye


== July 7 ==
== 2023-02-01 ==
* 23:54 jgage: kafka brokers 1018 & 1021 were demoted; i have triggered a leader election and they are leaders again
* 23:45 zabe@deploy1002: Finished scap: Backport for [[gerrit:885908{{!}}Stop writing to cuc_user and cuc_user_text in group1 wikis (T233004)]] (duration: 08m 07s)
* 23:05 logmsgbot: catrope Synchronized visualeditor-default.dblist: Enable VE by default on labswiki (duration: 00m 12s)
* 23:39 zabe@deploy1002: zabe: Backport for [[gerrit:885908{{!}}Stop writing to cuc_user and cuc_user_text in group1 wikis (T233004)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet
* 21:56 hoo: Restarted hhvm on mw1003 "Fatal error: Function already defined: wmfLoadInitialiseSettings in /srv/mediawiki/wmf-config/CommonSettings.php on line 187"
* 23:37 zabe@deploy1002: Started scap: Backport for [[gerrit:885908{{!}}Stop writing to cuc_user and cuc_user_text in group1 wikis (T233004)]]
* 21:16 logmsgbot: krinkle Synchronized php-1.26wmf13/includes/resourceloader/ResourceLoader.php: T104769 (duration: 00m 13s)
* 23:31 rzl@cumin2002: dbctl commit (dc=all): 'Depool db2181', diff saved to https://phabricator.wikimedia.org/P43574 and previous config saved to /var/cache/conftool/dbconfig/20230201-233140-rzl.json
* 20:53 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf13
* 23:31 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage
* 20:00 logmsgbot: twentyafterfour Finished scap: testwiki to php-1.26wmf13 and rebuild l10n cache (duration: 39m 41s)
* 23:27 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage
* 19:47 gwicke: restarted cassandra on restbase1005
* 23:19 dzahn@cumin1001: END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab2002.wikimedia.org with reason: security release
* 19:20 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf13 and rebuild l10n cache
* 23:17 dancy@deploy1002: Synchronized php: group1 wikis to 1.40.0-wmf.21  refs [[phab:T325584|T325584]] (duration: 06m 57s)
* 19:15 moritzm: installed PHP security updates on all trusty hosts
* 23:10 dancy@deploy1002: rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.21 refs [[phab:T325584|T325584]]
* 18:58 ejegg: updated payments from a17ee221db0dbde70c92e24fc188379b6dbad613 to ec34ebf61e5962f66b807abdcb519ff323d41e8e
* 23:01 zabe@deploy1002: Finished scap: Backport for [[gerrit:885781{{!}}CachingKartographerEmbeddingHandler: Fall back to Special:BlankPage title (T328601)]] (duration: 07m 45s)
* 18:08 twentyafterfour: restarted apache2 on iridium (phab hotfix)
* 22:55 zabe@deploy1002: zabe: Backport for [[gerrit:885781{{!}}CachingKartographerEmbeddingHandler: Fall back to Special:BlankPage title (T328601)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet
* 17:10 robh: OTRS update appears to be functioning normallyAs such, ending maintenance window.
* 22:54 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye
* 17:06 robh: otrs is now using the new sha256 cert
* 22:53 zabe@deploy1002: Started scap: Backport for [[gerrit:885781{{!}}CachingKartographerEmbeddingHandler: Fall back to Special:BlankPage title (T328601)]]
* 17:00 robh: starting otrs maint window
* 22:49 zabe@deploy1002: Finished scap: Backport for [[gerrit:885898{{!}}Stop writing to cuc_comment_id in group0 wikis (T233004)]] (duration: 13m 03s)
* 16:58 _joe_: restarted HHVM on mw1026, near to OOM
* 22:47 dzahn@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: security release
* 16:47 twentyafterfour: applied hotfix for phabricator bug: https://secure.phabricator.com/D13544
* 22:40 brett@cumin2002: END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp5022.eqsin.wmnet with OS bullseye
* 16:36 mutante: protactinium - manual iptables rules replaced by puppet/ferm rules
* 22:38 zabe@deploy1002: zabe: Backport for [[gerrit:885898{{!}}Stop writing to cuc_comment_id in group0 wikis (T233004)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet
* 16:11 logmsgbot: thcipriani Synchronized php-1.26wmf12/extensions/ContentTranslation/extension.json: Remove default value for ContentTranslationCampaigns (duration: 00m 12s)
* 22:36 zabe@deploy1002: Started scap: Backport for [[gerrit:885898{{!}}Stop writing to cuc_comment_id in group0 wikis (T233004)]]
* 15:33 jynus: manually editing table mediawiki.ipblocks to fully solve a former software bug
* 22:32 kindrobot: close UTC late backport window
* 15:12 Jeff_Green: ptr records for frack/codfw and authdns-update
* 22:31 kindrobot@deploy1002: Finished scap: Backport for [[gerrit:885841{{!}}Enable client preferences for group1 (T327979)]] (duration: 10m 37s)
* 15:10 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Enable ContentTranslation in enwiki [[gerrit:222991]] (duration: 00m 13s)
* 22:22 kindrobot@deploy1002: nray and kindrobot: Backport for [[gerrit:885841{{!}}Enable client preferences for group1 (T327979)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet
* 14:21 jynus: dropping optin_survey_old table from enwiki
* 22:21 kindrobot@deploy1002: Started scap: Backport for [[gerrit:885841{{!}}Enable client preferences for group1 (T327979)]]
* 13:23 akosiaris: restarting gitblit on antimony
* 22:14 kindrobot@deploy1002: Finished scap: Backport for [[gerrit:885852{{!}}Enable Linter write namespace, tag and template for all wikis (T299612)]] (duration: 18m 14s)
* 11:31 mobrovac: restbase restarted cassandra on rb1005
* 21:57 kindrobot@deploy1002: kindrobot and sbailey: Backport for [[gerrit:885852{{!}}Enable Linter write namespace, tag and template for all wikis (T299612)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet
* 11:26 godog: restart cassandra on restbase1004, heap exhausted
* 21:57 eevans@cumin1001: END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore100*: Applying new TLS certificates — [[phab:T327675|T327675]] - eevans@cumin1001
* 10:49 godog: restarted cassandra on restbase1005, mutations through the roof
* 21:56 kindrobot@deploy1002: Started scap: Backport for [[gerrit:885852{{!}}Enable Linter write namespace, tag and template for all wikis (T299612)]]
* 08:27 godog: set operations/puppet/cassandra git submodule repo as hidden
* 21:53 aokoth@cumin1001: END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab2002.wikimedia.org with reason: Security Release
* 06:11 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Jul  7 06:11:46 UTC 2015 (duration 11m 45s)
* 21:52 kindrobot@deploy1002: Finished scap: Backport for [[gerrit:885358{{!}}Disable write old for CheckUserLog reason on group 0 (T233004)]] (duration: 14m 53s)
* 05:51 logmsgbot: krinkle Synchronized php-1.26wmf12/extensions/WikiEditor/modules/jquery.wikiEditor.toolbar.js: I3e965dda1c4 (duration: 00m 12s)
* 21:43 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-07 02:27:55+00:00
* 21:39 eevans@cumin1001: START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore100*: Applying new TLS certificates — [[phab:T327675|T327675]] - eevans@cumin1001
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 06m 09s)
* 21:39 kindrobot@deploy1002: dreamyjazz and kindrobot: Backport for [[gerrit:885358{{!}}Disable write old for CheckUserLog reason on group 0 (T233004)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet
* 01:12 ori: Re-pooled mw1152 at 20:46 UTC, did not log it then.
* 21:37 kindrobot@deploy1002: Started scap: Backport for [[gerrit:885358{{!}}Disable write old for CheckUserLog reason on group 0 (T233004)]]
* 00:41 springle: upgrade db1041 trusty
* 21:32 kindrobot@deploy1002: Finished scap: Backport for [[gerrit:865214{{!}}Disable wgParserEnableLegacyMediaDOM on group1 wikis (T314318)]] (duration: 13m 56s)
* 00:37 logmsgbot: krenair Synchronized php-1.26wmf12/extensions/CentralAuth/includes/CreateLocalAccountJob.php: https://gerrit.wikimedia.org/r/#/c/223211/ (duration: 00m 13s)
* 21:26 eevans@puppetmaster1001: conftool action : set/pooled=true; selector: dnsdisc=sessionstore,name=codfw
* 21:26 eevans@puppetmaster1001: conftool action : get/pooled=true; selector: dnsdisc=sessionstore,name=codfw
* 21:26 eevans@puppetmaster1001: conftool action : get/pooled=true; selector: dnsdisc=sessionstore,name=codfw
* 21:24 aokoth@cumin1001: START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Security Release
* 21:20 kindrobot@deploy1002: arlolra and kindrobot: Backport for [[gerrit:865214{{!}}Disable wgParserEnableLegacyMediaDOM on group1 wikis (T314318)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet
* 21:19 eevans@cumin1001: END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore200*: Applying new TLS certificates — [[phab:T327675|T327675]] - eevans@cumin1001
* 21:18 kindrobot@deploy1002: Started scap: Backport for [[gerrit:865214{{!}}Disable wgParserEnableLegacyMediaDOM on group1 wikis (T314318)]]
* 21:14 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3065.esams.wmnet
* 21:10 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3065.esams.wmnet with OS bullseye
* 21:03 kindrobot: start UTC late backport deployment window
* 21:02 eevans@cumin1001: START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore200*: Applying new TLS certificates — [[phab:T327675|T327675]] - eevans@cumin1001
* 20:46 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3065.esams.wmnet with reason: host reimage
* 20:44 eevans@puppetmaster1001: conftool action : set/pooled=false; selector: dnsdisc=sessionstore,name=codfw
* 20:43 urandom: depooling sessionstore —codfw— in preparation for Cassandra restarts — [[phab:T327675|T327675]]
* 20:42 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3065.esams.wmnet with reason: host reimage
* 20:40 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3064.esams.wmnet
* 20:38 eevans@puppetmaster1001: conftool action : get/pooled; selector: dnsdisc=$SERVICE,name=$DC
* 20:33 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3064.esams.wmnet with OS bullseye
* 20:22 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3065.esams.wmnet with OS bullseye
* 20:21 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3063.esams.wmnet
* 20:11 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3064.esams.wmnet with reason: host reimage
* 20:09 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3063.esams.wmnet with OS bullseye
* 20:08 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3064.esams.wmnet with reason: host reimage
* 20:03 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5031.eqsin.wmnet,service=ats-be
* 20:03 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5031.eqsin.wmnet,service=cdn
* 20:00 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5031.eqsin.wmnet with OS bullseye
* 19:53 dancy: The train is blocked on [[phab:T328601|T328601]]
* 19:49 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3064.esams.wmnet with OS bullseye
* 19:49 dancy@deploy1002: Synchronized php: group1 wikis to 1.40.0-wmf.20  refs [[phab:T325584|T325584]] (duration: 06m 36s)
* 19:49 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3062.esams.wmnet
* 19:48 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3062.esams.wmnet with OS bullseye
* 19:48 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3063.esams.wmnet with reason: host reimage
* 19:45 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3063.esams.wmnet with reason: host reimage
* 19:42 dancy@deploy1002: rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.20  refs [[phab:T325584|T325584]]
* 19:41 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5021.eqsin.wmnet,service=ats-be
* 19:41 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5021.eqsin.wmnet,service=cdn
* 19:37 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5021.eqsin.wmnet with OS bullseye
* 19:33 dancy@deploy1002: deploy-promote aborted:  (duration: 11m 58s)
* 19:33 dancy@deploy1002: sync-file aborted: group1 wikis to 1.40.0-wmf.21  refs [[phab:T325584|T325584]] (duration: 03m 38s)
* 19:30 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5031.eqsin.wmnet with reason: host reimage
* 19:29 dancy@deploy1002: rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.21  refs [[phab:T325584|T325584]]
* 19:27 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5031.eqsin.wmnet with reason: host reimage
* 19:26 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3062.esams.wmnet with reason: host reimage
* 19:24 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3063.esams.wmnet with OS bullseye
* 19:24 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3061.esams.wmnet
* 19:24 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3062.esams.wmnet with reason: host reimage
* 19:17 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3061.esams.wmnet with OS bullseye
* 19:04 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage
* 19:03 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3062.esams.wmnet with OS bullseye
* 19:02 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3060.esams.wmnet
* 19:02 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3060.esams.wmnet with OS bullseye
* 19:01 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage
* 18:56 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3061.esams.wmnet with reason: host reimage
* 18:55 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5031.eqsin.wmnet with OS bullseye
* 18:55 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5031.eqsin.wmnet with OS bullseye
* 18:52 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3061.esams.wmnet with reason: host reimage
* 18:47 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5031.eqsin.wmnet with OS bullseye
* 18:46 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5031.eqsin.wmnet with OS bullseye
* 18:39 jbond@cumin2002: END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts puppetmaster2003.codfw.wmnet
* 18:38 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3060.esams.wmnet with reason: host reimage
* 18:37 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5031.eqsin.wmnet with OS bullseye
* 18:35 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3060.esams.wmnet with reason: host reimage
* 18:32 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3061.esams.wmnet with OS bullseye
* 18:31 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3059.esams.wmnet
* 18:31 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3059.esams.wmnet with OS bullseye
* 18:29 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS bullseye
* 18:29 jbond@cumin2002: START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts puppetmaster2003.codfw.wmnet
* 18:29 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5021.eqsin.wmnet with OS bullseye
* 18:22 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS bullseye
* 18:21 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on cp1075.eqiad.wmnet with reason: downtimed for idrac firmware testing
* 18:20 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on cp1075.eqiad.wmnet with reason: downtimed for idrac firmware testing
* 18:19 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5030.eqsin.wmnet,service=ats-be
* 18:19 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5030.eqsin.wmnet,service=cdn
* 18:19 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5019.eqsin.wmnet,service=ats-be
* 18:19 sukhe@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp5019.eqsin.wmnet,service=cdn
* 18:13 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3060.esams.wmnet with OS bullseye
* 18:13 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3058.esams.wmnet
* 18:12 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3058.esams.wmnet with OS bullseye
* 18:10 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5030.eqsin.wmnet with OS bullseye
* 18:10 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P43573 and previous config saved to /var/cache/conftool/dbconfig/20230201-181036-root.json
* 18:10 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P43572 and previous config saved to /var/cache/conftool/dbconfig/20230201-181031-root.json
* 18:10 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P43571 and previous config saved to /var/cache/conftool/dbconfig/20230201-181024-root.json
* 18:10 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P43570 and previous config saved to /var/cache/conftool/dbconfig/20230201-181016-root.json
* 18:10 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P43569 and previous config saved to /var/cache/conftool/dbconfig/20230201-181011-root.json
* 18:06 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3059.esams.wmnet with reason: host reimage
* 18:03 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3059.esams.wmnet with reason: host reimage
* 17:55 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P43568 and previous config saved to /var/cache/conftool/dbconfig/20230201-175531-root.json
* 17:55 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P43567 and previous config saved to /var/cache/conftool/dbconfig/20230201-175526-root.json
* 17:55 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P43566 and previous config saved to /var/cache/conftool/dbconfig/20230201-175519-root.json
* 17:55 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P43565 and previous config saved to /var/cache/conftool/dbconfig/20230201-175511-root.json
* 17:55 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P43564 and previous config saved to /var/cache/conftool/dbconfig/20230201-175506-root.json
* 17:54 marostegui@cumin1001: dbctl commit (dc=all): 'db2106 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P43563 and previous config saved to /var/cache/conftool/dbconfig/20230201-175446-root.json
* 17:48 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3058.esams.wmnet with reason: host reimage
* 17:45 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3058.esams.wmnet with reason: host reimage
* 17:41 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3059.esams.wmnet with OS bullseye
* 17:40 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3057.esams.wmnet
* 17:40 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3057.esams.wmnet with OS bullseye
* 17:40 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43562 and previous config saved to /var/cache/conftool/dbconfig/20230201-174026-root.json
* 17:40 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43561 and previous config saved to /var/cache/conftool/dbconfig/20230201-174021-root.json
* 17:40 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43560 and previous config saved to /var/cache/conftool/dbconfig/20230201-174015-root.json
* 17:40 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43559 and previous config saved to /var/cache/conftool/dbconfig/20230201-174007-root.json
* 17:40 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43558 and previous config saved to /var/cache/conftool/dbconfig/20230201-174001-root.json
* 17:39 marostegui@cumin1001: dbctl commit (dc=all): 'db2106 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P43557 and previous config saved to /var/cache/conftool/dbconfig/20230201-173941-root.json
* 17:39 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5030.eqsin.wmnet with reason: host reimage
* 17:36 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5030.eqsin.wmnet with reason: host reimage
* 17:25 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43555 and previous config saved to /var/cache/conftool/dbconfig/20230201-172521-root.json
* 17:25 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43554 and previous config saved to /var/cache/conftool/dbconfig/20230201-172516-root.json
* 17:25 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43553 and previous config saved to /var/cache/conftool/dbconfig/20230201-172510-root.json
* 17:25 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43552 and previous config saved to /var/cache/conftool/dbconfig/20230201-172502-root.json
* 17:24 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43551 and previous config saved to /var/cache/conftool/dbconfig/20230201-172456-root.json
* 17:24 marostegui@cumin1001: dbctl commit (dc=all): 'db2106 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43550 and previous config saved to /var/cache/conftool/dbconfig/20230201-172436-root.json
* 17:23 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3058.esams.wmnet with OS bullseye
* 17:22 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3056.esams.wmnet
* 17:22 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3056.esams.wmnet with OS bullseye
* 17:19 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3057.esams.wmnet with reason: host reimage
* 17:17 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5019.eqsin.wmnet with OS bullseye
* 17:15 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3057.esams.wmnet with reason: host reimage
* 17:10 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43549 and previous config saved to /var/cache/conftool/dbconfig/20230201-171016-root.json
* 17:10 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43548 and previous config saved to /var/cache/conftool/dbconfig/20230201-171011-root.json
* 17:10 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43547 and previous config saved to /var/cache/conftool/dbconfig/20230201-171005-root.json
* 17:09 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43546 and previous config saved to /var/cache/conftool/dbconfig/20230201-170957-root.json
* 17:09 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43545 and previous config saved to /var/cache/conftool/dbconfig/20230201-170951-root.json
* 17:09 marostegui@cumin1001: dbctl commit (dc=all): 'db2106 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43544 and previous config saved to /var/cache/conftool/dbconfig/20230201-170931-root.json
* 16:57 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5030.eqsin.wmnet with OS bullseye
* 16:57 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5030.eqsin.wmnet with OS bullseye
* 16:57 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3056.esams.wmnet with reason: host reimage
* 16:55 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43543 and previous config saved to /var/cache/conftool/dbconfig/20230201-165512-root.json
* 16:55 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43542 and previous config saved to /var/cache/conftool/dbconfig/20230201-165506-root.json
* 16:55 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43541 and previous config saved to /var/cache/conftool/dbconfig/20230201-165500-root.json
* 16:54 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43540 and previous config saved to /var/cache/conftool/dbconfig/20230201-165452-root.json
* 16:54 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3056.esams.wmnet with reason: host reimage
* 16:54 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43539 and previous config saved to /var/cache/conftool/dbconfig/20230201-165446-root.json
* 16:54 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3057.esams.wmnet with OS bullseye
* 16:54 marostegui@cumin1001: dbctl commit (dc=all): 'db2106 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43538 and previous config saved to /var/cache/conftool/dbconfig/20230201-165426-root.json
* 16:42 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5030.eqsin.wmnet with OS bullseye
* 16:42 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5030.eqsin.wmnet with OS bullseye
* 16:40 marostegui@cumin1001: dbctl commit (dc=all): 'es2026 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P43536 and previous config saved to /var/cache/conftool/dbconfig/20230201-164007-root.json
* 16:40 marostegui@cumin1001: dbctl commit (dc=all): 'db2158 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P43535 and previous config saved to /var/cache/conftool/dbconfig/20230201-164002-root.json
* 16:39 marostegui@cumin1001: dbctl commit (dc=all): 'db2157 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P43534 and previous config saved to /var/cache/conftool/dbconfig/20230201-163955-root.json
* 16:39 marostegui@cumin1001: dbctl commit (dc=all): 'db2146 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P43533 and previous config saved to /var/cache/conftool/dbconfig/20230201-163947-root.json
* 16:39 marostegui@cumin1001: dbctl commit (dc=all): 'db2136 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P43532 and previous config saved to /var/cache/conftool/dbconfig/20230201-163941-root.json
* 16:39 marostegui@cumin1001: dbctl commit (dc=all): 'db2106 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43531 and previous config saved to /var/cache/conftool/dbconfig/20230201-163921-root.json
* 16:33 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5030.eqsin.wmnet with OS bullseye
* 16:33 brett@cumin2002: START - Cookbook sre.hosts.reimage for host cp3056.esams.wmnet with OS bullseye
* 16:31 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5030.eqsin.wmnet with OS bullseye
* 16:29 sukhe@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5019.eqsin.wmnet with reason: host reimage
* 16:26 sukhe@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp5019.eqsin.wmnet with reason: host reimage
* 16:25 jynus: reloaded apache on mailman
* 16:25 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5030.eqsin.wmnet with OS bullseye
* 16:23 elukey@deploy1002: helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' .
* 16:22 elukey@deploy1002: helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' .
* 16:15 cgoubert@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply
* 16:14 cgoubert@deploy1002: helmfile [eqiad] START helmfile.d/services/mw-debug: apply
* 16:14 cgoubert@deploy1002: helmfile [codfw] DONE helmfile.d/services/mw-debug: apply
* 16:13 cgoubert@deploy1002: helmfile [codfw] START helmfile.d/services/mw-debug: apply
* 15:53 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5019.eqsin.wmnet with OS bullseye
* 15:51 sukhe@cumin2002: END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5019.eqsin.wmnet with OS bullseye
* 15:31 sukhe@cumin2002: START - Cookbook sre.hosts.reimage for host cp5019.eqsin.wmnet with OS bullseye
* 14:56 sukhe: cp1075.eqiad.wmnet for idrac firmware upgrade testing
* 14:55 sukhe@puppetmaster1001: conftool action : set/pooled=no; selector: name=cp1075.eqiad.wmnet,service=ats-be
* 14:55 sukhe@puppetmaster1001: conftool action : set/pooled=no; selector: name=cp1075.eqiad.wmnet,service=cdn
* 14:52 awight: EU deployment window complete
* 14:48 ayounsi@deploy1002: helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.
* 14:48 awight@deploy1002: Finished scap: Backport for [[gerrit:884155{{!}}wmf-config: add new revision-score streams for EventGate main (T317768)]] (duration: 08m 25s)
* 14:47 ayounsi@deploy1002: helmfile [staging-codfw] START helmfile.d/admin 'apply'.
* 14:41 awight@deploy1002: elukey and awight: Backport for [[gerrit:884155{{!}}wmf-config: add new revision-score streams for EventGate main (T317768)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet
* 14:41 marostegui@cumin1001: dbctl commit (dc=all): 'Depool db2136 db2158 db2157 es2026 db2106 db2146 [[phab:T327404|T327404]]', diff saved to https://phabricator.wikimedia.org/P43530 and previous config saved to /var/cache/conftool/dbconfig/20230201-144152-root.json
* 14:40 ayounsi@deploy1002: helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.
* 14:40 ayounsi@deploy1002: helmfile [staging-codfw] START helmfile.d/admin 'apply'.
* 14:40 awight@deploy1002: Started scap: Backport for [[gerrit:884155{{!}}wmf-config: add new revision-score streams for EventGate main (T317768)]]
* 14:39 ayounsi@deploy1002: helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.
* 14:39 ayounsi@deploy1002: helmfile [staging-codfw] START helmfile.d/admin 'apply'.
* 14:37 awight@deploy1002: Finished scap: Backport for [[gerrit:885391{{!}}Add cswiki to desktop-improvements group. (T328154)]] (duration: 09m 22s)
* 14:29 awight@deploy1002: jdrewniak and awight: Backport for [[gerrit:885391{{!}}Add cswiki to desktop-improvements group. (T328154)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet
* 14:28 awight@deploy1002: Started scap: Backport for [[gerrit:885391{{!}}Add cswiki to desktop-improvements group. (T328154)]]
* 14:26 awight@deploy1002: Finished scap: Backport for [[gerrit:885798{{!}}Squashed diff to catch up to master]] (duration: 09m 07s)
* 14:19 awight@deploy1002: awight and mlitn: Backport for [[gerrit:885798{{!}}Squashed diff to catch up to master]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet
* 14:17 awight@deploy1002: Started scap: Backport for [[gerrit:885798{{!}}Squashed diff to catch up to master]]
* 14:11 awight@deploy1002: backport aborted:  (duration: 06m 09s)
* 14:11 awight@deploy1002: sync-world aborted: Backport for [[gerrit:885798{{!}}Squashed diff to catch up to master]] (duration: 03m 36s)
* 14:09 awight@deploy1002: mlitn and awight: Backport for [[gerrit:885798{{!}}Squashed diff to catch up to master]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet
* 14:07 awight@deploy1002: Started scap: Backport for [[gerrit:885798{{!}}Squashed diff to catch up to master]]
* 14:07 jmm@cumin2002: END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts bast3005.wikimedia.org
* 14:07 jmm@cumin2002: END (PASS) - Cookbook sre.dns.netbox (exit_code=0)
* 14:07 jmm@cumin2002: END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast3005.wikimedia.org decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 14:06 moritzm: updating perf on Bullseye hosts
* 14:05 jmm@cumin2002: START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast3005.wikimedia.org decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 13:55 jmm@cumin2002: START - Cookbook sre.dns.netbox
* 13:51 jmm@cumin2002: START - Cookbook sre.hosts.decommission for hosts bast3005.wikimedia.org
* 13:48 jmm@cumin2002: END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts bast5002.wikimedia.org
* 13:48 jmm@cumin2002: END (PASS) - Cookbook sre.dns.netbox (exit_code=0)
* 13:48 jmm@cumin2002: END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast5002.wikimedia.org decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 13:47 jmm@cumin2002: START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast5002.wikimedia.org decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 13:43 jmm@cumin2002: START - Cookbook sre.dns.netbox
* 13:36 jmm@cumin2002: START - Cookbook sre.hosts.decommission for hosts bast5002.wikimedia.org
* 13:21 moritzm: installing curl security updates on bullseye
* 13:00 stevemunene@deploy1002: helmfile [staging] DONE helmfile.d/services/datahub: sync on main
* 12:59 stevemunene@deploy1002: helmfile [staging] START helmfile.d/services/datahub: apply on main
* 12:41 jmm@cumin2002: END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts testvm2003.codfw.wmnet
* 12:41 jmm@cumin2002: END (PASS) - Cookbook sre.dns.netbox (exit_code=0)
* 12:41 jmm@cumin2002: END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: testvm2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 12:40 jmm@cumin2002: START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: testvm2003.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 12:31 jmm@cumin2002: START - Cookbook sre.dns.netbox
* 12:27 jmm@cumin2002: START - Cookbook sre.hosts.decommission for hosts testvm2003.codfw.wmnet
* 12:16 jmm@cumin2002: END (FAIL) - Cookbook sre.puppet.renew-cert (exit_code=99) for testvm2002.codfw.wmnet: Renew puppet certificate - jmm@cumin2002
* 12:15 jmm@cumin2002: START - Cookbook sre.puppet.renew-cert for testvm2002.codfw.wmnet: Renew puppet certificate - jmm@cumin2002
* 11:29 ladsgroup@deploy1002: Synchronized wmf-config/InitialiseSettings.php: Move CirrusSearch settings from IS.php to ext-CirrusSearch.php, part III ([[phab:T308932|T308932]]) (duration: 06m 43s)
* 11:27 jmm@cumin2002: END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts testvm2001.codfw.wmnet
* 11:27 jmm@cumin2002: END (PASS) - Cookbook sre.dns.netbox (exit_code=0)
* 11:27 jmm@cumin2002: END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: testvm2001.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 11:24 jmm@cumin2002: START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: testvm2001.codfw.wmnet decommissioned, removing all IPs except the asset tag one - jmm@cumin2002"
* 11:22 jgiannelos@deploy1002: Finished deploy [kartotherian/deploy@e1ca693] (codfw): Allow stylesheets through CSP (duration: 01m 45s)
* 11:21 ladsgroup@deploy1002: Synchronized multiversion/MWConfigCacheGenerator.php: Move CirrusSearch settings from IS.php to ext-CirrusSearch.php, part II ([[phab:T308932|T308932]]) (duration: 07m 04s)
* 11:21 jmm@cumin2002: START - Cookbook sre.dns.netbox
* 11:20 jgiannelos@deploy1002: Started deploy [kartotherian/deploy@e1ca693] (codfw): Allow stylesheets through CSP
* 11:17 jmm@cumin2002: START - Cookbook sre.hosts.decommission for hosts testvm2001.codfw.wmnet
* 11:17 jgiannelos@deploy1002: Finished deploy [kartotherian/deploy@e1ca693] (eqiad): Allow stylesheets through CSP (duration: 00m 51s)
* 11:16 jgiannelos@deploy1002: Started deploy [kartotherian/deploy@e1ca693] (eqiad): Allow stylesheets through CSP
* 11:14 ladsgroup@deploy1002: Synchronized wmf-config/ext-CirrusSearch.php: Move CirrusSearch settings from IS.php to ext-CirrusSearch.php, part I ([[phab:T308932|T308932]]) (duration: 07m 04s)
* 11:01 stevemunene@deploy1002: Finished deploy [analytics/refinery@a8840b0] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@a8840b0] (duration: 01m 18s)
* 11:00 stevemunene@deploy1002: Started deploy [analytics/refinery@a8840b0] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@a8840b0]
* 10:59 stevemunene@deploy1002: Finished deploy [analytics/refinery@a8840b0] (thin): Regular analytics weekly train THIN [analytics/refinery@a8840b0] (duration: 00m 05s)
* 10:59 stevemunene@deploy1002: Started deploy [analytics/refinery@a8840b0] (thin): Regular analytics weekly train THIN [analytics/refinery@a8840b0]
* 10:58 stevemunene@deploy1002: Finished deploy [analytics/refinery@a8840b0]: Regular analytics weekly train [analytics/refinery@a8840b0] (duration: 04m 29s)
* 10:54 stevemunene@deploy1002: Started deploy [analytics/refinery@a8840b0]: Regular analytics weekly train [analytics/refinery@a8840b0]
* 10:52 steve_munene: Deploying refinery for ops week
* 10:42 elukey@deploy1002: helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'.
* 10:42 elukey@deploy1002: helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'.
* 10:42 zabe: start running migrateRevisionCommentTemp in remaining sections (for now except s3) in screens # [[phab:T275246|T275246]]
* 10:42 elukey@deploy1002: helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'.
* 10:42 elukey@deploy1002: helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'.
* 10:41 elukey@deploy1002: helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'.
* 10:41 elukey@deploy1002: helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'.
* 10:23 jmm@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host krb2002.codfw.wmnet with OS bullseye
* 10:08 jmm@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on krb2002.codfw.wmnet with reason: host reimage
* 10:05 jmm@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on krb2002.codfw.wmnet with reason: host reimage
* 10:01 godog: upgrade grafana to 8.5.20 on cloudmetrics* - [[phab:T328405|T328405]]
* 09:57 godog: upgrade grafana to 8.5.20 on grafana1002 - [[phab:T328405|T328405]]
* 09:50 jmm@cumin2002: START - Cookbook sre.hosts.reimage for host krb2002.codfw.wmnet with OS bullseye
* 09:47 godog: upgrade grafana to 8.5.20 on grafana2001 - [[phab:T328405|T328405]]
* 09:15 urbanecm: Clean sign up throttle for IP 195.113.145.2 (via resetAuthenticationThrottle.php; [[phab:T328521|T328521]])
* 09:14 urbanecm@deploy1002: Finished scap: Backport for [[gerrit:885734{{!}}Add new throttle rule (T328521)]] (duration: 07m 24s)
* 09:07 urbanecm@deploy1002: Started scap: Backport for [[gerrit:885734{{!}}Add new throttle rule (T328521)]]
* 09:06 urbanecm@deploy1002: backport aborted:  (duration: 00m 01s)
* 09:05 ladsgroup@deploy1002: Finished scap: Backport for [[gerrit:883620{{!}}Create additional namespaces on shn.wikibooks (T327850)]] (duration: 15m 06s)
* 08:54 stevemunene@deploy1002: helmfile [staging] DONE helmfile.d/services/datahub: apply on main
* 08:54 stevemunene@deploy1002: helmfile [staging] START helmfile.d/services/datahub: apply on main
* 08:52 ladsgroup@deploy1002: superpes and ladsgroup: Backport for [[gerrit:883620{{!}}Create additional namespaces on shn.wikibooks (T327850)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet
* 08:50 ladsgroup@deploy1002: Started scap: Backport for [[gerrit:883620{{!}}Create additional namespaces on shn.wikibooks (T327850)]]
* 08:49 ladsgroup@deploy1002: Finished scap: Backport for [[gerrit:885321{{!}}Add a wordmark to trwiktionary (T328499)]] (duration: 08m 05s)
* 08:45 jayme@cumin1001: conftool action : set/pooled=false; selector: name=codfw,dnsdisc=k8s-ingress-staging
* 08:45 jayme@cumin1001: conftool action : set/pooled=true; selector: name=eqiad,dnsdisc=k8s-ingress-staging
* 08:42 ladsgroup@deploy1002: superpes and ladsgroup: Backport for [[gerrit:885321{{!}}Add a wordmark to trwiktionary (T328499)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet
* 08:41 ladsgroup@deploy1002: Started scap: Backport for [[gerrit:885321{{!}}Add a wordmark to trwiktionary (T328499)]]
* 08:40 ladsgroup@deploy1002: Finished scap: Backport for [[gerrit:884934{{!}}Add mobile wordmark to cswiktionary (T328357)]] (duration: 12m 26s)
* 08:29 ladsgroup@deploy1002: superpes and ladsgroup: Backport for [[gerrit:884934{{!}}Add mobile wordmark to cswiktionary (T328357)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet
* 08:27 ladsgroup@deploy1002: Started scap: Backport for [[gerrit:884934{{!}}Add mobile wordmark to cswiktionary (T328357)]]
* 08:27 elukey@deploy1002: helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'.
* 08:27 elukey@deploy1002: helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'.
* 08:27 elukey@deploy1002: helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'.
* 08:27 elukey@deploy1002: helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'.
* 08:27 ladsgroup@deploy1002: Finished scap: Backport for [[gerrit:879926{{!}}Remove former EventLogging streams for navtiming (T281103 T286703 T308621 T323623)]] (duration: 09m 42s)
* 08:19 jayme@cumin1001: END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 6 hosts
* 08:19 jayme@cumin1001: START - Cookbook sre.hosts.remove-downtime for 6 hosts
* 08:19 ladsgroup@deploy1002: ladsgroup and krinkle: Backport for [[gerrit:879926{{!}}Remove former EventLogging streams for navtiming (T281103 T286703 T308621 T323623)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet
* 08:17 ladsgroup@deploy1002: Started scap: Backport for [[gerrit:879926{{!}}Remove former EventLogging streams for navtiming (T281103 T286703 T308621 T323623)]]
* 08:14 ladsgroup@deploy1002: Finished scap: Backport for [[gerrit:726854{{!}}Remove unused eventlogging_RUMSpeedIndex stream (T286700)]] (duration: 10m 15s)
* 08:06 ladsgroup@deploy1002: phedenskog and ladsgroup: Backport for [[gerrit:726854{{!}}Remove unused eventlogging_RUMSpeedIndex stream (T286700)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet
* 08:05 moritzm: installing libarchive security updates
* 08:04 ladsgroup@deploy1002: Started scap: Backport for [[gerrit:726854{{!}}Remove unused eventlogging_RUMSpeedIndex stream (T286700)]]
* 08:01 ayounsi@cumin1001: END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 55821
* 07:57 ayounsi@cumin1001: START - Cookbook sre.network.peering with action 'configure' for AS: 55821
* 07:33 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1158 ([[phab:T310011|T310011]])', diff saved to https://phabricator.wikimedia.org/P43524 and previous config saved to /var/cache/conftool/dbconfig/20230201-073348-ladsgroup.json
* 07:18 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P43523 and previous config saved to /var/cache/conftool/dbconfig/20230201-071841-ladsgroup.json
* 07:03 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P43522 and previous config saved to /var/cache/conftool/dbconfig/20230201-070335-ladsgroup.json
* 06:48 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1158 ([[phab:T310011|T310011]])', diff saved to https://phabricator.wikimedia.org/P43521 and previous config saved to /var/cache/conftool/dbconfig/20230201-064828-ladsgroup.json
* 06:45 ladsgroup@cumin1001: dbctl commit (dc=all): 'Depooling db1158 ([[phab:T310011|T310011]])', diff saved to https://phabricator.wikimedia.org/P43520 and previous config saved to /var/cache/conftool/dbconfig/20230201-064311-ladsgroup.json
* 06:44 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance
* 06:43 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance
* 06:42 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1158.eqiad.wmnet with reason: Maintenance
* 06:42 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 12:00:00 on db1158.eqiad.wmnet with reason: Maintenance
* 00:38 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3055.esams.wmnet
* 00:37 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3055.esams.wmnet with OS bullseye
* 00:15 brett@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3055.esams.wmnet with reason: host reimage
* 00:12 brett@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on cp3055.esams.wmnet with reason: host reimage
* 00:02 brett@cumin2002: conftool action : set/pooled=yes; selector: name=cp3054.esams.wmnet
* 00:01 brett@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3054.esams.wmnet with OS bullseye


== July 6 ==
==Archives ==
* 23:50 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/221989/ (duration: 00m 12s)
See [[Server Admin Log/Archives]].
* 23:49 logmsgbot: krenair Synchronized w/static/images/project-logos/mrwikisource.png: https://gerrit.wikimedia.org/r/#/c/221989/ (duration: 00m 13s)
<noinclude>
* 23:35 logmsgbot: krenair Synchronized wmf-config/abusefilter.php: https://gerrit.wikimedia.org/r/#/c/223179/ - should be labs-only (duration: 00m 12s)
[[Category:SAL]]
* 23:32 logmsgbot: krenair Synchronized README: https://gerrit.wikimedia.org/r/#/c/222941/ - ... (duration: 00m 13s)
[[Category:Operations]]
* 23:27 logmsgbot: krenair Synchronized wmf-config: https://gerrit.wikimedia.org/r/#/c/221809/ - should be a noop, just doc changes (duration: 00m 13s)
</noinclude>
* 23:25 logmsgbot: krenair Synchronized wmf-config: https://gerrit.wikimedia.org/r/#/c/221808/ (duration: 00m 13s)
* 23:17 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings-labs.php: https://gerrit.wikimedia.org/r/#/c/223185/ (duration: 00m 12s)
* 23:06 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/220970/ (duration: 00m 14s)
* 21:46 gwicke: restarted cassandra instance on restbase1003; was low on memory and constantly writing small chunks
* 21:30 andrewbogott: rebooting labvirt1005, again.  Somehow virtualization is turned off again
* 21:12 subbu: deployed parsoid version 87a746e6
* 21:04 logmsgbot: ori Synchronized php-1.26wmf12/thumb.php: cdc75debaf: Add Content-Length header to thumb.php error responses (duration: 00m 13s)
* 21:02 mutante: purging static-bz URL on varnish ...
* 20:39 akosiaris: upload php5_5.3.10-1ubuntu3.19-wmf1 on apt.wikimedia.org/precise-wikimedia
* 20:15 gwicke: restart cassandra instance on 1005
* 20:04 mobrovac: restbase restart cassandra on rb1005
* 19:28 logmsgbot: krenair Synchronized wmf-config: https://gerrit.wikimedia.org/r/#/c/223040/ (duration: 00m 12s)
* 19:11 gwicke: reduced compaction throughput from 160 to 100 mb/s across the cassandra cluster via 'nodetool -h <host> setcompactionthroughput 100'
* 18:51 gwicke: restarted cassandra on restbase1001 with jdk8, see T104888
* 18:22 gwicke: restarted cassandra on restbase1004 with jdk8
* 17:54 Jeff_Green: authdns-update for new rigel A record
* 17:42 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: increase db2029 traffic to normal levels (duration: 00m 12s)
* 17:37 gwicke: upgraded restbase1005 to jdk8
* 17:35 gwicke: restarting cassandra instance on restbase1005: out of heap
* 17:10 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: repool db2029 again after conf upgrade(2/2) (duration: 00m 11s)
* 17:09 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: repool db2029 again after conf upgrade (duration: 00m 11s)
* 16:38 jynus: upgrade and restart of db2029
* 16:35 ori: depooled mw1152
* 15:29 logmsgbot: krenair Finished scap: https://gerrit.wikimedia.org/r/#/c/222993/ (duration: 22m 09s)
* 15:21 _joe_: repooling mw1152
* 15:20 _joe_: attempting dump-apc on mw1060
* 15:09 _joe_: depooled the HHVM imagescaler again
* 15:07 logmsgbot: krenair Started scap: https://gerrit.wikimedia.org/r/#/c/222993/
* 15:02 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/222617/ (duration: 00m 12s)
* 14:48 moritzm: installed python security updates on analytics*, lab* and virt*
* 14:46 moritzm: added python-diskimage-builder 0.1.46-1+wmf1 for jessie-wikimedia on carbon
* 14:43 _joe_: depooled the HHVM imagescaler, spitting 503s again.
* 14:18 mobrovac: restbase started thinning out parsoid data (local_group_wikipedia_T_parsoid_dataDVIsgzJSne8k) for >= 22 days
* 14:07 YuviPanda: restart apache on labcontrol1001 to pick up parser function change
* 12:57 moritzm: installed python security updates on mw*, es* and db*
* 12:18 logmsgbot: hoo Synchronized wmf-config/: Enable WikibaseQuality and WikibaseQualityConstraints on wikidata (duration: 00m 13s)
* 12:15 logmsgbot: hoo Finished scap: Update WikibaseQuality and WikibaseQualityConstraint (duration: 25m 56s)
* 11:49 logmsgbot: hoo Started scap: Update WikibaseQuality and WikibaseQualityConstraint
* 11:40 hoo: Created the `wbqc_constraints` table on wikidatawiki
* 09:02 _joe_: restarted the appserver on mw1059 with hhvm.server.apc.expire_on_sets = true, restarted the heap profiling to confirm my hypothesis on T104769
* 08:31 _joe_: restarted cassandra on rb1004. again.
* 05:01 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1034, depool db1041 (duration: 00m 12s)
* 05:00 springle: stash/pull/apply CommonSettings.php on tin, which was left with modifications
* 04:35 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Jul  6 04:35:45 UTC 2015 (duration 35m 44s)
* 02:22 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-06 02:22:12+00:00
* 02:18 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 06m 07s)
 
== July 5 ==
* 22:30 bd808: Restarted logstash on logstah1001; Hung due to OOM errors
* 22:03 mobrovac: restbase rolling restart of restbase
* 18:11 logmsgbot: krenair Synchronized docroot/noc: https://gerrit.wikimedia.org/r/#/c/222932/ (duration: 00m 12s)
* 17:49 logmsgbot: krenair Synchronized docroot/noc/conf: https://gerrit.wikimedia.org/r/#/c/222290/ (duration: 00m 13s)
* 17:44 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/221600/ (duration: 00m 12s)
* 15:16 YuviPanda: restarted nutcracker on silver.
* 12:55 mobrovac: restbase rolling restart of cassandra to apply the 16G heap change https://gerrit.wikimedia.org/r/222899
* 11:21 _joe_: restarted cassandra on restbase1004 (again), seemingly crashed for a bad request
* 11:03 _joe_: restarting cassandra on rb1003,4 and restbase on rb1002,3
* 09:43 bblack: restarted restbase on restbase1005
* 08:40 _joe_: collecting heaps on an api appserver, mw1115, as comparison
* 08:29 _joe_: restaarted HHVM on mw1059 with heap profiling enabled, collecting data (will stop this evening).
* 08:27 bblack: FYI: 08:15 < grrrit-wm> (CR) BBlack: [C: 2 V: 2] filter S:RI from wm2015register T45250 [puppet] - https://gerrit.wikimedia.org/r/222879 (owner: BBlack)
* 08:23 _joe_: restarted hhvm because of ooms, not apache
* 08:23 _joe_: restarted apache on mw1105,mw1092,90,82,78
* 07:09 bblack: restarted cassandra on restbase1004
* 07:07 bblack: restarted cassandra + restbase on restbase1005
* 07:01 jynus: Restarted HHVM for mw1112,1028,1057,1061,1069,1070,1084,1086
* 02:57 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-05 02:57:28+00:00
 
== July 4 ==
* 23:49 Krenair: Ran "mwscript updateSpecialPages.php labswiki --override --only=Wantedpages" on silver, completed in 0.44 seconds
* 23:44 Krenair: test morebots
* 21:22 YuviPanda: restarted cassandra on restbase1004 per urandom
* 19:15 YuviPanda: restarted cassandra on restbase1001
* 17:15 _joe_: restarted cassandra on restbase1001
* 16:12 logmsgbot: krenair Synchronized wmf-config/interwiki.cdb: Updating interwiki cache (duration: 10m 35s)
* 12:56 logmsgbot: krinkle Synchronized php-1.26wmf12/resources/src/mediawiki/mediawiki.Title.js: I1dae1e63e47 (duration: 00m 17s)
* 05:01 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat Jul  4 05:01:43 UTC 2015 (duration 1m 42s)
* 03:11 ori: Promoted Krinkle and Krenair to admin, cloudadmin on wikitech, because duh.
* 02:39 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-04 02:39:41+00:00
* 02:29 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 09m 59s)
* 01:00 springle: reload haproxy dbproxy1004
 
== July 3 ==
* 23:59 logmsgbot: legoktm Synchronized php-1.26wmf12/extensions/Translate/: Translate+UserMerge fixes (duration: 00m 17s)
* 23:55 logmsgbot: legoktm Synchronized php-1.26wmf12/extensions/WikiLove/: WikiLove+UserMerge fixes (duration: 00m 18s)
* 23:24 logmsgbot: ori Synchronized w/404.php: Force 'Transfer-Encoding: Chunked' header on 404 responses (duration: 00m 31s)
* 22:36 Krenair: restarted apache on silver to see if it would make https://gerrit.wikimedia.org/r/#/c/221969/ take effect for T104360. It did not.
* 21:46 ori: depooled mw1152
* 20:12 ori: restarted cassandra on restbase1001
* 17:28 ori: pooled mw1152 (HHVM image scaler) for debugging.
* 17:05 logmsgbot: krenair Synchronized php-1.26wmf12/extensions/Collection/RenderingAPI.php: https://gerrit.wikimedia.org/r/#/c/222616/ - hoping this fixes T104708 (duration: 00m 44s)
* 15:35 YuviPanda: cd /mnt/backup/others-20150703/ ; tar --acls --xattrs -cpf - . | pv -L 80M -C -p -r -e -b -t -B 32M -T | ssh -c chacha20-poly1305@openssh.com -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -C -B 32M | tar --acls --xattrs -xpf - -C /srv/backup-others-20150703" on labstore1002
* 15:35 YuviPanda: mount /dev/mapper/backup-others--20150703 /srv/backup-others-20150703/ on labstore2001
* 15:34 YuviPanda: mkdir /srv/backup-others-20150703 on labstore2001
* 15:33 YuviPanda: mkfs -t ext4 /dev/mapper/backup-others--20150703 on labstore2001 completed
* 15:33 YuviPanda: run mount -o ro /dev/mapper/labstore-others--20150703 /mnt/backup/others-20150703/ on labstore1002
* 15:32 YuviPanda: run mkdir /mnt/backup/others-20150703 on labstore1002
* 15:31 YuviPanda: run  lvcreate -L 640G -s -n others-20150703 labstore/others on labstore1002
* 15:29 YuviPanda: running mkfs -t ext4 /dev/mapper/backup-others--20150703 on labstore2001
* 15:28 YuviPanda: run lvcreate -L 3.5T -n others-20150703 backup on labstore2001
* 15:25 YuviPanda: begin process of backing up others (all labs projects except tools) on to labstore2001 from labstore1002
* 14:06 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1022 (low traffic) (duration: 00m 54s)
* 13:27 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: repool db2047 after maintenance (duration: 00m 22s)
* 13:27 YuviPanda: run cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs -cpf - . | pv -L 80M -C -p -r -e -b -t -B 32M -T | ssh -c chacha20-poly1305@openssh.com -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -C -B 32M | tar --acls --xattrs -xpf - -C /srv/backup-tools-20150703" on labstore1002
* 13:27 YuviPanda: interrupting tar |ssh | tar script and cleaning out destination again
* 13:17 YuviPanda: clean out tar | ssh | tar target on labstore2001
* 13:15 YuviPanda: /dev/null filled up on labstore1002, aborting pipe of valuable user data into it.
* 13:13 YuviPanda: run cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs -cpf - . | pv -L 80M -C -p -r -e -b -t -B 32M -T > /dev/null on labstore1002
* 13:02 YuviPanda: run cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs -cpf - . | pv -L 80M -C -p -r -e -b -t -B 32M -T | ssh -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -C -B 32M | tar --acls --xattrs -xpf - -C /srv/backup-tools-20150703" on labstore1002
* 13:02 YuviPanda: interrupt tar | ssh | tar on labstore1002 and killed dest on labstore2001
* 12:43 YuviPanda: cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs -cpf - . | pv -L 80M -p -r -e -b -t -B 32M -T | ssh -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -B 32M | tar --acls --xattrs -xpf - -C /srv/backup-tools-20150703" on screen on labstore1002
* 12:43 mobrovac: restbase deploying restbase/deploy @ 1a826a5
* 12:42 YuviPanda: interrupt tar | ssh | tar on labstore1002, clean out destination on labstore2001
* 12:36 YuviPanda: interrupted tar | ssh | tar on labstore1002 and cleaned out dest on labstore2001
* 12:35 YuviPanda: cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs -cpf - . | pv -L 80M -p -r -e -b -t -B 16M | ssh -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -B 16M | tar --acls --xattrs -xpf - -C /srv/backup-tools-20150703" in screen on labstore1002
* 12:33 YuviPanda: rm -rf /srv/backup-tools-20150703/* on labstore2001
* 12:31 mark: labstore2001: mount /srv/backup -o remount,ro
* 12:31 YuviPanda: interrupt tar | ssh | tar on labstore1002
* 12:29 YuviPanda: cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs -cpf - . | ssh -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -L 80M -p -r -e -b -t -B 16M | tar --acls --xattrs -xpf - -C /srv/backup-tools-20150703" on labstore1002
* 12:28 YuviPanda: cd /mnt/backup/tools-20150703/ ; tar --acls --xattrs cpf - . | ssh -i ~/.ssh/id_labstore root@labstore2001.codfw.wmnet "pv -L 80M -p -r -e -b -t -B 16M | tar --acls --xattrs xpf - -C /srv/backup-tools-20150703" on labstore1002
* 12:09 YuviPanda: running mount -o ro /dev/mapper/labstore-tools--20150703 /mnt/backup/tools-20150703/ now
* 11:57 YuviPanda: run  lvcreate -L 640G -s -n tools-20150703 labstore/tools on labstore1002
* 11:50 YuviPanda: running  lvcreate -L 640G -s tools -n tools-20150703 labstore on labstore1002
* 11:26 YuviPanda:  umount /mnt/backup/project/tools/ on labstore1002
* 11:24 YuviPanda: ran mount /dev/mapper/backup-tools--20150703 /srv/backup-tools-20150703/ on labstore2001
* 11:22 YuviPanda: mkdir /srv/backup-tools-20150703 on labstore2001
* 11:13 YuviPanda: run mkfs -t ext4 /dev/mapper/backup-tools--20150703  on labstore2001
* 11:09 YuviPanda: lvcreate -L 6TB -n tools-20150703 backup on labstore2001
* 11:09 jynus: reimports finished on dbstore2* hosts and puppet reenabled after T104471 was fixed
* 10:56 mobrovac: restbase disabling puppet on restbase1005 to tweak JVM params for cassandra
* 10:50 YuviPanda: started du of maps project on labstore2001
* 09:36 mobrovac: restbase restarting cassandra on rb1002
* 06:19 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri Jul  3 06:19:02 UTC 2015 (duration 19m 1s)
* 02:50 urandom: restbase rolling restart
* 02:49 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-03 02:49:31+00:00
* 02:42 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 11m 43s)
* 02:06 logmsgbot: ori Synchronized php-1.26wmf12/extensions/CentralAuth: I0e5f2d3b2: Updated mediawiki/core Project: mediawiki/extensions/CentralAuth  7f8da7139714dd5089dd03e8679aba25c2c89c4d (duration: 00m 15s)
 
== July 2 ==
* 22:34 logmsgbot: legoktm Synchronized php-1.26wmf12/extensions/CentralAuth/: Made use of new USE_MULTI_COMMIT flag in user merge jobs (duration: 00m 18s)
* 22:31 logmsgbot: legoktm Synchronized php-1.26wmf12/extensions/UserMerge/:  Added USE_MULTI_COMMIT flag to enable query batching (duration: 00m 26s)
* 21:51 logmsgbot: legoktm Synchronized php-1.26wmf12/extensions/Interwiki/Interwiki_body.php: Add missing global $wgInterwikiViewOnly declaration (duration: 00m 15s)
* 21:37 twentyafterfour: restarted apache2 or iridium after applying hotfix for phabricator css issue
* 21:22 logmsgbot: legoktm Synchronized php-1.26wmf12/extensions/CentralNotice/: https://gerrit.wikimedia.org/r/222484 (duration: 00m 15s)
* 21:16 cwdent: updated civicrm from 4fe0648ea9f36282731bf651a59ca1a617db6c08 to 04efc7d5c7bbb068f907125f2184692aee676123
* 20:47 logmsgbot: legoktm Synchronized wmf-config/CommonSettings.php: Disable global merge (duration: 00m 14s)
* 20:13 andrewbogott: restarted keystone on labcontrol1001
* 18:54 bd808: Running sync-common on mw1111; fatal log showed it to be running 1.26wmf9
* 18:30 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: all wikis to 1.26wmf12
* 18:02 YuviPanda: running exportfs -ra on labstore1002
* 16:40 bd808: Restarted logstash on logstash1001 due to OOM
* 16:05 bblack: cp1065 undowntimed/repooled
* 16:04 YuviPanda: clean out exports.d in labstore1002, will get regenerated. backup in /root/exports.backup
* 15:18 logmsgbot: anomie Synchronized php-1.26wmf12/extensions/Wikidata/: SWAT: Update Wikibase: SearchEntities return 'aliases' when not same as label [[gerrit:222311]] (duration: 00m 20s)
* 15:18 YuviPanda: killed icinga-wm again
* 15:17 bblack: depooled cp1065 in pybal/puppet
* 14:57 mutante: restarting gitblit on antimony for the 123443th time
* 14:54 mutante: restarted apache on strontium
* 14:50 YuviPanda: killed icinga-wm for a bit
* 14:43 YuviPanda: kicked puppetmaster on palladium
* 14:28 YuviPanda: restarted apache on labcontrol1001
* 14:14 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: depool db2029 again: T104573 (duration: 00m 12s)
* 13:58 urandom: restarted restbase1005.eqiad
* 13:49 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: repool db2029; depool db2047 for maintenance (duration: 00m 13s)
* 11:19 mobrovac: restbase restarting cassandra on rb1005
* 07:06 logmsgbot: krinkle Synchronized w/touch.php: T104538 (duration: 00m 11s)
* 07:05 logmsgbot: krinkle Synchronized w/favicon.php: T104538 (duration: 00m 11s)
* 06:34 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Emergency depool of db2029 (duration: 00m 12s)
* 06:27 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Jul  2 06:27:57 UTC 2015 (duration 27m 56s)
* 04:18 ori: depooled mw1152.
* 03:38 logmsgbot: krinkle Synchronized docroot/default/index.html: 6d49d229806 (duration: 00m 12s)
* 03:37 logmsgbot: krinkle Synchronized 404.html: 6d49d229806 (duration: 00m 12s)
* 03:14 logmsgbot: legoktm Synchronized wmf-config/interwiki.cdb: Updating interwiki cache (duration: 00m 12s)
* 02:54 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-02 02:54:06+00:00
* 02:52 logmsgbot: krinkle Synchronized docroot and w: 245a1ff (duration: 00m 12s)
* 02:51 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 05m 19s)
* 02:37 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-07-02 02:37:03+00:00
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 10m 23s)
* 00:44 ori: Repooling mw1152 (HHVM image scaler) for testing)
 
== July 1 ==
* 23:30 springle: restart mysqld dbstore2002 T104471
* 23:06 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/222202/ (duration: 00m 11s)
* 21:39 godog: bounce gitblit
* 20:38 jgage: restarted gitblit on antimony
* 19:50 ori: restarted gitblit on antimony
* 19:49 ori: mw1152 not actually re-pooled because of ongoing work on palladium. I'm undoing the change and hanging back now.
* 19:41 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group1 wikis to 1.26wmf12
* 19:36 logmsgbot: twentyafterfour Synchronized php-1.26wmf12: sync 1.26wmf12 branch revert of "Implement support for Google reCAPTCHA 2.0" 90665a737bc25ff3c859044755d662c6cd700573 (duration: 02m 04s)
* 19:31 jynus: replication issues for shard s7 on dbstore2001 and dbstore2002, production applications *not* affected
* 19:31 urandom: from restbase1002; node thin_out_key_rev_value_data.js `hostname -i` local_group_wikipedia_T_parsoid_html 2>&1 | pv --line-mode | gzip -c > wikipedia_T_parsoid_html.log.gz
* 19:28 ori: Repooling mw1152 for further testing of HHVM scaler
* 19:03 logmsgbot: hoo Synchronized php-1.26wmf12/extensions/Wikidata/: Update DataModel to fix SnakList (duration: 00m 20s)
* 18:42 logmsgbot: hoo Synchronized wmf-config/mobile-labs.php: consistency (duration: 00m 12s)
* 18:41 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings-labs.php: consistency (duration: 00m 31s)
* 18:02 andrewbogott: restarted keystone on labcontrol1001
* 17:03 jgage: beginning puppet CA replacement procedure
* 16:06 ejegg: enabled queue consumers
* 16:05 akosiaris: re-enabling ntp everywhere
* 15:59 ejegg: disabled queue consumers
* 15:30 logmsgbot: hoo Synchronized php-1.26wmf12/extensions/Wikidata/: Remove alias uniqueness constraints (duration: 00m 21s)
* 15:06 urandom: restbase1002: PWD=/home/eevans/restbase-mod-table-cassandra/maintenance; node thin_out_key_rev_value_data.js `hostname -i` local_group_wikimedia_T_parsoid_html 2>&1 | pv --line-mode | gzip -c > wikimedia_T_parsoid_html.log.gz
* 15:05 bblack: re-enabling puppet on caches
* 14:59 bblack: disabling puppet on caches (because puppet always breaks when you move files/modules around...)
* 13:57 bblack: rebooting cp2001 (test kernel update)
* 11:32 YuviPanda: rsync on labstore1002 finished, restarting to see what was skipped + errors
* 10:47 moritzm: installed patch security updates on 862 hosts
* 10:42 hashar: restarting Jenkins: upgrading Jenkins gearman plugin from 0.1.1-8-gf2024bd to 0.1.1-9-g08e9c42-change_192429_2  https://phabricator.wikimedia.org/T72597#1416913
* 07:48 mobrovac: restbase restarting cassandra on rb1005
* 05:28 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Jul  1 05:28:38 UTC 2015 (duration 28m 37s)
* 05:27 csteipp: deployed patch for T103765
* 04:41 logmsgbot: krinkle Synchronized php-1.26wmf12/includes/resourceloader/ResourceLoader.php: Iee884208c5c4b minify cache key (duration: 00m 11s)
* 03:10 mutante: git pull on strontium
* 03:00 logmsgbot: LocalisationUpdate completed (1.26wmf12) at 2015-07-01 03:00:21+00:00
* 02:53 logmsgbot: l10nupdate Synchronized php-1.26wmf12/cache/l10n: (no message) (duration: 10m 12s)
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-07-01 02:26:55+00:00
* 02:23 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 06m 50s)
* 02:12 springle: upgrade db1034 trusty
* 01:37 ori: Depooled mw1152. Req error dashboard shows elevated 5xx rates correlating with the server getting pooled, but the logs don't appear to corroborate it. Odd.
* 01:03 ori: Disabling Puppet on mw1152 for 12h to hack apache config to log locally
* 00:42 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I9a8018981: Double $wgMaxShellMemory on HHVM scalers (512 Mb => 1024 Mb) (duration: 00m 12s)
* 00:34 ori: pooled mw1152 (HHVM rendering) at weight 10 for testing
* 00:33 gwicke: rolling cassandra restart done
* 00:23 gwicke: starting rolling restart of cassandra nodes to apply new config
* 00:01 greg-g: we're still here
 
== June 30 ==
* 23:30 logmsgbot: hoo Synchronized php-1.26wmf12/extensions/Wikidata/: Fix EntityParserOutputGenerator (duration: 00m 21s)
* 22:55 ori: depooled mw1152
* 22:52 ori: Pooled HHVM image scaler (mw1152) at weight 1 for testing.
* 22:52 gwicke: updated restbase1004 to openjdk-8
* 22:46 bblack: restarting gitblit on antimony, because Java is so 1996
* 22:43 tgr: running eval.php (along the lines of https://gerrit.wikimedia.org/r/#/c/221783) on commonswiki to fix T104395
* 22:13 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Flow-occupy Wikipedia talk namespace on cawiki (duration: 00m 11s)
* 22:09 matt_flaschen: Done converting wikitext namespace to Flow on Catalan Wikipedia
* 22:03 matt_flaschen: Started convertNamespaceFromWikitext.php for Project_talk on Catalan Wikipedia
* 21:46 RoanKattouw: Also ran populateContentModel.php --table=archive for talk namespaces on officewiki
* 21:45 RoanKattouw: Ran populateContentModel.php --table=archive --ns=5 on officewiki
* 21:29 RoanKattouw: Ran populateContentModel.php --table=page --ns=5 on cawiki
* 21:19 logmsgbot: catrope Synchronized php-1.26wmf12/extensions/Flow: (no message) (duration: 00m 14s)
* 21:19 logmsgbot: catrope Synchronized php-1.26wmf11/extensions/Flow: (no message) (duration: 00m 14s)
* 21:14 logmsgbot: catrope Synchronized php-1.26wmf12/extensions/Flow: (no message) (duration: 00m 14s)
* 21:14 logmsgbot: catrope Synchronized php-1.26wmf11/extensions/Flow: (no message) (duration: 00m 13s)
* 21:01 RoanKattouw: Running populateContentModel.php on officewiki for page table in namespaces occupied by Flow (1,3,5,7,9,11,13,15,91,93,101,111,113,829)
* 20:58 logmsgbot: catrope Synchronized php-1.26wmf12/maintenance/: Add populateContentModel maintenance script (duration: 00m 13s)
* 20:58 logmsgbot: catrope Synchronized php-1.26wmf11/maintenance/: Add populateContentModel maintenance script (duration: 00m 17s)
* 20:53 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings.php: Log 'wbq_evaluation' (duration: 00m 12s)
* 20:46 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings.php: Enable WikibaseQuality extensions on testwikidata (duration: 00m 14s)
* 20:39 hoo: Created `wbqc_constraints` on testwikidatawiki (s3).
* 20:23 logmsgbot: thcipriani rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf12
* 20:15 logmsgbot: thcipriani Purged l10n cache for 1.26wmf6
* 20:14 logmsgbot: thcipriani Purged l10n cache for 1.26wmf7
* 20:14 logmsgbot: thcipriani Purged l10n cache for 1.26wmf8
* 20:13 logmsgbot: thcipriani Purged l10n cache for 1.26wmf9
* 20:13 logmsgbot: thcipriani Purged l10n cache for 1.26wmf10
* 20:05 logmsgbot: thcipriani Finished scap: testwiki to php-1.26wmf12 and rebuild l10n cache (duration: 34m 58s)
* 19:41 ostriches: OAI: disabled unused accounts
* 19:30 logmsgbot: thcipriani Started scap: testwiki to php-1.26wmf12 and rebuild l10n cache
* 19:00 logmsgbot: demon Synchronized php-1.26wmf11/includes/WebResponse.php: rv my test (duration: 00m 12s)
* 18:55 logmsgbot: demon Synchronized php-1.26wmf11/includes/WebResponse.php: (no message) (duration: 00m 12s)
* 18:36 cmjohnson1: labcontrol1002 going down for a few minutes
* 18:33 mutante: tendril - short downtime for switch to new repo
* 18:17 gwicke: restarted cassandra on restbase1005 with g1gc GC and larger heap
* 18:16 gwicke: restarted cassandra on restbase1004 with g1gc GC and larger heap
* 17:02 akosiaris: enabled and ran puppet on lvs400X, lvs300X, lvs100[123]. noops
* 16:58 bblack: re-enabling puppet on caches
* 16:52 bblack: disabling puppet on cache clusters
* 16:48 akosiaris: enabled an ran puppet on all lvs servers @ codfw
* 16:22 akosiaris: enabled and ran puppet on lvs1004. noop as well
* 16:19 akosiaris: enabled and running puppet on lvs1005
* 16:11 akosiaris: enabling and running puppet on lvs1006
* 16:09 akosiaris: disabling puppet on all lvs and neon
* 16:07 gwicke: restarting cassandra instance on restbase1004
* 15:12 logmsgbot: thcipriani Synchronized wmf-config: SWAT: Standardise a ton of ticket comments [[gerrit:221803]] (duration: 00m 13s)
* 15:04 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Enable CX all wikipedias except enwiki [[gerrit:221831]] (duration: 00m 13s)
* 14:46 kart_: Update cxserver to 0d21a80
* 14:10 mobrovac: restbase restarting cassandra on restbase1005
* 11:29 mobrovac: restbase restarting cassandra on restbase1005
* 10:41 mobrovac: restbase restarting on all nodes
* 09:54 mobrovac: restbase restarting cassandra on restbase1004
* 08:53 mobrovac: restbase restrting cassandra on restbase1004
* 08:05 jynus: applying schema changes for Gather extension
* 06:56 jynus: initiating query profiling on db1018
* 05:21 gwicke: restarting cassandra instance on restbase1004; was in small-write mode
* 05:17 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1034 (duration: 00m 12s)
* 04:37 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Jun 30 04:37:00 UTC 2015 (duration 36m 59s)
* 02:22 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-30 02:22:00+00:00
* 02:18 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 06m 09s)
* 02:11 logmsgbot: krenair Synchronized wmf-config/wikitech.php: (no message) (duration: 00m 12s)
* 01:56 logmsgbot: krenair Synchronized wmf-config/wikitech.php: (no message) (duration: 00m 11s)
* 01:41 logmsgbot: krinkle Synchronized php-1.26wmf11/includes/resourceloader/ResourceLoader.php: I7761242f01 (duration: 00m 14s)
* 00:37 godog: restbase1* upgrade to cassandra 2.1.7 completed
 
== June 29 ==
* 23:57 robh: mw2027 was offline (blank screen on serial console).  mgmt powercycled
* 23:48 godog: start upgrading restbase1* to cassandra 2.1.7
* 23:41 gwicke: restarted cassandra instance on restbase1004.eqiad; log showed many small writes and clients saw timeouts
* 23:29 gwicke: deployed restbase 32db4ce1e1
* 23:21 logmsgbot: ori Synchronized php-1.26wmf11/includes/resourceloader: I0e5f2d3b2: resourceloader: Add timing metrics for key operations (duration: 01m 12s)
* 23:15 logmsgbot: catrope Synchronized wmf-config/: wikitech cleanup (duration: 01m 08s)
* 23:11 RoanKattouw: ssh: connect to host mw2027.codfw.wmnet port 22: Connection timed out
* 23:11 RoanKattouw: Synced wmf-config/CommonSettings.php:  Remove survey access point in Popups
* 23:09 godog: stop ircecho on neon, icinga spam
* 22:53 gwicke: canary deploy of restbase 32db4ce1e1 on restbase1001.eqiad
* 21:30 urandom: restarting restbase1004 to apply new metrics reporting interval
* 20:19 subbu: deployed parsoid sha ea98be88
* 18:18 logmsgbot: ori Synchronized php-1.26wmf11/includes/db/LoadBalancer.php: I0e5f2d3b2: Use APC for caching slave lag times (duration: 01m 09s)
* 18:00 cmjohnson1: powering down ms-be1015
* 16:06 bblack: re-enabling puppet on caches
* 15:51 bblack: disabling puppet on caches temporarily ...
* 15:49 logmsgbot: krenair Synchronized php-1.26wmf11/extensions/OpenStackManager: https://gerrit.wikimedia.org/r/#/c/221648/ (duration: 00m 13s)
* 15:29 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/221405/ (duration: 00m 15s)
* 15:26 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/221612/ (duration: 00m 12s)
* 15:24 logmsgbot: krenair Synchronized w/static/images/project-logos/zhwiki-hans-2x.png: https://gerrit.wikimedia.org/r/#/c/221113/ (duration: 00m 14s)
* 15:24 logmsgbot: krenair Synchronized w/static/images/project-logos/zhwiki-hans-1.5x.png: https://gerrit.wikimedia.org/r/#/c/221113/ (duration: 00m 12s)
* 15:23 logmsgbot: krenair Synchronized w/static/images/project-logos/zhwiki-hans.png: https://gerrit.wikimedia.org/r/#/c/221113/ (duration: 00m 12s)
* 15:20 logmsgbot: krenair Synchronized wmf-config/wikitech.php: https://gerrit.wikimedia.org/r/#/c/221009/ (duration: 00m 11s)
* 15:18 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/221047/ (duration: 00m 13s)
* 15:12 logmsgbot: krenair Synchronized php-1.26wmf11/extensions/ContentTranslation/modules/tools/ext.cx.tools.link.js: https://gerrit.wikimedia.org/r/#/c/221605 (duration: 00m 13s)
* 15:02 logmsgbot: krenair Synchronized php-1.26wmf11/extensions/ContentTranslation/modules/tools/ext.cx.tools.formatter.js: https://gerrit.wikimedia.org/r/#/c/221604/ (duration: 00m 14s)
* 14:34 jynus: rebooting and reinstalling db1022
* 12:06 YuviPanda: restarting rsync with new exclusions file on labstore1002 to codfw
* 12:06 YuviPanda: excluded maps, mwoffliner and video project from rsync of broken FS to speed it up
* 11:59 YuviPanda: interupt rsync on labstore1001 to prevent it from copying mwofflienr files
* 11:00 _joe_: shutting down etcd1003, cleaning exported resources
* 10:32 _joe_: effectively removing etcd1003 from the cluster
* 10:17 _joe_: starting removal of etcd1003 from the etcd cluster
* 08:49 _joe_: joined conf1003 to the etcd cluster
* 08:20 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Depool db1022 for reinstall (duration: 00m 12s)
* 08:12 _joe_: adding conf1002 to the etcd cluster as a member
* 07:46 akosiaris: disabling ntp everywhere expect selected hosts in anticipation for the leap second
* 04:51 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Jun 29 04:51:48 UTC 2015 (duration 51m 47s)
* 03:08 jgage: jmxtrans filled disks on all kafka brokers, 21GB log files. removed logs and restarted services.
* 02:23 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-29 02:23:47+00:00
* 02:20 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 05m 53s)
* 00:52 springle: restart eventlogging auto-purge on m4
* 00:51 springle: restart replication on dbstore2002
* 00:00 springle: pausing replication on dbstore2002
 
== June 28 ==
* 23:51 logmsgbot: ori Synchronized php-1.26wmf11/extensions/CentralNotice/modules/ext.centralNotice.bannerController/bannerController.js: I6ffdc977e87: Parse older format of Geo cookies (duration: 00m 13s)
* 04:30 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun Jun 28 04:30:54 UTC 2015 (duration 30m 53s)
* 02:20 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-28 02:20:52+00:00
* 02:17 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 05m 56s)
 
== June 27 ==
* 23:30 bd808: Deleted corrupt shards on logstash1004 and logstash1005. Recovery in process
* 20:12 ori: Delegated full access to Google Webmaster Tools for myself (olivneh@).
* 04:58 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat Jun 27 04:58:46 UTC 2015 (duration 58m 45s)
* 02:23 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-27 02:23:40+00:00
* 02:20 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 05m 46s)
 
== June 26 ==
* 23:57 bd808: Logstash log ingestion working again after forcing recovery of replicas for logstash-2015.06.26; new logs were being rejected with only a primary shard available
* 23:54 bd808: re-enabled allocation on logstash elasticsearch cluster
* 23:05 bblack: restarted gitblit on antimony, AGAIN
* 22:57 mutante: restarted gitblit
* 22:43 logmsgbot: catrope Synchronized php-1.26wmf11/extensions/Flow: Temporarily make subpages in Flow-occupied namespaces non-Flow again (duration: 00m 14s)
* 22:36 bd808: set indices.recovery.concurrent_streams to 4 on logstash ES cluster
* 22:36 godog: set indices.recovery.max_bytes_per_sec to 10mb on logstash ES cluster
* 22:25 godog: set indices.recovery.max_bytes_per_sec to 50mb on logstash ES cluster
* 22:25 jamesofur: Reset email address of User:Chwms identity verified in person at editathon
* 22:09 bd808: restarted logstash on logstash1001
* 21:10 urandom: taking xenon down to be rebootstrapped
* 20:10 bd808: Deleted 4 corrupt indices (logstash-2015.05.30 logstash-2015.05.31 logstash-2015.06.03 logstash-2015.06.06) on logstash1004
* 19:58 bd808: stopping elasticsearch on logstash1004 to cleanup corrupt shards
* 17:05 mutante: zirconium - manual cleanup, removing planet
* 17:04 godog: reverted cronolog puppetmaster patch, restarting apache
* 14:17 Krenair: Deployed patch for T103391
* 12:23 logmsgbot: krenair Synchronized wmf-config/wikitech.php: https://gerrit.wikimedia.org/r/#/c/221105/ (duration: 00m 12s)
* 12:18 _joe_: added conf1001 to the etcd cluster
* 07:57 logmsgbot: krinkle Synchronized php-1.26wmf11/extensions/Popups: T103610 (duration: 00m 11s)
* 06:04 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri Jun 26 06:04:14 UTC 2015 (duration 4m 13s)
* 05:22 twentyafterfour: restarted apache on iridium to fix phabricator fatal
* 02:33 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-26 02:33:33+00:00
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 05m 36s)
* 00:51 gwicke: reverted restbase1001 canary to 90817c2a
* 00:36 logmsgbot: ori Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi: I0e5f2d3b2: Updated mediawiki/core Project: mediawiki/extensions/SyntaxHighlight_GeSHi (duration: 00m 11s)
* 00:16 logmsgbot: krinkle Synchronized wmf-config/InitialiseSettings.php: T102852 (duration: 00m 12s)
* 00:15 logmsgbot: krinkle Synchronized w/static/images/project-logos/zhwiki-2x.png: T102852 (duration: 00m 13s)
* 00:14 logmsgbot: krinkle Synchronized w/static/images/project-logos/zhwiki-1.5x.png: T102852 (duration: 00m 12s)
* 00:05 logmsgbot: krinkle Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi/modules/pygments.wrapper.css: I5d1510dc80d6d4712ca8411 (duration: 00m 12s)
 
== June 25 ==
* 23:53 mutante: planet1001 (ganeti) - signing puppet cert, initial run
* 23:31 mutante: apt-get upgrade on zirconium
* 23:28 logmsgbot: krenair Synchronized wmf-config/wikitech.php: https://gerrit.wikimedia.org/r/#/c/220847/ (duration: 00m 12s)
* 23:27 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/220847/ (duration: 00m 11s)
* 23:24 logmsgbot: krenair Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi: https://gerrit.wikimedia.org/r/#/c/220997/ (duration: 00m 13s)
* 23:20 gwicke: canary update of restbase on restbase1001 to 4b961f166 (deploy d1c4d9961)
* 23:16 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/218926/ (duration: 00m 12s)
* 23:11 logmsgbot: krenair Synchronized wmf-config/logging.php: https://gerrit.wikimedia.org/r/#/c/220784/ (duration: 00m 13s)
* 23:03 legoktm: fixed content models on lrcwiki for Module namespace
* 23:02 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/220485/ (duration: 00m 16s)
* 22:02 logmsgbot: hoo Synchronized php-1.26wmf11/extensions/Wikidata/: Update Wikidata: Use SELECT FOR UPDATE in SqlIdGenerator (duration: 00m 20s)
* 21:29 godog: rm /var/lib/git/operations/puppet/modules/cassandra from labcontrol1001 labcontrol1002
* 21:10 godog: rm /var/lib/git/operations/puppet/modules/cassandra from rhodium
* 21:07 godog: rm /var/lib/git/operations/puppet/modules/cassandra from strontium and palladium
* 21:06 godog: push puppet.git after module/cassandra removal T92560
* 20:41 mutante: deleted SVN monitor from watchmouse
* 20:18 mutante: bye SVN - subversion URLs now redirect to phab or doc
* 20:08 logmsgbot: nikerabbit Finished scap: T103888 CX aliases (duration: 22m 37s)
* 19:46 logmsgbot: nikerabbit Started scap: T103888 CX aliases
* 18:09 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: all wikis to 1.26wmf11
* 17:46 logmsgbot: krenair Synchronized wmf-config: (no message) (duration: 00m 31s)
* 17:43 logmsgbot: krenair Synchronized wmf-config/wikitech.php: https://gerrit.wikimedia.org/r/#/c/218098/ (duration: 00m 12s)
* 17:43 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/218098/ (duration: 00m 12s)
* 17:18 logmsgbot: ori Synchronized php-1.26wmf11/resources/src/mediawiki.skinning/elements.css: Ieab6b1473e6ce: תיקון טעות (duration: 00m 12s)
* 15:59 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/219599/ (duration: 00m 12s)
* 15:57 logmsgbot: krenair Synchronized wmf-config/CommonSettings-labs.php: https://gerrit.wikimedia.org/r/#/c/217539/ - noop for prod, labs only part (duration: 00m 12s)
* 15:56 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/217539/ (duration: 00m 13s)
* 15:51 logmsgbot: krenair Synchronized wmf-config/flaggedrevs.php: https://gerrit.wikimedia.org/r/#/c/203370/ (duration: 00m 12s)
* 15:49 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/218539/ (duration: 00m 15s)
* 15:32 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings-labs.php: https://gerrit.wikimedia.org/r/#/c/220068/ - noop for prod, just labs (duration: 00m 12s)
* 15:30 logmsgbot: krenair Synchronized commonsuploads.dblist: https://gerrit.wikimedia.org/r/#/c/220715/ (duration: 00m 12s)
* 15:24 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/220747/ (duration: 00m 12s)
* 15:16 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/220408/ (duration: 00m 12s)
* 15:12 logmsgbot: krenair Synchronized php-1.26wmf11/extensions/SemanticForms/includes/SF_AutoeditAPI.php: https://gerrit.wikimedia.org/r/#/c/220765/ (duration: 00m 12s)
* 15:04 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/220706/ (duration: 00m 12s)
* 15:02 logmsgbot: krenair Synchronized wmf-config/throttle.php: https://gerrit.wikimedia.org/r/#/c/220653/ (duration: 00m 12s)
* 13:30 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Repool es2003 (but not es2004) after maintenance (duration: 00m 12s)
* 10:57 jynus: rebooting es2003 and es2004
* 10:40 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: depool es2003 and es2004 for maintenance (duration: 00m 13s)
* 10:09 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1018 (duration: 00m 12s)
* 09:02 jynus: restarting mysqld on db1018
* 08:42 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool db1018 for maintenance (duration: 00m 13s)
* 08:33 logmsgbot: ori Synchronized php-1.26wmf11/resources/src/mediawiki.skinning/elements.css: I0e5f2d3b2: Wrap lines in <nowiki><pre></nowiki> and .mw-code by default (duration: 00m 12s)
* 06:59 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Jun 25 06:59:13 UTC 2015 (duration 59m 12s)
* 04:04 ori: restarted apache2 on palladium
* 03:11 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-25 03:11:01+00:00
* 03:04 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 10m 19s)
* 02:40 bblack: puppet re-enabled on caches
* 02:37 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-25 02:37:44+00:00
* 02:34 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 06m 44s)
* 02:04 bblack: disabling puppet on cp* caches for patch-testing
* 00:43 awight: update crm from bd8a00196071ddd04efbff7b30567dd9357c9000 to e923225e423948bd70440e2d1131460b10cefac1
* 00:38 godog: upgrade cassandra to 2.1.7 on restbase1008
* 00:30 twentyafterfour: phabricator upgrade completed
* 00:28 godog: upgrade cassandra to 2.1.7 on restbase1004
* 00:12 legoktm: <twentyafterfour> Phabricator upgrade happening now. Will be down for a few minutes.
 
== June 24 ==
* 23:18 logmsgbot: rmoen Synchronized wmf-config/mobile.php: Enable browse experiment on test and enwiki (duration: 00m 14s)
* 23:17 logmsgbot: rmoen Synchronized wmf-config/InitialiseSettings.php: Enable browse experiment on test and enwiki (duration: 00m 12s)
* 23:13 urandom: rolling restart of Cassandra staging cluster
* 23:04 logmsgbot: legoktm Synchronized php-1.26wmf11/extensions/CentralAuth: https://gerrit.wikimedia.org/r/#/c/220637/ (duration: 00m 13s)
* 23:03 logmsgbot: legoktm Synchronized php-1.26wmf11/extensions/UserMerge: https://gerrit.wikimedia.org/r/#/c/220638/ (duration: 00m 13s)
* 22:32 mutante: zirconium - stop using 443 at all, rm NameVirtualHost *:443
* 22:30 mutante: zirconium - deleting unused apache configs, bugzilla, etherpad, ...
* 21:09 godog: start cassandra on restbase1008
* 18:41 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group1 wikis to 1.26wmf11
* 18:02 logmsgbot: legoktm Synchronized php-1.26wmf11/extensions/Flow/includes/Specials/SpecialEnableFlow.php: https://gerrit.wikimedia.org/r/#/c/220514/ (duration: 00m 15s)
* 17:24 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: repool es2001 and es2002 after maintenance (duration: 00m 13s)
* 17:05 thcipriani: scap completed with the exception of snapshot1001 that's disk is full
* 17:04 logmsgbot: thcipriani scap failed: OSError [Errno 2] No such file or directory: '/var/lock/scap' (duration: 41m 33s)
* 16:22 logmsgbot: thcipriani Started scap: SWAT: Automatically add to shell group when adding to a project [[gerrit:220468]]
* 16:10 logmsgbot: ori Synchronized php-1.26wmf11/includes/page/Article.php: I0e5f2d3b2: Revert r47388 / 8d9243cf3: Use Title::getLocalURL() for rel=canonical links (duration: 00m 13s)
* 15:57 logmsgbot: thcipriani Synchronized wmf-config: SWAT: Revert Enable browse prototype on test- and enwiki (duration: 00m 15s)
* 15:49 jynus: rebooting es2001 and es2002
* 15:44 logmsgbot: thcipriani Synchronized wmf-config: SWAT: Enable browse prototype on test- and enwiki [[gerrit:219451]] (duration: 00m 12s)
* 15:24 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable ContentTranslation in testwiki [[gerrit:220385]] (duration: 00m 12s)
* 15:17 logmsgbot: thcipriani Synchronized php-1.26wmf11/extensions/ContentTranslation: SWAT: Enable publish button when the preference is not to use initial translation (duration: 00m 12s)
* 15:14 andrewbogott: disabled puppet on labcontrol1001 to hotfix https://gerrit.wikimedia.org/r/#/c/220476/
* 15:08 logmsgbot: thcipriani Synchronized php-1.26wmf10/extensions/ContentTranslation: SWAT: Enable publish button when the preference is not to use initial translation (duration: 00m 13s)
* 14:53 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: depool es2001 and es 2002 for maintenance (duration: 00m 13s)
* 14:12 logmsgbot: krenair Synchronized php-1.26wmf10/extensions/SemanticForms/includes/SF_AutoeditAPI.php: T103653 live hack (duration: 00m 13s)
* 10:44 _joe_: restarting jmxtrans on analytics1021
* 10:31 jgage: restarting kafka on analytics1021
* 10:10 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Switchover master es1008 -> es1009 (duration: 00m 12s)
* 09:24 hashar: removing java 6 from gallium and lanthanum https://phabricator.wikimedia.org/T103491
* 09:17 hashar: apt-get upgrade on gallium and lanthanum
* 09:16 jynus: performing a master failover of es1008 into es1009
* 08:27 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1004 (duration: 00m 14s)
* 05:46 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Jun 24 05:46:32 UTC 2015 (duration 46m 31s)
* 05:12 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1045 (duration: 00m 13s)
* 05:03 jgage: removed old logs and did 'apt-get clean' on analytics1021 to make space
* 03:00 logmsgbot: LocalisationUpdate completed (1.26wmf11) at 2015-06-24 03:00:45+00:00
* 02:54 logmsgbot: l10nupdate Synchronized php-1.26wmf11/cache/l10n: (no message) (duration: 10m 34s)
* 02:28 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-24 02:28:16+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 07m 21s)
* 01:39 logmsgbot: ori Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi: I0e5f2d3b2 (duration: 00m 13s)
* 01:01 gwicke: rolling restart of cassandra instances to rule out a single node in funky state causing elevated p99 latency
* 00:43 ori: experimenting with httpd on mw1041 again
* 00:19 gwicke: rolling restart of restbase instances to rule out backend connections as a source for high p99 latencies
* 00:14 ori: experimenting with HHVM shutdown via /stop on the admin server on mw1041
 
== June 23 ==
* 23:38 logmsgbot: ori Finished scap: scapping to all apaches for --restart test (duration: 07m 03s)
* 23:30 logmsgbot: ori Started scap: scapping to all apaches for --restart test
* 23:24 bblack: nginxes all updated for ssl stapling bugfix
* 23:24 logmsgbot: ori Finished scap: scapping to scap-test dsh group for --restart test (duration: 06m 02s)
* 23:18 logmsgbot: ori Started scap: scapping to scap-test dsh group for --restart test
* 23:16 logmsgbot: ori scap aborted: scapping to scap-test dsh group for --restart test (duration: 00m 06s)
* 23:16 logmsgbot: ori Started scap: scapping to scap-test dsh group for --restart test
* 22:14 logmsgbot: legoktm Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.class.php: RejectParserCacheValue may pass a WikiPage or Article (duration: 00m 13s)
* 22:07 mutante: tmp. disabling puppet on mw1033
* 21:53 logmsgbot: legoktm Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.class.php: (no message) (duration: 00m 15s)
* 21:50 logmsgbot: ori Synchronized php-1.26wmf11/includes/parser/ParserCache.php: (no message) (duration: 00m 12s)
* 21:40 mutante: starting instance planet1001 on ganeti1003 - cant get console
* 21:40 logmsgbot: legoktm Synchronized php-1.26wmf11/includes/parser/ParserCache.php: (no message) (duration: 00m 13s)
* 21:36 bd808: updated scap to 33f3002 (Ensure that the minimum batch size used by cluster_ssh is 1)
* 21:34 logmsgbot: ori Synchronized php-1.26wmf11/extensions/SyntaxHighlight_GeSHi: 3c8bb2c493: Update SyntaxHighlight_GeSHi for cherry-pick (duration: 00m 13s)
* 20:32 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 wikis to 1.26wmf11
* 20:19 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings-labs.php: Beta-only change to add Flow_test to enwiki (duration: 00m 11s)
* 19:59 logmsgbot: ori scap failed: OSError [Errno 10] No child processes (duration: 01m 46s)
* 19:58 logmsgbot: ori Started scap: (no message)
* 19:52 ori: updated scap to master
* 19:11 ori: running apache graceful-stop on mw1042 to test mod_status behavior during graceful stop
* 19:02 logmsgbot: twentyafterfour Finished scap: New deployment branch: 1.26wmf11 try #2 (13 apaches failed) (duration: 03m 50s)
* 18:58 logmsgbot: twentyafterfour Started scap: New deployment branch: 1.26wmf11 try #2 (13 apaches failed)
* 18:53 logmsgbot: twentyafterfour Finished scap: New deployment branch: 1.26wmf11 (duration: 26m 37s)
* 18:31 godog: start rolling-downgrade of cassandra to 2.1.3 T102015
* 18:27 logmsgbot: twentyafterfour Started scap: New deployment branch: 1.26wmf11
* 18:13 logmsgbot: ori Finished scap: (no message) (duration: 04m 34s)
* 18:11 paravoid: reloading nginx on all cp* for reuseport
* 18:08 logmsgbot: ori Started scap: (no message)
* 17:57 ori: repooled scap-test servers (mw1170-mw1175 and mw1270-mw1275)
* 17:16 logmsgbot: ori Finished scap: (no message) (duration: 01m 42s)
* 17:14 logmsgbot: ori Started scap: (no message)
* 17:10 logmsgbot: ori Finished scap: (no message) (duration: 01m 34s)
* 17:09 logmsgbot: ori Started scap: (no message)
* 17:06 logmsgbot: ori scap aborted: (no message) (duration: 01m 23s)
* 17:04 logmsgbot: ori Started scap: (no message)
* 16:53 logmsgbot: bd808 Finished scap: no-op sync to scap-test dsh group; Testing HHVM restart take 4 (duration: 01m 30s)
* 16:52 logmsgbot: bd808 Started scap: no-op sync to scap-test dsh group; Testing HHVM restart take 4
* 16:45 cscott: updated OCG to version db7a56965233a74c73917c78b5c8c84c867321d9
* 16:37 logmsgbot: bd808 Finished scap: no-op sync to scap-test dsh group; Testing HHVM restart take 3 (duration: 01m 12s)
* 16:35 logmsgbot: bd808 Started scap: no-op sync to scap-test dsh group; Testing HHVM restart take 3
* 16:35 bd808: updated scap to da64a65 (Cast pid read from file to an int)
* 16:26 logmsgbot: bd808 Finished scap: no-op sync to scap-test dsh group; Testing HHVM restart take 2 (duration: 01m 26s)
* 16:25 logmsgbot: bd808 Started scap: no-op sync to scap-test dsh group; Testing HHVM restart take 2
* 16:22 bd808: updated scap to 947b93f (Fix reference to _get_apache_list)
* 16:12 logmsgbot: bd808 scap failed: AttributeError 'Scap' object has no attribute '_get_apache_list' (duration: 02m 15s)
* 16:10 logmsgbot: bd808 Started scap: no-op sync to scap-test dsh group; Testing HHVM restart
* 16:01 paravoid: staggered upgrade of cp* fleet to nginx 1.9.2
* 15:57 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: Follow-up 94e5fd2: Default wmgUseContentTranslation true only on Wikipedias [[gerrit:220161]] (duration: 00m 16s)
* 15:49 jynus: rebooting es1004
* 15:09 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Enable CX as default except where it is not deployed [[gerrit:220078]] (duration: 00m 12s)
* 15:04 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable 'frwiki-recommender' campaign in frwiki [[gerrit:220071]] (duration: 00m 13s)
* 14:54 paravoid: reprepro: including nginx 1.9.2-1~bpo8+1 to jessie-wikimedia/backports
* 14:39 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1003, depool es1004 (duration: 00m 12s)
* 14:04 cscott: reverted OCG to version ca4f64852de5b1de782b292b50038fbd2dd84266 (bundler failing with exit code 8)
* 13:57 cscott: updated OCG to version d7c698d5bf730d34057945e912ac75dc542dd788
* 13:44 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/209744/ (duration: 00m 13s)
* 13:44 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/209744/ (duration: 00m 12s)
* 12:54 moritzm: ssh on precise hosts has been updated to a backport of 6.6p1-2ubuntu2 (the version from trusty). this allows us to use modern crypto (plus labs can simplify key handling)
* 12:45 jynus: rebooting es1003
* 12:18 moritzm: uploaded openssh_6.6p1-2ubuntu2~wmfprecise2 to precise-wikimedia on apt.wikimedia.org
* 12:10 logmsgbot: hoo Synchronized arbitraryaccess.dblist: Arbitrary access for ruwiki and cswiki. T102122 (duration: 00m 12s)
* 11:33 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1002, depool es1003 (part 2/2) (duration: 00m 12s)
* 11:25 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1002, depool es1003 (duration: 00m 12s)
* 09:41 moritzm: updated jsch on gallium and lanthanum to support modern SSH key exchange in Jenkins (actually that happened yesterday, but I forgot to log it back then)
* 09:41 moritzm: added jsch_0.1.50-1ubuntu1~wmfprecise1 to precise-wikimedia on carbon
* 09:09 akosiaris: failing over etherpad to db1016
* 04:53 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Jun 23 04:53:17 UTC 2015 (duration 53m 16s)
* 03:33 springle: xtrabackup clone db2023 to db1045
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-23 02:26:44+00:00
* 02:23 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 06m 47s)
* 01:17 logmsgbot: krinkle Synchronized docroot and w: (no message) (duration: 00m 12s)
* 01:00 bd808: Pruned virt1000 from trebuchet minions list: redis-cli srem "deploy:scap/scap:minions" virt1000.wikimedia.org
 
== June 22 ==
* 23:42 gwicke: restarted Cassandra on restbase1006
* 23:27 logmsgbot: catrope Synchronized php-1.26wmf10/extensions/MobileFrontend: For real this time (duration: 00m 14s)
* 23:27 logmsgbot: catrope Synchronized php-1.26wmf10/extensions/Gather: For real this time (duration: 00m 13s)
* 23:17 logmsgbot: catrope Synchronized php-1.26wmf10/extensions/Gather: SWAT (duration: 00m 12s)
* 23:17 logmsgbot: catrope Synchronized php-1.26wmf10/extensions/MobileFrontend/: SWAT (duration: 00m 15s)
* 23:12 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Enable TinyRGB ICC profile swapping on testwiki (duration: 00m 13s)
* 22:51 logmsgbot: ori Synchronized php-1.26wmf10/resources/src/mediawiki/mediawiki.Title.js: I0e5f2d3b2: Fix undeclared dependency on jquery.mwExtension (duration: 00m 12s)
* 22:45 gwicke: restarting Cassandra on restbase1005 to get the metrics back
* 22:37 gwicke: restarting Cassandra on restbase1004 to get the metrics back
* 22:33 gwicke: restarting Cassandra on restbase1003 to get the metrics back
* 22:24 gwicke: restarting Cassandra on restbase1002 to get the metrics back
* 22:19 bd808: scap error "@ERROR: access denied to common from localhost (127.0.0.1)" from mw2187 and mw2080 on sync-file test.
* 22:17 logmsgbot: bd808 Synchronized README: Testing sync-file after scap update (duration: 00m 12s)
* 22:08 RoanKattouw: Deployed patch for T103054
* 21:59 godog: reboot restbase1008
* 21:56 bd808: updated scap to 81b7c14 (Move dsh group file names to config)
* 21:55 bd808: trebuchet checkout for scap/scap failed on 23 hosts: mw1104, mw1222, mw2009, mw2011, mw2021, mw2028, mw2031, mw2034, mw2069, mw2076, mw2080, mw2086, mw2095, mw2099, mw2120, mw2127, mw2131, mw2136, mw2170, mw2187, mw2189, mw2197, virt1000
* 21:50 bd808: trebuchet fetch for scap/scap failed on mw2086.codfw.wmnet, mw1222.eqiad.wmnet and virt1000.wikimedia.org
* 21:41 gwicke: restarting Cassandra on restbase1001 to get the metrics back
* 21:20 ori: Depooled mw1170-mw1175 and mw1270-mw1275 for testing Idddcfe46
* 21:07 chasemp: rebooting mw1101 the hard way
* 20:28 cscott: updated Parsoid to version d488783e
* 19:34 akosiaris: delete pad:ips from etherpad
* 19:01 jynus: rebooting es1002
* 18:52 logmsgbot: ori Synchronized php-1.26wmf10/includes/OutputPage.php: I0e5f2d3b2: Construct clean canonical URLs for wiki pages, ignoring request URL (T67402) (duration: 00m 14s)
* 18:01 legoktm: live-hacking mw1017 to debug T103053
* 17:49 mutante: Bugzilla has left the building
* 16:31 jynus: reseting wikitech-static mysql contents to improve fragmentation
* 16:26 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1001, depool es1002 (duration: 00m 14s)
* 16:12 andrewbogott: shutting down virt1000
* 16:08 andrewbogott: disabling puppet on virt1000
* 16:07 ottomata: deploying eventlogging 0.9.  This includes changes for arbitrary eventlogging URIs in all eventlogging stages, as well as support for schema based kafka topic URIs. 
* 15:24 logmsgbot: thcipriani Synchronized php-1.26wmf10/extensions/WikiEditor: SWAT: Reduce 'Edit' EventLogging schema sampling rate to 6.25% (1/16th) [[gerrit:219837]] (duration: 00m 13s)
* 15:04 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings-labs.php: SWAT: Default wmgUseWikibaseQuality on beta to true. [[gerrit:219630]] (duration: 00m 14s)
* 14:32 hashar: restarting Jenkins
* 13:26 jynus: rebooting es1001 for regular maintenance
* 12:08 paravoid: powercycled ms-be1002, stuck at console
* 11:12 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Depool es1001 (duration: 00m 13s)
* 11:06 _joe_: restarting hhvm on the low-memory appservers (main and api)
* 09:23 hashar: upgrading Jenkins gearman plugin from 0.1.1 to latest master (f2024bd). Restarting Jenkins.
* 05:11 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Jun 22 05:11:22 UTC 2015 (duration 11m 21s)
* 02:31 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-22 02:31:32+00:00
* 02:27 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 07m 27s)
* 00:44 jgage: restarted gitblit on antimony again
 
== June 21 ==
* 11:28 jynus: restarting apache on mw1110
* 06:55 gwicke: restarted  bootstrap on restbase1009 earlier today; hardware hasn't died yet
* 05:01 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun Jun 21 05:01:07 UTC 2015 (duration 1m 6s)
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-21 02:27:13+00:00
* 02:23 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 10m 23s)
* 01:39 jgage: restarted gitblit on antimony at 00:43 UTC
* 01:37 Krenair: testing morebots
 
== June 20 ==
* 22:50 bblack: restarted gitblit java service on antimony
* 04:27 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat Jun 20 04:27:14 UTC 2015 (duration 27m 13s)
* 02:21 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-20 02:21:30+00:00
* 02:18 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 07m 02s)
 
== June 19 ==
* 23:32 gwicke: upgraded restbase1006 to cassandra 2.1.7
* 23:30 gwicke: starting cassandra bootstrap on restbase1009
* 21:37 gwicke: upgraded cassandra on 1003 to 2.1.7 (pre-release, likely going out on Monday)
* 18:32 godog: stop cassandra on restbase1008
* 17:45 logmsgbot: krenair Synchronized private/PrivateSettings.php: sync 4a30446e for wikitech cleanup - T102361 (duration: 00m 12s)
* 17:24 godog: install linux 3.19 on restbase100[789]
* 17:12 ori: salt -t30 -G 'php:hhvm' cmd.run 'rm -f /usr/local/bin/check_tc_space' (https://gerrit.wikimedia.org/r/#/c/219102/)
* 16:54 moritzm: updated/rebooted nescio/maerlant to 3.19
* 13:40 andrewbogott: test test test
* 02:19 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-19 02:19:33+00:00
* 02:16 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 05m 08s)
* 00:49 springle: killed storm of research queries on dbstore1002, load avg 90+, replag, likely explosion, etc. emailing analytics@
* 00:13 logmsgbot: ebernhardson Synchronized php-1.26wmf10/extensions/Flow/tests/: no-op sync of flow test cases in wmf10 (duration: 00m 17s)
* 00:11 logmsgbot: ebernhardson Synchronized php-1.26wmf10/skins/Vector/: Bump Vector submodule in 1.26wmf10 for swat (duration: 00m 12s)
 
== June 18 ==
* 23:37 logmsgbot: ebernhardson Synchronized php-1.26wmf9/skins/Vector: Bump Vector in 1.26wmf9 for SWAT (duration: 00m 16s)
* 23:22 logmsgbot: ebernhardson Synchronized wmf-config/: Actually enable the feedback link on Special:Search (duration: 00m 17s)
* 23:08 logmsgbot: ebernhardson Synchronized wmf-config/InitialiseSettings.php: Enable wgCirrusSearchFeedbackLink on enwiki (duration: 00m 13s)
* 21:07 godog: start (bootstrap) cassandra on restbase1008
* 20:43 akosiaris: uploaded to apt.wikimedia.org trusty-wikimedia: apertium-urd-hin_0.1.0+svn~r60389-1
* 20:17 akosiaris: restarted salt on sca1001, truncate log files. keep a sample in /tmp/
* 20:03 chasemp: apache && hhvm restart for mw 1243 1250 1254 1256 1257
* 20:00 chasemp: apache && hhvm restart for mw...1256 1255 1254 1250 1243 1242 1071 1021
* 19:58 mutante: restarting hhvm on mw1021, mw1071
* 19:27 godog: bounce cassandra on restbase1003, new logging configuration
* 19:26 akosiaris: puppet-merged on strontium
* 19:15 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedia wikis to 1.26wmf10
* 19:06 godog: upgrade cassandra to 2.1.6 on restbase1003
* 18:56 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-urd_0.1.0~r57551-1
* 18:56 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-hin_0.1.0~r57344-1
* 18:56 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-cy-en_0.1.1~r57554-1
* 18:43 legoktm: fixed content model of MediaWiki:Common.css@lrcwiki
* 18:18 YuviPanda: restarted nutcracker on wikitech
* 18:16 YuviPanda: restarted keystone on labcontrol1001
* 17:13 gwicke: bouncing cassandra on restbase1002
* 17:11 godog: restart cassandra on restbase1004
* 15:53 gwicke: updated restbase to 7ffaf94b
* 15:13 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Hovercards: Disable test release on Catalan and Greek Wikipedias [[gerrit:215932]] (duration: 00m 13s)
* 15:06 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Add wikis for deployment on 20150618 [[gerrit:218886]] (duration: 00m 14s)
* 11:14 akosiaris: powercycling labstore2001
* 09:08 moritzm: added firejail_0.9.26-1~wmfjessie1 and firejail_0.9.26-1~wmftrusty1 to apt.wikimedia.org
* 08:45 jynus: very brief replication stop for s7, already corrected
* 06:51 Coren: rebooting labstore2001
* 06:32 legoktm: live hacking mw1017 for T102915
* 05:26 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Jun 18 05:26:01 UTC 2015 (duration 26m 0s)
* 02:48 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-18 02:48:44+00:00
* 02:46 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 05m 03s)
* 02:32 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-18 02:32:45+00:00
* 02:28 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 05m 56s)
* 02:04 springle: applied T99941 scema change to all remaining affected (ie, old) wikis
* 02:01 tgr: ran https://gerrit.wikimedia.org/r/#/c/159350/7/backend/schema/mysql/developer_agreement.sql on mediawikiwiki
* 01:32 ejegg: updated payments from f33d0a8687a120a2057a7e6acad67da63b17f97e to a17ee221db0dbde70c92e24fc188379b6dbad613
* 01:20 logmsgbot: ori Synchronized php-1.26wmf10/resources/src/mediawiki.action/mediawiki.action.edit.stash.js: 0c21a14a6e: Revert StashEdit: Use postWithToken (duration: 00m 13s)
* 01:06 twentyafterfour: applied hotfix for T102276 and restarted apache on iridium
* 00:00 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf10
 
== June 17 ==
* 23:35 logmsgbot: catrope Synchronized php-1.26wmf10/extensions/Gather: SWAT (duration: 00m 14s)
* 23:35 gwicke: rolled back restbase to 90817c2a
* 23:24 logmsgbot: catrope Synchronized php-1.26wmf9/extensions/MobileFrontend: SWAT (duration: 00m 15s)
* 23:23 logmsgbot: catrope Synchronized php-1.26wmf9/extensions/Flow: SWAT (duration: 00m 15s)
* 22:45 gwicke: rolling restart of cassandra nodes
* 22:09 gwicke: rolling restart of restbase instances to apply puppet change after puppet actually ran on all nodes
* 21:58 gwicke: rolling restart of restbase instances to apply config change
* 21:56 godog: restart nutcracker on mw1145
* 21:35 gwicke: restarting cassandra on restbase1005
* 20:47 mutante: temp. stopped icinga-wm
* 20:37 gwicke: deployed RESTBase 7ffaf94bfc
* 20:24 cscott: updated Parsoid to version 402ddf66
* 20:01 ottomata: resized antimony's / LV from 30G to 100G.  looks like /var/lib/git was getting filled up
* 19:43 jynus: rolling schema changes on hewiki
* 19:29 godog: downgrade and restart cassandra to 2.1.3 on restbase1001, metrics not being pushed to graphite with 2.1.6
* 19:05 godog: bounce cassandra on xenon
* 18:46 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: Ic03b152de: Make $wgUploadPath for commons https only for benefit instant commons (duration: 00m 14s)
* 18:11 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group1 wikis to 1.26wmf10
* 17:45 godog: bounce cassandra on restbase1001
* 17:39 mutante: repooled mw1234
* 17:24 ottomata: starting reinstall of Zookeeper analytics nodes (analytics102[345]): https://phabricator.wikimedia.org/T101713
* 17:16 godog: bounce cassandra on restbase1001
* 17:14 jynus: rolling schema changes on ruwiki master
* 17:13 mutante: running puppet via salt on api appservers in batches, switch to ganglia_new and carbon
* 17:12 godog: cassandra stopped sending graphite metrics after restart, investigating (test cluster works fine tho)
* 16:58 jynus: rolling schema changes on ruwiki slaves
* 16:28 godog: start upgrading restbase1001 to cassandra 2.1.6 T102015
* 16:02 logmsgbot: thcipriani Finished scap: Wikitech-Ldap host record roll-out (duration: 24m 35s)
* 15:37 logmsgbot: thcipriani Started scap: Wikitech-Ldap host record roll-out
* 15:19 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Give patrolmarks right to "*" on dewiki [[gerrit:218901]] (duration: 00m 13s)
* 15:17 logmsgbot: anomie Synchronized wmf-config/throttle.php: SWAT: Add a throttle exception for United Islands of Prague [[gerrit:217413]] (duration: 00m 14s)
* 15:15 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Disable captcha on labswiki for now [[gerrit:218908]] (duration: 00m 13s)
* 15:10 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Add extra namespace aliases for Italian Wikipedia [[gerrit:215708]] (duration: 00m 13s)
* 15:08 anomie: SWAT: Enable anti-abuse features on labswiki [[gerrit:218903]]
* 15:08 jynus: testing some schema changes on testwiki
* 15:00 logmsgbot: aude Synchronized usagetracking.dblist: Enable Wikibase usage tracking on nowiki and plwiki (duration: 00m 13s)
* 13:56 logmsgbot: aude Synchronized usagetracking.dblist: Enable Wikibase usage tracking on fiwiki and idwiki (duration: 00m 13s)
* 13:26 logmsgbot: aude Synchronized usagetracking.dblist: Enable Wikibase usage tracking on bgwiki and eowiki (duration: 00m 13s)
* 10:52 akosiaris: reload pybal on lvs1006
* 10:50 mobrovac: finished deploying mathoid I40ef68 on SCA
* 10:48 akosiaris: repooled mathoid.svc.eqiad.wmnet: sca1002 backend
* 10:44 akosiaris: enable puppet on sca1002
* 10:43 akosiaris: enable puppet
* 10:43 akosiaris: depool sca1002 for mathoid.svc.eqiad.wmnet
* 10:43 akosiaris: reloaded pybal on lvs1003
* 10:28 akosiaris: repool sca1002, depool sca1001
* 10:18 mark: Halting pvmove of md124 on labstore1001
* 09:30 akosiaris: disable puppet on sca1001
* 09:09 akosiaris: depool sca1001, resource: mathoid
* 09:09 akosiaris: puppet disabled on sca1002
* 08:37 YuviPanda: run sudo salt -t 20 -b 100 '*' cmd.run 'sudo service salt-minion restart' on virt1000, attempt to get them to answer on labcontrol1001 instead
* 06:52 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Jun 17 06:52:58 UTC 2015 (duration 52m 57s)
* 02:56 logmsgbot: LocalisationUpdate completed (1.26wmf10) at 2015-06-17 02:56:49+00:00
* 02:55 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1045 (duration: 00m 13s)
* 02:54 springle: found wikiversions.json modified on tin since 2015-06-16 23:27 (catrope?); stashed and reapplied the file in order to do a pull
* 02:54 logmsgbot: l10nupdate Synchronized php-1.26wmf10/cache/l10n: (no message) (duration: 04m 44s)
* 02:35 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-17 02:35:23+00:00
* 02:32 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 06m 12s)
* 02:21 logmsgbot: ori Synchronized php-1.26wmf9/extensions/CentralNotice/modules/ext.centralNotice.bannerController/bannerController.js: I480cbc7ad (duration: 00m 12s)
* 02:21 logmsgbot: ori Synchronized php-1.26wmf10/extensions/CentralNotice/modules/ext.centralNotice.bannerController/bannerController.js: I480cbc7ad (duration: 00m 12s)
* 00:10 paravoid: draining esams because of upcoming network maintenance window
 
== June 16 ==
* 23:28 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Enable local upload on fawikivoyage; enable logging for T76305 (duration: 00m 13s)
* 23:28 logmsgbot: catrope Synchronized wmf-config/CommonSettings.php: Set previous values for password length policies (duration: 00m 16s)
* 23:17 logmsgbot: twentyafterfour Finished scap: testwiki to 1.26wmf10 (duration: 43m 04s)
* 23:02 godog: restore INFO cassandra logging level on restbase1003
* 22:44 godog: start cassandra on restbase1008
* 22:43 godog: enable back some cassandra debugging on restbase1003
* 22:33 logmsgbot: twentyafterfour Started scap: testwiki to 1.26wmf10
* 22:26 urandom: restored default logging level on restbase1003
* 22:22 urandom: enabling even more debugging on restbase1003
* 22:14 urandom: enable (some) debug logging on restbase1003
* 21:57 logmsgbot: twentyafterfour scap failed: CalledProcessError Command '/usr/local/bin/mwscript mergeMessageFileList.php --wiki="testwiki" --list-file="/srv/mediawiki-staging/wmf-config/extension-list" --output="/tmp/tmp.SxGNHsmVYP" ' returned non-zero exit status 1 (duration: 01m 24s)
* 21:56 logmsgbot: twentyafterfour Started scap: testwiki to 1.26wmf10
* 20:34 logmsgbot: krinkle Synchronized php-1.26wmf9/extensions/WikimediaEvents/modules/ext.wikimediaEvents.resourceloader.js: T101806 live hack (duration: 00m 12s)
* 19:24 Coren: labstore1001 pvmove of slice2 to slice 51 started; some bursts of iowait expected but should have minimal enduser impact)
* 18:36 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Fix usage tracking setting (duration: 00m 14s)
* 18:03 godog: bounce statsite on graphite1001, stuck while writing to graphite
* 17:30 ejegg: update SmashPig on listener from e1e925c9fc2a60c1e14ef01d8b653dc09512f51f to 258f2c917b1ae50b01231927bcd6f58ecaa8940b
* 17:23 logmsgbot: krinkle Synchronized php-1.26wmf9/includes/resourceloader/ResourceLoader.php: undo live hack (duration: 00m 13s)
* 17:09 logmsgbot: aude Synchronized arbitraryaccess.dblist: Enable arbitrary access on gomwiki and lrcwiki (duration: 00m 13s)
* 17:09 logmsgbot: aude Synchronized usagetracking.dblist: Enable Wikibase usage tracking on second batch of s3 wikis (duration: 00m 13s)
* 17:03 logmsgbot: bblack Synchronized wmf-config/InitialiseSettings.php: wgCanonicalServer: HTTPS for all (duration: 00m 15s)
* 16:44 logmsgbot: krenair Synchronized wmf-config/interwiki.cdb: Updating interwiki cache (duration: 00m 13s)
* 16:43 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 13s)
* 16:43 logmsgbot: krenair Synchronized w/static/images/project-logos/gomwiki.png: (no message) (duration: 00m 14s)
* 16:42 logmsgbot: krenair Synchronized langlist: gomwiki (duration: 00m 13s)
* 16:41 logmsgbot: krenair rebuilt wikiversions.cdb and synchronized wikiversions files: (no message)
* 16:40 logmsgbot: krenair Synchronized database lists: (no message) (duration: 00m 13s)
* 16:29 logmsgbot: krenair Synchronized wmf-config/interwiki.cdb: Updating interwiki cache (duration: 00m 13s)
* 16:27 logmsgbot: krenair Synchronized langlist: (no message) (duration: 00m 14s)
* 16:25 logmsgbot: krenair Synchronized w/static/images/project-logos/lrcwiki.png: (no message) (duration: 00m 13s)
* 16:21 moritzm: updated copper, oxygen, labstore2001 and labnodepool1001 to the 3.19 kernel
* 16:11 logmsgbot: krenair Synchronized wmf-config/interwiki.cdb: Updating interwiki cache (duration: 00m 13s)
* 16:10 logmsgbot: krenair Synchronized wmf-config: (no message) (duration: 00m 14s)
* 16:06 logmsgbot: krenair rebuilt wikiversions.cdb and synchronized wikiversions files: (no message)
* 16:05 logmsgbot: krenair Synchronized database lists: (no message) (duration: 00m 15s)
* 15:43 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: templateeditor: add templateeditor right in hewiki [[gerrit:218426]] (duration: 00m 13s)
* 15:09 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Turn on wgGenerateThumbnailOnParse for wikitech. [[gerrit:218553]] (duration: 00m 12s)
* 15:03 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Add wikis for CX deployment on 20150616 [[gerrit:218341]] (duration: 00m 12s)
* 14:18 cmjohnson: barium is going down for disk replacement
* 13:38 logmsgbot: aude Synchronized usagetracking.dblist: Enable Wikibase usage tracking on dewiki (duration: 00m 15s)
* 13:18 akosiaris: rebooted etherpad1001 for kernel upgrades
* 12:51 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Repool es2005, es2006 and es2007 after maintenance (duration: 00m 13s)
* 12:44 logmsgbot: aude Synchronized usagetracking.dblist: Enable Wikibase usage tracking on cswiki (duration: 00m 14s)
* 12:20 logmsgbot: aude Synchronized usagetracking.dblist: Enable usage tracking on ruwiki (duration: 00m 15s)
* 11:21 paravoid: restarting the puppetmaster
* 11:19 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1073, warm up (duration: 00m 13s)
* 10:36 akosiaris: rebooting ganeti200{1..6}.codfw.wmnet for kernel upgrades
* 09:33 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Depool es2005, es2006 and es2007 for maintenance (duration: 00m 14s)
* 09:10 YuviPanda: deleted huge puppet-master.log on labcontrol1001
* 08:05 jynus: added m5-slave to dns servers
* 07:52 paravoid: restarting hhvm on mw1121
* 07:52 moritzm: blacklisted the overlayfs kernel module (prevents a reliable local root exploit on all Ubuntu systems). no systems in the fleet had an overlaysfs mount present or the kernel module loaded, so there should be no impact on existing systems. Note: This is a bandaid, I'll create a Phab task to deploy this via puppet in the future (and to also blacklist additional desktopy kernel modules which increase our attack
* 07:39 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1005 (duration: 00m 14s)
* 06:24 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Jun 16 06:24:04 UTC 2015 (duration 24m 3s)
* 06:18 godog: restore ES replication throttling to 20mb/s
* 06:13 godog: restore ES replication throttling to 40mb/s
* 06:08 logmsgbot: filippo Synchronized wmf-config/PoolCounterSettings-common.php: unthrottle ES (duration: 00m 14s)
* 05:56 godog: bump ES replication throttling to 60mb/s
* 05:50 manybubbles: ok - we're yellow and recovering. ops can take this from here. We have a root cause and we have things I can complain about to the elastic folks I plan to meet with today anyway. I'm going to finish waking up now.
* 05:49 manybubbles: reenabling puppet agent on elasticsearch machines
* 05:46 manybubbles: I expect them to be red for another few minutes during the initial master recovery
* 05:45 manybubbles: started all elasticsearch nodes and now they are recovering.
* 05:41 godog: restart gmond on elastic1007
* 05:39 logmsgbot: filippo Synchronized wmf-config/PoolCounterSettings-common.php: throttle ES (duration: 00m 13s)
* 05:25 manybubbles: shutting down all the elasticsearch on the elasticsearch nodes against - another full cluster restart should fix it like it did last time...............
* 05:11 godog: restart elasticsearch on elastic1031
* 03:06 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1073 (duration: 00m 12s)
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-16 02:27:51+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 05m 52s)
* 00:55 tgr: running extensions/Gather/maintenance/updateCounts.php for gather wikis - https://phabricator.wikimedia.org/T101460
* 00:52 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1057, warm up (duration: 00m 13s)
* 00:46 godog: killed bacula-fd on graphite1001, shouldn't be running and consuming bandwidth (cc akosiaris)
* 00:27 godog: kill python stats on cp1052, filling /tmp
 
== June 15 ==
* 23:42 ori: Cleaning up renamed jobqueue metrics on graphite{1,2}001
* 23:01 godog: killed bacula-fd on graphite2001, shouldn't be running and consuming bandwidth (cc akosiaris)
* 22:54 logmsgbot: hoo Synchronized wmf-config/filebackend.php: Fix commons image inclusion after commons went https only (duration: 00m 14s)
* 22:18 godog: run disk stress-test on restbase1007 / restbase1009
* 22:06 logmsgbot: twentyafterfour Synchronized hhvm-fatal-error.php: deploy: Guard header() call in error page (duration: 00m 15s)
* 22:05 logmsgbot: twentyafterfour Synchronized wmf-config/InitialiseSettings-labs.php: deploy: Never use wgServer/wgCanonicalServer values from production in labs (duration: 00m 12s)
* 20:37 logmsgbot: yurik Synchronized docroot/bits/WikipediaMobileFirefoxOS: Bumping FirefoxOS app to latest (duration: 00m 14s)
* 20:30 godog: bounce cassandra on restbase1003
* 20:18 godog: start cassandra on restbase1008, bootstrapping
* 20:04 godog: sign restbase1008 key, run puppet
* 20:00 godog: powercycle restbase1007, investigate disk issue
* 19:07 logmsgbot: ori Synchronized php-1.26wmf9/includes/jobqueue: 0a32aa3be4: jobqueue: use more sensible metric key names (duration: 00m 13s)
* 16:57 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT:  Grant cloudadmins the 'editallhiera' right [[gerrit:218115]] (duration: 00m 14s)
* 16:48 logmsgbot: thcipriani Synchronized php-1.26wmf9/extensions/OpenStackManager/OpenStackManagerHooks.php: SWAT: refer to user the right way (duration: 00m 13s)
* 16:48 godog: powercycle graphite1002, no ssh, unresponsive console
* 16:19 jynus: upgrading es1005 mysql service while depooled
* 16:12 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT:  Grant cloudadmins the 'editallhiera' right [[gerrit:218115]] (duration: 00m 12s)
* 16:10 bblack: pybal restarts complete, all ok
* 16:09 logmsgbot: thcipriani Finished scap: SWAT: Openstack manager and language updates (duration: 21m 27s)
* 15:47 logmsgbot: thcipriani Started scap: SWAT: Openstack manager and language updates
* 15:46 bblack: starting pybal restart process for config changes ( https://gerrit.wikimedia.org/r/#/c/218285/ ), inactives first w/ manual verification of ok-ness
* 15:11 bblack: rebooting cp3041 (downtimed)
* 15:00 _joe_: ES is green
* 14:38 logmsgbot: aude Synchronized php-1.26wmf9/extensions/Wikidata: Fix property label constraints bug (duration: 00m 24s)
* 14:27 logmsgbot: aude Synchronized arbitraryaccess.dblist: Enable arbitrary access on s7 wikis (duration: 00m 13s)
* 13:47 jynus: enabling puppet on all elastic* nodes, should enable also ganglia
* 13:11 logmsgbot: demon Synchronized wmf-config/PoolCounterSettings-common.php: all the search (duration: 00m 12s)
* 13:04 _joe_: re-scaling down the recovery index bandwidth in ES to 20 mb/s
* 12:52 logmsgbot: demon Synchronized wmf-config/PoolCounterSettings-common.php: partially turn search back on (duration: 00m 13s)
* 11:54 _joe_: raised the ES index replica bandwidth limit to 60mb
* 11:31 akosiaris: migrating etherpad.wikimedia.org to etherpad1001.eqiad.wmnet
* 11:15 _joe_: raised the max bytes for ES recovery to 40mbps
* 10:49 manybubbles: and we're yellow right now.
* 10:49 manybubbles: the initial primaries stage - the red stage of the rolling restart - recovers quick-ish
* 10:48 manybubbles: soon we should see it go yellow and stay that way while the replicas recover
* 10:48 manybubbles: manybubbles is confident his mighty bitch slap of the elasticsearch cluster has set it further to the road to recovery
* 10:46 jynus: disabled puppet on all elasticsearch nodes to avoid restarting services and other magic
* 10:44 _joe_: disabled hot threads logging, ganglia on es nodes
* 10:44 manybubbles: started Elasticsearch on all elasticsearch nodes
* 10:38 manybubbles: stopping all elasticsearch servers - going for a full cluster resstart.
* 10:11 manybubbles: restarting elasticsearch on elasticsearch1021 - that one is in a gc death spiral
* 09:26 logmsgbot: oblivian Synchronized wmf-config/PoolCounterSettings-common.php: temporarily throttle down cirrussearch (duration: 00m 13s)
* 09:12 logmsgbot: oblivian Synchronized wmf-config/PoolCounterSettings-common.php: temporarily throttle down cirrussearch (duration: 00m 13s)
* 07:35 _joe_: attempting a fast restart of elastic1020
* 07:21 logmsgbot: ori Synchronized php-1.26wmf9/extensions/CirrusSearch/includes/Util.php: I504dac0c3: Add missing 'use \Status;' to includes/Util.php (duration: 00m 13s)
* 04:56 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Jun 15 04:56:39 UTC 2015 (duration 56m 38s)
* 03:31 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1057 (duration: 00m 12s)
* 02:22 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-15 02:22:56+00:00
* 02:19 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 05m 46s)
 
== June 14 ==
* 10:39 YuviPanda: running du -d 2 on /srv/project in a screen sesssion on labstore1001
* 04:33 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun Jun 14 04:33:20 UTC 2015 (duration 33m 19s)
* 02:42 logmsgbot: reedy Synchronized wmf-config/extension-list: noop (duration: 00m 13s)
* 02:40 logmsgbot: krenair Synchronized wmf-config/squid-labs.php: sync random labs-only file to test per irc (duration: 00m 13s)
* 02:21 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-14 02:21:28+00:00
* 02:18 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 05m 47s)
 
== June 13 ==
* 19:30 bblack: repooled cp1071, cp3040
* 18:53 bblack: rebooting cp1071, cp3040 to look at BIOS-level things (depooled, icinga-downed)
* 17:08 logmsgbot: krinkle Synchronized php-1.26wmf9/extensions/WikimediaEvents: T101806 (duration: 00m 12s)
* 15:47 paravoid: labstore1001: stopping manage-nfs-volumes daemon
* 04:41 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat Jun 13 04:41:57 UTC 2015 (duration 41m 56s)
* 03:51 Krinkle: Running deleteEqualMessages.php for sawiki (T45917)
* 03:49 Krinkle: Running deleteEqualMessages.php for cewiki (T45917)
* 02:21 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-13 02:20:58+00:00
* 02:18 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 05m 19s)
* 00:17 gwicke: restarted cassandra on restbase1001
* 00:13 gwicke: restarted cassandra on restbase1002
 
== June 12 ==
* 22:57 ejegg: rolled back SmashPig on listener from 15acdafef9d9682c417632e5ac5a5f2e5380f92e to e1e925c9fc2a60c1e14ef01d8b653dc09512f51f
* 22:40 ejegg: updated SmashPig on listener from e1e925c9fc2a60c1e14ef01d8b653dc09512f51f to 15acdafef9d9682c417632e5ac5a5f2e5380f92e
* 22:24 godog: upgrade and bounce carbon daemons on graphite2001 to investigate T101572
* 21:16 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: I3694489ba: wgCanonicalServer->https for new HTTPS domains (duration: 00m 14s)
* 20:33 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/217878/1 (duration: 00m 13s)
* 20:32 logmsgbot: krenair Synchronized w/static/images/project-logos/dawiki-200k.png: https://gerrit.wikimedia.org/r/#/c/217878/1 (duration: 00m 16s)
* 20:15 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/217670/ (duration: 00m 12s)
* 19:28 ejegg: updated SmashPig on payments-listener from f9c3eaa99fa0fe8ef098d0fc876091d3676aa039 to 5a463400bc74706ba7bf6256cd0101014e792acb
* 19:28 ejegg: updated SmashPig on payments-listener ccepting New Patients:
* 18:47 ejegg: updated SmashPig on payments-listener from 7fed22ad933a6d3e371d60dfc6f8fdd0f9131510 to f9c3eaa99fa0fe8ef098d0fc876091d3676aa039
* 18:45 logmsgbot: faidon Synchronized wmf-config/InitialiseSettings.php: remove wmgHTTPSBlacklistCountries (duration: 00m 12s)
* 18:45 logmsgbot: faidon Synchronized wmf-config/CommonSettings.php: remove CanIPUseHTTPS hook (duration: 00m 13s)
* 17:39 moritzm: updated cerium, xenon and praseodymium to 3.19 kernel
* 17:08 ejegg: enabled queue consumer
* 17:08 ejegg: updated crm from d13aaa4e9e937b0b1ae1f5de61ea7ff1f316d58f to bd8a00196071ddd04efbff7b30567dd9357c9000
* 16:53 ejegg: disabled donations queue consumer
* 15:52 logmsgbot: faidon Synchronized wmf-config/CommonSettings.php: hide prefershttps user pref (duration: 00m 13s)
* 15:40 logmsgbot: faidon Synchronized docroot/search.wikimedia.org/index.php: unbreak search.wikimedia.org due to HTTPS (duration: 00m 12s)
* 15:27 jynus: mysql load issues on labsdb1003, investigating
* 13:39 moritzm: updated etcd* to 3.19 kernel
* 12:11 jynus: restarting mariadb at labsdb1003
* 11:58 moritzm: updated rdb200* to 3.19 kernel
* 11:31 jynus: db2068 up but all services and console login unresponsive, powercycling
* 10:06 springle: killed a bunch of queries hammering labsdb1003 for days
* 09:58 moritzm: updated mc2004 to mc2016 to 3.19 kernel
* 06:06 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri Jun 12 06:06:55 UTC 2015 (duration 6m 54s)
* 04:37 logmsgbot: ori Synchronized php-1.26wmf9/extensions/FlaggedRevs: I4cfb47b41: Avoid post-redirect parse for certain edits (duration: 00m 14s)
* 02:40 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-12 02:40:36+00:00
* 02:34 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 10m 00s)
* 00:40 logmsgbot: legoktm Synchronized wmf-config/InitialiseSettings-labs.php: https://gerrit.wikimedia.org/r/217759 (duration: 00m 15s)
* 00:07 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings-labs.php: (no message) (duration: 00m 14s)
 
== June 11 ==
* 23:59 logmsgbot: legoktm Synchronized wmf-config/InitialiseSettings-labs.php: https://gerrit.wikimedia.org/r/217753 (duration: 00m 16s)
* 23:54 logmsgbot: ori Synchronized php-1.26wmf9/includes/EditPage.php: cf7df757f2: Instrument edit failures (duration: 00m 14s)
* 23:41 logmsgbot: ebernhardson Synchronized php-1.26wmf9/extensions/MobileFrontend: Bump MobileFrontend in 1.26wmf9 for SWAT (duration: 00m 14s)
* 23:40 ejegg: updated civicrm from 7ffe0cefb019828a09c9369187f14518847b5f41 to d13aaa4e9e937b0b1ae1f5de61ea7ff1f316d58f
* 23:24 logmsgbot: ebernhardson Synchronized php-1.26wmf9/extensions/CirrusSearch/: Fix prefer-recent queries in cirrussearch (duration: 00m 13s)
* 23:02 ejegg: updated SmashPig on the rest of the cluster from 477e8a8be5ea895262031c147330de5a651cc3ac to 7fed22ad933a6d3e371d60dfc6f8fdd0f9131510
* 22:17 godog: temporary bump php memory_limit on magnesium to test T102092
* 22:11 ejegg: updated SmashPig on payments-listener from 477e8a8be5ea895262031c147330de5a651cc3ac to 7fed22ad933a6d3e371d60dfc6f8fdd0f9131510
* 21:54 ori: Widespread TC cache exhaustion again, doing rolling restart of HHVMs
* 21:46 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I3d3ed7647: Test LCStoreStaticArray on test2wiki (duration: 00m 14s)
* 21:01 godog: NPE while trying to make restbase1007 (cassandra 2.1.5) join the cluster, trying matching the same cassandra version (2.1.3)
* 20:57 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: fix last commit, did not have any affect (duration: 00m 16s)
* 20:55 ejegg: updated payments from 43c7952d2a31deaea97e8319f5612d644dce43c8 to f33d0a8687a120a2057a7e6acad67da63b17f97e
* 20:54 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/217688/1 (duration: 00m 13s)
* 20:10 godog: sign restbase1007 puppet key and first puppet run
* 19:10 logmsgbot: legoktm Synchronized wmf-config/InitialiseSettings-labs.php: https://gerrit.wikimedia.org/r/217591 (duration: 00m 13s)
* 18:58 logmsgbot: legoktm Synchronized wmf-config/InitialiseSettings-labs.php: beta only change - https://gerrit.wikimedia.org/r/217560 (duration: 00m 12s)
* 18:55 logmsgbot: krinkle Synchronized php-1.26wmf9/extensions/WikimediaEvents: T101806 (duration: 00m 14s)
* 18:43 logmsgbot: twentyafterfour Synchronized php-1.26wmf9/includes/AjaxResponse.php: Hotfix Iafff9982bbbee893c13f891901dde88f998db7a6 (duration: 00m 14s)
* 18:16 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: all wikis to 1.26wmf9
* 17:44 ejegg: rolled back payments to 43c7952d2a31deaea97e8319f5612d644dce43c8
* 17:41 ejegg: updated payments from 43c7952d2a31deaea97e8319f5612d644dce43c8 to 15f24d24b150d5d774314b0c1b40ae26a73185f2
* 17:00 moritzm: updated mc200[1-3] to linux 3.19
* 16:28 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Use arbitrary access tag (duration: 00m 12s)
* 16:27 logmsgbot: aude Synchronized wmf-config/CommonSettings.php: Add arbitrary access group tag (duration: 00m 13s)
* 16:27 logmsgbot: aude Synchronized arbitraryaccess.dblist: Add dblist for arbitrary access wikis (duration: 00m 13s)
* 16:24 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Use usagetracking tag (duration: 00m 13s)
* 16:23 logmsgbot: aude Synchronized wmf-config/CommonSettings.php: Add usagetracking group tag (duration: 00m 16s)
* 16:23 ori: Scap + deployments exhausted TC cache on Apaches; performed a rolling restart of HHVM
* 16:21 logmsgbot: aude Synchronized usagetracking.dblist: Add dblist for usage tracking wikis (duration: 00m 25s)
* 16:19 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: Disable Parsoid update jobs (duration: 00m 14s)
* 16:18 logmsgbot: thcipriani Finished scap: SWAT: Update namespaces and special pages for Northern Luri (lrc) from translatewiki [[gerrit:216533]] [[gerrit:217327]] (duration: 32m 11s)
* 15:46 logmsgbot: thcipriani Started scap: SWAT: Update namespaces and special pages for Northern Luri (lrc) from translatewiki [[gerrit:216533]] [[gerrit:217327]]
* 15:27 logmsgbot: thcipriani Synchronized php-1.26wmf9/extensions/OpenStackManager: SWAT: update OpenStackManager to disable unused sudoer features [[gerrit:217407]] (duration: 00m 13s)
* 15:11 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Make VisualEditor access RESTbase directly on all public wikis [[gerrit:214833]] (duration: 00m 12s)
* 15:05 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Add wikis for deployment on 20150611 [[gerrit:217460 ]] (duration: 00m 12s)
* 14:33 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable usage tracking on jawiki (duration: 00m 12s)
* 13:40 _joe_: rolling restart of all the restbase instances
* 13:33 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable usage tracking on frwiki (duration: 00m 12s)
* 13:32 _joe_: running puppet on all restbase hosts
* 13:19 _joe_: running puppet on restbase1001
* 13:16 _joe_: disabling puppet on restbase hosts in anticipation for merging https://gerrit.wikimedia.org/r/217431
* 13:11 paravoid: removing gdnsd from apt: precise-wikimedia (1.9.0-1~precise1/2.1.0-1~precise1), trusty-wikimedia (2.1.0-1), jessie-wikimedia (2.1.2-1~deb8u1)
* 12:13 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable arbitrary access on Wikivoyage and Wikiquote (duration: 00m 13s)
* 11:48 YuviPanda: reboot labvirt1005 for kernel upgrade
* 11:46 YuviPanda: installing linux-image-generic-lts-vivid on labvirt1005 to get a 3.19 kernel
* 09:51 akosiaris: uploaded ruby-jsduck_5.3.4 and ruby-rkelly-remix_0.0.6 on apt.wikimedia.org/jessie-wikimedia/main
* 08:18 akosiaris: recreating jessie chroots on copper
* 06:21 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Jun 11 06:21:53 UTC 2015 (duration 21m 52s)
* 04:44 twentyafterfour: upgraded phabricator at 1:50 UTC (belatedly logged...)
* 03:01 logmsgbot: LocalisationUpdate completed (1.26wmf9) at 2015-06-11 03:01:48+00:00
* 03:00 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1057, warm up (duration: 01m 16s)
* 02:59 logmsgbot: l10nupdate Synchronized php-1.26wmf9/cache/l10n: (no message) (duration: 05m 59s)
* 02:43 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-11 02:43:34+00:00
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 09m 13s)
 
== June 10 ==
* 23:23 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Add www.limis.lt to $wgCopyUploadsDomains (duration: 00m 19s)
* 22:07 logmsgbot: twentyafterfour Synchronized php-1.26wmf9/extensions/MobileFrontend/includes/skins/banners.mustache: Deploying https://gerrit.wikimedia.org/r/#/c/217417/ (duration: 00m 16s)
* 20:38 logmsgbot: ori Synchronized php-1.26wmf8/includes/Hooks.php: d6802ad7d6: Avoid section profiling in Hooks::run due to high overhead (duration: 00m 14s)
* 20:37 logmsgbot: ori Synchronized php-1.26wmf9/includes/Hooks.php: e552f4942d: Avoid section profiling in Hooks::run due to high overhead (duration: 00m 17s)
* 20:36 logmsgbot: ori Synchronized php-1.26wmf9/includes/User.php: 2f4f1e279d: Fixed "wfTimestamp() fed bogus time value" errors (duration: 00m 12s)
* 20:36 logmsgbot: ori Synchronized php-1.26wmf8/includes/User.php: 55e18123ca: Fixed "wfTimestamp() fed bogus time value" errors (duration: 00m 15s)
* 18:07 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group1 wikis to 1.26wmf9
* 16:14 godog: reboot ms-be2008 to check disk swap config
* 15:50 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: retry (duration: 01m 08s)
* 15:34 Krenair: sync failed to something like 25 hosts, cannot directly log into any of them either
* 15:17 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/215030/ - no code change, just docs - should not have to wait 9 days for this (duration: 01m 08s)
* 13:16 moritzm: installed curl security updates on elastic*, wtp*, db*, virt*, labs*, labmon*, labstore*, es*
* 12:38 paravoid: zirconium: rm -rf /var/log2 (last log there from Mar 20th 2014)
* 10:55 jynus: disruption for maintenance starting on labsdb1002 https://lists.wikimedia.org/pipermail/labs-l/2015-June/003766.html
* 03:02 logmsgbot: ori Synchronized php-1.26wmf8/includes/User.php: 55e18123ca: Fixed "wfTimestamp() fed bogus time value" (duration: 01m 07s)
* 03:01 logmsgbot: ori Synchronized php-1.26wmf9/includes/User.php: 2f4f1e279d: Fixed "wfTimestamp() fed bogus time value" (duration: 01m 08s)
* 02:36 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-10 02:35:44+00:00
* 02:31 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 20s)
* 01:33 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1057 (duration: 01m 08s)
* 01:13 logmsgbot: ori Synchronized php-1.26wmf8/extensions/FlaggedRevs: 433fae7f23: Update FlaggedRevs for cherry-picks (duration: 01m 09s)
* 01:10 logmsgbot: ori Synchronized php-1.26wmf9/extensions/FlaggedRevs: 2cfc8c9f2b: Update FlaggedRevs for cherry-picks (duration: 01m 09s)
 
== June 9 ==
* 23:57 logmsgbot: catrope Synchronized php-1.26wmf8/includes/: Avoid parser cache miss that often occurs post-save (duration: 01m 14s)
* 23:29 logmsgbot: catrope Synchronized php-1.26wmf8/resources/src/mediawiki/mediawiki.js: touch (duration: 01m 08s)
* 23:23 logmsgbot: catrope Synchronized php-1.26wmf9/includes/resourceloader/ResourceLoaderOOUIImageModule.php: Fix OOUI image variants (duration: 01m 08s)
* 23:22 ori: Deleting unused metrics on graphite2001 (sum_sq and stddev) as well
* 23:21 logmsgbot: catrope Synchronized php-1.26wmf9/resources/src/mediawiki/mediawiki.js: Add logging for T101806 private modules (duration: 01m 08s)
* 23:20 ori: Deleting unused  metrics in graphite1001 (sum_sq and stddev)
* 23:19 logmsgbot: catrope Synchronized php-1.26wmf8/resources/src/mediawiki/mediawiki.js: Add logging for T101806 private modules (duration: 01m 08s)
* 23:16 logmsgbot: catrope Synchronized wmf-config/CirrusSearch-common.php: fix total breakage of search in wmf9 (duration: 01m 08s)
* 22:44 andrewbogott: moving labs-ns0 from virt1000 to labcontrol1001
* 22:43 andrewbogott: stopping almost everything on virt1000
* 20:31 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf9
* 20:27 logmsgbot: twentyafterfour Finished scap: testwiki to php-1.26wmf9 and rebuild l10n cache (duration: 29m 24s)
* 19:58 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf9 and rebuild l10n cache
* 19:42 mutante: einsteinium - no console output after reboot command, powercycled, booting again
* 19:36 mutante: rebooting einsteinium
* 19:28 mutante: restarted apache on mw1227
* 17:30 mutante: wikitech-static: installing bunch of package upgrades on the external wikitech-static VM
* 17:13 cmjohnson1: db1058 replacing failed disk 7
* 16:20 cmjohnson1: analytics1028 going down for troubleshooting
* 16:17 kart_: updated cxserver to 4a71145
* 15:37 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/Wikidata: SWAT: Update Wikidata - forward compat for usage tracking [[gerrit:216967]] (duration: 01m 17s)
* 15:20 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT take II: Enabled Guided Tour on th.wikipedia [[gerrit:216950]] (duration: 01m 08s)
* 15:19 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Enabled Guided Tour on th.wikipedia [[gerrit:216950]] (duration: 01m 08s)
* 15:05 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Add wikis for deployment on 20150609 [[gerrit:216622]] (duration: 01m 09s)
* 11:09 Krenair: Email set for User:GifTagger@commonswiki per [[phab:T100889]]
* 09:05 akosiaris: uploaded etherpad-lite_1.5.6-2 on apt.wikimedia.org/jessie-wikimedia/main component
* 08:22 akosiaris: upload etherpad-lite_1.5.6-1 on apt.wikimedia.org, jessie-wikimedia dist, main component
* 04:35 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Jun  9 04:34:08 UTC 2015 (duration 34m 7s)
* 02:28 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-09 02:27:30+00:00
* 02:23 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 12s)
* 01:42 godog: stop icinga-wm on neon
 
== June 8 ==
* 23:43 bblack: repooled cp3030/cp1065 in pybal
* 23:11 logmsgbot: ebernhardson Synchronized php-1.26wmf8/extensions/UploadWizard/: Bump UploadWizard in 1.26wmf8 for evening SWAT (duration: 01m 09s)
* 22:21 bblack: depooled cp3030, cp1065 in pybal for ipsec
* 20:17 subbu: deployed parsoid sha 131554ba
* 19:18 jynus: RAID degradation (disk failure) on s5 master (db1058), no production impact, replacement on the way
* 17:13 ottomata: restarted eventlogging services on eventlog1001 after disabling kafka pieces
* 16:13 _joe_: powercycling tmh1001, console blank, unresponsive to pings
* 16:00 logmsgbot: thcipriani Synchronized commonsuploads.dblist: SWAT: Revert Temporarily re-enable uploads on Marathi Wikipedia, for real [[gerrit:216719]] (duration: 01m 07s)
* 15:58 logmsgbot: thcipriani Synchronized commonsuploads.dblist: SWAT: Revert Temporarily re-enable uploads on Marathi Wikipedia [[gerrit:216719]] (duration: 01m 08s)
* 15:40 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/Cite: SWAT: Revert Do all of Cite's real work during unstrip and followup [[gerrit:216715]] (duration: 01m 08s)
* 15:19 Coren: T96063: process halted for now as store/backup is unmovable and on slice5
* 15:17 logmsgbot: thcipriani Synchronized w/static/images/project-logos/pflwiki.png: SWAT: Fix transparency of pflwiki logo [[gerrit:216595]] (duration: 01m 08s)
* 15:15 akosiaris: disabled ircecho on neon for a while
* 14:53 Coren: T96063: starting pvmove from slice5 to slice2
* 14:48 Coren: T96063: dropped volume slice1 from vg store
* 14:46 Coren: T96063: dropped store/project
* 14:44 Coren: starting https://phabricator.wikimedia.org/T96063 on labstore1001
* 14:24 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool es1005 (duration: 01m 08s)
* 14:23 Coren: rsync in progress between labstore1001:store/backup and labstore1002:backup/backup (at ionice idle)
* 14:13 Coren: created store/backup snapshot on labstore1001 for backup copy
* 13:03 moritzm: added strongswan_5.3.0-1+wmf2 to jessie-wikimedia on carbon
* 11:42 _joe_: purging squid cache on carbon
* 11:26 moritzm: updated mc2* to 2:2.8.17-1+deb8u1
* 10:55 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool es1007 (duration: 01m 08s)
* 10:27 akosiaris: disabled puppet on uranium, investigating ganglia problems
* 10:05 akosiaris: ganglia gmetad problems
* 05:25 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Jun  8 05:24:08 UTC 2015 (duration 24m 7s)
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-08 02:25:12+00:00
* 02:21 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 07s)
 
== June 7 ==
* 23:27 godog: reboot ms-be2008 sdg failed, xfs unhappy
* 07:03 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1073, warm up (duration: 01m 09s)
* 05:16 andrewbogott: we did a whole lot of things to labstore1001 while morebots was away
* 05:14 andrewbogott: service nfs-kernel-server restart on labstore1001
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-07 02:25:13+00:00
* 02:21 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 09s)
 
== June 6 ==
* 23:46 subbu: deployed parsoid 5172a446 (cherry-pick of 719c736f) -- hotfix for T101599
* 05:48 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat Jun  6 05:47:40 UTC 2015 (duration 47m 39s)
* 02:31 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-06 02:30:24+00:00
* 02:26 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 10s)
 
== June 5 ==
* 22:42 godog: powercycle graphite2001, no console no ssh
* 22:06 andrewbogott: restarted apache on virt1000
* 20:49 ori: Upgrading hhvm-fss on application servers to 1.1.7; expect brief 5xx spike.
* 20:14 logmsgbot: demon Synchronized php-1.26wmf8: live hack (duration: 02m 32s)
* 20:10 mutante: apt-get upgrade on terbium
* 19:52 godog: bounce redis on rdb1001/rdb1003 to pick up new slave limits
* 19:51 mutante: chown root:root / on terbium
* 19:50 godog: bounce redis on rdb1002/rdb1004 to pick up new slave limits
* 19:29 godog: bounce redis again on rdb1003 after increasing the slave limits more
* 19:17 godog: bounce redis on rdb1003 after bumping slave limits
* 19:07 godog: redis master logs shows periodic 'cmd=sync scheduled to be closed ASAP for overcoming of output buffer limits.' indicating the slave fails to sync
* 18:40 godog: spike in redis network starting at ~15.00 UTC, correlates with ocg failures
* 18:01 moritzm: restarted gerrit on ytterbium for java update
* 14:43 jynus: short lag period on db1049, traffic automatically redirected to other slave and back to normal
* 14:07 moritzm: added ubuntu-meta-1.325+wmf1 for trusty-wikimedia to apt.wikimedia.org (T100004)
* 14:07 moritzm: added ubuntu-meta-1.267.1+wmf1 for precise-wikimedia to apt.wikimedia.org (T100004)
* 12:44 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Depool es1007 (duration: 01m 08s)
* 12:08 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1009 (duration: 01m 08s)
* 11:30 _joe_: uploaded new HHVM package, installing on mw1025 for testing
* 09:17 moritzm: added redis_2.6.13-1+wmf1 to precise-wikimedia on apt.wikimedia.org
* 06:24 moritzm: added redis_2.8.4-2+wmf1 to trusty-wikimedia on apt.wikimedia.org
* 05:23 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri Jun  5 05:22:50 UTC 2015 (duration 22m 49s)
* 04:10 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1073 (duration: 01m 08s)
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-05 02:25:20+00:00
* 02:21 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 09s)
* 01:27 tgr: deploying schema changes for Gather on enwiki, enwikivoyage, hewiki (T98490, T101460)
* 00:08 logmsgbot: catrope Synchronized php-1.26wmf8/vendor/oojs/oojs-ui/php/Tag.php: Fix OOUI fatals (T99210) (duration: 00m 13s)
 
== June 4 ==
* 23:40 logmsgbot: catrope Synchronized php-1.26wmf8/extensions/MobileFrontend: SWAT (duration: 00m 13s)
* 23:28 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Disable VE A/B test for new accounts on enwiki (duration: 00m 13s)
* 22:39 ejegg: updated payments from d22e44e3fab2b937707c2776384cb93a49b4cfd3 to 43c7952d2a31deaea97e8319f5612d644dce43c8
* 22:21 ottomata: doing controlled restart of kafka brokers services to apply auto create topic config
* 21:48 jgage: analyics1013 crashed, rebooted
* 21:42 logmsgbot: ori Synchronized php-1.26wmf8/includes/libs/ReplacementArray.php: 1b20d62c26: Revert "awful hack: disable fss on zhwiki only, except on mw1017" (duration: 00m 13s)
* 21:34 ori: performing rolling restart of HHVMs for hhvm-fss upgrade
* 21:27 bd808: restarted logstash and elasticsearch on logstash100[1-3] to pick up latest jre updates
* 18:48 mutante: restarted apache on silver/wikitech
* 18:20 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Depool es1009 and master-slave switchover (duration: 00m 13s)
* 18:01 awight: Enabling PayPal audit parser job
* 17:57 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Repool es1008 (duration: 00m 15s)
* 17:44 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Repool es2008 and its slaves (duration: 00m 13s)
* 17:21 ori: Disabling Puppet and nutcracker on mw1017 to control for parser cache
* 17:18 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Depool es2008 and its slaves (duration: 00m 13s)
* 17:17 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Depool es1008 (duration: 00m 12s)
* 16:33 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 09m 17s)
* 16:23 logmsgbot: kartik Started scap: Update ContentTranslation
* 15:54 moritzm: added redis_2.8.4-2+wmf1 to trusty-wikimedia on apt.wikimedia.org
* 15:48 logmsgbot: anomie Synchronized php-1.26wmf8/includes/jobqueue/: SWAT: jobqueue: Record stats on how long it takes before a job is run [[gerrit:215748]] (duration: 00m 14s)
* 15:38 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable ApiFeatureUsage everywhere [[gerrit:215901]] (duration: 00m 19s)
* 15:36 logmsgbot: anomie Synchronized wmf-config/CommonSettings.php: SWAT: Remove obsolete 'ValidateExtendedMetadataCache' hook [[gerrit:215900]] (duration: 00m 12s)
* 15:35 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Added staff-recommender campaign [[gerrit:215865]] (duration: 00m 12s)
* 15:30 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Add wikis for deployment on 20150406 [[gerrit:215281]] (duration: 00m 12s)
* 15:12 logmsgbot: ori Synchronized php-1.26wmf8/includes/libs/ReplacementArray.php: Ia5f3dc84605: awful hack: disable fss on zhwiki only, except on mw1017 (duration: 00m 17s)
* 15:09 _joe_: puppet disabled, fss disabled on mw1017
* 14:42 YuviPanda: running sudo sed -i 's/GlobalSign_CA.pem/ca-certificates.crt/' /etc/ldap/ldap.conf on all labs nodes
* 14:36 awight: Disable PayPal audit parsing job
* 12:19 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1072, warm up (duration: 00m 13s)
* 05:12 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Jun  4 05:11:32 UTC 2015 (duration 11m 31s)
* 02:30 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-04 02:28:54+00:00
* 02:25 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 07m 22s)
 
== June 3 ==
* 23:42 logmsgbot: kaldari Synchronized wmf-config/InitialiseSettings.php: syncing ImportSource change for meta (duration: 00m 13s)
* 23:34 logmsgbot: kaldari Synchronized wmf-config/InitialiseSettings.php: syncing config change for mediawiki logo on mobile, take 2 (duration: 00m 12s)
* 23:26 logmsgbot: kaldari Synchronized wmf-config/InitialiseSettings.php: syncing config change for mediawiki logo on mobile (duration: 00m 12s)
* 23:25 logmsgbot: kaldari Synchronized images/mobile/mediawiki.png: syncing mediawiki logo for mobile (duration: 00m 12s)
* 22:02 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on ukwiki and viwiki (duration: 00m 15s)
* 21:58 mutante: restarted gitblit
* 21:53 logmsgbot: ori Synchronized php-1.26wmf8/includes/resourceloader/ResourceLoader.php: 7f49853fc9: ResourceLoader::filter: use APC when running under HHVM (did not sync correct file previously) (duration: 00m 12s)
* 21:20 andrewbogott: restarting pdns on virt1000 and labcontrol1001
* 21:05 Jamesofur: decryption key for Board Election insert into voteWiki
* 20:58 bblack: repooling ns0 -> radon AuthDNS
* 20:55 bblack: depooling ns0 -> radon AuthDNS (rebooting for kernel update)
* 20:50 hashar: restarted zuul entirely to remove some stalled jobs
* 20:29 paravoid: kafka preferred-replica-election on an1021
* 20:28 hashar: Restarting Jenkins to release a deadlock
* 20:23 logmsgbot: ori Synchronized php-1.26wmf8/resources/Resources.php: 7f49853fc9: ResourceLoader::filter: use APC when running under HHVM (duration: 00m 13s)
* 20:19 subbu: deployed parsoid sha ab675400
* 19:08 bblack: changed ops/puppet repo to ff-only in gerrit config, feel free to scream/revert if necc!
* 18:46 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: All wikis to 1.26wmf8, no new branch until next Tuesday, June 9th
* 18:42 logmsgbot: twentyafterfour Finished scap: Delete stale branch symlinks (1.26wmf1,1.26wmf2) (duration: 07m 14s)
* 18:35 logmsgbot: twentyafterfour Started scap: Delete stale branch symlinks (1.26wmf1,1.26wmf2)
* 15:16 logmsgbot: legoktm Synchronized wmf-config/InitialiseSettings.php: Remove references to $wgEchoCohortInterval (duration: 00m 12s)
* 15:16 logmsgbot: legoktm Synchronized wmf-config/CommonSettings.php: Change default extension distributor branch to REL1_25 (duration: 00m 15s)
* 15:15 bblack: repooling ns1->baham DNS traffic
* 15:07 bblack: depooling ns1->baham DNS traffic for kernel update
* 15:00 moritzm: added linux 3.19.3-5 for jessie-wikimedia on apt.wikimedia.org
* 14:46 bblack: restarted hhvm on mw1195, seems to be a case of https://phabricator.wikimedia.org/T89912
* 14:32 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on huwiki (duration: 00m 12s)
* 14:29 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Repool es2008, es2009 and es2010 (duration: 00m 14s)
* 14:10 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on eswiki (duration: 00m 13s)
* 13:38 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Depool es2008, es2009 and es2010 (duration: 00m 14s)
* 13:12 paravoid: reimaging rubidium with trusty, as spare
* 13:02 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on arwiki and cawiki (duration: 00m 15s)
* 12:56 paravoid: permanently switching ns0 to radon instead of rubidium
* 12:53 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Repool es2009 (duration: 00m 15s)
* 11:04 paravoid: kafka preferred-replica-election on an1021
* 10:55 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: Depool es2009 (duration: 00m 13s)
* 10:43 paravoid: powercycling ms-be1005
* 10:28 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: repool es2010 (duration: 00m 14s)
* 10:24 moritzm: added linux-meta 1.2 for jessie-wikimedia on carbon.wikimedia.org
* 10:09 hashar: Jenkins: refreshing all jobs to get rid of an obsolete http notification to Zuul {{bug|T93321}}
* 09:48 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool es1008 (duration: 00m 13s)
* 09:00 logmsgbot: jynus Synchronized wmf-config/db-codfw.php: depool es2010 (duration: 00m 13s)
* 08:51 moritzm: removed fuse/ntfs-3g from wtp*
* 07:47 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool es1008 (duration: 00m 14s)
* 05:42 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Jun  3 05:41:31 UTC 2015 (duration 41m 30s)
* 02:50 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-03 02:48:55+00:00
* 02:45 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 06m 37s)
* 02:28 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-06-03 02:27:38+00:00
* 02:25 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1072 (duration: 00m 12s)
* 02:23 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 07m 13s)
* 01:57 springle: replicate m3 to codfw dbstore2001
* 01:37 springle: start sync m4 eventlogging to codfw dbstore2002
* 00:35 logmsgbot: mattflaschen Synchronized php-1.26wmf8/extensions/Calendar/: Sync Calendar 1.26wmf8 for module position (duration: 00m 12s)
* 00:20 logmsgbot: mattflaschen Synchronized php-1.26wmf8/includes/User.php: Fixed $flags bit operation precedence fail in User::loadFromDatabase() (duration: 00m 14s)
 
== June 2 ==
* 23:56 logmsgbot: mattflaschen Synchronized php-1.26wmf8/extensions/Flow/: Sync Flow 1.26wmf8 for import fix (duration: 00m 15s)
* 23:43 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings.php: Disable WikiGrok (duration: 00m 13s)
* 23:33 logmsgbot: mattflaschen Synchronized php-1.26wmf8/includes/resourceloader/ResourceLoaderStartUpModule.php: Don't cache minification of user.tokens (duration: 00m 15s)
* 23:33 logmsgbot: mattflaschen Synchronized php-1.26wmf8/includes/resourceloader/ResourceLoader.php: Don't cache minification of user.tokens (duration: 00m 13s)
* 23:33 logmsgbot: mattflaschen Synchronized php-1.26wmf8/includes/OutputPage.php: Don't cache minification of user.tokens (duration: 00m 14s)
* 23:31 logmsgbot: mattflaschen Synchronized php-1.26wmf7/includes/resourceloader/ResourceLoaderStartUpModule.php: Don't cache minification of user.tokens (duration: 00m 13s)
* 23:31 logmsgbot: mattflaschen Synchronized php-1.26wmf7/includes/resourceloader/ResourceLoader.php: Don't cache minification of user.tokens (duration: 00m 14s)
* 23:31 logmsgbot: mattflaschen Synchronized php-1.26wmf7/includes/OutputPage.php: Don't cache minification of user.tokens (duration: 00m 13s)
* 21:44 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I263aa9542: Set $wgExtDistUseEventLogging = true; (duration: 00m 13s)
* 21:43 logmsgbot: ori Synchronized php-1.26wmf8/extensions/ExtensionDistributor: cdd033e7d8: Update ExtensionDistributor for cherry-picks (duration: 00m 13s)
* 19:24 logmsgbot: ori Synchronized wmf-config/StartProfiler.php: I7810b72d5: Sample profiling data at 1:10,000 (duration: 00m 12s)
* 19:19 logmsgbot: ori Synchronized wmf-config: I35255f357 and I026dfdbf68 (duration: 00m 12s)
* 19:15 logmsgbot: aude Synchronized wmf-config/Wikibase.php: bump cache epoch for wikidata (duration: 00m 13s)
* 19:06 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: wgMaxCredits to 0 (duration: 00m 13s)
* 18:53 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group1 wikis to 1.26wmf8
* 18:46 robh: sodium has resumed normal service. all items on https://phabricator.wikimedia.org/T100711 addressed
* 17:56 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool es1010 (duration: 00m 12s)
* 17:18 robh: mailing list traffic halted for list renames
* 17:07 robh: lists.wikimedia.org is now sha256 cert
* 17:04 robh: starting the lists.wikimedia.org certificate update, archives will offline during this process
* 15:44 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool es1010 (duration: 00m 13s)
* 15:03 logmsgbot: thcipriani Synchronized wmf-config/wikitech.php: SWAT: No longer set use_dnsmasq for new instances. [[gerrit:215317]] (duration: 00m 12s)
* 12:31 twentyafterfour: merged https://gerrit.wikimedia.org/r/#/c/214288/ and deployed scap
* 12:18 moritzm: installed linux-tools-3.19.8-1 for jessie-wikimedia on carbon
* 07:36 logmsgbot: nikerabbit Synchronized wmf-config/InitialiseSettings.php: Fixed wiki id for fiu_vro for CX beta feature (duration: 00m 13s)
* 05:41 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Jun  2 05:39:57 UTC 2015 (duration 39m 56s)
* 02:49 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-02 02:48:23+00:00
* 02:44 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 05m 45s)
* 02:28 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-06-02 02:27:42+00:00
* 02:23 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 26s)
* 02:06 logmsgbot: krinkle Synchronized php-1.26wmf7/resources/src/mediawiki/mediawiki.js: backport rl-fix I717b86573 (duration: 00m 14s)
* 00:33 ejegg: updated payments-wiki from a4fef65ec1dd3db1fb1d7ceb797b2c7485c722d2 to d22e44e3fab2b937707c2776384cb93a49b4cfd3
* 00:07 ori: Updated jobrunner for I1d351d8d1: Made periodictasks stats calls more useful
* 00:02 logmsgbot: ori Synchronized php-1.26wmf8/extensions/RSS/RSSParser.php: Ice44740fb: Don't rely on strip marker uniqueness (T10104) (duration: 00m 14s)
* 00:01 logmsgbot: ori Synchronized php-1.26wmf7/extensions/RSS/RSSParser.php: Ice44740fb: Don't rely on strip marker uniqueness (T10104) (duration: 00m 13s)
 
== June 1 ==
* 23:36 mutante: restarted gitblit ..
* 23:15 ori: Deployed jobchron / jobrunner change Icab05090b and restarted jobchron / jobrunner on job queue runners.
* 22:51 ejegg: updated payments from 60c160110a20cf763b82677ff1501e9ce0c919bc to a4fef65ec1dd3db1fb1d7ceb797b2c7485c722d2
* 21:36 godog: doing some local testing on carbon for T100636 fwiw, thus puppet disabled
* 21:35 ejegg: update paymentswiki from aa66797553fbcfb63f7cf29abccc44d060b65db0 to 60c160110a20cf763b82677ff1501e9ce0c919bc
* 21:13 logmsgbot: ori Synchronized php-1.26wmf7/languages/LanguageConverter.php: 1d054ce6d3: Use a fixed marker prefix string in the Parser and MWTidy (duration: 00m 14s)
* 20:40 logmsgbot: ori Synchronized php-1.26wmf8/languages/LanguageConverter.php: 1d054ce6d3: Use a fixed marker prefix string in the Parser and MWTidy (duration: 00m 13s)
* 20:29 twentyafterfour: disabled several no-longer-existent repositories in phabricator which apparently have been deleted in gerrit
* 20:26 subbu: deployed parsoid sha 73445bfd
* 20:05 twentyafterfour: restarted apache2 and phd on iridium (phabricator)
* 19:52 MaxSem: Repopulated gis.spatial_ref_sys on labsdb1004 with postgis 2.1 data, old contents backed up as spatial_ref_sys_bak
* 18:55 logmsgbot: ori Synchronized php-1.26wmf7/extensions/SemanticForms/includes/SF_FormUtils.php: I7ed3996a1: Stop using StripState (duration: 00m 13s)
* 18:55 logmsgbot: ori Synchronized php-1.26wmf8/extensions/SemanticForms/includes/SF_FormUtils.php: I7ed3996a1: Stop using StripState (duration: 00m 15s)
* 17:46 yurik: deployed graphoid service update - grafana logging cleanup
* 16:40 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool pc1003 (duration: 00m 15s)
* 16:06 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: T99491, T100925: Sysops to add users to import group on maiwiki, newiki (duration: 00m 14s)
* 15:47 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/CodeReview: SWAT: Backport CodeReview module position fix [[gerrit:215043]] (duration: 00m 13s)
* 15:24 logmsgbot: thcipriani Synchronized php-1.26wmf8/includes/resourceloader/ResourceLoaderWikiModule.php: SWAT: Make ResourceLoaderWikiModule support custom position [[gerrit:214741]] (duration: 00m 15s)
* 15:23 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/WikiEditor: SWAT: Make ResourceLoaderWikiModule support custom position [[gerrit:214741]] (duration: 00m 13s)
* 15:22 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/VectorBeta: SWAT: Make ResourceLoaderWikiModule support custom position [[gerrit:214741]] (duration: 00m 15s)
* 15:21 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/SyntaxHighlight_GeSHi: SWAT: Make ResourceLoaderWikiModule support custom position [[gerrit:214741]] (duration: 00m 14s)
* 15:20 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/MobileFrontend: SWAT: Make ResourceLoaderWikiModule support custom position [[gerrit:214741]] (duration: 00m 13s)
* 15:18 logmsgbot: thcipriani Synchronized php-1.26wmf8/extensions/Gather: SWAT: Make ResourceLoaderWikiModule support custom position [[gerrit:214741]] (duration: 00m 13s)
* 14:42 cmjohnson1: powering down analytics1028 to swap the bad DIMM
* 14:38 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool pc1003 (duration: 00m 12s)
* 13:48 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable arbitrary access on wikisource and itwiki, and make other projects sidebar feature default for ptwiki (for real) (duration: 00m 12s)
* 13:45 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable arbitrary access on wikisource and itwiki, and make other projects sidebar feature default for ptwiki (duration: 00m 15s)
* 13:31 logmsgbot: aude Synchronized php-1.26wmf8/extensions/Wikidata: css compatibility fixes for wmf8 (duration: 00m 24s)
* 13:00 logmsgbot: krenair Synchronized php-1.26wmf8/extensions/WikimediaMessages/WikimediaMessages.hooks.php: https://gerrit.wikimedia.org/r/#/c/215011/ - fix EditPageCopyrightWarning (duration: 00m 16s)
* 12:22 moritzm: added firmware-nonfree 0.44~wmf1 for jessie-wikimedia on carbon
* 09:32 yurik: deployed latest graphoid service to sca100x
* 08:18 hashar: Jenkins: upgrading git plugin from 1.5.0 to latest
* 08:12 mobrovac: restbase restart cassandra on restbase1006
* 08:09 mobrovac: restbase restart cassandra on restbase1005
* 08:07 mobrovac: restbase restart cassandra on restbase1004
* 08:05 mobrovac: restbase restart cassandra on restbase1003
* 08:00 mobrovac: restbase restart cassandra on restbase1002
* 07:59 mobrovac: restbase restart cassandra on restbase1001
* 05:19 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Jun  1 05:18:18 UTC 2015 (duration 18m 17s)
* 02:47 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-06-01 02:46:32+00:00
* 02:43 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 05m 37s)
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-06-01 02:26:03+00:00
* 02:22 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 35s)
 
== May 31 ==
* 22:35 jgage: graphite2001 keeps falling off the net due to OOM; swap 100% in use. dist-upgraded & rebooted. dmesg in ~gage/dmesg.2015-05-31
* 18:37 logmsgbot: krinkle Synchronized php-1.26wmf8/resources/src/mediawiki/mediawiki.js: rl live fix - I717b86573 (duration: 00m 12s)
* 17:36 Krinkle: Confirmed RL problem solved. The jquery|mediawiki&version=bizqqnC request was cached with an old mw.loader implementation somehow. After the touch and sync, the version is now dQAzAsdU and the implementation is up to date.
* 17:33 logmsgbot: krinkle Synchronized php-1.26wmf7/resources: touch mediawiki.js (duration: 00m 13s)
* 17:20 Krinkle: Investigating RL issues (clients are loading mediawiki.notification&version=19700101T000000Z, mw.loader.moduleRegistry contains NaN for versions)
* 17:12 gwicke: performed a rolling restart of RESTBase Cassandra nodes to address elevated request error rates apparently related to schema disagreement
* 05:35 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun May 31 05:34:36 UTC 2015 (duration 34m 35s)
* 02:47 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-05-31 02:46:41+00:00
* 02:43 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 05m 51s)
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-31 02:25:44+00:00
* 02:21 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 41s)
 
== May 30 ==
* 21:07 bd808: Upgraded Elasticsearch cluster to 1.3.9 on logstash100[1-6]
* 18:35 logmsgbot: hoo Synchronized php-1.26wmf7/extensions/UploadWizard/: Touch js… (duration: 00m 18s)
* 17:06 logmsgbot: legoktm Synchronized php-1.26wmf8/extensions/WikiEditor/extension.json: Explicitly define module position (duration: 00m 13s)
* 05:32 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat May 30 05:31:02 UTC 2015 (duration 31m 1s)
* 02:56 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-05-30 02:55:22+00:00
* 02:52 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 05m 40s)
* 02:36 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-30 02:34:55+00:00
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 50s)
* 01:15 ori: Deployed rcstream I797bc1244: Handle invalid JSON gracefully
* 00:08 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/212436/ - docs only, no code change (how was this waiting 10 days?) (duration: 00m 14s)
 
== May 29 ==
* 23:56 logmsgbot: ori Synchronized w/static/images/project-logos: Ic62747f37: Optimise project logos added since I8c9a6a56 (duration: 00m 13s)
* 21:21 logmsgbot: ori Synchronized wmf-config/throttle.php: Ife45684c5: Add another IP address for Santiago edit-a-thon (duration: 00m 13s)
* 20:43 logmsgbot: ori Synchronized robots.txt: I7b321b62d: allow robots to use RL on domains (duration: 00m 14s)
* 17:18 mutante: fix client_max_body_size syntax error in nginx config of payments1001
* 15:19 logmsgbot: anomie Synchronized php-1.26wmf8/extensions/ConfirmEdit/: Update ConfirmEdit to fix API breakage [[gerrit:214620]] (duration: 00m 14s)
* 14:52 paravoid: re-redirecting ns0 traffic back to rubidium
* 14:17 jynus: Moving pdns and designate databases from m1 to m5
* 13:30 logmsgbot: aude Synchronized php-1.26wmf8/extensions/Wikidata: touch js and css files to try to fix issues on test.wikidata (duration: 00m 26s)
* 13:17 godog: roll-restart cassandra on cerium / xenon / praseodymium following java upgrade
* 11:53 paravoid: reimaging rubidium
* 11:45 _joe_: restart nutcracker on mw1150
* 11:41 paravoid: redirecting ns0 traffic to baham (= ns1) in preparation for rubidium upgrade
* 06:52 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri May 29 06:51:45 UTC 2015 (duration 51m 44s)
* 06:13 logmsgbot: ori Synchronized php-1.26wmf7/includes/deferred/SiteStatsUpdate.php: Icc12c07ab: Update context stats in SiteStatsUpdate (duration: 00m 13s)
* 06:12 logmsgbot: ori Synchronized php-1.26wmf8/includes/deferred/SiteStatsUpdate.php: Icc12c07ab: Update context stats in SiteStatsUpdate (duration: 00m 14s)
* 06:03 apergos: salt keys regenerated on all production hosts (minions, not master key)
* 03:09 logmsgbot: LocalisationUpdate completed (1.26wmf8) at 2015-05-29 03:08:15+00:00
* 03:02 logmsgbot: l10nupdate Synchronized php-1.26wmf8/cache/l10n: (no message) (duration: 10m 08s)
* 02:36 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-29 02:35:10+00:00
* 02:31 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 54s)
* 00:07 logmsgbot: ori Synchronized php-1.26wmf7/includes/diff/UnifiedDiffFormatter.php: d95cac90c7: Make the output of UnifiedDiffFormatter match diff -u (duration: 00m 14s)
* 00:06 logmsgbot: ori Synchronized php-1.26wmf7/extensions/Echo/includes/DiffParser.php: 41d27c4a26: Update Echo for cherry-picks (duration: 00m 13s)
 
== May 28 ==
* 23:33 jgage: restarted nutcracker on mw1056 due to errors, per bd808
* 23:18 logmsgbot: catrope Synchronized php-1.26wmf7/includes/EditPage.php: Fix regression with URL-specified edit tags (duration: 00m 13s)
* 23:18 logmsgbot: catrope Synchronized php-1.26wmf6/includes/EditPage.php: Fix regression with URL-specified edit tags (duration: 00m 13s)
* 23:04 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Enable A/B test of VE for new accounts on enwiki (duration: 00m 13s)
* 22:48 logmsgbot: hoo Synchronized php-1.26wmf7/: Touching some JS, re-syncing resource definitions to rule out causes for Wikidata JS problem. (duration: 01m 00s)
* 21:52 logmsgbot: ori Synchronized php-1.26wmf7/resources/src/mediawiki/mediawiki.toc.js: Touching file on unconfirmed suspicion of stale cache (duration: 00m 16s)
* 21:51 logmsgbot: ori Synchronized php-1.26wmf8/resources/src/mediawiki/mediawiki.toc.js: Touching file on unconfirmed suspicion of stale cache (duration: 00m 15s)
* 20:24 mutante: killed nodejs on wtp1023,wtp1016
* 20:11 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on Wikivoyage (duration: 00m 13s)
* 20:03 cscott: updated Parsoid to version 497da30e ; canary restart of wtp1001; observed network TX spike (possibly UDP, possibly logging); reverted to 8ed6fd0b and restarted all parsoids.
* 19:33 mutante: temp. stopped icinga-wm
* 19:05 logmsgbot: legoktm Synchronized php-1.26wmf8/extensions/Gadgets/: Explicitly define module position (duration: 00m 14s)
* 18:32 logmsgbot: legoktm Synchronized php-1.26wmf7/extensions/GlobalCssJs/: Explicitly define module position (duration: 00m 12s)
* 18:24 logmsgbot: legoktm Synchronized php-1.26wmf8/extensions/GlobalCssJs/: Explicitly define module position (duration: 00m 13s)
* 18:22 logmsgbot: krenair Synchronized php-1.26wmf6/extensions/VisualEditor: https://gerrit.wikimedia.org/r/#/c/214397/ - in case we have to go back to wmf6 again for whatever reason (duration: 00m 15s)
* 18:20 logmsgbot: krenair Synchronized php-1.26wmf8/extensions/VisualEditor: https://gerrit.wikimedia.org/r/#/c/214396/ (duration: 00m 13s)
* 18:17 logmsgbot: krenair Synchronized php-1.26wmf7/extensions/VisualEditor: https://gerrit.wikimedia.org/r/#/c/214395/ (duration: 00m 14s)
* 17:29 logmsgbot: twentyafterfour Finished scap: Group0 to 1.26wmf8, everything else to 1.26wmf7 (duration: 28m 16s)
* 17:01 logmsgbot: twentyafterfour Started scap: Group0 to 1.26wmf8, everything else to 1.26wmf7
* 16:59 paravoid: reimaging baham
* 16:52 paravoid: redirecting ns1 traffic to rubidium (= ns0) in preparation for baham upgrade
* 15:54 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 03m 19s)
* 15:50 logmsgbot: kartik Started scap: Update ContentTranslation
* 15:47 logmsgbot: thcipriani Synchronized wmf-config/abusefilter.php: SWAT: Modify AbuseFilter block configuration on eswikibooks [[gerrit:206510]] (duration: 00m 15s)
* 15:40 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Prevent indexing of User: namespace on ukwiki [[gerrit:210680]] (duration: 00m 14s)
* 15:35 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable NewUserMessage on sa.wikipedia [[gerrit:212724]] (duration: 00m 13s)
* 15:28 godog: set operations/debs/python-statsd as hidden in gerrit -- deprecated
* 15:24 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT:  Enable Extension:NewUserMessage on ta.wikipedia [[gerrit:213841]] (duration: 00m 12s)
* 15:13 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable SandboxLink for cswiki [[gerrit:214247]] (duration: 00m 15s)
* 15:11 godog: set operations/debs/txstatsd as hidden in gerrit -- deprecated
* 15:05 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT: CX: Add wikis for CX deployment on 20150528 [[gerrit:213992]] (duration: 00m 15s)
* 15:00 bblack: merged up https://gerrit.wikimedia.org/r/214345 - look here if IPv6 problems!
* 14:37 cmjohnson1: powering down dataset1001 to add disk array
* 14:17 bblack: deploying https://gerrit.wikimedia.org/r/214341 - keep in mind if ipv6-related issues arise!
* 13:50 akosiaris: started ircecho (icinga-wm) on neon
* 13:46 hashar: upgrading Jenkins git plugin from 1.4.6+wmf1 to 1.7.1 {{bug|T100655}}  and restarting Jenkins
* 13:25 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool pc1003 (not to confuse with db1003) after warmup (duration: 00m 15s)
* 13:11 akosiaris: killed ircecho service on neon
* 09:48 _joe_: depooling the HHVM appserver. 503s reduced slightly but still non-irrelevant
* 09:37 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: Depool pc1003 (duration: 00m 15s)
* 09:35 _joe_: pooling mw1152 into the imagescalers pool after fixes made in Lyon
* 06:11 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu May 28 06:09:56 UTC 2015 (duration 9m 55s)
* 04:22 springle: reload dbstore1002 s7
* 02:41 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-28 02:40:00+00:00
* 02:36 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 06m 46s)
* 02:20 springle: set global read_only=0 on pc1001 pc1002. this config broke in the recent upgrade
* 00:59 logmsgbot: legoktm Synchronized php-1.26wmf8/resources/: Revert "Convert mediawiki.toc and mediawiki.user to using mw.cookie" (duration: 00m 17s)
* 00:58 logmsgbot: legoktm Synchronized php-1.26wmf7/resources/: Revert "Convert mediawiki.toc and mediawiki.user to using mw.cookie" (duration: 00m 13s)
* 00:07 logmsgbot: twentyafterfour Synchronized rpc/RunJobs.php: deploy I98b8a4ddbcdd58d1f2f23e4b1bf154f10b6b279e (duration: 00m 17s)
 
== May 27 ==
* 23:46 awight: updated payments from 858b87319daa3d66f62eb32e08cefc6b061748d1 to aa66797553fbcfb63f7cf29abccc44d060b65db0
* 23:31 logmsgbot: twentyafterfour Finished scap: scap, now with 10% less fail (duration: 22m 07s)
* 23:26 awight: payments rolled back to 858b87319daa3d66f62eb32e08cefc6b061748d1
* 23:24 awight: updated payments from 858b87319daa3d66f62eb32e08cefc6b061748d1 to aa66797553fbcfb63f7cf29abccc44d060b65db0
* 23:09 logmsgbot: twentyafterfour Started scap: scap, now with 10% less fail
* 22:57 logmsgbot: ori rebuilt wikiversions.cdb and synchronized wikiversions files: (no message)
* 21:49 mutante: restarted hhvm on mw1250,mw1254,mw1256
* 21:47 mutante: restarted hhvm on mw1017,mw1243,mw1244
* 21:42 bblack: restarting hhvm everywhere on 30s intervals between hosts
* 21:10 logmsgbot: twentyafterfour Synchronized php-1.26wmf8: Fix ConfirmEdit fatal Change-Id: I22353669a85391c3d9760a5253cac1263e895cf9 (duration: 01m 08s)
* 20:46 logmsgbot: twentyafterfour Purged l10n cache for 1.26wmf6
* 20:45 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf8
* 20:41 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedias to 1.26wmf7
* 20:36 logmsgbot: twentyafterfour Finished scap: testwiki to php-1.26wmf8 and rebuild l10n cache (duration: 67m 53s)
* 19:40 akosiaris: removed operations/puppet/varnish from gerrit, git.wikimedia.org and github. The repo was used as a git submodule but the workflow turned out to be cumbersome approximately a year ago and was no longer updated. Up to a few minutes ago, it only served as a source of confusion. It no longer does.
* 19:28 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf8 and rebuild l10n cache
* 19:22 logmsgbot: twentyafterfour scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="testwiki" --outdir="/tmp/scap_l10n_1863397713" --threads=4 --lang en  --quiet' returned non-zero exit status 255 (duration: 03m 38s)
* 19:18 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf8 and rebuild l10n cache
* 18:12 moritzm: Uploaded gridengine_6.2u5-4+wmf2 for precise-wikimedia to apt.wikimedia.org
* 17:55 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool pc1002 (duration: 00m 13s)
* 17:42 paravoid: rebooting asw-d2-eqiad
* 17:41 ottomata: initiating controlled shutdown of kafka broker analytics1018 in anticipation of switch reboot
* 15:33 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool pc1002 (duration: 00m 13s)
* 15:02 cmjohnson1: powering down cp1069 to relocate within the same rack
* 14:47 cmjohnson1: powering down cp1070 to relocate within the same rack
* 13:30 hashar: All Jenkins slaves are disconnected due to some ssh error. CI is down.
* 13:27 hashar: restarting Jenkins for java upgrade
* 13:13 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool pc1001 (duration: 00m 13s)
* 11:16 akosiaris: rebooting ganeti100{1..4} for bridge networking configuration
* 09:59 paravoid: powercycling ms-be1001; dead, console unresponsive
* 06:35 springle: clone dbstore2001 data to dbstore2002
* 05:48 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed May 27 05:47:25 UTC 2015 (duration 47m 24s)
* 02:53 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-27 02:52:25+00:00
* 02:48 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 52s)
* 02:29 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-27 02:28:34+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 06m 45s)
 
== May 26 ==
* 18:21 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group1 wikis to 1.26wmf7
* 17:13 logmsgbot: krenair Synchronized wmf-config/interwiki.cdb: Updating interwiki cache (duration: 00m 15s)
* 17:10 logmsgbot: krenair Synchronized multiversion/MWMultiVersion.php: open cnwikimedia (duration: 00m 13s)
* 16:27 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 13s)
* 16:12 logmsgbot: krenair rebuilt wikiversions.cdb and synchronized wikiversions files: add cnwikimedia
* 16:08 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 15s)
* 16:07 logmsgbot: krenair Synchronized database lists: (no message) (duration: 00m 15s)
* 16:07 logmsgbot: krenair Synchronized w/static/images/project-logos/cnwikimedia.png: (no message) (duration: 00m 19s)
* 15:52 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1063 (duration: 00m 14s)
* 15:32 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1063 (warm period) (duration: 00m 13s)
* 15:24 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/213652/ (duration: 00m 15s)
* 15:23 logmsgbot: krenair Synchronized wmf-config/throttle.php: https://gerrit.wikimedia.org/r/#/c/213257/ (duration: 00m 14s)
* 14:54 bblack: restarted ganglia-monitor on all cp* (many were obviously-broken, probably most recently from bad startup after the reboots last week)
* 14:14 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool db1063 (duration: 00m 12s)
* 08:24 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool pc1001 (duration: 00m 13s)
* 05:53 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue May 26 05:52:50 UTC 2015 (duration 52m 49s)
* 03:02 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-26 03:01:12+00:00
* 02:55 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 09m 31s)
* 02:29 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-26 02:28:08+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 06m 44s)
* 01:35 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1026, warm up (duration: 00m 14s)
 
== May 25 ==
* 16:36 jynus: running diagnostics on mariadb@pc1001: a very small amount of requests may experience extra latency
* 14:17 duh: intentionally not scapping right now, will let l10nupdate sync it out
* 14:16 logmsgbot: legoktm Synchronized php-1.26wmf7/extensions/WikimediaMessages/i18n/: ExtensionDistributor message updates (duration: 00m 17s)
* 13:53 logmsgbot: legoktm Synchronized php-1.26wmf7/extensions/ExtensionDistributor: Update ExtensionDistributor to master (duration: 00m 13s)
* 13:38 logmsgbot: jynus Synchronized wmf-config/InitialiseSettings-labs.php: restbase change from yurik (duration: 00m 14s)
* 13:37 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1018 (warm cache) (duration: 00m 13s)
* 13:09 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1018 (duration: 00m 14s)
* 10:31 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool db1018 (duration: 00m 13s)
* 08:36 YuviKTM: running du -d 1 -h > du-may-25-2015 on /exp/project/tools on labstore1001 to audit tools' NFS usage
* 05:12 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon May 25 05:11:47 UTC 2015 (duration 11m 46s)
* 02:50 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-25 02:49:45+00:00
* 02:45 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 32s)
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-25 02:26:39+00:00
* 02:22 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 06m 36s)
 
== May 24 ==
* 17:18 springle: stop mysqld db1002 db1003 db1004 db1005 db1006 db1007
* 10:00 ^d: gerrit: manually gc'd all repos to help with clone times
* 08:55 godog: resize existing whisper files with new retention on graphite2001
* 05:42 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun May 24 05:41:35 UTC 2015 (duration 41m 34s)
* 02:58 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-24 02:57:17+00:00
* 02:53 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 57s)
* 02:34 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-24 02:33:23+00:00
* 02:29 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 06m 34s)
 
== May 23 ==
* 23:30 logmsgbot: ori Synchronized php-1.26wmf7/extensions/Gadgets: b592efa5fe: Update Gadgets for I6da3eede0: Conversion to using WAN cache (duration: 00m 13s)
* 12:54 godog: remove MediaWiki.xhprof to pick up new retention schema
* 12:53 godog: bounce carbon on graphite1001 to pick up new retention schema
* 11:16 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: Ic258d01a7: Revert "Change StatsD port to another value temporarily" (duration: 00m 13s)
* 10:22 ori: Metrics from MediaWiki to graphite are temporarily suspended while xhprof profiling work is ongoing.
* 10:21 logmsgbot: ori Synchronized wmf-config/StartProfiler.php: Exclude xhprof.run_init from being reported (duration: 00m 13s)
* 10:03 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 13s)
* 09:57 logmsgbot: ori Synchronized wmf-config/StartProfiler.php: Ia7549d45: Re-enable xhprof profiling (duration: 00m 14s)
* 09:52 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I311c989e9: Change StatsD port to another value temporarily (duration: 00m 14s)
* 05:13 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat May 23 05:12:44 UTC 2015 (duration 12m 43s)
* 02:45 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-23 02:44:48+00:00
* 02:41 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 05m 56s)
* 02:24 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-23 02:23:36+00:00
* 02:20 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 06m 02s)
* 00:33 mutante: adding cwdent to WMF LDAP group per https://www.mediawiki.org/wiki/User:CDentinger_%28WMF%29
* 00:04 logmsgbot: ori Synchronized php-1.26wmf6/includes: 9bf0236c20, 2d3c9233ed (duration: 00m 17s)
 
== May 22 ==
* 20:59 logmsgbot: ori Synchronized php-1.26wmf7/includes: 4632aff034 (duration: 00m 18s)
* 19:19 logmsgbot: ori Synchronized php-1.26wmf6/includes/profiler: 0d9c4dd8fe, ec22d6e6c3, 4127b1a315: Profiler improvements (duration: 00m 16s)
* 19:18 logmsgbot: ori Synchronized php-1.26wmf7/includes/profiler: a69ee4a0f7, a3773b4d8b, ab19be9d99: Profiler improvements (duration: 00m 15s)
* 17:16 yuvipanda: rebooted labvirt1005 from mgmt see what's up with disk array
* 16:53 yuvipanda: rebooted labvirt1005 for T99738
* 15:01 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/211696/ - disable VE A/B test (duration: 00m 12s)
* 13:57 jynus: schema change on x1 shard https://phabricator.wikimedia.org/T94427 No downtime expected
* 10:55 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1036 (duration: 00m 12s)
* 07:58 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool db1036 (duration: 00m 13s)
* 06:48 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri May 22 06:47:25 UTC 2015 (duration 47m 23s)
* 05:50 springle: upgrade db1026 trusty mariadb 10, mydumper reload
* 03:09 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-22 03:08:51+00:00
* 03:02 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 10m 14s)
* 02:43 logmsgbot: hoo Synchronized php-1.26wmf6/extensions/Wikidata/: Update Wikidata: Make wbmergeitems respect the bot parameter (duration: 00m 19s)
* 02:38 logmsgbot: hoo Synchronized php-1.26wmf7/extensions/Wikidata/: Update Wikidata from wmf4 to wmf6 branch. (duration: 00m 22s)
* 02:36 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-22 02:35:33+00:00
* 02:32 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 05m 56s)
 
== May 21 ==
* 23:50 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings.php: Re-enable subpages for the template namespace on officewiki (duration: 00m 13s)
* 23:35 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings.php: Enable NewUserMessage on hif.wikipedia (duration: 00m 14s)
* 23:30 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings.php: Configure import sources for hif.wikipedia (duration: 00m 12s)
* 23:26 logmsgbot: hoo Synchronized wmf-config/InitialiseSettings.php: Site name configuration on ast.wiktionary (duration: 00m 12s)
* 23:08 logmsgbot: ori Synchronized php-1.26wmf6/includes: 7238213e6d: Defer some updates in doEditUpdates() (duration: 00m 16s)
* 23:07 logmsgbot: ori Synchronized php-1.26wmf7/includes: da79b19b88: Defer some updates in doEditUpdates() (duration: 00m 16s)
* 17:01 mutante: mw1123: apt-get autoclean, rebooting for kernel upgrade
* 16:57 mutante: dist-upgrade on mw1123
* 16:34 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 23m 25s)
* 16:10 logmsgbot: kartik Started scap: Update ContentTranslation
* 16:04 mutante: armed keyholder on mira
* 15:56 kart_: Updated cxserver
* 15:32 Tim: removed max-registration properties from 2015 board elections on metawiki and votewiki per my comment on T97924
* 15:09 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/212281/ (duration: 00m 10s)
* 15:06 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/211116/ (duration: 00m 16s)
* 15:00 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/205778/ - enable VE A/B test (duration: 00m 14s)
* 14:58 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/205778/ - VE A/B test on enwiki (duration: 00m 11s)
* 14:37 bblack: enabling puppet on caches for varnish retries changes...
* 11:51 logmsgbot: twentyafterfour Finished scap: 1.26wmf7 symlinks (duration: 05m 16s)
* 11:49 twentyafterfour: I'm investigating some inconsistencies in symlinks in /srv/mediawiki, ref https://phabricator.wikimedia.org/T99886
* 11:46 logmsgbot: twentyafterfour Started scap: 1.26wmf7 symlinks
* 11:31 paravoid: troubleshooting analytics1036, includes reboots
* 07:49 akosiaris: uploaded to apt.wikimedia.org trusty-wikimedia distribution jessie-wikimedia: php-luasandbox_2.0.9
* 07:21 _joe_: cleaning the bytecode cache database everywhere
* 06:43 _joe_: cleaning up the bytecode caches of a few appservers
* 06:28 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu May 21 06:27:09 UTC 2015 (duration 27m 8s)
* 04:55 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: Ia5239c1e: Unset $wgDiff, so we stop shelling out to diff (duration: 00m 12s)
* 03:10 logmsgbot: LocalisationUpdate completed (1.26wmf7) at 2015-05-21 03:09:49+00:00
* 03:06 logmsgbot: l10nupdate Synchronized php-1.26wmf7/cache/l10n: (no message) (duration: 06m 13s)
* 02:45 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-21 02:44:18+00:00
* 02:38 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 09m 36s)
* 00:38 logmsgbot: ori Synchronized php-1.26wmf7/includes/MediaWiki.php: adacd7b35c: Pass a message key to MalformedTitleException constructor (duration: 00m 11s)
* 00:37 logmsgbot: ori Synchronized php-1.26wmf6/includes/MediaWiki.php: b13721b5cb: Pass a message key to MalformedTitleException constructor (duration: 00m 12s)
* 00:20 logmsgbot: ori Synchronized php-1.26wmf6/includes/jobqueue/JobQueueGroup.php: 1e43c05283: Revert "Undefer push() in lazyPush() temporarily" (duration: 00m 12s)
 
== May 20 ==
* 23:07 logmsgbot: legoktm Synchronized php-1.26wmf7/extensions/SyntaxHighlight_GeSHi/: https://gerrit.wikimedia.org/r/212456 (duration: 00m 14s)
* 23:05 logmsgbot: legoktm Synchronized wmf-config/: Disable WikiGrok in WMF production (duration: 00m 13s)
* 22:14 logmsgbot: twentyafterfour Purged l10n cache for 1.26wmf5
* 21:51 logmsgbot: ori Synchronized php-1.26wmf6/includes: I32a3cfabc: Made pushLazyJobs() handle all queue groups (duration: 00m 18s)
* 21:25 logmsgbot: legoktm Synchronized php-1.26wmf7/extensions/SyntaxHighlight_GeSHi: https://gerrit.wikimedia.org/r/#/c/212450/ (duration: 00m 13s)
* 21:18 logmsgbot: twentyafterfour Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 14s)
* 21:01 cscott: updated OCG to version ca4f64852de5b1de782b292b50038fbd2dd84266
* 20:59 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf7
* 20:58 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedias to 1.26wmf6
* 20:50 logmsgbot: twentyafterfour Finished scap: retry: testwiki to php-1.26wmf7 and rebuild l10n cache (duration: 26m 02s)
* 20:42 ebernhardson: restarted gmond on elastic10{01..31}.eqiad.wmnet
* 20:24 logmsgbot: twentyafterfour Started scap: retry: testwiki to php-1.26wmf7 and rebuild l10n cache
* 20:12 subbu: deployed parsoid version 8ed6fd0b
* 19:35 logmsgbot: twentyafterfour scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="testwiki" --outdir="/tmp/scap_l10n_3448528422" --threads=4 --lang en  --quiet' returned non-zero exit status 255 (duration: 03m 22s)
* 19:32 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf7 and rebuild l10n cache
* 17:41 bblack: esams+eqiad upload varnish caches will be downtimed+rebooted today, experimenting with depool effects as well (next several hours)
* 16:03 logmsgbot: manybubbles Synchronized php-1.26wmf5/extensions/Flow/: SWAT update flow for wmf5 to fix two issues (duration: 00m 14s)
* 15:54 godog: rolling restart restbase on restbase1003-1006
* 15:52 mobrovac: restbase restarted on restbase1002
* 15:47 godog: restbase restarted on restbase1001
* 15:35 logmsgbot: manybubbles Synchronized php-1.26wmf6/extensions/Flow/: SWAT update flow for wmf6 to fix two issues (duration: 00m 12s)
* 15:22 logmsgbot: manybubbles Synchronized wmf-config/InitialiseSettings.php: SWAT new namespaces for ptwikinews (duration: 00m 11s)
* 15:18 logmsgbot: manybubbles Synchronized wmf-config/throttle.php: SWAT clean old throttle rule and add a new one for an upcoming festival (duration: 00m 13s)
* 15:14 logmsgbot: manybubbles Synchronized wmf-config/InitialiseSettings.php: SWAT update urwikiquote logo 2/2 (duration: 00m 11s)
* 15:13 logmsgbot: manybubbles Synchronized w/static/images/project-logos/urwikiquote.png: SWAT update urwikiquote logo 1/2 (duration: 00m 13s)
* 15:06 springle: db1045 pt-osc reindexing (should be low load, ~2hr)
* 14:36 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable usage tracking on itwiki and wikiquote (duration: 00m 16s)
* 14:25 milimetric: Deployed Event Logging Server with better batch insertion on Monday, May 18 (apologies for late notice)
* 13:13 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1045; depool db1026 (duration: 00m 13s)
* 10:18 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1063 (duration: 00m 11s)
* 09:43 _joe_: stopping puppet, fiddling with HHVM parameters on mw1114
* 09:37 Coren: tools kicked grrrit-wm in the diodes.
* 09:35 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1063 (duration: 00m 12s)
* 06:45 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool db1063 for maintenance (duration: 00m 11s)
* 06:43 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed May 20 06:42:22 UTC 2015 (duration 42m 21s)
* 03:13 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-20 03:12:31+00:00
* 03:06 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 09m 40s)
* 02:41 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-20 02:40:07+00:00
* 02:36 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 06m 30s)
* 01:14 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1045 (duration: 00m 15s)
* 00:43 logmsgbot: ebernhardson Synchronized wmf-config/: Per-user poolcounter triggered many more times than expected (duration: 00m 15s)
* 00:42 logmsgbot: ebernhardson Synchronized wmf-config/PoolCounterSettings-common.php: Enable per-user poolcounter in CirrusSearch on all wikis (duration: 00m 14s)
* 00:41 logmsgbot: ebernhardson Synchronized wmf-config/InitialiseSettings.php: Enable per-user poolcounter in CirrusSearch on all wikis (duration: 00m 12s)
* 00:40 logmsgbot: ebernhardson Synchronized php-1.26wmf5/extensions/NavigationTiming/: Update NavigationTiming for cherry-picks in 1.26wmf5 (duration: 00m 12s)
* 00:39 logmsgbot: ebernhardson Synchronized php-1.26wmf6/extensions/NavigationTiming/: Update NavigationTiming for cherry-picks in 1.26wmf6 (duration: 00m 12s)
* 00:36 logmsgbot: ebernhardson Synchronized php-1.26wmf5/extensions/CirrusSearch/: Bump CirrusSearch in 1.26wmf5 for poolcounter error message updates (duration: 00m 11s)
* 00:35 logmsgbot: ebernhardson Synchronized php-1.26wmf6/extensions/CirrusSearch/: Bump CirrusSearch in 1.26wmf6 for poolcounter error message updates (duration: 00m 13s)
* 00:34 logmsgbot: ebernhardson Synchronized php-1.26wmf5/extensions/CirrusSearch/: Bump CirrusSearch in 1.26wmf5 for poolcounter error message updates (duration: 00m 12s)
* 00:32 logmsgbot: ebernhardson Synchronized php-1.26wmf6/extensions/CirrusSearch/: Bump CirrusSearch in 1.26wmf6 for poolcounter error message updates (duration: 00m 12s)
 
== May 19 ==
* 23:35 gwicke: deployed RESTBase 90817c2a
* 23:20 logmsgbot: bd808 Synchronized wmf-config/InitialiseSettings.php: logstash: Exclude jobrunner debug messages (duration: 00m 12s)
* 23:10 logmsgbot: bd808 Synchronized wmf-config/InitialiseSettings.php: Enable NewUserMessage on maiwiki and pawiki (duration: 00m 12s)
* 22:06 ejegg: updated payment from e89d18ee20abcb1ca3c455e6a298bf8a6aa84442 to  858b87319daa3d66f62eb32e08cefc6b061748d1
* 21:16 logmsgbot: kaldari Synchronized php-1.26wmf6/extensions/MobileFrontend: syncing MobileFrontend for 1.26wmf6 (duration: 00m 11s)
* 21:15 logmsgbot: kaldari Synchronized php-1.26wmf6/extensions/Gather: syncing Gather for 1.26wmf6 (duration: 00m 12s)
* 21:07 robh: merging fixes to sodium, mailing list outage fixed
* 20:51 andrewbogott: rebooting/reimaging virt1005, virt1006, 1007
* 20:22 mutante: mailman: killed processes by user "list". started mailman
* 19:40 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: Ia6a2cb7: Removed "refreshLinks" from $wgJobBackoffThrottling (duration: 00m 12s)
* 19:37 logmsgbot: anomie Finished scap: Step 2 for deploying ApiFeatureUsage: sync the config, and l10n data again because I don't think it did last time (duration: 44m 34s)
* 19:25 robh: mailman permission errors abound!  had to take it offline again and fixing
* 19:02 robh: mailman is back to routing mail normally (still testing rename parts)
* 18:53 logmsgbot: anomie Started scap: Step 2 for deploying ApiFeatureUsage: sync the config, and l10n data again because I don't think it did last time
* 18:51 logmsgbot: anomie Finished scap: Step 1 for deploying ApiFeatureUsage: sync the code and l10n data (duration: 05m 39s)
* 18:46 logmsgbot: anomie Started scap: Step 1 for deploying ApiFeatureUsage: sync the code and l10n data
* 18:38 yuvipanda: issuing start command for all hosts on labvirt1006, just to make sure
* 18:35 yuvipanda: labvirt1006 rebooting, long POST
* 18:31 yuvipanda: restarted labvirt1006
* 18:20 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group1 to 1.26wmf6
* 18:15 robh: stopping mailman again for further planned work T99098
* 17:43 robh: mailing lists still down, scrubbing list archives is painful and error prone
* 17:33 ottomata: starting reboots of analytics worker nodes in order to enable hyperthreading Bug: https://phabricator.wikimedia.org/T90640
* 17:04 robh: puppet stopped on sodium (dont need it restarting mailman while im working)
* 17:04 robh: starting mailman downtime window to scrub content off list archive per T99098
* 16:58 bblack: automated reboots of esams/eqiad non-upload caches starting (should auto-downtime, should be no real impact)...
* 15:51 logmsgbot: anomie Synchronized php-1.26wmf5/extensions/AbuseFilter/: SWAT: Fix boolean response in API action=abusefiltercheckmatch [[gerrit:211743]] (duration: 00m 12s)
* 15:50 logmsgbot: anomie Synchronized php-1.26wmf6/extensions/AbuseFilter/: SWAT: Fix boolean response in API action=abusefiltercheckmatch [[gerrit:211744]] (duration: 00m 10s)
* 15:31 logmsgbot: anomie Synchronized php-1.26wmf5/includes/skins/SkinTemplate.php: SWAT: Revert "output mw-content-{ltr,rtl} unconditionally" [[gerrit:211893]] (duration: 00m 12s)
* 15:28 logmsgbot: anomie Synchronized php-1.26wmf6/includes/skins/SkinTemplate.php: SWAT: Revert "output mw-content-{ltr,rtl} unconditionally" [[gerrit:211894]] (duration: 00m 13s)
* 15:16 logmsgbot: anomie Synchronized php-1.26wmf5/includes/registration/ExtensionRegistry.php: SWAT: registration: Don't array_unique() over the queue before loading it [[gerrit:211948] (duration: 00m 12s)
* 15:15 logmsgbot: anomie Synchronized php-1.26wmf6/includes/registration/ExtensionRegistry.php: SWAT: registration: Don't array_unique() over the queue before loading it [[gerrit:211947] (duration: 00m 12s)
* 14:43 jynus: back to read/write after virt1000 database migration - migration seems ok
* 14:41 godog: purge cassandra system CF metrics from graphite1001
* 14:29 jynus: temporarily going read-only for virt1000 for database migration
* 14:24 mobrovac: enabled puppet on restbase1001
* 14:19 mobrovac: restbase group1 wiki keyspaces created
* 14:15 mobrovac: starting manually RB with group1 wikis enabled on restbase1001
* 14:11 mobrovac: restbase100x: removed superfluous keyspaces by hand from Cassandra
* 13:47 bblack: done with cp40xx reboot process
* 13:32 bblack: rebooting ulsfo caches (cp40xx - currently depooled from all traffic + downtimed in icinga)
* 13:09 mobrovac: disabled puppet on restbase100x
* 12:51 godog: bounce hhvm on mw1152
* 08:26 _joe_: restarting a few HHVM instances with a full TC space
* 05:05 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue May 19 05:03:56 UTC 2015 (duration 3m 55s)
* 02:46 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-19 02:45:17+00:00
* 02:43 logmsgbot: krinkle Synchronized php-1.26wmf6/includes/resourceloader/ResourceLoader.php: Ic0df4fb5cff (duration: 00m 12s)
* 02:42 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 05m 43s)
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-19 02:25:05+00:00
* 02:21 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 06m 11s)
* 00:37 logmsgbot: ebernhardson Synchronized php-1.26wmf5/includes/jobqueue/JobQueueGroup.php: Undefer push() in lazyPush() temporarily (duration: 00m 12s)
* 00:36 logmsgbot: ebernhardson Synchronized php-1.26wmf6/includes/jobqueue/JobQueueGroup.php: Undefer push() in lazyPush() temporarily (duration: 00m 12s)
 
== May 18 ==
* 23:49 yuvipanda: restarted nutcracker on mw1053 and mw1107 for bd808
* 23:47 bd808: nutcracker needs restart on mw1053 and mw1107
* 23:37 yuvipanda: restarting hhvm on mw1123
* 23:36 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: Revert "Removed "refreshLinks" from $wgJobBackoffThrottling" (duration: 00m 14s)
* 23:29 logmsgbot: ebernhardson Synchronized wmf-config/CommonSettings.php: removed refreshlinks from #wgJobBackoffThrottling (duration: 00m 14s)
* 23:21 hoo: Reverting my changes to the sites and site_identifiers tables from earlier on... apparently the export/importSites.php maintenance scripts don't work as advertised
* 23:03 logmsgbot: ori Synchronized php-1.26wmf6/extensions/Echo: 8609cb6b90: Update Echo for cherry-picks (duration: 00m 30s)
* 23:02 logmsgbot: ori Synchronized php-1.26wmf5/extensions/Echo: 8c619b99a6: Update Echo for cherry-picks (duration: 00m 57s)
* 22:46 hoo: Updating the sites table on all wikis to reflect the language code change of bhwiki (from bh to bho). I have a backup of the old table from Wikidata in my home, should things go wrong.
* 20:38 mforns: upgraded and restarted EventLogging server: 19b5b7ae719321c4b8fb112890b574051b090571
* 20:12 subbu: deployed parsoid version 8ed3e503
* 19:42 yurik: restarted graphoid service to pick up the new config https://gerrit.wikimedia.org/r/#/c/211450/
* 19:35 ori: restarted statsv on hafnium
* 18:29 logmsgbot: ori Synchronized php-1.26wmf6/includes: 335f8a257d, e3b2255d9c (for UBN! T99468) (duration: 00m 28s)
* 18:28 logmsgbot: ori Synchronized php-1.26wmf5/includes: 335f8a257d, e3b2255d9c (for UBN! T99468) (duration: 01m 26s)
* 18:06 ori: restarted HHVM on mw1107 with libjemalloc heap profiling enabled
* 17:55 ori: Enabling heap profiling on mw11107 to troubleshoot T99525
* 17:08 andrewbogott: starting all instances on labvirt1001 (well, the ones that were running before)
* 16:59 andrewbogott_: dist-upgrading labvirt1001 since it’s down anyway and we may be due for kernel updates.
* 16:53 andrewbogott_: rebooting labvirt1001, and frowning a lot
* 15:59 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/209286/ and https://gerrit.wikimedia.org/r/#/c/211407/ - should be no-ops (duration: 00m 20s)
* 15:36 logmsgbot: marktraceur Synchronized php-1.26wmf6/includes/: [SWAT] [wmf6] resourceloader: Don't cache minification of user.tokens (duration: 00m 19s)
* 15:24 logmsgbot: marktraceur Synchronized php-1.26wmf6/includes/Title.php: [SWAT] [wmf6] Log callers that trigger Title::newFromText $text type warning (duration: 00m 46s)
* 15:23 logmsgbot: marktraceur Synchronized php-1.26wmf5/includes/Title.php: [SWAT] [wmf5] Log callers that trigger Title::newFromText $text type warning (duration: 00m 15s)
* 15:07 logmsgbot: marktraceur Synchronized wmf-config/InitialiseSettings.php: [SWAT] [config] Add wikis for deployment on 2015-05-18 (duration: 00m 29s)
* 14:35 andrewbogott: disabling puppet on labnet1001 to debug dnsmasq
* 14:07 _joe_: restarting HHVM on mw1107 - memory leak probably happening
* 13:38 logmsgbot: aude Synchronized wmf-config/InitialiseSettings-labs.php: Remove beta-specific Graph settings (duration: 01m 46s)
* 13:34 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable arbitrary access on enwikivoyage, fawiki, and hewiki, and graph extension everywhere (duration: 00m 57s)
* 13:31 logmsgbot: aude Synchronized php-1.26wmf6/extensions/Wikidata: Fix rdf dump script (duration: 03m 23s)
* 13:27 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1063 after warmup period (duration: 01m 01s)
* 13:01 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: repool db1063 (duration: 00m 17s)
* 11:13 yurik: deployed graphoid update to fix https://phabricator.wikimedia.org/T99349
* 11:10 logmsgbot: jynus Synchronized wmf-config/db-eqiad.php: depool db1063 (duration: 01m 00s)
* 11:07 jynus: depooling db1063 from cluster for maintenance
* 09:02 godog: loss on ulsfo-eqiad, depooled ulsfo
* 05:18 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon May 18 05:17:50 UTC 2015 (duration 17m 49s)
* 02:46 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-18 02:45:52+00:00
* 02:42 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 05m 35s)
* 02:26 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-18 02:25:54+00:00
* 02:21 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 06m 24s)
 
== May 17 ==
* 05:06 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun May 17 05:05:16 UTC 2015 (duration 5m 15s)
* 02:44 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-17 02:43:13+00:00
* 02:39 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 05m 18s)
* 02:25 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-17 02:24:09+00:00
* 02:20 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 06m 10s)
 
== May 16 ==
* 13:27 manybubbles: that was the last server in the elasticsearch rolling restart. all done. now we have new versions of the plugins. Lets try not to do that again.
* 13:25 manybubbles: es-tool restart-fast on elastic1031
* 09:15 godog: bounce hhvm on mw1196
* 09:10 godog: bounce hhvm on mw1141
* 07:49 godog: restart hhvm on mw1234, still pushing xhprof metrics
* 06:03 _joe_: killed nrpe on labvirt1003 - see T99341
* 05:02 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat May 16 05:01:02 UTC 2015 (duration 1m 1s)
* 04:11 andrewbogott: restarting sshd and generally poking around on labvirt1003
* 02:47 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-16 02:46:08+00:00
* 02:43 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 04m 55s)
* 02:29 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-16 02:28:37+00:00
* 02:25 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 55s)
 
== May 15 ==
* 22:35 ejegg: updated crm from 03eb4cff1b009e8abaceec250f9a1c5d1f3c6b18 to 7ffe0cefb019828a09c9369187f14518847b5f41
* 19:44 manybubbles: elastic1027 es-tool restart-fast
* 19:37 awight: update crm from 2a2336655737a2cd1d3cc24624d1e8475e4cf039 to 03eb4cff1b009e8abaceec250f9a1c5d1f3c6b18
* 18:29 manybubbles: elastic1026 es-tool restart-fast
* 18:28 godog: bounce hhvm on mw1118
* 17:55 jynus: migrating of db service from virt1000 to m5-master aborted, service continues on virt1000
* 17:44 manybubbles: rolling restart almost done on elastic1025 - 1026 is next!
* 17:33 andrewbogott: updating qemu binaries on labvirt1001
* 17:29 godog: clean up remaining xhprof metrics from graphite1001
* 17:19 godog: bounce hhvm on mw1017
* 17:07 godog: still seeing metrics from xhprof creating, looking for source
* 16:29 godog: bounce carbon on graphite1001
* 16:23 manybubbles: elastic1023 and elastic1024 (skipped one log) es-tool restart-fast
* 16:16 godog: bounce statsdlb on graphite1001
* 14:49 jynus: migrating mariadb service from virt1000 to m5-master
* 14:37 manybubbles: elastic1021 es-tool restart-fast
* 14:26 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1053 in s1, warm up (duration: 00m 13s)
* 12:21 manybubbles: elastic1020 es-tool restart-fast
* 10:19 godog: bounce statsite and uwsgi on graphite1001
* 09:29 godog: restart carbon on graphite1001
* 09:15 godog: restart hhvm on mw1018, straggling
* 09:07 godog: rm MediaWiki.run_init from graphite1001 / graphite2001
* 09:04 ori: restarted hhvm / jobrunner on jobrunners to force them to pick up I6a516a0da ; re-cleared /var/lib/carbon/whisper/MediaWiki/query_* on graphite1001 and graphite2001
* 08:49 kart_: Updated cxserver to 1cb6cec
* 08:21 jynus: reenabling icinga check for MySQL on db1009
* 08:15 logmsgbot: oblivian Synchronized wmf-config/StartProfiler.php: Null-sync to touch the file (duration: 00m 12s)
* 07:20 ori: rm -rf /var/lib/carbon/whisper/MediaWiki/query_* on graphite1001 and graphite2001, as follow-up cleanup for I6a516a0da
* 07:14 logmsgbot: ori Synchronized wmf-config/StartProfiler.php: I6a516a0da: Don't send profiling data to graphite for now (duration: 00m 11s)
* 06:23 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri May 15 06:22:19 UTC 2015 (duration 22m 18s)
* 05:38 jynus: temporarily opening mysql port on firewall from db1009 to virt1000
* 04:37 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1018, warm up (duration: 00m 11s)
* 02:58 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-15 02:56:59+00:00
* 02:55 springle: xtrabackup clone db1057 to db1053
* 02:54 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 04m 37s)
* 02:42 springle: upgrade db1053 trusty
* 02:34 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-15 02:33:18+00:00
* 02:33 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1019; depool db1053 (duration: 00m 13s)
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 39s)
* 02:12 manybubbles|away: elastic1019 es-tool restart-fast
* 01:12 manybubbles|away: elastic1018 es-tool restart-fast
* 00:07 manybubbles|away: elastic1017 es-tool restart-fast
 
== May 14 ==
* 23:35 logmsgbot: demon Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 11s)
* 23:20 ori: Depooled mw1169; HHVM deadlock à la T89912. Leaving it depooled to investigate.
* 23:05 logmsgbot: demon Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 11s)
* 23:05 logmsgbot: demon Synchronized w/static/images/project-logos/urwikiquote.png: (no message) (duration: 00m 14s)
* 23:03 logmsgbot: demon Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 17s)
* 22:26 logmsgbot: ori Synchronized wmf-config/StartProfiler.php: Icbf826a7: 1:1000 request profiling via xhprof (duration: 00m 12s)
* 22:23 gwicke: deployed RESTBase v0.6.3 (fd942ac38ad)
* 22:20 logmsgbot: ori Synchronized php-1.26wmf6/extensions/CirrusSearch/includes/ElasticsearchIntermediary.php: (no message) (duration: 00m 15s)
* 21:39 manybubbles: I'm going to be done doing rolling restarts for a couple of hours. If someone wants to pick them up and do the next one after the cluster goes green again then be my guest.
* 21:35 manybubbles: es-tool restart-fast on elastic1016
* 21:27 logmsgbot: ori Synchronized php-1.26wmf6/extensions/CirrusSearch/includes/ElasticsearchIntermediary.php: (no message) (duration: 00m 12s)
* 21:27 logmsgbot: ori Synchronized php-1.26wmf5/extensions/CirrusSearch/includes/ElasticsearchIntermediary.php: (no message) (duration: 00m 12s)
* 21:14 logmsgbot: ori Synchronized php-1.26wmf6/extensions/CirrusSearch/includes/ElasticsearchIntermediary.php: I3df6713a1: Log request times to StatsD (duration: 00m 13s)
* 21:14 logmsgbot: ori Synchronized php-1.26wmf5/extensions/CirrusSearch/includes/ElasticsearchIntermediary.php: I3df6713a1: Log request times to StatsD (duration: 00m 15s)
* 21:11 manybubbles: elastic1015 es-tool restart-fast
* 19:43 robh: mass unsubcription in listadmins list, resulting in unsupressed mass unsubscribe notices to all listadmin email address (sorry about the emails!)
* 19:24 logmsgbot: legoktm Synchronized php-1.26wmf5/skins/Nostalgia/skin.json: touch (duration: 00m 17s)
* 19:15 legoktm: debugging on tin / mw1017 for nostalgiawiki issue
* 16:59 ^d: elasticsearch: set transient cluster.routing.allocation.node_concurrent_recoveries on prod cluster to 8 (default: 2) to speed up recoveries.
* 16:52 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 44m 07s)
* 16:28 andrewbogott: disabling puppet on labnet1001 for testing
* 16:13 godog: es-tool restart-fast on elastic1014
* 16:08 logmsgbot: kartik Started scap: Update ContentTranslation
* 15:46 logmsgbot: thcipriani Synchronized php-1.26wmf5/extensions/Translate: SWAT update translate to a6f0a63 [[gerrit:210919]] (duration: 00m 15s)
* 15:12 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT enable new article campaign except bawiki [[gerrit:210916]] (duration: 00m 12s)
* 15:04 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: Open external links on votewiki in new tab [[gerrit:210849]] (duration: 00m 12s)
* 15:00 godog: es-tool restart-fast on elastic1013
* 14:48 logmsgbot: andyrussg Synchronized php-1.26wmf6/extensions/CentralNotice/: Update CentralNotice (duration: 00m 13s)
* 14:34 paravoid: reimaging multatuli
* 14:34 jynus: migrating data db from virt1000 to db1009
* 14:23 bblack: restarted ganglia-monitor on eeden
* 14:21 logmsgbot: andyrussg Synchronized php-1.26wmf5/extensions/CentralNotice/: Update CentralNotice (duration: 00m 12s)
* 14:16 godog: es-tool restart-fast on elastic1012
* 14:12 paravoid: switching ns2 back to eeden
* 13:56 cmjohnson1: upgrading tellurium to trusty
* 13:41 cmjohnson1: power cycling barium
* 13:40 godog: es-root restart-fast on elastic1011
* 13:21 paravoid: reimaging eeden with jessie
* 12:59 paravoid: switching ns2 to multatuli
* 12:53 jynus: disabling temporarily Ichinga check for MySQL running on db1009 until data is migrated from virt1000 and host sent to production
* 12:40 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-pt-gl_0.9.2~r60358-1
* 12:36 godog: es-tool restart-fast on elastic1010
* 11:40 manybubbles: restarting elasticsearch on elastic1009
* 05:07 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu May 14 05:06:09 UTC 2015 (duration 6m 8s)
* 02:55 manybubbles: restarting elasticsearch on elastic1008
* 02:50 logmsgbot: LocalisationUpdate completed (1.26wmf6) at 2015-05-14 02:49:53+00:00
* 02:47 logmsgbot: l10nupdate Synchronized php-1.26wmf6/cache/l10n: (no message) (duration: 04m 16s)
* 02:44 springle: xtrabackup clone db1056 to db1019
* 02:29 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-14 02:28:02+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 51s)
* 01:48 manybubbles: sorry - restarting elasticsearch on elastic1007
* 01:47 manybubbles: restarting elastic1007
* 01:33 logmsgbot: springle Synchronized wmf-config/db-codfw.php: pool new codfw slaves (duration: 00m 11s)
* 01:28 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1060, warm up (duration: 00m 14s)
* 00:49 manybubbles: restarting elasticsearch on elastic1006
* 00:03 logmsgbot: ebernhardson Synchronized php-1.26wmf5/extensions/Gather/: SWAT Submodule bump for Gather extension (duration: 00m 12s)
 
== May 13 ==
* 23:52 awight: payments config: correct memcache location
* 23:40 logmsgbot: ebernhardson Synchronized wmf-config/CirrusSearch-common.php: SWAT deploy cirrus config change (duration: 00m 12s)
* 22:26 logmsgbot: twentyafterfour Purged l10n cache for 1.26wmf4
* 22:25 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group 0 to 1.26wmf6
* 22:21 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Wikipedias to 1.26wmf5
* 22:17 twentyafterfour: restarted phd on iridium (phabricator) to sync the daemons' configuration
* 21:28 manybubbles: restarting elasticsearch on elastic1005
* 21:12 cscott: updated OCG to version c7c75e5b03ad9096571dc6dbfcb7022c924ccb4f
* 21:03 awight: updated payments from f97f8f99268974cfdb0182f178955bd627137842 to e89d18ee20abcb1ca3c455e6a298bf8a6aa84442
* 20:28 subbu: deployed parsoid version a8108fe6
* 20:15 manybubbles: restarted elasticsearch on elastic1004
* 20:12 logmsgbot: twentyafterfour Finished scap: testwiki to php-1.26wmf6 and rebuild l10n cache (duration: 47m 24s)
* 20:11 manybubbles: cancel that - I just realized I can't do that.
* 20:10 manybubbles: elastic1003 restarted elasticsearch just fine. the cluster restart is going awesome. I'm going to rig the other 28 to restart via a script, one after the other. Expect nagios to complain about them some.
* 20:03 bblack: restarting hhvm on mw1190
* 19:25 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf6 and rebuild l10n cache
* 19:11 awight: paymens rolled back to f97f8f99268974cfdb0182f178955bd627137842
* 19:10 awight: payments updated from f97f8f99268974cfdb0182f178955bd627137842 to 5c326a521120a904a2012654e9287757dc5a8ca2
* 19:00 manybubbles: elastic1002 restart went well - starting elastic1003
* 18:45 awight: rolled back payments to f97f8f99268974cfdb0182f178955bd627137842
* 18:43 awight: update payments from f97f8f99268974cfdb0182f178955bd627137842 to 5c326a521120a904a2012654e9287757dc5a8ca2
* 18:05 logmsgbot: demon Synchronized wmf-config/CommonSettings.php: undo all the nostalgia (duration: 00m 10s)
* 17:21 logmsgbot: demon Synchronized wmf-config/CommonSettings.php: something something skins are broken (duration: 00m 11s)
* 17:14 logmsgbot: demon Synchronized wmf-config/CommonSettings.php: because sometimes moving code helps (duration: 00m 15s)
* 17:10 manybub|lunch: elastic1002 restarted and rejoined the cluster - now the cluster is repaining. hurray.
* 17:08 manybub|lunch: elastic1001 restarted and rejoined the cluster hapilly while I was at lunch. it looks good - no errors beyond the ones we have fixes in flight for. So I'm going to do elastic1002
* 17:03 hashar: Zuul clone failures solved. Was due to network traffic being interrupted between labs and prod.
* 16:53 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/209967/ (duration: 00m 14s)
* 16:51 hashar: Zuul clone failure https://phabricator.wikimedia.org/T98980
* 16:49 andrewbogott: re-enabling puppet on labnet1001
* 16:46 mutante: es2010 failed disk, reopening ticket for last fail in January
* 16:41 jynus: Enabling puppet agent in db1009.eqiad after reinstall
* 16:40 logmsgbot: ori Synchronized php-1.26wmf4/includes/resourceloader/ResourceLoader.php: I30b490e5b: ResourceLoader::filter: use APC when running under HHVM (duration: 00m 11s)
* 16:38 logmsgbot: ori Synchronized php-1.26wmf5/includes/resourceloader/ResourceLoader.php: I30b490e5b: ResourceLoader::filter: use APC when running under HHVM (duration: 00m 14s)
* 16:28 andrewbogott: disabling puppet on labnet1001 to tinker with nova config
* 15:44 mark: Disregard cr2-knams:xe-0/0/0; we're working on it
* 15:21 manybubbles: I think the elasticsearch cluster got stuck with alloation disabled after the rolling restart. Funky. Haven't seen that one before. Probably a problem with our instructions. Anyway, unstuck it and recovery is going faster now
* 15:17 logmsgbot: demon Synchronized wmf-config/InitialiseSettings.php: didn't work, undoing previous sync (duration: 00m 12s)
* 15:15 logmsgbot: demon Synchronized wmf-config/InitialiseSettings.php: trying something (duration: 00m 12s)
* 14:53 manybubbles: elasticsearch restart on elastic1001 going well. cluster still in recovering state as expect. I'll give it an hour to soak.
* 14:48 manybubbles: ok - time to start the rolling restart. I'm going to to elastic1001 first non-automated and watch it
* 14:36 manybubbles: s/gitfit/gitfat/ oh well
* 14:35 manybubbles: first attempt at syncing elasticsearch plugins didn't work 100%. syncing again. gitfit/gitdeploy is betraying me
* 14:32 manybubbles: syncing new versions of elsaticsearch plugins to prod. no restarts yet.
* 14:04 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable usage tracking for Wikisource (duration: 00m 14s)
* 13:57 aude: added wbc_entity_usage table on all Wikibase Client wikis
* 13:56 jynus: jcrespo Disabling puppet agent in db1009.eqiad in preparation for reinstall
* 13:45 logmsgbot: aude Synchronized php-1.26wmf5/extensions/Wikidata: Update maintenance script (duration: 00m 20s)
* 12:45 springle: xtrabackup clone db1060 to db1018
* 12:39 springle: upgrade and restart db1060
* 09:20 jamesofur: inserting FDC election encryption key
* 06:21 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed May 13 06:19:59 UTC 2015 (duration 19m 58s)
* 05:53 springle: reinstall db1018
* 04:50 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1018 (duration: 00m 12s)
* 03:11 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-13 03:10:31+00:00
* 03:07 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 43s)
* 02:46 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-13 02:45:28+00:00
* 02:39 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 10m 08s)
* 01:56 damagecat: Started 'jobs' screen in tin to drain refreshLinks for enwiki using --nothrottle (T98621)
* 01:29 logmsgbot: legoktm Synchronized wmf-config/CommonSettings.php: Hardcode UploadWizard max upload size - T98933 (duration: 00m 12s)
* 01:23 logmsgbot: legoktm Synchronized php-1.26wmf5/extensions/GWToolset/:  Check php max_file_size limit directly from PHP $_FILES (duration: 00m 12s)
* 01:21 logmsgbot: legoktm Synchronized php-1.26wmf4/extensions/GWToolset/:  Check php max_file_size limit directly from PHP $_FILES (duration: 00m 12s)
* 01:07 gwicke: added commons to supported projects in RESTBase API
* 00:16 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I5ebedfdfb: Set $wgGadgetsCacheType to CACHE_ACCEL (duration: 00m 12s)
* 00:13 logmsgbot: ori Synchronized php-1.26wmf4/includes/jobqueue/jobs/RefreshLinksJob.php: 914d71f3cc: Temporary hack to drain excess refreshLinks jobs (duration: 00m 14s)
* 00:12 logmsgbot: ori Synchronized php-1.26wmf4/extensions/Gadgets: 7539873979: Update Gadgets for cherry-pick (duration: 00m 12s)
* 00:10 logmsgbot: ori Synchronized php-1.26wmf5/extensions/Gadgets: cbb9b1e475: Update Gadgets for cherry-pick (duration: 00m 12s)
 
== May 12 ==
* 23:40 ori: Upgraded all Apaches to HHVM 3.6.1+dfsg1-1+wm2 and Apache 2.4.7-1ubuntu4.4
* 23:26 logmsgbot: demon Synchronized php-1.26wmf4/extensions/CirrusSearch/: (no message) (duration: 00m 12s)
* 23:24 logmsgbot: demon Synchronized php-1.26wmf4/includes/jobqueue/jobs/RefreshLinksJob.php: (no message) (duration: 00m 11s)
* 23:23 logmsgbot: demon Synchronized php-1.26wmf5/includes/jobqueue/jobs/RefreshLinksJob.php: (no message) (duration: 00m 12s)
* 23:23 logmsgbot: demon Synchronized php-1.26wmf5/includes/media/DjVu.php: (no message) (duration: 00m 12s)
* 23:18 ori: Upgrading more HHVMs; DPKG alerts likely but they will be transient.
* 23:10 logmsgbot: demon Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 11s)
* 23:03 logmsgbot: demon Synchronized wmf-config/InitialiseSettings.php: swat (duration: 00m 12s)
* 21:48 logmsgbot: kaldari Finished scap: updating i18n for Gather (1.26wmf5) (duration: 23m 17s)
* 21:25 logmsgbot: kaldari Started scap: updating i18n for Gather (1.26wmf5)
* 21:24 logmsgbot: kaldari Synchronized php-1.26wmf5/extensions/Gather: Updating Gather for 1.26wmf5 (duration: 00m 12s)
* 21:06 apergos: manually installed trigger-trebuchet update on tin after accidental salt upgrade there woops :-D
* 20:56 mutante: upgrading salt packages on tin
* 19:50 ori: Upgrading several app servers to new version of HHVM, expect transient 'DPKG CRITICAL' alerts
* 18:19 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group1 wikis to 1.26wmf5
* 17:38 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: Ie4641b6e4: Set $wgWMEStatsdBaseUri to host-relative beacon/ path (duration: 00m 12s)
* 16:24 yurik: graphoid service synced, now supports Cache Control headers
* 16:19 ori: restarted HHVM on mw1061; T89912
* 15:20 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT Add *.sl.nsw.gov.au to wgCopyUploadsDomains [[gerrit:210356]] (duration: 00m 11s)
* 15:15 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT Namespaces configuration on or.wiktionary [[gerrit:210350]] (duration: 00m 12s)
* 15:10 hashar: mediawiki-phpunit-hhvm Jenkins job is broken due to an hhvm upgrade {{bug|T98876}}
* 15:07 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT enable NewUserMessage on bh.wikipedia [[gerrit:209146]] (duration: 00m 13s)
* 13:55 akosiaris: temporarily blocked an IP on uranium firewall. It was the cause of requests causing CPU load. http://ganglia.wikimedia.org/latest/graph.php?r=day&z=xlarge&h=uranium.wikimedia.org&m=cpu_report&s=descending&mc=2&g=cpu_report&c=Miscellaneous+eqiad
* 11:06 twentyafterfour: restarted apache on iridium to clear php opecode cache
* 09:53 akosiaris: restarted gitblit on antimony
* 06:58 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue May 12 06:57:17 UTC 2015 (duration 57m 16s)
* 06:15 springle: pt-kill on 3600s running on dbstore1002 until repl streams recover
* 06:05 springle: killed 100+ 3-day unindexed research queries on dbstore1002, all repl streams lagging and /tmp unhappy
* 03:01 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-12 03:00:22+00:00
* 02:57 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 47s)
* 02:35 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-12 02:34:30+00:00
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 33s)
* 00:39 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings.php: Update Wikipedia word mark and related config (duration: 00m 11s)
* 00:38 logmsgbot: mattflaschen Synchronized images/mobile/wikipedia-wordmark-en.png: Update Wikipedia word mark and related config (duration: 00m 13s)
* 00:30 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings.php: Add www.jacar.go.jp to wgCopyUploadsDomains (duration: 00m 11s)
* 00:30 yuvipanda: restarted nutcracker on silver
* 00:28 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings.php: Deploy Catalan Wikinews flood group (duration: 00m 13s)
* 00:19 logmsgbot: mattflaschen Synchronized php-1.26wmf5/includes/page/WikiPage.php: Job queue changes for triggerOpportunisticLinksUpdate (duration: 00m 12s)
* 00:18 logmsgbot: mattflaschen Synchronized php-1.26wmf5/includes/jobqueue/: Job queue changes for triggerOpportunisticLinksUpdate (duration: 00m 12s)
* 00:17 logmsgbot: mattflaschen Synchronized php-1.26wmf4/includes/jobqueue/: Job queue changes for triggerOpportunisticLinksUpdate (duration: 00m 13s)
* 00:15 yuvipanda: restarted apache on silver
* 00:01 logmsgbot: mattflaschen Synchronized php-1.26wmf4/includes/page/WikiPage.php: Job queue changes for triggerOpportunisticLinksUpdate (duration: 00m 11s)
* 00:00 logmsgbot: mattflaschen Synchronized php-1.26wmf4/includes/jobqueue/: Job queue changes for triggerOpportunisticLinksUpdate (duration: 00m 12s)
 
== May 11 ==
* 23:46 logmsgbot: mattflaschen Synchronized wmf-config: Sync wmf-config for CirrusSearch PoolCounter change; applies to group 0 initially (duration: 00m 12s)
* 23:37 logmsgbot: kaldari Synchronized wmf-config/InitialiseSettings-labs.php: sync InitialiseSettings-labs.php for Browse experiment in mobile (duration: 00m 13s)
* 23:34 logmsgbot: mattflaschen Synchronized php-1.26wmf5/extensions/Flow/: Deploy Flow metadataonly fix (duration: 00m 14s)
* 23:32 yuvipanda: andrewbogott_afk playing around with upgrading virt*** boxes, which are non-live labs boxen.
* 23:31 logmsgbot: mattflaschen Synchronized php-1.26wmf4/extensions/Flow/: Deploy Flow metadataonly fix (duration: 00m 13s)
* 23:17 logmsgbot: mattflaschen Synchronized wmf-config/CommonSettings.php: Make VE default editor for Flow (duration: 00m 13s)
* 23:13 legoktm: manually renamed and migrated User:~~@nlwiki --> User:~~-~nlwiki@global (T98155)
* 22:55 logmsgbot: ori Synchronized php-1.26wmf4/extensions/Josa: dd2db67d9b: Update Josa for cherry-picks (duration: 00m 13s)
* 22:54 logmsgbot: ori Synchronized php-1.26wmf5/extensions/Josa: a0b561da25: Update Josa for cherry-picks (duration: 00m 11s)
* 22:05 twentyafterfour: removed /var/run/phab_repo_lock_libext_Sprint on iridium to allow sprint repo sync
* 22:01 logmsgbot: bd808 Synchronized wmf-config/InitialiseSettings-labs.php: Add common wikitag for all beta cluster wikis (duration: 00m 12s)
* 21:54 ori: Restarting HHVM on mw1036; threads stuck on HPHP::StatCache::refresh
* 21:48 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: I45c1c76d4: Deploy Josa extension to production (enabling) (duration: 00m 13s)
* 21:47 logmsgbot: ori Finished scap: I45c1c76d4: Deploy Josa extension to production (but not enabling yet) (duration: 46m 54s)
* 21:43 ori: Restarting HHVM on mw1110; threads stuck on HPHP::StatCache::refresh
* 21:00 logmsgbot: ori Started scap: I45c1c76d4: Deploy Josa extension to production (but not enabling yet)
* 20:49 hoo: Resolved T98695 by setting the email of the global account to the former enwiki email address.
* 19:37 hoo: Updated Wikidata's property suggester with data from today's json dump
* 18:49 legoktm: renamed a bunch more invalid usernames (https://phabricator.wikimedia.org/T5507)
* 18:41 ori: Deployed I4e3f42ea7, which increases jobrunner::runners_basic from 14 -> 20
* 18:41 logmsgbot: yurik Synchronized wmf-config: patch 210111 - Cleaned Graph, enabled wmgGraphImgServiceAlways (duration: 00m 13s)
* 18:15 logmsgbot: yurik Synchronized php-1.26wmf4/extensions/Graph: Bump Graph to master (duration: 00m 11s)
* 18:14 logmsgbot: yurik Synchronized php-1.26wmf5/extensions/Graph: Bump Graph to master (duration: 00m 14s)
* 17:16 logmsgbot: manybubbles Finished scap: SWAT js config vargs changes (duration: 14m 55s)
* 17:01 logmsgbot: manybubbles Started scap: SWAT js config vargs changes
* 17:01 logmsgbot: manybubbles scap aborted: SWAT js config vargs changes (duration: 27m 58s)
* 16:33 logmsgbot: manybubbles Started scap: SWAT js config vargs changes
* 15:59 manybubbles: waiting a few minutes after that last set of patches before we're sure that the load is down and then, hopefully, we'll scap to get the core changes that are already merged and sitting on tin that we had to ignore while we handled the trafic spike.
* 15:53 logmsgbot: manybubbles Synchronized php-1.26wmf4/includes/media/DjVu.php: SWAT: 10 mb djvu files are expensive to thumbnail (wmf4) (duration: 00m 13s)
* 15:52 logmsgbot: manybubbles Synchronized php-1.26wmf5/includes/media/DjVu.php: SWAT: 10 mb djvu files are expensive to thumbnail (wmf5) (duration: 00m 11s)
* 15:33 manybubbles: stopping SWAT due to some incident that just picked up. Right now Ib990f00ebe974008cea4dccbaa212ec20c846674 and Ida3fd5f8808202892001f66c4a534c1725e769a6 are merged awaiting a scap.
* 15:26 logmsgbot: manybubbles Synchronized wmf-config/CommonSettings.php: SWAT cleanup wgGraphImgServiceAlways 3/3 (duration: 00m 12s)
* 15:26 logmsgbot: manybubbles Synchronized wmf-config/InitialiseSettings.php: SWAT cleanup wgGraphImgServiceAlways 2/3 (duration: 00m 12s)
* 15:25 logmsgbot: manybubbles Synchronized wmf-config/InitialiseSettings-labs.php: SWAT cleanup wgGraphImgServiceAlways 1/3 (duration: 00m 12s)
* 15:05 logmsgbot: manybubbles Synchronized wmf-config/InitialiseSettings.php: SWAT: send all mediawiki events from all wikis to logstash (duration: 00m 12s)
* 15:03 logmsgbot: manybubbles Synchronized wmf-config/InitialiseSettings-labs.php: SWAT: enable graph extension in beta. this should be a noop (duration: 00m 13s)
* 14:01 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable arbitrary Wikibase access for nlwiki and frwikisource (duration: 00m 16s)
* 13:49 logmsgbot: aude Synchronized php-1.26wmf4/extensions/Wikidata: Fix interaction with AbuseFilter (duration: 00m 20s)
* 13:46 logmsgbot: aude Synchronized php-1.26wmf5/extensions/Wikidata: Fix interaction with AbuseFilter (duration: 00m 19s)
* 05:10 ori: upgrading canary appservers to 3.6.1+dfsg1-1+wm2
* 04:55 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon May 11 04:53:58 UTC 2015 (duration 53m 57s)
* 04:17 springle: restarted hhvm on mw1020. lots of fatal noise about N4HPHP13DataBlockFullE
* 02:43 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-11 02:42:42+00:00
* 02:39 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 37s)
* 02:23 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-11 02:22:25+00:00
* 02:18 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 19s)
 
== May 10 ==
* 17:45 ori: App server traffic coincides with spike on S4 dbs, lots of commons sleeper queries, fatal log contains many references to User:Richenza/gallery, so nuking.
* 17:20 ori: Inbound app server traffic more than doubled over the past 12 hrs: http://ganglia.wikimedia.org/latest/graph.php?r=week&z=xlarge&c=Application+servers+eqiad&m=cpu_report&s=by+name&mc=2&g=network_report
* 05:17 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun May 10 05:16:10 UTC 2015 (duration 16m 9s)
* 02:45 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-10 02:44:48+00:00
* 02:41 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 26s)
* 02:25 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-10 02:24:40+00:00
* 02:20 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 16s)
 
== May 9 ==
* 20:55 logmsgbot: krenair Synchronized php-1.26wmf4/extensions/VisualEditor/modules/ve-mw/ui/tools/ve.ui.MWEditModeTool.js: https://gerrit.wikimedia.org/r/#/c/209950/ (duration: 00m 12s)
* 20:53 logmsgbot: krenair Synchronized php-1.26wmf5/extensions/VisualEditor/modules/ve-mw/ui/tools/ve.ui.MWEditModeTool.js: https://gerrit.wikimedia.org/r/#/c/209949/ (duration: 00m 11s)
* 05:06 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat May  9 05:05:16 UTC 2015 (duration 5m 15s)
* 02:44 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-09 02:43:07+00:00
* 02:39 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 21s)
* 02:24 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-09 02:23:15+00:00
* 02:19 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 11s)
 
== May 8 ==
* 23:45 logmsgbot: bd808 Synchronized wmf-config/CommonSettings.php: beta: switch $wmfUdp2logDest to deployment-fluorine.eqiad.wmflabs (duration: 00m 12s)
* 22:11 mutante: gzipping some user data on lutetium
* 21:17 logmsgbot: yurik Synchronized wmf-config/CommonSettings.php: Disable security header for Graphs on zerowiki (duration: 00m 12s)
* 21:14 logmsgbot: yurik Synchronized wmf-config/InitialiseSettings.php: Disable security header for Graphs on zerowiki (duration: 00m 12s)
* 21:02 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings-labs.php: Sync out change that only affects Beta Cluster (duration: 00m 11s)
* 19:18 logmsgbot: yurik Synchronized php-1.26wmf4/extensions/CentralAuth: Bumping CentralAuth (duration: 00m 13s)
* 19:18 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I6236f5e2c: Use $wgServer to construct static-asset URLs (duration: 00m 12s)
* 19:12 logmsgbot: yurik Synchronized php-1.26wmf5/extensions/CentralAuth: Bumping CentralAuth (duration: 00m 12s)
* 18:42 csteipp: deployed patch for T98313 for wmf4/5
* 18:14 logmsgbot: yurik Synchronized php-1.26wmf4/extensions/Graph/: Bumping graph (duration: 00m 14s)
* 18:14 logmsgbot: yurik Synchronized php-1.26wmf5/extensions/Graph/: Bumping graph (duration: 00m 14s)
* 16:53 logmsgbot: bd808 Synchronized w/static/images/project-logos/labswiki.png: Add missing labswiki.png (duration: 00m 13s)
* 15:37 Krenair: restarted apache on silver -again- to deal with reports of session errors
* 15:28 greg-g: wikitech's session data errors are transient, hitting save multiple times will eventually work
* 15:26 greg-g: multiple independent reports of wikitech wiki having session data errors
* 14:13 logmsgbot: bblack Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 13s)
* 13:17 logmsgbot: faidon Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 12s)
* 13:17 logmsgbot: faidon Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 14s)
* 13:14 logmsgbot: faidon Synchronized wmf-config/InitialiseSettings.php: revert bits.wm.org change (duration: 00m 12s)
* 13:14 logmsgbot: faidon Synchronized wmf-config/CommonSettings.php: revert bits.wm.org change (duration: 00m 12s)
* 13:03 logmsgbot: faidon Synchronized wmf-config/CommonSettings.php: Switch assets back to bits.wikimedia.org (duration: 00m 15s)
* 13:03 logmsgbot: faidon Synchronized wmf-config/InitialiseSettings.php: Switch assets back to bits.wikimedia.org (duration: 00m 14s)
* 11:49 godog: deploy librenms 2fa805ff
* 09:39 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-kaz_0.1.0~r60155-1
* 09:39 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-dan-nor_1.0.0~r48173-1
* 05:14 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri May  8 05:13:23 UTC 2015 (duration 13m 22s)
* 04:16 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: I4c70ce4d0: Fix wikiname: roa-rupwiki -> roa_rupwiki (duration: 00m 12s)
* 03:33 logmsgbot: legoktm Synchronized w/static/images/project-logos/wikimania2015wiki.png: Use png for wikimania2015wiki logo (duration: 00m 12s)
* 02:49 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-08 02:48:15+00:00
* 02:45 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 05m 47s)
* 02:29 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-08 02:28:07+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 06s)
* 00:00 logmsgbot: rmoen Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 11s)
 
== May 7 ==
* 23:54 logmsgbot: rmoen Synchronized php-1.26wmf4/extensions/VisualEditor/: Update VE with Cherry-picks (duration: 00m 12s)
* 23:51 logmsgbot: rmoen Synchronized php-1.26wmf5/extensions/VisualEditor/: Update VE for cherry-picks (duration: 00m 11s)
* 23:41 logmsgbot: rmoen Synchronized php-1.26wmf4/extensions/Flow/: Bump flow with cherry-picks (duration: 00m 13s)
* 23:39 logmsgbot: rmoen Synchronized php-1.26wmf5/extensions/Flow: Bump Flow with cherry-picks (duration: 00m 14s)
* 23:31 logmsgbot: rmoen Synchronized php-1.26wmf4/extensions/Gather: Update Gather with cherry-picks (duration: 00m 14s)
* 23:20 logmsgbot: rmoen Synchronized php-1.26wmf5/extensions/Gather/: Update Gather with Cherry-picks (duration: 00m 15s)
* 22:58 andrewbogott: restarting all instances on labvirt1008, crossing fingers
* 22:38 andrewbogott: rebooting labvirt1008, running dist-upgrade, rebooting again
* 21:29 awight: updated payments from 3ab89e2b14eb449f7ceddf2325493d6235395ecd to f97f8f99268974cfdb0182f178955bd627137842
* 21:25 gwicke: deployed RESTBase 6043e3ada (v0.6.2)
* 21:01 apergos: dumps are interrupted on snapshot1004 while I do a manual run for testing/debugging purposes. please let it run and don't start any other processes on the box, thanks
* 20:53 bd808: Updated kibana to bb9fcf6 (Merge remote-tracking branch 'upstream/kibana3')
* 20:36 legoktm: renaming users with invalid usernames (https://phabricator.wikimedia.org/T5507)
* 20:18 logmsgbot: ori Synchronized wmf-config: I3846e34ed, I1fcb3f17d, I8c9a6a567, I1a73c83f7, and Iacbd92931: serve optimized, cacheable logos from /static (duration: 00m 19s)
* 20:14 bd808: updated scap to 5d681af (Better handling for php lint checks)
* 20:14 bd808: Trebuchet checkout failed for scap/scap on mw1222.eqiad.wmnet, mw1113.eqiad.wmnet, mw1104.eqiad.wmnet
* 20:13 bd808: Trebuchet fetch for scap/scap failed on mw1222.eqiad.wmnet
* 19:17 logmsgbot: legoktm Synchronized php-1.26wmf4/extensions/CentralAuth/: https://gerrit.wikimedia.org/r/209538 and https://gerrit.wikimedia.org/r/209539 (duration: 00m 16s)
* 19:16 logmsgbot: legoktm Synchronized php-1.26wmf5/extensions/CentralAuth/: https://gerrit.wikimedia.org/r/209538 and https://gerrit.wikimedia.org/r/209539 (duration: 00m 16s)
* 16:56 bd808: sync-common on snapshot1004 finished in 12:36
* 16:49 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: Enable shortURL on saprojects [[gerrit:201216]] (duration: 00m 14s)
* 16:43 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: Enable ShortUrl on newiki [[gerrit:206736]] (duration: 00m 21s)
* 16:37 bd808: Running sync-common manually on snapshot1004.eqiad.wmnet
* 16:36 thcipriani: create shorturl table in sawiki, sawikisource, sawikiquote, sawiktionary, sawikibooks
* 16:36 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 16m 21s)
* 16:23 thcipriani: populateShortUrlTable on newiki
* 16:20 thcipriani: creating newiki shorturl table
* 16:19 logmsgbot: kartik Started scap: Update ContentTranslation
* 15:48 logmsgbot: thcipriani Synchronized php-1.26wmf4/extensions/CentralAuth/includes/LocalRenameJob/LocalRenameUserJob.php: Update CentralAuth [[gerrit:209493]] (duration: 00m 21s)
* 15:34 logmsgbot: thcipriani Synchronized php-1.26wmf5/extensions/CentralAuth/includes/LocalRenameJob/LocalRenameUserJob.php: Update CentralAuth [[gerrit:209492]] (duration: 00m 17s)
* 15:27 springle: db connection EINTR noise in logs, see T98489
* 15:16 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: CX enable content translations [[gerrit:209207]] (duration: 00m 12s)
* 14:39 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1019 (duration: 00m 14s)
* 13:55 moritzm: uploaded to apt.wikimedia.org jessie-wikimedia: linux-meta_1.1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-tat_0.1.0~r57462-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-pt-gl_0.9.2~r57551-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-oc-es_1.0.6~r60161-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-oc-ca_1.0.6~r60158-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-fr-es_0.9.2~r27040-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-eus_0.1.0-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-eu-es_0.3.3~r56159-1
* 13:02 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-eu-en_0.3.1~r60155-1
* 13:01 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-es-gl_1.0.8~r57542-1
* 13:01 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-es-ast_1.1.0~r60158-1
* 13:01 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-es-an_0.3.0~r60158-1
* 13:01 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-en-gl_0.5.2~r57551-1
* 13:01 akosiaris: uploaded to apt.wikimedia.org jessie-wikimedia: apertium-dan_0.1.0-1
* 12:30 bblack: rebooting cp1070
* 12:26 godog: bounce uwsgi on graphite1001
* 12:25 godog: bounce uwsgi on graphite1001
* 10:26 godog: bounce uwsgi on graphite1001
* 10:01 mark: Decreased labstore1001 md125 sync_speed_min from 80000 to 40000
* 09:35 mark: Increased /sys/block/md125/md/sync_speed_min from 4000 to 40000
* 09:29 mark: Increased /sys/block/md125/md/sync_speed_min from 1000 to 4000
* 05:40 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu May  7 05:39:36 UTC 2015 (duration 39m 35s)
* 03:03 logmsgbot: LocalisationUpdate completed (1.26wmf5) at 2015-05-07 03:02:50+00:00
* 02:59 logmsgbot: l10nupdate Synchronized php-1.26wmf5/cache/l10n: (no message) (duration: 08m 35s)
* 02:36 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-07 02:35:43+00:00
* 02:35 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1054 in s2, warm up (duration: 01m 09s)
* 02:29 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 09m 27s)
* 02:14 logmsgbot: krenair Synchronized wmf-config: update interwiki.cdb, T98429 (duration: 00m 24s)
* 01:50 bblack: we're still hitting cap on Zayo as of shortly-ago in graphs and seeing smokeping loss, moved california to eqiad
* 00:13 mutante: running refreshLinks.php for s2
* 00:11 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/MobileFrontend/: SWAT (duration: 00m 42s)
* 00:11 gwicke: deployed RESTBase 8865b9c48
 
== May 6 ==
* 23:43 logmsgbot: catrope Synchronized php-1.26wmf5/extensions/VisualEditor: SWAT (duration: 00m 18s)
* 23:43 logmsgbot: catrope Synchronized php-1.26wmf5/extensions/MobileFrontend: SWAT (duration: 00m 34s)
* 23:19 RoanKattouw: Running populateShortUrl.phg on knwiki
* 23:16 RoanKattouw: Running namespaceDupes.php on tewikiquote
* 23:15 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: SWAT (duration: 00m 17s)
* 23:12 RoanKattouw: Created shorturls table on knwiki
* 20:39 logmsgbot: twentyafterfour Purged l10n cache for 1.26wmf3
* 20:37 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf5
* 20:32 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedias to 1.26wmf4
* 20:29 apergos: salt upgraded to 2014.7.5 on all precise/trusty/jessie hosts in production except for: labcontrol2001, tin, virt1000 (deferred) and dysprosium/labvirt1005/labstore1002 (down)
* 20:15 logmsgbot: twentyafterfour Synchronized php-1.26wmf5/extensions/MobileFrontend/javascripts/modules/search/init.js: Temporarily disable MobileWebSearch logging (duration: 00m 36s)
* 20:14 twentyafterfour: ignore all rumors of scap failures, the scaps were successful, with the exception of snapshot1004.eqiad.wmnet which hangs every time
* 20:14 logmsgbot: twentyafterfour Synchronized php-1.26wmf4/extensions/MobileFrontend/javascripts/modules/search/init.js: Temporarily disable MobileWebSearch logging (duration: 00m 37s)
* 20:12 logmsgbot: twentyafterfour scap failed: OSError [Errno 2] No such file or directory: '/var/lock/scap' (duration: 27m 49s)
* 19:44 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf5 and rebuild l10n cache
* 18:39 mutante: restarting apache on rhodium
* 18:34 bblack: rebooting cp3030
* 18:14 andrewbogott: restarted gmetad on uranium
* 17:41 andrewbogott: powering down virt1005 and virt1006
* 17:38 andrewbogott: depuppeting and decommissioning virt1005 and virt1006
* 17:24 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on enwikivoyage, fawiki and hewiki (duration: 00m 18s)
* 17:03 jgage: hadoop active namenode switched back to analytics1001 after rack C4 switch replacement
* 16:43 apergos: done with all trusty salt updates in pro except for labcontrol1002 (?), doing jessie now in very tiny batches, it's being trouble
* 15:29 bd808: Stashed uncommitted change to scap on tin that disabled php opening tag check for sync-file
* 15:27 bd808: Updated scap to 57036d2 (Update statsd events)
* 15:27 bd808: trebuchet checkout for scap/scap failed for mw1113.eqiad.wmnet, mw1222.eqiad.wmnet, mw1104.eqiad.wmnet
* 15:25 bd808: trebuchet fetch for scap/scap failed on mw1222.eqiad.wmnet
* 15:04 logmsgbot: bd808 Synchronized wmf-config/InitialiseSettings.php: Send group0 + group1 MediaWiki events to logstash {{gerrit|209170}} (duration: 00m 16s)
* 14:32 cmjohnson1: shutting down db1054 for maintenance
* 14:22 _joe_: depooling the HHVM imagescaler
* 14:20 Nemo_bis: phabricator went down again for some minutes, seems ok now?
* 14:17 _joe_: pooling the HHVM imagescalers to test if the issues are solved now.
* 14:15 andrewbogott: rebooting labvirt1009 one last time
* 13:53 _joe_: upgrading the hhvm imagescaler (mw1152) to HHVM 3.6.1
* 13:47 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1021 in s2, warm up (duration: 00m 27s)
* 13:42 apergos: all precise hosts are upgraded to salt except for tin and virt1000; in the middle of trusty updates now, in batches
* 13:38 _joe_: uploading HHVM 3.6.1 and all the related extensions to apt.wikimedia.org
* 13:01 paravoid: replacing asw-c4-eqiad (T93730)
* 12:45 logmsgbot: krenair Synchronized php-1.26wmf4/extensions/SemanticMediaWiki/specials/QueryPages/SMW_QueryPage.php: https://gerrit.wikimedia.org/r/#/c/209212/ (duration: 00m 21s)
* 08:12 logmsgbot: legoktm Synchronized wmf-config/CommonSettings-labs.php: no-op (duration: 00m 24s)
* 07:20 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I019944f42: Change EventLogging endpoint to /beacon/event (duration: 00m 14s)
* 06:51 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed May  6 06:50:27 UTC 2015 (duration 50m 26s)
* 03:14 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-06 03:13:28+00:00
* 03:09 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 08m 46s)
* 02:46 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-05-06 02:45:26+00:00
* 02:36 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 10m 46s)
* 02:27 springle: xtrabackup clone db1060 to db1021
* 02:04 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I83ad6d060: Remove wmgUseBits setting, now that the migration is complete (duration: 00m 18s)
* 02:02 logmsgbot: aude Synchronized php-1.26wmf4/extensions/Wikidata: Fix Wikibase api error output bug - update submoduled (duration: 00m 28s)
* 01:59 logmsgbot: aude Synchronized php-1.26wmf4/extensions/Wikidata: Fix Wikibase api error output bug (duration: 01m 08s)
* 01:52 logmsgbot: ori Synchronized multiversion/MWWikiversions.php: Ib08e36901: MWWikiversions::readDbListFile: allow single-line ("#" or "//") comments (duration: 00m 18s)
* 01:40 springle: upgrade db1021 trusty
* 00:51 springle: schema change running T95179 wikidata, bit unusual, dropping a not-null field
* 00:46 logmsgbot: bd808 Synchronized wmf-config/CommonSettings.php: Add AffCom user group application contact page on meta {{gerrit|207332}} (duration: 00m 20s)
* 00:45 logmsgbot: bd808 Synchronized docroot/noc/createTxtFileSymlinks.sh: Add AffCom user group application contact page on meta {{gerrit|207332}} (duration: 00m 17s)
* 00:45 logmsgbot: bd808 Synchronized docroot/noc/conf/AffComContactPages.php.txt: Add AffCom user group application contact page on meta {{gerrit|207332}} (duration: 00m 15s)
* 00:44 logmsgbot: bd808 Synchronized wmf-config/AffComContactPages.php: Add AffCom user group application contact page on meta {{gerrit|207332}} (duration: 00m 33s)
* 00:15 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/Flow: SWAT (duration: 00m 23s)
* 00:15 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/WikiEditor: SWAT (duration: 00m 33s)
* 00:13 bd808: Aborted sync-common on snapshot1004; host is starved for RAM and using swap heavily
* 00:06 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/CirrusSearch: SWAT (duration: 00m 28s)
* 00:06 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/Flow: SWAT (duration: 00m 52s)
* 00:04 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/WikiEditor: SWAT (duration: 00m 42s)
 
== May 5 ==
* 23:57 bd808: aborted and restarted sync-common on snapshot1004.eqiad.wmnet manually after waiting 24 minutes with no progress
* 23:49 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Use Wiki.svg for wikimania2015wiki logo (duration: 00m 19s)
* 23:47 jgage: switched hadoop active namenode from analytics1001 to analytics1002 for rack C4 switch replacement tomorrow morning (T93730)
* 23:39 logmsgbot: rmoen Finished scap: Updates for Gather and MobileFrontend (duration: 41m 11s)
* 23:33 bd808: running sync-common on snapshot1004.eqiad.wmnet manually after it was aborted in scap by rmoen
* 23:30 bd808: snapshot1004.eqiad.wmnet hanging scap yet again
* 23:23 mutante: deleted 8G recurring_blocked.tsv from lutetium
* 22:58 logmsgbot: rmoen Started scap: Updates for Gather and MobileFrontend
* 22:54 logmsgbot: rmoen Synchronized php-1.26wmf3/extensions/Gather/: Update Gather to master (duration: 00m 36s)
* 22:53 logmsgbot: rmoen Synchronized php-1.26wmf3/extensions/MobileFrontend/: Update MobileFrontend (duration: 00m 31s)
* 22:52 logmsgbot: rmoen Synchronized php-1.26wmf4/extensions/Gather/: Update Gather to master (duration: 00m 25s)
* 22:52 mutante: gzip lutetium-slow.log on lutetium to save disk space
* 22:52 logmsgbot: rmoen Synchronized php-1.26wmf4/extensions/MobileFrontend/: Update MobileFrontend (duration: 00m 39s)
* 22:23 mutante: apt-get clean on lutetium to free disk space
* 19:53 twentyafterfour: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group1 wikis to 1.26wmf4 (actual time 18:12 UTC)
* 19:44 logmsgbot: aude Synchronized php-1.26wmf4/extensions/Wikidata: Fix usage tracking issue on Wikidata - with submodule update (duration: 00m 33s)
* 19:41 logmsgbot: aude Synchronized php-1.26wmf4/extensions/Wikidata: Fix usage tracking issue on Wikidata (duration: 00m 40s)
* 19:35 bblack: rebooting cp3030 ...
* 19:23 yuvipanda: disabled puppet on zookeeper hosts
* 18:49 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I5978a3910: Update $wgULSFontRepositoryBasePath for post-bits world (duration: 00m 18s)
* 18:43 logmsgbot: ori Synchronized wmf-config: Ia98fc4c5d: wmgUseBits: false for enwiki (duration: 00m 17s)
* 18:33 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: I2ee277293: wmgUseBits: false for all but enwiki (duration: 00m 13s)
* 17:50 logmsgbot: yurik Synchronized wmf-config/InitialiseSettings.php: Enable graph extension on all wikis except wikidata (duration: 00m 19s)
* 17:43 logmsgbot: yurik Synchronized php-1.26wmf3/extensions/Graph: Cherrypicked Graph ext 209004 (duration: 00m 16s)
* 17:42 logmsgbot: yurik Synchronized php-1.26wmf4/extensions/Graph: Cherrypicked Graph ext 209004 (duration: 00m 20s)
* 17:00 logmsgbot: yurik Synchronized wmf-config/CommonSettings.php: Enable graphoid noscript fallback for graph ext (duration: 00m 20s)
* 16:50 yurik_: deployed latest graphoid 0.1.3 service
* 15:16 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: Add medialib.naturalis.nl to wgCopyUploadsDomains [[gerrit:208634]] (duration: 00m 26s)
* 14:07 godog: shut fluorine to replace sdb
* 13:13 akosiaris: restarted apache2 on palladium
* 13:04 Tim: updating voter list for the FDC election for T97924
* 08:47 paravoid: repooling ulsfo
* 07:59 godog: test reboot fluorine with new disk
* 05:51 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue May  5 05:50:01 UTC 2015 (duration 50m 0s)
* 05:07 logmsgbot: tstarling Synchronized php-1.26wmf3/extensions/SecurePoll/cli/wm-scripts/bv2015/voterList.php: (no message) (duration: 00m 16s)
* 04:43 logmsgbot: tstarling Synchronized php-1.26wmf3/extensions/SecurePoll/cli/wm-scripts/bv2015/voterList.php: (no message) (duration: 00m 19s)
* 02:59 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-05 02:57:54+00:00
* 02:54 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 07m 06s)
* 02:31 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-05-05 02:30:45+00:00
* 02:26 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 08m 20s)
* 01:41 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1021, move s5 api to db1049 (duration: 00m 15s)
* 01:20 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1070, warm up (duration: 00m 19s)
* 00:32 yuvipanda: restarted hhvm on mw1197
* 00:24 logmsgbot: aude Synchronized wmf-config/Wikibase.php: Enable Wikibase subscription tracking (duration: 00m 12s)
 
== May 4 ==
* 23:59 logmsgbot: catrope Finished scap: (no message) (duration: 24m 34s)
* 23:34 logmsgbot: catrope Started scap: (no message)
* 23:15 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/MassMessage/: SWAT (duration: 00m 12s)
* 23:14 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/MassMessage/: SWAT (duration: 00m 12s)
* 23:14 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/VisualEditor/: SWAT (duration: 00m 12s)
* 23:13 logmsgbot: catrope Synchronized php-1.26wmf4/includes/skins/SkinTemplate.php: SWAT (duration: 00m 11s)
* 22:37 Krenair: silver: apache2ctl restart for T98084
* 22:26 Tim: on terbium: running voterList.php again, with corrected edit counts
* 21:55 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: Id56e33263: wmgUseBits: false for ru and eswiki (duration: 00m 12s)
* 21:40 logmsgbot: bd808 Finished scap: Update 1.26wmf4 ContactPage and WikimediaMessages for AffCom contact form (duration: 22m 11s)
* 21:34 paravoid: cr{1,2}-{eqiad,ulsfo}: swapping metrics for ulsfo's transport links
* 21:18 logmsgbot: bd808 Started scap: Update 1.26wmf4 ContactPage and WikimediaMessages for AffCom contact form
* 21:03 Coren: checking raid consistency from labstore1002
* 21:03 ottomata: rebooting analytics1037
* 20:27 Coren: Starting NFS server switch - graceful labstore1001 shutdown.
* 20:11 gwicke: deployed restbase v0.6.0 / 76583a07
* 19:56 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: I62dffd271: wmgUseBits: false for nl and dewiki (duration: 00m 11s)
* 19:24 logmsgbot: ori Synchronized w/5xx.php: (no message) (duration: 00m 14s)
* 19:12 awight: update crm from 514e7ea41acd14e1565b31b76621ea840d209e07 to 2a2336655737a2cd1d3cc24624d1e8475e4cf039
* 19:12 logmsgbot: ori Synchronized multiversion: I2d93ede75: Remove FormatJson from mediawiki-config (duration: 00m 13s)
* 18:51 logmsgbot: ori Synchronized multiversion/FormatJson.php: Ice8f1796c: Update FormatJson to 532337e6ff from mediawiki/core (duration: 00m 12s)
* 18:44 cscott: updated Parsoid to version b53a7272
* 18:26 logmsgbot: ori Synchronized wmf-config: I81df3a614, I02b06f8e2, I366561a0f: Use MWWikiversions::readDbListFile to read dblist files; Allow computed dblist expressions; Add group1.dblist (duration: 00m 14s)
* 17:53 legoktm: running delete-wmf-tags (https://phabricator.wikimedia.org/P531) on all extension repos
* 16:58 andrewbogott: reimaging/renaming virt1011 -> labvirt1007
* 15:40 logmsgbot: thcipriani Synchronized php-1.26wmf4/extensions/ContentTranslation: Update ContentTranslation to 0bd91b6 [[gerrit:208607]] (duration: 00m 30s)
* 15:32 logmsgbot: thcipriani Synchronized php-1.26wmf3/extensions/ContentTranslation: Sync-dir for ContentTranslation to 6f81619 [[gerrit:208605]] (duration: 00m 18s)
* 15:23 logmsgbot: thcipriani Synchronized php-1.26wmf3/extensions/ContentTranslation/modules/tools/ext.cx.tools.formatter.js: Update ContentTranslation to 6f81619 [[gerrit:208605]] (duration: 00m 25s)
* 15:17 ottomata: starting upgrade of Analytics Cluster to CDH 5.4: https://phabricator.wikimedia.org/T97453
* 15:05 andrewbogott: halting virt1011 pending its rename to labvirt1007
* 14:51 godog: halt fluorine to fix console and swap sda
* 14:50 paravoid: draining ulsfo, network troubles (internal network packet loss)
* 13:49 paravoid: draining all traffic from the Giglinx/Zayo link to ulsfo
* 05:56 Tim: on terbium: running populateEditCount-fixup.php on all wikis
* 05:53 logmsgbot: tstarling Synchronized php-1.26wmf4/extensions/SecurePoll: Iae874c0403a8362929362ca645f4aca18feb0269 (duration: 00m 19s)
* 05:52 logmsgbot: tstarling Synchronized php-1.26wmf3/extensions/SecurePoll: Iae874c0403a8362929362ca645f4aca18feb0269 (duration: 00m 22s)
* 05:36 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon May  4 05:35:29 UTC 2015 (duration 35m 28s)
* 02:49 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-04 02:48:16+00:00
* 02:44 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 07m 33s)
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-05-04 02:26:00+00:00
* 02:22 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 08m 58s)
* 01:13 bd808: Started logstash cluster relocating indices off of logstash100[1-3] to logstash100[4-6]
 
== May 3 ==
* 19:28 yuvipanda:  chown www-data: /var/log/mediawiki/refreshLinks/s3@3.log and s2@2.log for Reedy
* 16:23 logmsgbot: hoo Synchronized wmf-config/: Re-enable global renames (duration: 00m 12s)
* 15:17 _joe_: restarted jobchron, not jobcron, this time for real
* 14:37 bblack: dewiki jobqueue:*:rootjob wipe complete
* 14:37 bblack: enwiki + commonswiki jobqueue:*:rootjob wipe complete
* 14:19 bblack: deleting :rootjob: entries for enwiki from redis too
* 14:16 bblack: deleting :rootjob: entries for commonswiki from redis
* 13:54 _joe_: restarting jobcron on the jobrunners
* 13:27 logmsgbot: hoo Synchronized wmf-config/: Temporary disable global renames (duration: 00m 16s)
* 12:47 _joe_: restarting redis server on rdb1001, lagging on the most basic queries
* 12:38 _joe_: deploying I969fe8d329c1bcbb919a54cb225200ba0e006a03 to the jobrunners trying to make them work again
* 05:14 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun May  3 05:13:13 UTC 2015 (duration 13m 12s)
* 04:28 springle: xtrabackup clone db1049 to db1070
* 04:01 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: depool db1070 (duration: 00m 16s)
* 02:48 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-03 02:47:30+00:00
* 02:47 logmsgbot: springle Synchronized wmf-config/db-eqiad.php: repool db1068, warm up (duration: 00m 15s)
* 02:44 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 07m 11s)
* 02:27 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-05-03 02:26:02+00:00
* 02:22 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 08m 11s)
 
== May 2 ==
* 22:16 ori: Deployed change I3bc87f3a5 to fix UBN! bug T97912. Bug was affecting ability to translate messages needed for running upcoming board election.
* 22:16 logmsgbot: ori Synchronized php-1.26wmf4/extensions/Translate/api/ApiQueryMessageGroups.php: I3bc87f3a5: ApiQueryMessageGroups: mark '_canchange' and '_name' as non-API-metadata (duration: 00m 30s)
* 22:09 logmsgbot: ori Synchronized php-1.26wmf3/extensions/Translate/api/ApiQueryMessageGroups.php: I3bc87f3a5: ApiQueryMessageGroups: mark '_canchange' and '_name' as non-API-metadata (duration: 00m 31s)
* 20:25 windowcat: Updated jobrunners to c95d565e242e6fa3706c088ddab1cc6f716408e1
* 19:31 springle: xtrabackup clone db2048, db2049, db2050, db2051, db2052, db2053, db2054 from codfw masters
* 19:09 springle: upgrade db1068 trusty, xtrabackup clone from db1056
* 19:02 ottomata: resinstalling analytics1004 and analytics1010 as trusty
* 06:08 yuvipanda: signed puppet certs manually on virt1000
* 05:19 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat May  2 05:18:29 UTC 2015 (duration 18m 28s)
* 03:24 ori: Granted self admin rights on metawiki temporarily to debug a CentralNotice issue.
* 02:53 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-02 02:52:36+00:00
* 02:48 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 07m 01s)
* 02:32 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/WikiEditor: Fix data gathering bug (duration: 00m 25s)
* 02:32 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-05-02 02:31:00+00:00
* 02:27 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 08m 11s)
* 02:15 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/WikiEditor: Fix data gathering bug (duration: 00m 15s)
* 00:02 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 16s)
 
== May 1 ==
* 23:53 logmsgbot: aaron Synchronized php-1.26wmf4/includes/media/DjVu.php: caa2efc0e76c2ba849d465006600d131dc2f78b5 (duration: 00m 21s)
* 23:52 logmsgbot: aaron Synchronized php-1.26wmf3/includes/media/DjVu.php: 6cdb23c5d662151a2b578c2acc8823bc975fc22a (duration: 00m 15s)
* 23:40 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: I02e28db61: Update apple-touch to use static (duration: 00m 23s)
* 21:08 matt_flaschen: Ran FlowUpdateWorkflowPageId.php for all production Flow wikis for https://phabricator.wikimedia.org/T96888
* 20:37 logmsgbot: andyrussg Synchronized php-1.26wmf4/extensions/EducationProgram/: Update EducationProgram (duration: 00m 21s)
* 20:01 logmsgbot: demon Synchronized wmf-config/CommonSettings.php: less realm stuff (duration: 00m 17s)
* 20:00 logmsgbot: andyrussg Synchronized php-1.26wmf3/extensions/EducationProgram/: Update EducatiDonProgram (duration: 00m 30s)
* 18:54 logmsgbot: legoktm Synchronized wikiversions-labs.json: https://gerrit.wikimedia.org/r/#/c/208170/ no-op (duration: 00m 25s)
* 18:53 logmsgbot: legoktm Synchronized all-labs.dblist: https://gerrit.wikimedia.org/r/#/c/208170/ no-op (duration: 00m 18s)
* 18:11 logmsgbot: legoktm Synchronized all-labs.dblist: https://gerrit.wikimedia.org/r/208154 - no-op (duration: 00m 19s)
* 15:58 logmsgbot: anomie Synchronized php-1.26wmf3/includes/: Deploy [[gerrit:208109]] to reduce the complaining about the new feature (duration: 00m 28s)
* 15:50 logmsgbot: anomie Synchronized php-1.26wmf4/includes/: Deploy [[gerrit:208109]] to reduce the complaining about the new feature (duration: 00m 24s)
* 15:29 gwicke: finished restarting cassandra nodes on restbase100*.eqiad
* 15:21 ottomata: doing java security update on kafka brokers, doing rolling restarts
* 14:50 gwicke: slowly restarting restbase100*.eqiad to apply new gen size change
* 10:47 godog: bounce apache2 on strontium
* 10:47 godog: bounce apache2 on palladium, mod_passenger died
* 05:45 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri May  1 05:44:23 UTC 2015 (duration 44m 22s)
* 03:05 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-05-01 03:04:21+00:00
* 03:01 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 06m 45s)
* 02:38 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-05-01 02:37:20+00:00
* 02:31 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 09m 46s)
* 00:18 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/PageTriage/: SWAT (duration: 00m 30s)
* 00:13 logmsgbot: ori Synchronized wmf-config: Iae2e55a11: wmgUseBits: false for itwiki (duration: 00m 19s)
 
== April 30 ==
* 23:59 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/PageTriage/: SWAT (duration: 00m 30s)
* 23:39 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/CentralAuth: SWAT (duration: 00m 15s)
* 23:39 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/Flow: SWAT (duration: 00m 51s)
* 23:38 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/MobileFrontend: SWAT (duration: 00m 58s)
* 23:35 logmsgbot: catrope Synchronized php-1.26wmf4/includes/skins/SkinTemplate.php: Add mw-content-ltr/rtl for missing pages (duration: 00m 35s)
* 23:33 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/CentralAuth/: SWAT (duration: 00m 31s)
* 23:32 ori: EventLogging events logged client-side appear not to be making it to eventlog1001.eqiad.wmnet; Ori investigating.
* 23:29 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/MobileFrontend/: SWAT (duration: 01m 43s)
* 23:04 RoanKattouw: Created wikilove tables on hywiki
* 23:03 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Enable WikiLove on hywiki (duration: 00m 49s)
* 22:48 logmsgbot: mattflaschen Finished scap: Deploy Flow changes to 1.26wmf4 facilitate LQT->Flow conversion (duration: 33m 35s)
* 22:19 awight: payments redeployed, revision for payments-wiki changed... from df8aeb5d1c5f595348f77cb56d3975eca19a65a2 to 3ab89e2b14eb449f7ceddf2325493d6235395ecd
* 22:17 awight: payments rolled back from 3ab89e2b14eb449f7ceddf2325493d6235395ecd to df8aeb5d1c5f595348f77cb56d3975eca19a65a2
* 22:14 logmsgbot: mattflaschen Started scap: Deploy Flow changes to 1.26wmf4 facilitate LQT->Flow conversion
* 22:10 awight: updating payments from df8aeb5d1c5f595348f77cb56d3975eca19a65a2 to 3ab89e2b14eb449f7ceddf2325493d6235395ecd
* 21:46 awight: update payments from 83d09e09178c634ad35dbb684d1c3aebbb709969 to df8aeb5d1c5f595348f77cb56d3975eca19a65a2
* 21:05 bd808: Finally got sync-common to run to completion on snapshot1004; runtime 45 minutes!
* 20:43 legoPanda: renaming <2k users who were missed in the original run (SUL finalization)
* 19:23 awight: enabling Thank You job
* 19:23 awight: updated crm from 59f03df6b689ef443cc7b7e31e6f5b2986bc8bc9 to 514e7ea41acd14e1565b31b76621ea840d209e07
* 19:07 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: I93cdc4a2e and I9ee6bec1f: Define $wgAssetsHost based on wmgUseBits; use it to reference standard chrome (duration: 00m 16s)
* 18:46 Coren: rebooting labstore1002 in prevision of switch to make sure it starts up cleanly.
* 18:14 K4-713: disabled Thank You mail send
* 17:41 bd808: sync-common on snapshot1004 failed after 33 minutes with rsync timeout
* 17:04 logmsgbot: demon Synchronized php-1.26wmf3/includes/Setup.php: meh, didn't work (duration: 00m 27s)
* 17:01 logmsgbot: demon Synchronized php-1.26wmf3/includes/Setup.php: trying something (duration: 00m 18s)
* 16:59 bd808: aborted sync-common on snapshot1004.eqiad.wmnet after 15 minutes for inactivity; trying again
* 16:44 bd808: started sync-common on snapshot1004 to fix aborted sync
* 16:42 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 26m 42s)
* 16:16 logmsgbot: kartik Started scap: Update ContentTranslation
* 15:22 logmsgbot: anomie Synchronized php-1.26wmf3/extensions/EducationProgram/: SWAT: EducationProgram: ApiListStudents: Use XML-friendly tag names [[gerrit:207778]] (duration: 00m 39s)
* 15:12 logmsgbot: anomie Synchronized php-1.26wmf4/extensions/EducationProgram/: SWAT: EducationProgram: ApiListStudents: Use XML-friendly tag names [[gerrit:207779]] (duration: 00m 25s)
* 15:09 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable GeoData at cawikibooks [[gerrit:199930]] (duration: 00m 19s)
* 15:08 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Restrict local uploads on mai.wikipedia [[gerrit:207725]] (duration: 00m 14s)
* 15:05 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Content Translation for Deployment 20150430 [[gerrit:207472]] (duration: 00m 18s)
* 15:03 logmsgbot: anomie Synchronized wmf-config/CommonSettings.php: SWAT: Bump timestamp in 'ValidateExtendedMetadataCache' hook for T97469 [[gerrit:207769]] (duration: 00m 30s)
* 12:27 godog: upgrade statsite on ms-be1*
* 12:25 godog: upgrade statsite on ms-fe1*
* 12:09 hashar: restarting Jenkins https://phabricator.wikimedia.org/T96183
* 10:53 godog: delete old /tmp/ganglia-graph from uranium
* 10:36 godog: upgrade statsite on labmon1001
* 08:16 paravoid: repooling esams, network maintenance is over
* 05:48 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Apr 30 05:47:26 UTC 2015 (duration 47m 25s)
* 05:15 paravoid: draining esams, planned upsteam network maintenance
* 03:04 logmsgbot: LocalisationUpdate completed (1.26wmf4) at 2015-04-30 03:03:09+00:00
* 03:00 logmsgbot: l10nupdate Synchronized php-1.26wmf4/cache/l10n: (no message) (duration: 07m 09s)
* 02:39 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-30 02:38:03+00:00
* 02:31 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 10m 59s)
* 01:18 logmsgbot: legoktm Synchronized php-1.26wmf3/includes/api/ApiOpenSearch.php: Restore B/C for ApiOpenSearch json output if warnings are present (duration: 00m 20s)
* 01:17 logmsgbot: legoktm Synchronized php-1.26wmf4/includes/api/ApiOpenSearch.php: Restore B/C for ApiOpenSearch json output if warnings are present (duration: 00m 30s)
 
== April 29 ==
* 23:58 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Enable direct RESTbase load on all Wikipedias (duration: 00m 21s)
* 23:57 logmsgbot: catrope Synchronized php-1.26wmf4/extensions/MobileFrontend: SWAT (duration: 00m 33s)
* 23:50 logmsgbot: catrope Synchronized php-1.26wmf3/resources/lib/jquery/jquery.js: Update jQuery to 1.11.3 (duration: 00m 31s)
* 23:49 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/VisualEditor: SWAT (duration: 00m 39s)
* 23:49 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/WikiEditor: SWAT (duration: 00m 23s)
* 23:48 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/Gather: SWAT (duration: 00m 32s)
* 23:24 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Enable Graph extension on sewikimedia (duration: 00m 21s)
* 23:21 logmsgbot: catrope Synchronized wmf-config/CommonSettings.php: Disable Graph namespace on all wikis except the ones that already have it (duration: 00m 22s)
* 23:20 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Add wmgUseGraphWithNamespace (duration: 00m 28s)
* 23:18 logmsgbot: catrope Synchronized wmf-config/Wikibase.php: Enable use of subscriptions table on testwikidata (duration: 00m 31s)
* 22:48 logmsgbot: legoktm Synchronized php-1.26wmf3/includes/MovePage.php: MovePage: Move target existence check into isValidMove() - https://gerrit.wikimedia.org/r/#/c/207557/ (duration: 00m 26s)
* 22:48 springle: dbstore1002 /srv/tmp filled up. killed queries, fixed mount point, restarted mysqld
* 21:27 logmsgbot: twentyafterfour Purged l10n cache for 1.26wmf2
* 21:23 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf4
* 21:20 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedias to 1.26wmf3
* 21:15 logmsgbot: twentyafterfour Finished scap: testwiki to php-1.26wmf4 and rebuild l10n cache - attempt #2 (duration: 33m 13s)
* 21:04 bd808: load avg on snapshot04 11.11; scap slow waiting on it
* 20:41 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf4 and rebuild l10n cache - attempt #2
* 20:41 logmsgbot: twentyafterfour scap aborted: testwiki to php-1.26wmf4 and rebuild l10n cache (duration: 26m 52s)
* 20:34 bd808: /etc/dsh/group/scap-proxies is borken on tin
* 20:17 subbu: reverted deploy to ebdac59b
* 20:17 subbu: attempted deploy of 45b54f63 (failed)
* 20:14 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf4 and rebuild l10n cache
* 20:03 logmsgbot: ori Synchronized README: testing deploy 2 (duration: 00m 22s)
* 20:03 logmsgbot: ori Synchronized README: testing deploy script (duration: 00m 25s)
* 16:22 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 30s)
* 15:58 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable assigning "accountcreator" for newiki [[gerrit:206093]] (duration: 00m 30s)
* 15:55 logmsgbot: anomie Synchronized wmf-config/abusefilter.php: SWAT: Add abusefilter-modify-restricted right to sysop user group for idwiki [[gerrit:206080]] (duration: 00m 25s)
* 15:53 logmsgbot: anomie Synchronized php-1.26wmf2/extensions/MobileFrontend: SWAT: Ah, git rebasing was rebasing the reverted commits on top of the revert... (duration: 00m 21s)
* 15:51 logmsgbot: anomie Synchronized php-1.26wmf2/extensions/MobileFrontend: SWAT: Resync? (duration: 00m 36s)
* 15:47 logmsgbot: anomie Synchronized php-1.26wmf2/extensions/MobileFrontend: SWAT: Revert previous, broke stuff on wmf2 (duration: 00m 39s)
* 15:44 logmsgbot: anomie Synchronized php-1.26wmf2/extensions/MobileFrontend: SWAT: MobileFrontend: API: "editable" is a legacy boolean, don't convert it [[gerrit:207403]] (duration: 00m 23s)
* 15:43 _joe_: restarting HHVM on mw1132 too, same reason.
* 15:41 logmsgbot: anomie Synchronized php-1.26wmf3/extensions/MobileFrontend: SWAT: MobileFrontend: API: "editable" is a legacy boolean, don't convert it [[gerrit:207403]] (duration: 00m 37s)
* 15:40 _joe_: restarting HHVM on mw1232, stuck on __lll_lock_wait from HPHP::StatCache::refresh ()
* 15:30 logmsgbot: anomie Synchronized php-1.26wmf3/includes/api/ApiResult.php: SWAT: API: ApiResult must validate even when using numeric auto-indexes [[gerrit:207456]] (duration: 00m 26s)
* 15:20 logmsgbot: anomie Synchronized php-1.26wmf3/extensions/Wikidata: SWAT: Update Wikidata - fix change subscriptions script [[gerrit:207448]] (duration: 00m 53s)
* 15:08 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Remove sampling of api.log [[gerrit:206865]] (duration: 00m 29s)
* 15:05 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Load HTML directly from RESTBase on all wikipedias [[gerrit:206320]] (duration: 00m 17s)
* 13:03 paravoid: disabling netflows on cr1/2-ulsfo
* 07:12 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Wed Apr 29 07:11:38 UTC 2015 (duration 11m 37s)
* 05:28 logmsgbot: tstarling Synchronized php-1.26wmf3/extensions/SecurePoll: (no message) (duration: 00m 13s)
* 03:47 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-29 03:46:05+00:00
* 03:40 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 39m 55s)
* 02:48 springle: killed eight stalled commonswiki.transcode transactions on db1040
* 02:45 logmsgbot: LocalisationUpdate completed (1.26wmf2) at 2015-04-29 02:43:54+00:00
* 02:40 logmsgbot: aude Synchronized wmf-config/InitialiseSettings.php: Enable Wikibase usage tracking on nlwiki and frwikisource (duration: 00m 12s)
* 02:40 logmsgbot: l10nupdate Synchronized php-1.26wmf2/cache/l10n: (no message) (duration: 25m 50s)
* 00:38 springle: xtrabackup clone db2029 to db2047
* 00:38 springle: xtrabackup clone db2028 to db2046
* 00:20 logmsgbot: gwicke Synchronized wmf-config/InitialiseSettings.php: VE: Load HTML directly from RESTBase for enwiki (duration: 00m 22s)
* 00:07 logmsgbot: bd808 Synchronized docroot/noc/createTxtFileSymlinks.sh: Revert of AffCom contact form {{gerrit|207328}} (duration: 00m 35s)
* 00:06 logmsgbot: bd808 Synchronized wmf-config/CommonSettings.php: Revert of AffCom contact form {{gerrit|207328}} (duration: 00m 19s)
 
== April 28 ==
* 23:57 logmsgbot: bd808 Synchronized docroot/noc/conf/AffComContactPages.php.txt: Add AffCom user group application contact page on meta {{gerrit|207319}} (duration: 00m 28s)
* 23:51 logmsgbot: bd808 Synchronized wmf-config/CommonSettings.php: Add AffCom user group application contact page on meta {{gerrit|204205}} (duration: 00m 11s)
* 23:50 logmsgbot: bd808 Synchronized docroot/noc/createTxtFileSymlinks.sh: Add AffCom user group application contact page on meta {{gerrit|204205}} (duration: 00m 21s)
* 23:48 logmsgbot: bd808 Synchronized wmf-config/AffComContactPages.php: Add AffCom user group application contact page on meta {{gerrit|204205}} (duration: 00m 25s)
* 23:35 bd808|deploy: mw2031.codfw.wmnet syncing very slowly for SWAT
* 23:35 logmsgbot: bd808 Synchronized wmf-config/InitialiseSettings.php: Shell bugs {{gerrit|207162}} {{gerrit|206731}} {{gerrit|203783}} {{gerrit|207273}} {{gerrit|207170}} (duration: 01m 12s)
* 23:32 logmsgbot: bd808 Synchronized commonsuploads.dblist: Restrict local uploads on mai.wikipedia {{gerrit|207273}} (duration: 00m 32s)
* 23:26 logmsgbot: bd808 Synchronized php-1.26wmf3/extensions/VisualEditor: Update VisualEditor for two icon issues {{gerrit|207299}} (duration: 00m 27s)
* 23:06 logmsgbot: hoo Synchronized wmf-config/: Do Wikibase setting overrides for test wikis in Wikibase-production.php (duration: 00m 24s)
* 22:58 logmsgbot: legoktm Synchronized php-1.26wmf3/extensions/EventLogging/includes/ApiJsonSchema.php: https://gerrit.wikimedia.org/r/#/c/207297/ (duration: 00m 15s)
* 22:07 Tim: running bv2015/voterList.php on terbium
* 22:05 logmsgbot: tstarling Synchronized php-1.26wmf2/extensions/SecurePoll: for new voterList.php (duration: 00m 23s)
* 21:32 logmsgbot: demon Synchronized wmf-config/CommonSettings.php: expire old metadata cache entries (duration: 00m 26s)
* 21:30 logmsgbot: rmoen Synchronized php-1.26wmf3/extensions/Gather/: Updating gather (duration: 00m 44s)
* 20:58 logmsgbot: anomie Synchronized php-1.26wmf3/includes/media/FormatMetadata.php: Unbreak API imageinfo with extmetadata (mainly on Commons) (duration: 00m 25s)
* 19:34 twentyafterfour: Deployed patch for T97391
* 19:25 logmsgbot: twentyafterfour Synchronized php-1.26wmf3/thumb.php: (no message) (duration: 00m 19s)
* 19:22 logmsgbot: twentyafterfour Synchronized php-1.26wmf2/thumb.php: (no message) (duration: 00m 33s)
* 19:21 mutante: tmp. stopped icinga-wm because puppetmaster fail spam
* 19:21 mutante: restarting apache on palladium
* 18:47 robh: stopping puppet on carbon - livehacking partman recipe testing
* 18:46 legoktm: force merged User:Js@ruwiki to User:Js@global per global-renamers list
* 18:34 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: Group1 wikis to 1.26wmf3
* 18:32 milimetric: upgraded and restarted Eventlogging on eventlog1001 (now at be1e055)
* 18:22 milimetric: upgraded and restarted Eventlogging on hafnium (now at be1e055)
* 17:54 mutante: tungsten - disable in icinga. scheduled the longest downtime. shutdown -h now (T97274)
* 17:49 mutante: tungsten - revoke puppet cert, delete salt-key, delete from stored configs
* 15:55 logmsgbot: anomie Synchronized php-1.26wmf2/extensions/ContentTranslation: SWAT: Update ContentTranslation [[gerrit:207092]] (duration: 00m 58s)
* 15:45 logmsgbot: anomie Synchronized php-1.26wmf3/extensions/ContentTranslation: SWAT: Update ContentTranslation [[gerrit:207098]] (duration: 00m 46s)
* 15:35 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Content Translation in cs, el, kk and zu [[gerrit:207048]] (duration: 00m 27s)
* 15:31 logmsgbot: anomie Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Content Translation in cs, el, kk and zu [[gerrit:207048]] (duration: 00m 21s)
* 15:23 logmsgbot: anomie Synchronized php-1.26wmf3/includes/api/ApiQuery.php: SWAT: API: Remove metadata keys from indexpageids output [[gerrit:206861]] (duration: 00m 17s)
* 15:13 logmsgbot: anomie Synchronized php-1.26wmf2/extensions/CentralAuth/: SWAT: CentralAuth: Fix missing "&" in onMakeGlobalVariablesScript signature [[gerrit:207023]] (duration: 00m 24s)
* 15:11 logmsgbot: anomie Synchronized php-1.26wmf3/extensions/CentralAuth/: SWAT: CentralAuth: Fix missing "&" in onMakeGlobalVariablesScript signature [[gerrit:207021]] (duration: 00m 29s)
* 14:58 akosiaris: restart pybal on lvs1003
* 14:51 akosiaris: restarted pybal on lvs1006
* 13:51 ottomata: powercycling analytics1015 after crash
* 12:38 springle: xtrabackup clone db2023 to db2045
* 12:36 springle: xtrabackup clone db2019 to db2044
* 12:34 springle: xtrabackup clone db2018 to db2043
* 05:31 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Tue Apr 28 05:30:47 UTC 2015 (duration 30m 46s)
* 02:52 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-28 02:51:34+00:00
* 02:50 ottomata: 'kafka preferred-replica-election'
* 02:48 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 08m 32s)
* 02:29 logmsgbot: LocalisationUpdate completed (1.26wmf2) at 2015-04-28 02:28:11+00:00
* 02:25 bblack: restarted apache2 on palladium - it was throwing infinite 500 errors due to some mod_passenger issue...
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf2/cache/l10n: (no message) (duration: 10m 17s)
* 01:45 bblack: rebooting analytics1013 (not 1016)
* 01:45 bblack: rebooting analytics1016
* 00:37 bblack: rebooting cp3030
* 00:13 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/Flow: SWAT (duration: 00m 28s)
* 00:12 logmsgbot: catrope Synchronized php-1.26wmf2/extensions/Flow: SWAT (duration: 00m 41s)
* 00:11 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/WikimediaEvents/: SWAT (duration: 00m 45s)
 
== April 27 ==
* 23:33 logmsgbot: catrope Synchronized wmf-config/CommonSettings.php: Re-enable same-domain RESTbase entry point for VE (duration: 00m 22s)
* 23:29 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: SWAT (duration: 00m 26s)
* 23:28 logmsgbot: catrope Synchronized wmf-config/flaggedrevs.php: Remove autoreview group on frwikinews (duration: 00m 35s)
* 22:43 mutante: racreset on analytics1016 because no console
* 22:36 ottomata: powercycled analytics1016 after it is unreachable.
* 20:36 subbu: deployed parsoid sha ebdac59b
* 19:47 mutante: apt-get upgrade on iron (incl. apt itself, gnupg, ssl)
* 17:53 mutante: temp stopped icinga-wm
* 17:22 logmsgbot: aaron Synchronized php-1.26wmf2/includes/media/DjVu.php: 40d702b8d2d023d6f701e4aeb082b62b7adf2f0f (duration: 00m 19s)
* 17:20 logmsgbot: aaron Synchronized php-1.26wmf3/includes/media/DjVu.php: b980b0a9457b2f98a502cfe36edfc75300c7952f (duration: 00m 27s)
* 17:05 logmsgbot: aaron Synchronized wmf-config/db-eqiad.php: Lowered innodb_lock_wait_timeout from defaults (duration: 00m 27s)
* 17:03 logmsgbot: aaron Synchronized wmf-config/db-codfw.php: Lowered innodb_lock_wait_timeout from defaults (duration: 00m 22s)
* 17:03 logmsgbot: aaron Synchronized wmf-config/jobqueue-eqiad.php: Set  to .1 (duration: 00m 11s)
* 17:02 logmsgbot: aaron Synchronized wmf-config/jobqueue-codfw.php: Set  to .1 (duration: 00m 27s)
* 16:24 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: Extended SWAT [[gerrit:206822]] (duration: 00m 26s)
* 16:16 godog: boostrap cassandra on xenon
* 16:02 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT [[gerrit:201897]] (duration: 00m 22s)
* 15:55 logmsgbot: thcipriani Synchronized wmf-config/flaggedrevs.php: SWAT [[gerrit:199321]] (duration: 00m 17s)
* 15:51 logmsgbot: thcipriani Synchronized wmf-config/flaggedrevs.php: SWAT [[gerrit:206650]] no-op whitespace changes (duration: 00m 22s)
* 15:42 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT [[gerrit:206727]] and [[gerrit:206786]] (duration: 00m 16s)
* 15:27 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT [[gerrit:204467]] (duration: 00m 29s)
* 15:20 logmsgbot: thcipriani Synchronized wmf-config/flaggedrevs.php: SWAT [[gerrit:206647]] (duration: 00m 14s)
* 15:09 logmsgbot: thcipriani Synchronized wmf-config/InitialiseSettings.php: SWAT [[gerrit:206648]] (duration: 00m 51s)
* 14:50 godog: upgrade statsite on graphite1001
* 14:04 bblack: puppet disabled on caches while apt upgrades run...
* 13:28 paravoid: upgrading pfw-codfw to newer junos
* 12:31 paravoid: upgrading pfw-eqiad to newer junos
* 08:19 godog: ms-be101[678] object weight to 3000
* 05:09 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Mon Apr 27 05:08:23 UTC 2015 (duration 8m 22s)
 
== April 26 ==
* 23:31 paravoid: draining esams for planned upstream network maintenance (00:00-04:00 UTC)
* 08:16 jgage: ms-be1007 was unresponsive for ~6 hours, "soft lockup" output on console. rebooted.
* 05:29 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sun Apr 26 05:28:11 UTC 2015 (duration 28m 10s)
* 03:37 ori: Previous sync-file was for: If296f3d3c: Set max_execution_time in CommonSettings.php
* 03:36 logmsgbot: ori Synchronized wmf-config/CommonSettings.php: (no message) (duration: 00m 14s)
* 03:05 jgage: mw2027 rebooted unexpectedly, no clues in syslog. afterward i dist-upgraded, including new kernel.
* 02:56 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-26 02:55:00+00:00
* 02:52 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 06m 38s)
* 02:35 logmsgbot: LocalisationUpdate completed (1.26wmf2) at 2015-04-26 02:33:59+00:00
* 02:30 logmsgbot: l10nupdate Synchronized php-1.26wmf2/cache/l10n: (no message) (duration: 07m 39s)
 
== April 25 ==
* 15:26 subbu: deployed parsoid version fca17070 (cherry-pick of d2135c6b on parsoid master)
* 09:57 _joe_: nuked User:Niteshift/MVneu/2015_April_21-30 on commonswiki
* 05:18 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Sat Apr 25 05:17:41 UTC 2015 (duration 17m 40s)
* 04:30 logmsgbot: mattflaschen Synchronized wmf-config/CommonSettings-labs.php: Sync Beta Cluster-only change (for MW UI beta feature) (duration: 00m 16s)
* 04:30 logmsgbot: mattflaschen Synchronized wmf-config/InitialiseSettings-labs.php: Sync Beta Cluster-only change (for MW UI beta feature) (duration: 00m 16s)
* 02:42 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-25 02:41:54+00:00
* 02:39 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 05m 56s)
* 02:24 logmsgbot: LocalisationUpdate completed (1.26wmf2) at 2015-04-25 02:23:33+00:00
* 02:20 logmsgbot: l10nupdate Synchronized php-1.26wmf2/cache/l10n: (no message) (duration: 07m 48s)
 
== April 24 ==
* 22:14 logmsgbot: krinkle Synchronized php-1.26wmf2/includes/resourceloader/ResourceLoaderModule.php: Ibedc31659ed (duration: 00m 14s)
* 22:13 logmsgbot: krinkle Synchronized php-1.26wmf3/includes/resourceloader/ResourceLoaderModule.php: Ibedc31659ed (duration: 00m 17s)
* 21:11 ottomata: started hdfs balancer run
* 20:34 ori: Deployed I1fa012ca1: HHVM: Limit wall execution time of FCGI reqs to 290s
* 19:53 logmsgbot: aaron Synchronized wmf-config/db-codfw.php: Removed unused "max threads" stuff (duration: 00m 15s)
* 19:52 subbu: revert parsoid deploy to 3311936a
* 19:52 logmsgbot: aaron Synchronized wmf-config/db-eqiad.php: Removed unused "max threads" stuff (duration: 00m 14s)
* 19:42 logmsgbot: demon Synchronized php-1.26wmf2/extensions/CirrusSearch/includes/Searcher.php: undo debugging (duration: 00m 14s)
* 19:40 logmsgbot: demon Synchronized php-1.26wmf2/extensions/CirrusSearch/includes/Searcher.php: debugging (duration: 00m 17s)
* 18:58 ori: restarted puppetmaster on palladium as well
* 18:56 ori: restarted apache2 on palladium
* 16:06 andrewbogott: dist-upgrade (including kernel upgrade to 3.13.0-49-generic) on labvirt1004, rebooting
* 15:56 logmsgbot: demon Synchronized wmf-config/: logging cleanup, mostly for labs (duration: 00m 21s)
* 15:42 andrewbogott: dist-upgrade (including kernel upgrade to 3.13.0-49-generic) on labvirt1003, rebooting
* 15:08 andrewbogott: dist-upgrade (including kernel upgrade to 3.13.0-49-generic) on labvirt1005, rebooting
* 14:24 andrewbogott: dist-upgrade (including kernel upgrade to 3.13.0-49-generic) on labvirt1006, rebooting
* 10:31 akosiaris: nova migrated a couple of etcd's project VMs
* 09:09 _joe_: parsoid restart done
* 08:59 _joe_: restarting parsoid cluster-wide
* 08:47 ori: deployed parsoid/deploy 8b5de6aba / I4d55f6d50: Bump src to d2135c6b69 for deploy
* 08:09 logmsgbot: tstarling Synchronized php-1.26wmf2/includes/filerepo/file/LocalFile.php: reverting live hack (duration: 00m 16s)
* 06:40 ori: nuked http://commons.wikimedia.org/wiki/User:Niteshift/MVneu/2015_April_21-30
* 05:44 logmsgbot: ori Synchronized php-1.26wmf1/includes/filerepo/file/LocalFile.php: Undo local hack on version that is inactive (1.26wmf1). No-op. (duration: 00m 17s)
* 05:35 ori: restart hhvm on mw1222; locked up in pthread_cond_wait, backtrace: https://phabricator.wikimedia.org/P552
* 05:28 ori: nuked https://commons.wikimedia.org/wiki/User:Niteshift/MVneu/2015_April_21-20
* 05:18 logmsgbot: ori Synchronized wmf-config/InitialiseSettings.php: $wgExportAllowHistory default false, $wgExportMaxHistory default 1000 -> 10 (duration: 00m 16s)
* 05:05 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Fri Apr 24 05:04:42 UTC 2015 (duration 4m 41s)
* 04:47 logmsgbot: ori Synchronized php-1.26wmf2/includes/filerepo/file/LocalFile.php: Short-circuit LocalFile::loadExtraFromDB in attempt to mitigate outage (duration: 00m 12s)
* 04:42 springle: killing LocalFile::loadExtraFromDB wholesale on s4
* 04:32 logmsgbot: ori Synchronized php-1.26wmf1/includes/filerepo/file/LocalFile.php: Short-circuit LocalFile::loadExtraFromDB in attempt to mitigate outage (duration: 00m 14s)
* 04:25 ori: Did a cluster-wide 'service hhvm restart'.
* 02:48 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-24 02:47:12+00:00
* 02:44 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 06m 00s)
* 02:30 logmsgbot: LocalisationUpdate completed (1.26wmf2) at 2015-04-24 02:28:58+00:00
* 02:25 logmsgbot: l10nupdate Synchronized php-1.26wmf2/cache/l10n: (no message) (duration: 06m 35s)
* 00:47 logmsgbot: catrope Synchronized wmf-config/CommonSettings.php: Revert RESTbase URL change (duration: 00m 13s)
* 00:21 logmsgbot: catrope Synchronized php-1.26wmf3/extensions/VisualEditor: Fix RESTbase revid bug (duration: 00m 18s)
* 00:21 logmsgbot: catrope Synchronized php-1.26wmf2/extensions/VisualEditor: Fix RESTbase revid bug (duration: 00m 17s)
* 00:07 logmsgbot: catrope Synchronized wmf-config/CommonSettings.php: Use same-domain entry point for RESTbase (duration: 00m 13s)
* 00:07 logmsgbot: catrope Synchronized wmf-config/InitialiseSettings.php: Temp disable direct RESTbase on enwiki (duration: 00m 17s)
 
== April 23 ==
* 23:54 logmsgbot: rmoen Synchronized php-1.26wmf3/extensions/Flow: Bump flow for cherry-pick (duration: 00m 23s)
* 23:46 logmsgbot: mattflaschen Synchronized wmf-config/CommonSettings.php: Bump Flow cache version to 4.7 (1e28cf78e64eb860d6eade775abae43d11c1dd75) (duration: 00m 16s)
* 23:41 andrewbogott: updating labvirt1002 to 3.13.0-49-generic, dist-upgrade, rebooting
* 23:41 andrewbogott: reverted labvirt1001 to 3.13.0-49-generic because 3.16 wouldn’t mount the fs
* 23:30 logmsgbot: rmoen Synchronized php-1.26wmf2/extensions/MobileFrontend/: Update MobileFrontend to cherry picks (duration: 00m 20s)
* 23:30 logmsgbot: rmoen Synchronized php-1.26wmf3/extensions/MobileFrontend/: Update MobileFrontend to cherry picks (duration: 00m 38s)
* 22:48 andrewbogott: upgrading labvirt1001 to linux-image-3.16.0-34-generic, dist-upgrading, and rebooting
* 22:10 logmsgbot: bd808 Synchronized wmf-config/logging.php: logstash: Fix log level detection (c09014d) (duration: 00m 17s)
* 21:56 ori: Additional (planned) outcome of Ie22658727 and Ice65e7e70: xff log flowing to fluorine, causing bytes-in to climb from ~1.2M/s to ~2.1M/s
* 21:54 ori: Syncing Ie22658727 and Ice65e7e70 (which introduce new InitialiseSettings vars) in one go caused a small burst of 500s (peaking at 500/sec and lasting a few seconds) on four app servers.
* 21:42 logmsgbot: ori Synchronized wmf-config: Ie22658727 and Ice65e7e70: use Monolog to configure logging (duration: 00m 15s)
* 21:04 awight: update payments from 88b9f621bfee1de14a8cdef556a90e5567721754 to 83d09e09178c634ad35dbb684d1c3aebbb709969
* 19:31 mutante: restarting icinga-wm for config change
* 18:05 andrewbogott: rebooting labvirt1006
* 17:51 logmsgbot: kartik Synchronized php-1.26wmf2/extensions/ContentTranslation: (no message) (duration: 00m 15s)
* 17:29 logmsgbot: kartik Finished scap: Update ContentTranslation (duration: 26m 11s)
* 17:28 ori: scap stuck on snapshot1004; not accepting mwdeploy key
* 17:03 logmsgbot: kartik Started scap: Update ContentTranslation
* 16:53 logmsgbot: aaron Synchronized php-1.26wmf2/includes/jobqueue/JobRunner.php: d23777e6832f660984ce4445ab04f98b7ff0d25f (duration: 00m 12s)
* 16:33 andrewbogott: rebooting labvirt1005
* 15:03 logmsgbot: manybubbles Synchronized wmf-config/CommonSettings.php: swat: Re-enable Special:SupportedLanguages (duration: 00m 11s)
* 12:29 godog: investigating icinga UNKNOWN for hhvm queue/threads
* 09:15 godog: restart carbon on graphite1001, replace with carbon-c-relay
* 08:31 godog: restart carbon on labmon1001, replace with carbon-c-relay
* 05:22 logmsgbot: LocalisationUpdate ResourceLoader cache refresh completed at Thu Apr 23 05:21:17 UTC 2015 (duration 21m 16s)
* 02:49 logmsgbot: LocalisationUpdate completed (1.26wmf3) at 2015-04-23 02:48:40+00:00
* 02:46 logmsgbot: l10nupdate Synchronized php-1.26wmf3/cache/l10n: (no message) (duration: 03m 46s)
* 02:28 logmsgbot: LocalisationUpdate completed (1.26wmf2) at 2015-04-23 02:27:39+00:00
* 02:24 logmsgbot: l10nupdate Synchronized php-1.26wmf2/cache/l10n: (no message) (duration: 05m 46s)
* 00:15 logmsgbot: kaldari Synchronized wmf-config/InitialiseSettings.php: Turning on WikiGrok on English Wikipedia (for 2 week test) (duration: 00m 11s)
* 00:07 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/206024 (duration: 00m 14s)
* 00:05 logmsgbot: krenair Synchronized php-1.26wmf2/extensions/ZeroBanner/includes/ZeroSpecialPage.php: https://gerrit.wikimedia.org/r/#/c/206023/ (duration: 00m 13s)
 
== April 22 ==
* 23:47 logmsgbot: krenair Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/206015 (duration: 00m 12s)
* 23:44 logmsgbot: krenair Synchronized php-1.26wmf3/extensions/ZeroBanner/includes/ZeroSpecialPage.php: https://gerrit.wikimedia.org/r/#/c/206017/ (duration: 00m 13s)
* 23:26 logmsgbot: krenair Synchronized php-1.26wmf3/extensions/Flow: https://gerrit.wikimedia.org/r/#/c/206008/ (duration: 00m 13s)
* 23:17 logmsgbot: krenair Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/205889/ (duration: 00m 12s)
* 23:16 logmsgbot: krenair Synchronized php-1.26wmf2/extensions/OpenStackManager/nova/OpenStackNovaUser.php: https://gerrit.wikimedia.org/r/#/c/205887/ (duration: 00m 12s)
* 22:55 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: group0 to 1.26wmf3
* 22:52 logmsgbot: twentyafterfour rebuilt wikiversions.cdb and synchronized wikiversions files: wikipedias to 1.26wmf2
* 22:47 logmsgbot: twentyafterfour Finished scap: testwiki to php-1.26wmf3 and rebuild l10n cache (duration: 37m 11s)
* 22:44 hoo: Killed demon's "sudo -u www-data php /srv/mediawiki-staging/multiversion/MWScript.php refreshLinks.php --wiki=ptwiki" on terbium, sending the box into swap
* 22:10 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf3 and rebuild l10n cache
* 21:31 Coren: reboot round of deployment-prep done
* 21:05 Coren: Starting deployment-prep rolling reboots
* 20:13 logmsgbot: twentyafterfour scap failed: CalledProcessError Command '/usr/local/bin/mwscript mergeMessageFileList.php --wiki="testwiki" --list-file="/srv/mediawiki-staging/wmf-config/extension-list" --output="/tmp/tmp.KaXyRl6UJi" ' returned non-zero exit status 1 (duration: 02m 10s)
* 20:10 logmsgbot: twentyafterfour Started scap: testwiki to php-1.26wmf3 and rebuild l10n cache
* 20:08 subbu: deployed parsoid version 3311936a
* 19:51 hashar: Zuul / Jenkins back up and processing the 1+ hour backlog of changes.  Will take a while.  Multiple causes:  Zuul gearmand being stalled on a socket that has no more data to emit and  Jenkins being deadlocked due to an IRC plugin
* 19:44 hashar: Killing Jenkins cause .... we know
* 19:27 hashar: zuul gearman server is stalled
* 15:30 gwicke: stopped restbase on restbase1002 in preparation for cmjohnson1 checking the hardware
* 15:30 logmsgbot: demon Finished scap: 1.26wmf2 was tracking master. should be fixed, being paranoid and doing full sync + i18n rebuild (duration: 08m 11s)
* 15:21 logmsgbot: demon Started scap: 1.26wmf2 was tracking master. should be fixed, being paranoid and doing full sync + i18n rebuild
* 15:19 logmsgbot: demon Synchronized php-1.26wmf2/extensions/VisualEditor