You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
Server Admin Log: Difference between revisions
Jump to navigation
Jump to search
imported>Stashbot (ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance) |
imported>Stashbot (mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply) |
||
Line 1: | Line 1: | ||
== 2022-06-02 == | |||
* 01:47 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 01:38 krinkle@deploy1002: Synchronized multiversion/: {{Gerrit|Id9b34b755230}} no-op (duration: 03m 12s) | |||
* 01:37 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 01:36 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 01:36 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 01:35 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 01:15 krinkle@deploy1002: Synchronized src/Profiler.php: {{Gerrit|I257b41a45}} (duration: 03m 15s) | |||
* 01:15 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 01:14 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 01:14 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 01:13 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 01:09 krinkle@deploy1002: Synchronized wmf-config/PhpAutoPrepend.php: {{Gerrit|Iebd29aaa}} (duration: 02m 57s) | |||
* 01:07 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 01:07 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 01:06 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 01:05 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 01:05 krinkle@deploy1002: Synchronized src/Profiler.php: {{Gerrit|I93b3e43d32}} (duration: 03m 16s) | |||
* 00:50 krinkle@deploy1002: Synchronized wmf-config/MetaContactPages.php: {{Gerrit|Ief1368fd959f428}} (duration: 02m 56s) | |||
* 00:46 krinkle@deploy1002: Synchronized php-1.39.0-wmf.14/extensions/WikimediaMessages/: {{Gerrit|I5a700cd3648}} (duration: 03m 01s) | |||
* 00:40 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 00:39 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 00:39 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 00:38 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 00:28 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 00:24 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 00:24 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 00:23 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
== 2022-06-01 == | == 2022-06-01 == | ||
* 22:13 ryankemper: [[phab:T309720|T309720]] Downtimed cloudelastic until Monday while we perform maintenance across the next couple days (will manually lift downtime later) | |||
* 21:33 bking@cumin1001: END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: restart to enable S3 plugin - bking@cumin1001 - [[phab:T309720|T309720]] | |||
* 21:33 bking@cumin1001: START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: restart to enable S3 plugin - bking@cumin1001 - [[phab:T309720|T309720]] | |||
* 21:10 ebernhardson: restart wdqs-blazegraph on wdqs1007 to resolve BlazegraphFreeAllocatorsDecreasingRapidly | |||
* 21:05 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 21:05 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 21:05 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 21:04 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 20:58 urbanecm@deploy1002: Synchronized wmf-config/InitialiseSettings.php: {{Gerrit|5a8e7586bcb0933c96e8294e389c31270edb134e}}: Revert "Start writing to cuc_actor everywhere except s4 and s8" ([[phab:T233004|T233004]]) (duration: 00m 32s) | |||
* 20:38 andrew@deploy1002: helmfile [codfw] DONE helmfile.d/services/toolhub: apply | |||
* 20:36 andrew@deploy1002: helmfile [codfw] START helmfile.d/services/toolhub: apply | |||
* 20:36 andrew@deploy1002: helmfile [eqiad] DONE helmfile.d/services/toolhub: apply | |||
* 20:35 andrew@deploy1002: helmfile [eqiad] START helmfile.d/services/toolhub: apply | |||
* 20:32 cjming: end of UTC late backport window | |||
* 20:32 andrew@deploy1002: helmfile [staging] DONE helmfile.d/services/toolhub: apply | |||
* 20:31 andrew@deploy1002: helmfile [staging] START helmfile.d/services/toolhub: apply | |||
* 20:28 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 20:26 cjming@deploy1002: Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:800278{{!}}Start writing to cuc_actor everywhere except s4 and s8 (T233004)]] (duration: 03m 01s) | |||
* 20:25 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 20:25 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 20:24 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 20:21 andrew@deploy1002: helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply | |||
* 20:21 andrew@deploy1002: helmfile [eqiad] START helmfile.d/services/developer-portal: apply | |||
* 20:21 andrew@deploy1002: helmfile [codfw] DONE helmfile.d/services/developer-portal: apply | |||
* 20:20 andrew@deploy1002: helmfile [codfw] START helmfile.d/services/developer-portal: apply | |||
* 20:19 andrew@deploy1002: helmfile [staging] DONE helmfile.d/services/developer-portal: apply | |||
* 20:19 andrew@deploy1002: helmfile [staging] START helmfile.d/services/developer-portal: apply | |||
* 20:15 andrew@deploy1002: helmfile [staging] DONE helmfile.d/services/developer-portal: apply | |||
* 20:15 andrew@deploy1002: helmfile [staging] START helmfile.d/services/developer-portal: apply | |||
* 20:14 ladsgroup@cumin1001: dbctl commit (dc=all): 'Depooling db1147 ([[phab:T298560|T298560]])', diff saved to https://phabricator.wikimedia.org/P29323 and previous config saved to /var/cache/conftool/dbconfig/20220601-201402-ladsgroup.json | |||
* 20:13 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1147.eqiad.wmnet with reason: Maintenance | |||
* 20:13 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1147.eqiad.wmnet with reason: Maintenance | |||
* 20:09 SandraEbele: Successfully deployed refinery using scap, then deployed onto hdfs. | |||
* 19:42 ebysans@deploy1002: Finished deploy [analytics/refinery@13f791b] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@13f791b] (duration: 07m 06s) | |||
* 19:35 ebysans@deploy1002: Started deploy [analytics/refinery@13f791b] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@13f791b] | |||
* 19:35 ebysans@deploy1002: Finished deploy [analytics/refinery@13f791b] (thin): Regular analytics weekly train THIN [analytics/refinery@13f791b] (duration: 00m 07s) | |||
* 19:35 ebysans@deploy1002: Started deploy [analytics/refinery@13f791b] (thin): Regular analytics weekly train THIN [analytics/refinery@13f791b] | |||
* 19:19 ebysans@deploy1002: Finished deploy [analytics/refinery@13f791b]: Regular analytics weekly train [analytics/refinery@13f791b] (duration: 23m 12s) | |||
* 18:56 ebysans@deploy1002: Started deploy [analytics/refinery@13f791b]: Regular analytics weekly train [analytics/refinery@13f791b] | |||
* 18:52 SandraEbele: About to deploy analytics/refinery (weekly deployment train) | |||
* 18:14 jhuneidi@deploy1002: Synchronized php: group1 wikis to 1.39.0-wmf.14 refs [[phab:T308067|T308067]] (duration: 03m 02s) | |||
* 18:12 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 18:11 jhuneidi@deploy1002: rebuilt and synchronized wikiversions files: group1 wikis to 1.39.0-wmf.14 refs [[phab:T308067|T308067]] | |||
* 18:11 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 18:11 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 18:10 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 18:05 ladsgroup@deploy1002: Synchronized php-1.39.0-wmf.14/extensions/Wikibase/client: Backport: [[gerrit:802114{{!}}Don't call saveSettings in EchoNotificationsHandlers::doLocalUserCreated (T306636)]] (duration: 03m 11s) | |||
* 18:00 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 17:59 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 17:59 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 17:58 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 15:58 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1047.eqiad.wmnet with OS bullseye | |||
* 15:43 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1047.eqiad.wmnet with reason: host reimage | |||
* 15:40 mvernon@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1047.eqiad.wmnet with reason: host reimage | |||
* 15:24 mvernon@cumin2002: START - Cookbook sre.hosts.reimage for host ms-be1047.eqiad.wmnet with OS bullseye | |||
* 15:02 ladsgroup@deploy1002: Synchronized php-1.39.0-wmf.14/extensions/Thanks/includes/Hooks.php: Backport: [[gerrit:802110{{!}}Don't call saveOptions in Hooks::onAccountCreated (T306636)]] (duration: 03m 10s) | |||
* 15:01 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 15:00 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 15:00 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 14:59 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 14:55 ladsgroup@deploy1002: Synchronized php-1.39.0-wmf.13/extensions/Thanks/includes/Hooks.php: Backport: [[gerrit:802109{{!}}Don't call saveOptions in Hooks::onAccountCreated (T306636)]] (duration: 03m 10s) | |||
* 14:54 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 14:54 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 14:54 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 14:53 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 14:46 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1046.eqiad.wmnet with OS bullseye | |||
* 14:31 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1046.eqiad.wmnet with reason: host reimage | |||
* 14:28 mvernon@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1046.eqiad.wmnet with reason: host reimage | |||
* 14:25 aikochou@deploy1002: Finished deploy [ores/deploy@3d541df]: Deploy revscoring 2.11.4 to ORES - [[phab:T309536|T309536]] (duration: 45m 07s) | |||
* 14:10 mvernon@cumin2002: START - Cookbook sre.hosts.reimage for host ms-be1046.eqiad.wmnet with OS bullseye | |||
* 13:40 aikochou@deploy1002: Started deploy [ores/deploy@3d541df]: Deploy revscoring 2.11.4 to ORES - [[phab:T309536|T309536]] | |||
* 13:32 kevinbazira@deploy1002: helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | |||
* 13:32 kevinbazira@deploy1002: helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | |||
* 12:41 moritzm: installing ruby-nokogiri security updates | |||
* 12:24 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1136 ([[phab:T309617|T309617]])', diff saved to https://phabricator.wikimedia.org/P29320 and previous config saved to /var/cache/conftool/dbconfig/20220601-122426-ladsgroup.json | |||
* 12:09 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1136', diff saved to https://phabricator.wikimedia.org/P29318 and previous config saved to /var/cache/conftool/dbconfig/20220601-120921-ladsgroup.json | |||
* 11:54 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1136', diff saved to https://phabricator.wikimedia.org/P29317 and previous config saved to /var/cache/conftool/dbconfig/20220601-115416-ladsgroup.json | |||
* 11:44 marostegui@cumin1001: dbctl commit (dc=all): 'Repool db1137 in x1 with minimal weight to test 10.6.8 [[phab:T309679|T309679]] ', diff saved to https://phabricator.wikimedia.org/P29315 and previous config saved to /var/cache/conftool/dbconfig/20220601-114418-marostegui.json | |||
* 11:39 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1136 ([[phab:T309617|T309617]])', diff saved to https://phabricator.wikimedia.org/P29314 and previous config saved to /var/cache/conftool/dbconfig/20220601-113911-ladsgroup.json | |||
* 11:30 ladsgroup@cumin1001: dbctl commit (dc=all): 'Depooling db1136 ([[phab:T309617|T309617]])', diff saved to https://phabricator.wikimedia.org/P29313 and previous config saved to /var/cache/conftool/dbconfig/20220601-113017-ladsgroup.json | |||
* 11:30 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1136.eqiad.wmnet with reason: Maintenance | |||
* 11:30 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1136.eqiad.wmnet with reason: Maintenance | |||
* 11:21 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 11:21 ladsgroup@deploy1002: Synchronized php-1.39.0-wmf.13/extensions/PageTriage/includes/Hooks.php: Backport: [[gerrit:802107{{!}}Don't call saveOptions in LocalUserCreated (T306636)]] (duration: 03m 16s) | |||
* 11:20 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 11:20 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 11:19 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 11:18 marostegui@cumin1001: dbctl commit (dc=all): 'Repool db1137 in x1 with minimal weight to test 10.6.8 [[phab:T309679|T309679]] ', diff saved to https://phabricator.wikimedia.org/P29312 and previous config saved to /var/cache/conftool/dbconfig/20220601-111805-marostegui.json | |||
* 11:16 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1045.eqiad.wmnet with OS bullseye | |||
* 11:15 ladsgroup@deploy1002: Synchronized php-1.39.0-wmf.14/extensions/PageTriage/includes/Hooks.php: Backport: [[gerrit:802106{{!}}Don't call saveOptions in LocalUserCreated (T306636)]] (duration: 03m 01s) | |||
* 11:14 mwdebug-deploy@deploy1002: helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | |||
* 11:14 mwdebug-deploy@deploy1002: helmfile [codfw] START helmfile.d/services/mwdebug: apply | |||
* 11:14 mwdebug-deploy@deploy1002: helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | |||
* 11:13 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 11:04 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance | |||
* 11:04 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance | |||
* 10:54 XioNoX: upgrade fastnetmon to 1.2.1 in eqsin - [[phab:T271228|T271228]] | |||
* 10:51 XioNoX: upgrade fastnetmon to 1.2.1 in esams - [[phab:T271228|T271228]] | |||
* 10:49 XioNoX: upgrade fastnetmon to 1.2.1 in eqiad - [[phab:T271228|T271228]] | |||
* 10:48 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1045.eqiad.wmnet with reason: host reimage | |||
* 10:45 mvernon@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1045.eqiad.wmnet with reason: host reimage | |||
* 10:28 mvernon@cumin2002: START - Cookbook sre.hosts.reimage for host ms-be1045.eqiad.wmnet with OS bullseye | |||
* 10:13 moritzm: installing openldap security updates | |||
* 10:11 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1044.eqiad.wmnet with OS bullseye | |||
* 09:56 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1044.eqiad.wmnet with reason: host reimage | |||
* 09:39 mwdebug-deploy@deploy1002: helmfile [eqiad] START helmfile.d/services/mwdebug: apply | |||
* 09:36 mvernon@cumin2002: START - Cookbook sre.hosts.reimage for host ms-be1044.eqiad.wmnet with OS bullseye | |||
* 09:08 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1043.eqiad.wmnet with OS bullseye | |||
* 08:56 marostegui@cumin1001: dbctl commit (dc=all): 'Repool db1137 in x1 with minimal weight to test 10.6.8 [[phab:T309679|T309679]] ', diff saved to https://phabricator.wikimedia.org/P29307 and previous config saved to /var/cache/conftool/dbconfig/20220601-085620-marostegui.json | |||
* 08:49 moritzm: installing idp1002 [[phab:T308214|T308214]] | |||
* 08:48 mvernon@cumin2002: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1043.eqiad.wmnet with reason: host reimage | |||
* 08:45 mvernon@cumin2002: START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1043.eqiad.wmnet with reason: host reimage | |||
* 08:43 elukey@deploy1002: helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. | |||
* 08:43 elukey@deploy1002: helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. | |||
* 08:41 elukey@deploy1002: helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. | |||
* 08:41 elukey@deploy1002: helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. | |||
* 08:39 elukey: powercycle an-worker1094 - OEM event registered in `racadm getsel`, host frozen | |||
* 08:30 mvernon@cumin2002: START - Cookbook sre.hosts.reimage for host ms-be1043.eqiad.wmnet with OS bullseye | |||
* 08:30 mvernon@cumin2002: END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-be1043.eqiad.wmnet with OS bullseye | |||
* 08:20 moritzm: installing openssl security updates | |||
* 08:19 mvernon@cumin2002: START - Cookbook sre.hosts.reimage for host ms-be1043.eqiad.wmnet with OS bullseye | |||
* 08:11 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1139.eqiad.wmnet with reason: Maintenance | |||
* 08:11 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1139.eqiad.wmnet with reason: Maintenance | |||
* 08:11 marostegui@cumin1001: dbctl commit (dc=all): 'Add some weight to x1 master', diff saved to https://phabricator.wikimedia.org/P29306 and previous config saved to /var/cache/conftool/dbconfig/20220601-081130-marostegui.json | |||
* 08:10 marostegui@cumin1001: dbctl commit (dc=all): 'Depool db1137 for migration to 10.6 [[phab:T309679|T309679]]', diff saved to https://phabricator.wikimedia.org/P29305 and previous config saved to /var/cache/conftool/dbconfig/20220601-081044-root.json | |||
* 08:00 moritzm: installing idp2002 [[phab:T308214|T308214]] | |||
* 07:34 moritzm: installing libxml2 security updates | |||
* 03:14 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1172 ([[phab:T60674|T60674]])', diff saved to https://phabricator.wikimedia.org/P29301 and previous config saved to /var/cache/conftool/dbconfig/20220601-031406-ladsgroup.json | |||
* 02:59 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P29300 and previous config saved to /var/cache/conftool/dbconfig/20220601-025901-ladsgroup.json | |||
* 02:43 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P29299 and previous config saved to /var/cache/conftool/dbconfig/20220601-024356-ladsgroup.json | |||
* 02:28 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1172 ([[phab:T60674|T60674]])', diff saved to https://phabricator.wikimedia.org/P29298 and previous config saved to /var/cache/conftool/dbconfig/20220601-022851-ladsgroup.json | |||
* 02:03 ladsgroup@cumin1001: dbctl commit (dc=all): 'Depooling db1172 ([[phab:T60674|T60674]])', diff saved to https://phabricator.wikimedia.org/P29297 and previous config saved to /var/cache/conftool/dbconfig/20220601-020339-ladsgroup.json | |||
* 02:03 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1172.eqiad.wmnet with reason: Maintenance | |||
* 02:03 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 6:00:00 on db1172.eqiad.wmnet with reason: Maintenance | |||
* 01:41 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance | * 01:41 ladsgroup@cumin1001: END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance | ||
* 01:40 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance | * 01:40 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance | ||
Line 14: | Line 177: | ||
* 00:04 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 6:00:00 on db1116.eqiad.wmnet with reason: Maintenance | * 00:04 ladsgroup@cumin1001: START - Cookbook sre.hosts.downtime for 6:00:00 on db1116.eqiad.wmnet with reason: Maintenance | ||
* 00:04 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1177 ([[phab:T60674|T60674]])', diff saved to https://phabricator.wikimedia.org/P29289 and previous config saved to /var/cache/conftool/dbconfig/20220601-000448-ladsgroup.json | * 00:04 ladsgroup@cumin1001: dbctl commit (dc=all): 'Repooling after maintenance db1177 ([[phab:T60674|T60674]])', diff saved to https://phabricator.wikimedia.org/P29289 and previous config saved to /var/cache/conftool/dbconfig/20220601-000448-ladsgroup.json | ||