You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Nova Resource:Tools/SAL: Revision history

Jump to navigation Jump to search

Diff selection: Mark the radio buttons of the revisions to compare and hit enter or the button at the bottom.
Legend: (cur) = difference with latest revision, (prev) = difference with preceding revision, m = minor edit.

(newest | oldest) View ( | older 100) (20 | 50 | 100 | 250 | 500)

16 February 2018

  • curprev 18:2118:21, 16 February 2018imported>Stashbot 199,148 bytes +937 arturo: upgrading tools-proxy-01 and tools-paws-master-01, same as others

15 February 2018

  • curprev 13:5413:54, 15 February 2018imported>Stashbot 198,211 bytes +672 arturo: cleanup ferm (deinstall) in tools-services-01 for T187435

14 February 2018

  • curprev 13:0913:09, 14 February 2018imported>Stashbot 197,539 bytes +236 arturo: the reboot was OK, the server seems working and kubectl sees all the pods running in the deployment (T187315)

11 February 2018

  • curprev 01:2801:28, 11 February 2018imported>Stashbot 197,303 bytes +367 zhuyifei1999_: `# find /home/ -maxdepth 1 -perm -o+w \! -uid 0 -exec chmod -v o-w {} \;` Affected: only /home/tr8dr, mode 0777 -> 0775

9 February 2018

  • curprev 10:3510:35, 9 February 2018imported>Stashbot 196,936 bytes +989 arturo: deploy https://gerrit.wikimedia.org/r/#/c/409226/ T179343 T182562 T186846

8 February 2018

  • curprev 18:3818:38, 8 February 2018imported>Stashbot 195,947 bytes +1,291 arturo: aborrero@tools-k8s-master-01:~$ sudo kubectl uncordon tools-worker-1002.tools.eqiad.wmflabs

6 February 2018

  • curprev 13:1513:15, 6 February 2018imported>Stashbot 194,656 bytes +302 arturo: deploy https://gerrit.wikimedia.org/r/#/c/408529/ to tools-services-01

5 February 2018

  • curprev 17:5817:58, 5 February 2018imported>Stashbot 194,354 bytes +325 arturo: publishing/unpublishing trusty-tools repo in tools-services-01 to address T186539

3 February 2018

  • curprev 01:0401:04, 3 February 2018imported>Stashbot 194,029 bytes +129 chicocvenancio: killed io intensive process in bastion-03 "vltools python3 ./broken_ref_anchors.py"

31 January 2018

29 January 2018

28 January 2018

  • curprev 22:4922:49, 28 January 2018imported>Stashbot 193,665 bytes +165 chicocvenancio: killed compromised session generating miner processes

27 January 2018

  • curprev 00:5500:55, 27 January 2018imported>Stashbot 193,500 bytes +209 arturo: at tools-static-11 the kernel OOM killer stopped git gc at about 20% :-(

25 January 2018

  • curprev 23:4723:47, 25 January 2018imported>Stashbot 193,291 bytes +422 arturo: fix last deprecation warnings in tools-elastic-03, tools-elastic-02, tools-proxy-01 and tools-proxy-02 by replacing by hand configtimeout with http_configtimeout in /etc/puppet/puppet.conf

23 January 2018

22 January 2018

  • curprev 18:3218:32, 22 January 2018imported>Stashbot 192,662 bytes +538 arturo: T181948 T185314 deploying jobutils and misctools v1.28 in the cluster

19 January 2018

18 January 2018

  • curprev 16:1116:11, 18 January 2018imported>Stashbot 191,768 bytes +877 arturo: aborrero@tools-clushmaster-01:~$ sudo aptitude purge vblade vblade-persist runit (for something similar to T182781)

17 January 2018

  • curprev 18:4818:48, 17 January 2018imported>Stashbot 190,891 bytes +692 arturo: aborrero@tools-clushmaster-01:~$ clush -w @all 'apt-show-versions | grep upgradeable | grep trusty-wikimedia' | tee pending-upgrades-report-trusty-wikimedia.txt

16 January 2018

  • curprev 22:0122:01, 16 January 2018imported>Stashbot 190,199 bytes +3,082 chasemp: qstat -explain E -xml | grep 'name' | sed 's/<name>//' | sed 's/<\/name>//' | xargs qmod -cq

11 January 2018

  • curprev 20:3320:33, 11 January 2018imported>Stashbot 187,117 bytes +848 andrewbogott: repooling tools-exec-1411, tools-exec-1440, tools-webgrid-lighttpd-1419, tools-webgrid-lighttpd-1420, tools-webgrid-lighttpd-1421

10 January 2018

  • curprev 15:1415:14, 10 January 2018imported>Stashbot 186,269 bytes +1,549 chasemp: tools-clushmaster-01:~$ clush -f 1 -w @k8s-worker "sudo puppet agent --enable && sudo puppet agent --test"

9 January 2018

  • curprev 23:2123:21, 9 January 2018imported>Stashbot 184,720 bytes +2,117 yuvipanda: paws new cluster master is up, re-adding nodes by executing same sequence of commands for upgrading

8 January 2018

  • curprev 20:3420:34, 8 January 2018imported>Stashbot 182,603 bytes +219 madhuvishy: Restart kube services and uncordon tools-worker-1001

6 January 2018

  • curprev 00:3500:35, 6 January 2018imported>Stashbot 182,384 bytes +1,399 madhuvishy: Run `clush -w @paws-worker -b 'sudo iptables -L FORWARD'`

4 January 2018

  • curprev 17:2417:24, 4 January 2018imported>Stashbot 180,985 bytes +120 andrewbogott: rebooting tools-paws-worker-1019 to verify repair of T184018

3 January 2018

31 December 2017

  • curprev 02:0102:01, 31 December 2017imported>Stashbot 180,671 bytes +102 bd808: Killed some pwb.py and qacct processes running on tools-bastion-03

21 December 2017

  • curprev 17:5717:57, 21 December 2017imported>Stashbot 180,569 bytes +199 bd808: PAWS: deleted hub-deployment pod stuck in crashloopbackoff

19 December 2017

18 December 2017

  • curprev 12:0412:04, 18 December 2017imported>Stashbot 180,173 bytes +621 arturo: it seems jupyterhub tries to use a database which doesn't exists: [E 2017-12-18 11:59:49.896 JupyterHub app:904] Failed to connect to db: sqlite:///jupyterhub.sqlite

15 December 2017

14 December 2017

  • curprev 16:5816:58, 14 December 2017imported>Stashbot 179,263 bytes +188 arturo: running clush -w @all 'sudo puppet agent --test' from tools-clushmaster-01.eqiad.wmflabs due to https://gerrit.wikimedia.org/r/#/c/394572/ being merged

13 December 2017

11 December 2017

  • curprev 19:3219:32, 11 December 2017imported>Stashbot 178,492 bytes +261 bd808: git gc on tools-static-11; --aggressive was killed by system (T182604)

1 December 2017

  • curprev 15:3315:33, 1 December 2017imported>Stashbot 178,231 bytes +218 chasemp: put the weird mess of untracked files on tools puppetmaster into stash to see what breaks as they should not be there?

30 November 2017

20 November 2017

  • curprev 20:3420:34, 20 November 2017imported>Stashbot 177,867 bytes +94 chasemp: backup crons tools-cron-01:/var/spool/cron# cp -Rp crontabs/ /root/20112017/
  • curprev 00:5200:52, 20 November 2017imported>Stashbot 177,773 bytes +128 andrewbogott: cherry-picking https://gerrit.wikimedia.org/r/#/c/392172/ onto the tools puppetmaster

17 November 2017

  • curprev 21:3321:33, 17 November 2017imported>Stashbot 177,645 bytes +232 valhallasw`cloud: also g-w'ed those files, and sent emails to all the affected users

16 November 2017

  • curprev 17:4017:40, 16 November 2017imported>Stashbot 177,413 bytes +291 chasemp: tools-clushmaster-01:~$ clush -w @all 'sudo puppet agent --enable && sudo puppet agent --test && sudo unattended-upgrades -d'

15 November 2017

7 November 2017

  • curprev 01:2101:21, 7 November 2017imported>Stashbot 176,922 bytes +290 bd808: Removed all non-directory files from /home (via labstore1004 direct access)

5 November 2017

  • curprev 23:4823:48, 5 November 2017imported>Stashbot 176,632 bytes +391 bd808: Cleaned up 2 huge /tmp files left by tools.croptool (~6.5G)

3 November 2017

2 November 2017

1 November 2017

  • curprev 07:1107:11, 1 November 2017imported>Stashbot 176,084 bytes +213 madhuvishy: Clear nscd cache across all projects post labsdb dns switchover T179464

31 October 2017

  • curprev 16:5016:50, 31 October 2017imported>Stashbot 175,871 bytes +93 bd808: tools-bastion-03 (tools-login, login.tools) is overloaded

30 October 2017

  • curprev 17:3517:35, 30 October 2017imported>Stashbot 175,778 bytes +485 madhuvishy: Clear dns caches across tools hosts `sudo nscd -i hosts`

24 October 2017

  • curprev 18:0918:09, 24 October 2017imported>Stashbot 175,293 bytes +201 madhuvishy: Disable puppet on tools-package-builder-01 temporarily (T178920)

23 October 2017

  • curprev 14:4914:49, 23 October 2017imported>Stashbot 175,092 bytes +92 chasemp: wall message and scheduled reboot in 5m for bastion-03

18 October 2017

  • curprev 21:3621:36, 18 October 2017imported>Stashbot 175,000 bytes +275 chasemp: stop basebot -- it is going crazy and spamming email w/ failing to log to error.log. Need to figure out how to notify but it's clearly in a failure loop.

12 October 2017

  • curprev 16:5716:57, 12 October 2017imported>Stashbot 174,725 bytes +163 bd808: Rebuilding all Kubernetes Docker images to include toollabs-webservice 0.38

6 October 2017

5 October 2017

  • curprev 15:4615:46, 5 October 2017imported>Stashbot 174,353 bytes +169 chasemp: tools-bastion-03 has tons of local tools running long lived NFS intensive processes. I'm rebooting rather than playing whackamole.

3 October 2017

  • curprev 19:3019:30, 3 October 2017imported>Stashbot 174,184 bytes +103 bd808: `kubectl --namespace=prod delete pod --all` on tools-paws-master-01

1 October 2017

  • curprev 21:4621:46, 1 October 2017imported>Stashbot 174,081 bytes +108 madhuvishy: Cold migrating tools-clushmaster-01 from labvirt1015 to labvirt1017

29 September 2017

25 September 2017

  • curprev 15:1415:14, 25 September 2017imported>Stashbot 173,885 bytes +224 andrewbogott: rebooting tools-paws-worker-1006 since I can't access it

20 September 2017

  • curprev 16:5216:52, 20 September 2017imported>Stashbot 173,661 bytes +922 madhuvishy: apt-get install --only-upgrade apache2; service apache2 restart on tools-puppetmaster-01

13 September 2017

31 August 2017

  • curprev 20:3320:33, 31 August 2017imported>Stashbot 172,342 bytes +503 madhuvishy: Updated certs and ran puppet, restarted nginx on tools-proxy-* and tools-static-* (T174611)

24 August 2017

22 August 2017

  • curprev 19:2019:20, 22 August 2017imported>Stashbot 171,699 bytes +108 andrewbogott: deleted tools-puppetmaster-02, it was replaced a month ago by -01

12 August 2017

11 August 2017

10 August 2017

  • curprev 14:5914:59, 10 August 2017imported>Stashbot 171,463 bytes +117 chasemp: 'become stimmberechtigung && restart' && 'become intersect-contribs && restart'

9 August 2017

3 August 2017

  • curprev 00:4700:47, 3 August 2017imported>Stashbot 171,272 bytes +244 bd808: tools-bastion-03 not usably responsive to interactive commands; will reboot

31 July 2017

  • curprev 15:2815:28, 31 July 2017imported>Stashbot 171,028 bytes +82 chasemp: remove python-keystoneclient from bastion-03

27 July 2017

  • curprev 23:2723:27, 27 July 2017imported>Stashbot 170,946 bytes +353 bd808: Killed python procs owned by sdesabbata on tools-login that were stealing all cpu/io

26 July 2017

  • curprev 22:3322:33, 26 July 2017imported>Stashbot 170,593 bytes +95 chasemp: hotpatching an hiera value on tools master to see effects

20 July 2017

  • curprev 19:4819:48, 20 July 2017imported>Stashbot 170,498 bytes +694 bd808: Clearing all Eqw state jobs in all queues with: qstat -u '*' | grep Eqw | awk '{print $1;}' | xargs -L1 qmod -cj

19 July 2017

  • curprev 23:5223:52, 19 July 2017imported>Stashbot 169,804 bytes +302 bd808: Restarted cron on tools-cron-01; toolschecker job showing user not found errors

18 July 2017

  • curprev 19:5119:51, 18 July 2017imported>Stashbot 169,502 bytes +112 andrewbogott: enabling puppet on tools-proxy-02. I don't know why it was disabled.

17 July 2017

  • curprev 01:4301:43, 17 July 2017imported>Stashbot 169,390 bytes +182 bd808: Uncordoned tools-worker-1020 after it deleted pods with local storage that were filling the entire disk

13 July 2017

12 July 2017

7 July 2017

  • curprev 18:2618:26, 7 July 2017imported>Stashbot 168,710 bytes +88 bd808: Forced puppet runs on tools-redis-* for security fix

3 July 2017

  • curprev 04:2604:26, 3 July 2017imported>Stashbot 168,622 bytes +224 bd808: cdnjs on tools-static-10 is up to date

1 July 2017

  • curprev 19:4019:40, 1 July 2017imported>Stashbot 168,398 bytes +175 bd808: Disabled puppet on tools-k8s-master-01 to try and fix maintain-kubeusers

30 June 2017

  • curprev 01:3301:33, 30 June 2017imported>Stashbot 168,223 bytes +144 chasemp: time for i in `cat tools-hosts`; do ssh -i ~/.ssh/labs_root_id_rsa root@$i.eqiad.wmflabs 'hostname -f; uptime; tc-setup'; done
  • curprev 01:2901:29, 30 June 2017imported>Stashbot 168,079 bytes +1,063 andrewbogott: rebooting tools-cron-01

27 June 2017

  • curprev 21:3221:32, 27 June 2017imported>Stashbot 167,016 bytes +128 andrewbogott: moving all tools nodes to new puppetmaster, tools-puppetmaster-01.tools.eqiad.wmflabs

25 June 2017

24 June 2017

  • curprev 16:0116:01, 24 June 2017imported>Stashbot 166,810 bytes +133 bd808: Created and provisioned elasticsearch password for tools.wmde-uca-test (T167971)

23 June 2017

  • curprev 20:2020:20, 23 June 2017imported>Stashbot 166,677 bytes +175 bd808: Reindexing various elasticsearch indexes created before we upgraded to v2.x

22 June 2017

  • curprev 17:0317:03, 22 June 2017imported>Stashbot 166,502 bytes +287 bd808: Rolled back attempt at Elasticsearch upgrade. Indices need to be rebuilt with 2.x before 5.x can be installed. T164842
  • curprev 00:1200:12, 22 June 2017imported>Stashbot 166,215 bytes +3,075 bd808: Set ownership and permissions on $HOME/.kube for all tools (T165875)

14 June 2017

  • curprev 22:0922:09, 14 June 2017imported>Stashbot 163,140 bytes +83 bd808: Restarted apache2 proc on tools-puppetmaster-02

8 June 2017

  • curprev 18:1418:14, 8 June 2017imported>Stashbot 163,057 bytes +369 madhuvishy: Also delete from /tmp on tools-webgrid-lighttpd-1411 xvfb-run.*, calibre_* and ws-*.epub

7 June 2017

  • curprev 19:0519:05, 7 June 2017imported>Stashbot 162,688 bytes +94 madhuvishy: Killed scp job run by user torin8 on tools-bastion-02

6 June 2017

  • curprev 20:3020:30, 6 June 2017imported>Stashbot 162,594 bytes +142 chasemp: rebooting tools-bastion-02 as unresponsive (up 76 days and lots of seemingly left behind things running)

5 June 2017

  • curprev 23:4423:44, 5 June 2017imported>Stashbot 162,452 bytes +390 bd808: Deleted tools.iabot crontab that somehow got locally installed on tools-exec-1412 on 2017-05-24T20:55Z

1 June 2017

  • curprev 15:1515:15, 1 June 2017imported>Stashbot 162,062 bytes +124 andrewbogott: depooling/rebooting/repooling tools-exec-1403 as part of old kernel-purge testing

31 May 2017

  • curprev 19:2919:29, 31 May 2017imported>Stashbot 161,938 bytes +668 bd808: Rebuiding all Docker images to pick up toollabs-webservice v0.37 (T163355)

30 May 2017

  • curprev 22:3222:32, 30 May 2017imported>Stashbot 161,270 bytes +1,004 andrewbogott: migrating tools-webgrid-lighttpd-1406, tools-exec-1410 from labvirt1006 to labvirt1009 to balance cpu usage

26 May 2017

  • curprev 20:3220:32, 26 May 2017imported>Stashbot 160,266 bytes +160,266 bd808: Added tools-webgrid-lighttpd-14{19,2[0-8]} as submit hosts
(newest | oldest) View ( | older 100) (20 | 50 | 100 | 250 | 500)