You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
Server Lifecycle/reclaim checklist
< Server LifecycleJump to navigation Jump to search
Revision as of 17:18, 7 August 2018 by (make it more copypaste friendly)
This checklist is able to be copied and pasted into phabricator hardware request tasks for reclaiming systems to spare or decom.
 - all system services confirmed offline from production use  - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.  - remove system from all lvs/pybal active configuration  - any service group puppet/hiera/dsh config removed  - remove site.pp (replace with role::spare::system if system isn't shut down immediately during this process.) START NON-INTERRUPPTABLE STEPS  - disable puppet on host  - remove all remaining puppet references (include role::spare)  - power down host  - disable switch port  - switch port assignment noted on this task (for later removal)  - remove production dns entries  - puppet node clean, puppet node deactivate END NON-INTERRUPPTABLE STEPS  - system disks wiped (by onsite)  - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result  - IF DECOM: switch port configration removed from switch once system is unracked.  - IF DECOM: mgmt dns entries removed.  - IF RECLAIM: system added back to spares tracking (by onsite)