You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Difference between revisions of "SRE/Infrastructure Foundations/OKR"

From Wikitech-static
Jump to navigation Jump to search
imported>Jobo
 
imported>Krinkle
(remove table that was causing weird alignment and semantic/a11y conflict. The placement of this content in a table heading cell had as only visible effect that it was bolded, which I've preserved for now)
 
Line 3: Line 3:
==== For Fiscal Year 2021/2022 The Infrastructure Foundations has a number of focus areas: ====
==== For Fiscal Year 2021/2022 The Infrastructure Foundations has a number of focus areas: ====
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 1: Elevate Wikimedia's infrastructure security'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 1: Elevate Wikimedia's infrastructure security'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px;font-weight:bold">
{|class="sortable"
!


KR 1: Evaluate and select additional DDoS scrubbing provider.
KR 1: Evaluate and select additional DDoS scrubbing provider.
Line 11: Line 9:
KR 2: Implement session termination for applications.
KR 2: Implement session termination for applications.


|-
|}
</div></div></div>
</div></div></div>
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 2: Eliminate errors and delays due to manual interventions while assuring the continuous availability of the environment'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 2: Eliminate errors and delays due to manual interventions while assuring the continuous availability of the environment'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px;font-weight:bold">
{|class="sortable"
!


KR 1: Eliminate errors and delays due to manual interventions while assuring the continuous availability of the environment.
KR 1: Eliminate errors and delays due to manual interventions while assuring the continuous availability of the environment.
Line 23: Line 17:
KR 2: Reduce the manual work required for bios configuration by 33%
KR 2: Reduce the manual work required for bios configuration by 33%


|-
|}
</div></div></div>
</div></div></div>
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 3: Enable infrastructural self governance'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 3: Enable infrastructural self governance'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px;font-weight:bold">
{|class="sortable"
!


KR 1: Framework that would allow SRE upgrade and restart any service and server autonomously.
KR 1: Framework that would allow SRE upgrade and restart any service and server autonomously.


KR 2: Pilot new framework with successful cookbook adoption for three externally owned services.  
KR 2: Pilot new framework with successful cookbook adoption for three externally owned services. \
 
|-
|}
</div></div></div>
</div></div></div>
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 4: Reduce network capacity risks and improve latency.'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 4: Reduce network capacity risks and improve latency.'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px;font-weight:bold">
{|class="sortable"
!


KR 1: Decrease mean latency to the fastest root server by 5 ms for each Mediterranean, African and Middle Eastern country on average.  
KR 1: Decrease mean latency to the fastest root server by 5 ms for each Mediterranean, African and Middle Eastern country on average.  
Line 47: Line 32:
KR 2: Eqiad DC expansion measured by new equipment configured and ready for service.
KR 2: Eqiad DC expansion measured by new equipment configured and ready for service.


|-
|}
</div></div></div>
</div></div></div>
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 5: Reduce technical debt and allow the Wikimedia Foundation to benefit from technical innovations quicker.'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div class="mw-collapsible mw-collapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #07689f">'''Objective 5: Reduce technical debt and allow the Wikimedia Foundation to benefit from technical innovations quicker.'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px">
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px;font-weight:bold">
{|class="sortable"
!


KR 1: Established process for tracking OS updates in line with our policy resulting with: a. Capability to generate on fly report of each service OS version and days to EOL tracker. b. Stakeholders approved agreement on OS upgrade policy.
KR 1: Established process for tracking OS updates in line with our policy resulting with: a. Capability to generate on fly report of each service OS version and days to EOL tracker. b. Stakeholders approved agreement on OS upgrade policy.
Line 65: Line 46:
KR5: Setup a test cluster for Ganeti and use it to test the update to Debian 10  
KR5: Setup a test cluster for Ganeti and use it to test the update to Debian 10  


|-
</div>
|}

Latest revision as of 22:44, 13 September 2021

For Fiscal Year 2021/2022 The Infrastructure Foundations has a number of focus areas:

Objective 1: Elevate Wikimedia's infrastructure security

KR 1: Evaluate and select additional DDoS scrubbing provider.

KR 2: Implement session termination for applications.

Objective 2: Eliminate errors and delays due to manual interventions while assuring the continuous availability of the environment

KR 1: Eliminate errors and delays due to manual interventions while assuring the continuous availability of the environment.

KR 2: Reduce the manual work required for bios configuration by 33%

Objective 3: Enable infrastructural self governance

KR 1: Framework that would allow SRE upgrade and restart any service and server autonomously.

KR 2: Pilot new framework with successful cookbook adoption for three externally owned services. \

Objective 4: Reduce network capacity risks and improve latency.

KR 1: Decrease mean latency to the fastest root server by 5 ms for each Mediterranean, African and Middle Eastern country on average.

KR 2: Eqiad DC expansion measured by new equipment configured and ready for service.

Objective 5: Reduce technical debt and allow the Wikimedia Foundation to benefit from technical innovations quicker.

KR 1: Established process for tracking OS updates in line with our policy resulting with: a. Capability to generate on fly report of each service OS version and days to EOL tracker. b. Stakeholders approved agreement on OS upgrade policy.

KR 2: Two main mail servers/MXes migrated to Bullseye.

KR3: Reduce the risk of introducing breaking changes in WMCS (WikiMedia Cloud Services) while making changes on production puppet.

KR4: Prepare plan for our infrastructure resource modelling application resulted by selection of one IRM and documented evaluation of improvements.

KR5: Setup a test cluster for Ganeti and use it to test the update to Debian 10