You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Incident documentation/2021-09-26 appserver latency: Difference between revisions

From Wikitech-static
Jump to navigation Jump to search
imported>Krinkle
(Created page with "{{irdoc|status=review}} ==Summary== Increased db load for enwiki (s1) resulted in slower responses, which in turn resulted in overall php-fpm worker limits being reached and thus affecting requests for all wikis. For requests above the limit, the error was "''upstream connect error or disconnect/reset before headers. reset reason: overflow''". '''Impact''': For about 15 minutes, backend appservers were slower or unable to respond for all wikis. This mainly affected logg...")
 
imported>Krinkle
 
Line 1: Line 1:
{{irdoc|status=review}}
#REDIRECT [[Incidents/2021-09-26 appserver latency]]
==Summary==
Increased db load for enwiki (s1) resulted in slower responses, which in turn resulted in overall php-fpm worker limits being reached and thus affecting requests for all wikis. For requests above the limit, the error was "''upstream connect error or disconnect/reset before headers. reset reason: overflow''".
 
'''Impact''': For about 15 minutes, backend appservers were slower or unable to respond for all wikis. This mainly affected logged-in users and most bot/API queries. Some page views from unregistered users were affected, for pages that were recently edited or otherwise expired from the CDN cache. 
 
'''Documentation''':
 
* Public incident task: [[phab:T291767|T291767]]
 
* Similar to [[Incident documentation/2021-09-04 appserver latency]] and [[Incident documentation/2021-09-18 appserver latency]].
 
==Actionables==
 
* [[phab:T291767|T291767]] (restricted)
* [[phab:T251885|T251885]] (restricted)

Latest revision as of 17:49, 8 April 2022