Postmortem Index

Explore incident reports from various companies

Category

Cloud

Outages of, or caused by, public cloud providers (AWS, GCP, Azure, etc.) and their managed services.

Postmortems
108
Companies
52
Years covered
19
Date range
Dec 2007 – Oct 2025
Title Company Date Other categories
incident.io service disruption during AWS us-east-1 outage on October 20, 2025 incident.io 2025-10-20
LaunchDarkly service disruption due to AWS us-east-1 outage and internal cascading failures (October 2025) Launchdarkly 2025-10-20 – 2025-10-21
Amazon DynamoDB US-EAST-1 outage of October 2025 Amazon 2025-10-20
Global Google Cloud API outage due to Service Control null pointer exception Google 2025-06-12 – 2025-06-13
Amazon Kinesis Data Streams US-EAST-1 Degradation July 2024 Amazon 2024-07-30 – 2024-07-31
Turso free tier data leak and loss Turso 2023-12-01 – 2023-12-04
Cloudflare 1.1.1.1 lookup failures on October 4, 2023 Cloudflare 2023-10-04
Honeycomb total outage on July 25th, 2023 Honeycomb 2023-07-25
AWS Lambda Service Event in Northern Virginia (US-EAST-1) Region on June 13th, 2023 Amazon 2023-06-13
CircleCI jobs not starting due to Kubernetes networking failure CircleCI 2023-03-14 – 2023-03-15
Datadog Infrastructure Connectivity Issue March 2023 Datadog 2023-03-08 – 2023-03-10
Cloudflare service token incident on January 24, 2023 Cloudflare 2023-01-24
CircleCI security incident and data exfiltration (December 2022) CircleCI 2022-12-16 – 2022-12-22
Intermittent downtime from repeated crashes incident.io 2022-11-18
BigQuery Storage WriteAPI elevated error rates in US Multi-Region Google 2022-10-13 – 2022-10-14
Cloud Filestore ListInstances API failed with error code 429 globally Google 2022-09-13
Honeycomb Ingest System Outage: Shepherd Cache Delays Honeycomb 2022-09-08
Google Cloud europe-west2 outage due to cooling system failure Google 2022-07-19 – 2022-07-21
Google Cloud Networking, Storage, and BigQuery reduced capacity for lower priority traffic Google 2022-07-15
Google Cloud Networking packet loss May 2022 Google 2022-05-20
Atlassian April 2022 customer site deletion outage Atlassian 2022-04-05 – 2022-04-18
Slack’s Incident on 2-22-22 Slack 2022-02-22
Firefox HTTP/3 network stack outage Firefox 2022-01-13
AWS US-EAST-1 Internal Network Congestion on December 7, 2021 Amazon 2021-12-07 – 2021-12-08
GitHub November 2021 Availability Incident due to MySQL Schema Migration Github 2021-11-27
Google Cloud Networking and Load Balancing outage of November 2021 Google 2021-11-16
Google Cloud Networking issues in Europe and other regions on November 12, 2021 Google 2021-11-12
AWS Direct Connect disruption in Tokyo (AP-NORTHEAST-1) on September 2, 2021 Amazon 2021-09-01 – 2021-09-02
Delay in starting Docker Jobs. Machine & remote Docker environments blocked CircleCI 2021-05-21 – 2021-05-22
Slack Outage on January 4th 2021 Slack 2021-01-04
Amazon Kinesis US-EAST-1 outage November 2020 Amazon 2020-11-25 – 2020-11-26
Datadog US region infrastructure connectivity issue DataDog 2020-09-24 – 2020-09-25
PythonAnywhere storage volume failure on 7 July 2020 PythonAnywhere 2020-07-07
Flowdock outage and cross-organization data leak Broadcom (CA Technologies) 2020-04-21 – 2020-04-22
Zerodha Order Management System overload on August 29, 2019 Zerodha 2019-08-29
Amazon EC2 and EBS Issues in Tokyo (AP-NORTHEAST-1) on August 23, 2019 Amazon 2019-08-23
Google Cloud Network Outage in Eastern USA, June 2019 Google 2019-06-02
Google Cloud internal blob storage disruption March 2019 Google 2019-03-13
Elastic Cloud AWS us-east-1 outage of February 2019 Elastic 2019-02-04
Amazon EC2 DNS Resolution Issues in AP-NORTHEAST-2 Amazon 2018-11-21 – 2018-11-22
GitHub October 2018 Service Degradation due to MySQL Failover GitHub 2018-10-21 – 2018-10-22
Trading and hanging orders on 12th April 2018 Zerodha 2018-04-12
Travis CI production database truncation TravisCI 2018-03-13
Fortnite service outages of February 3-4, 2018 Epic Games 2018-02-03 – 2018-02-05
Unavailable Guilds & Connection Issues Discord 2017-10-13
GoCardless API and Dashboard outage on 10 October 2017 GoCardless 2017-10-10
Google Cloud HTTP(S) Load Balancer 502 errors on April 5, 2017 Google 2017-04-05
Discord Connectivity Issues (March 2017) Discord 2017-03-20
Square service disruption of March 16, 2017 Square 2017-03-16
Amazon S3 US-EAST-1 outage of February 2017 Amazon 2017-02-28
Instapaper AWS RDS MySQL 2TB File Size Limit Outage Instapaper 2017-02-09 – 2017-02-14
Travis CI container-based Linux builds outage due to worker rollback failure TravisCI 2017-02-02 – 2017-02-05
GitLab.com database outage of January 31, 2017 Gitlab 2017-01-31 – 2017-02-01
Google Compute Engine, Cloud VPN, and Network Load Balancer connectivity issues Google 2017-01-30
Buildkite outage of August 22nd, 2016 Buildkite 2016-08-22
Reddit outage and degraded performance on August 11, 2016 Reddit 2016-08-11 – 2016-08-12
Tarsnap outage 2016-07-24 Tarsnap 2016-07-24
AWS Sydney Region EC2 and EBS power disruption Amazon 2016-06-05
Google Compute Engine global connectivity loss April 2016 Google 2016-04-12
GitHub January 28th, 2016 datacenter power disruption GitHub 2016-01-28
Google Compute Engine Persistent Disk issue in europe-west1-b Google 2015-08-13 – 2015-08-16
EVE Online long downtime on July 15th, 2015 CCP Games 2015-07-15
Azure Storage service interruption Microsoft 2014-11-19
Azure Storage service interruption November 2014 Microsoft 2014-11-19
BrowserStack security incident due to Shellshock vulnerability on prototype machine BrowserStack 2014-11-09 – 2014-11-10
Yeller network partition causes processing delays Yeller 2014-07-29 – 2014-07-30
Google logged-in services outage due to incorrect configuration Google 2014-01-24
AWS SA-EAST-1 Availability Zone Power and Network Incident, December 2013 Amazon 2013-12-18
Stackdriver Intelligent Monitoring application outage on October 23, 2013 Stackdriver 2013-10-23 – 2013-10-26
Healthcare.gov launch failure Centers for Medicare & Medicaid Services (CMS) 2013-10-01 – 2013-12-31
Twilio billing system incident of July 2013 Twilio 2013-07-18 – 2013-07-20
PagerDuty notification dispatch system outage of April 2013 Pagerduty 2013-04-13
Kickstarter MySQL replication failure Kickstarter 2013-03-07
Amazon ELB Service Event in US-East Region on December 24, 2012 Amazon 2012-12-24 – 2012-12-25
AWS US-East Region Service Event of October 22, 2012 Amazon 2012-10-22 – 2012-10-23
Netflix's response to October 2012 AWS EBS degradation Netflix 2012-10-22
Knight Capital SMARS algorithmic trading incident of August 2012 Knight Capital 2012-08-01
Linux kernel leap second futex timer issue Linux 2012-07-01
AWS US East-1 power failure and service disruption in June 2012 Amazon 2012-06-30
Windows Azure Service Disruption on Feb 29th, 2012 Azure 2012-02-29 – 2012-03-01
Amazon EC2 and Amazon RDS Service Disruption in US East Region Amazon 2011-04-21 – 2011-04-24
Linux kernel leap second deadlock crash on New Year's 2008-2009 Linux 2009-01-01
Amazon S3 Availability Event: July 20, 2008 Amazon 2008-07-20
EVE Online: Trinity installer deletes boot.ini CCP Games 2007-12-05 – 2007-12-06
Amazon EC2, EBS, and RDS EU West Region Service Event Amazon
Amazon SimpleDB US East Region Disruption on June 13 Amazon
Untitled postmortem Elastic
Engineering Archives Heroku
Untitled postmortem Etsy
Etsy site outage caused by multicast rsync Etsy
EVE Online Stackless Python tasklet memory reuse bug CCP Games
Foursquare MongoDB memory exhaustion outage Foursquare
GitHub availability incidents in February and March 2026 GitHub
Google Cloud GCVE deletion incident impacting UniSuper Google
Google Code Jam 2014 Repeated Email Incident Google
Honeycomb operational burden and scaling issues in September and October Honeycomb
Honeycomb query performance and alerting incident (August 2022) Honeycomb
How I Broke `git push heroku main` Heroku
incident.io GKE Dataplane V2 `anetd` CPU saturation causes connection timeouts incident.io
Incident.io intermittent database connection pool timeouts incident.io
Razorpay RDS Multi-AZ Failover and Data Loss in December 2019 Razorpay
Untitled postmortem Salesforce
Steam client recursively deleted user files on Linux Valve
Untitled postmortem Stripe
Summary of the Amazon DynamoDB Service Disruption and Related Impacts in the US-East Region Amazon
Supermarket Intermittent Unresponsiveness Chef.io
TUI reservation system miscalculates G-TAWG takeoff weight TUI
Untitled postmortem WebKit code repository