Coinbase released a post-mortem report on the large-scale service outage on May 7, stating that the failure was caused by a cooling system failure at the AWS data center in the US East region, leading to cabinet thermal protection shutdowns and affecting multiple internet services. This incident caused core functions such as Coinbase trading, deposits, and withdrawals to be interrupted for about 8 hours, with full recovery taking approximately 12 hours. Coinbase stated that the event exposed deficiencies in its cross-availability zone automatic failover and middleware disaster recovery capabilities, and will upgrade its cross-region hot standby architecture, strengthen failure drills, and expand the Kafka system from dual-availability zone deployment to triple-availability zone deployment.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned