How long has the cross-region hot standby been called? When something really happens, it still relies on manual migration of partitions. This disaster recovery drill needs more funding and increased intensity.

View Original
CoinNetwork
Coinbase releases May outage incident review report, exposing architectural risks
CryptoWorld News: Coinbase experienced a major outage lasting approximately 8 hours on May 7, 2026, with full recovery in about 12 hours. A cooling system failure in an AWS us-east-1 availability zone caused EC2/EBS to go offline, affecting multiple services. The matching engine lost a majority of nodes, losing quorum, requiring rebuilding the node group and code adjustments; a hosted Kafka control plane failure prevented automatic election of partition leaders, disrupting quotes and data streams. Recovery was achieved after manual partition migration. Coinbase stated it will enhance cross-region hot standby and disaster recovery drills.
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned