Join our book community on Discord
In Chapter 10, Optimizing Observability, we covered work and resource metrics and learned how they provide autonomous teams with the information they need to fail forward fast.Throughout this book we have focused on creating bulkheads within our systems to control the blast radius when things go wrong. Our autonomous services have inbound and outbound bulkheads to protect them from upstream and downstream services and we use the natural bulkheads of cloud accounts to protect autonomous subsystems from each other.Now we turn our attention to multi-regional deployment. We will use cloud regions as another bulkhead to protect our systems from cloud provider disruptions. We will learn how to failover fast from an unhealthy region to a healthy region, so that our systems continue to run without downtime.In this chapter, we're going to cover the following main topics:
- Justifying multi-regional deployment
- Preparing for regional...