Summary
Some of the keys to having a successful DR solution are testing, monitoring and alerting, backups, and automation. You can introduce a solution such as a Game Day (where you can simulate the failure and failover to a replica) to exercise a failover in the DR environment. You can also introduce a modern discipline such as chaos engineering, where you can experiment on a software system in production to build confidence that your system can withstand any unexpected conditions. A good monitoring system will enable you to get a notification as soon as your system is down. If you go one step up, then your monitoring system, such as CloudWatch, will be tightly integrated with an automation solution, such as Lambda, which will spin up your new instance. Automation is the key, as it will reduce your downtime. Last but not least, you should take your backup regularly and restore it on a regular basis.
In this chapter, we have learned about the various DR solutions offered by AWS...