In this chapter, you learned about various principles to make your system reliable. These principles include making your system self-healing by applying rules of automation and to reduce the impact in the event of failure by designing a distributed system where the workload spans multiple resources.
Overall system reliability heavily depends on your system's availability and its ability to recover from disaster events. You learned about synchronous and asynchronous data replication types and how they affect your system reliability. You learned about various data replication methods, including array-based, network-based, host-based, and hypervisor-based. Each replication method has its pros and cons. There are multiple vendors' products available to achieve the desired data replication.
You learned about various disaster planning methods as per the organization's needs and the RTO and RPO. You learned about the backup and restore method, which has high RTO and RPO...