Redundancy and fault tolerance in compute
Before engaging in an in-depth exploration of resilient architecture design implementation, it is imperative to gain an understanding of the various types of interruptions that have the potential to destabilize a system operating on AWS.
The factors that disrupt the stability of a system may originate from both internal (those that you, as a user, have control over) and external (factors that are controlled by AWS that you have little or no control over) sources. Furthermore, they can be categorized as either controllable or uncontrollable factors. For instance, a sudden surge in usage resulting from the unexpected popularity of a piece of software is an occurrence that can’t be fully anticipated. However, it is also true that proactive planning for such spikes can be accomplished by implementing appropriate measures during the design phase of the application architecture.
Let’s consider some of the important factors that...