Reliability through scale, performance, availability
As discussed earlier, reliability can be broadly defined as the probability that the system will function as per its expected behavior under the specified environmental conditions within a specified time.
As you know from the previous chapters, API platforms are the backbone of the digital channels for an enterprise. Hence, ensuring the reliability of APIs is fundamental to the adoption and usage of digital experiences. In fact, this topic is so important that a complete engineering discipline, named Site Reliability Engineering (SRE), has emerged, and organizations are hiring professionals to tackle the complexity associated with it, to achieve appropriate levels of reliability for their digital services.
In the following sections, we will briefly touch upon the important topics related to SRE. For a more comprehensive study, please make use of the references listed in the Further reading section.
Note:
In this chapter...