Introduction to SRE
The term SRE was first coined by Ben Treynor Sloss at Google (https://sre.google/sre-book/introduction/). SRE has enabled Google to manage large-sized complex systems and massive infrastructure in the most efficient, reliable, scalable, and sustainable way.
Tip
SRE is primarily focused on the reliability of service.
Why is reliability so important?
Reliability is defined as the likelihood of services performing predictably under specified operational conditions. The most dependable systems will be more accessible, which will result in a better client experience. The reliability of your services is an important quality indicator.
Reliability and availability are interlinked; however, the difference is in the way they are measured. Although availability and reliability go hand in hand, the measures taken might produce different results. A system’s availability may be modeled mathematically as a measure of its reliability. In other words...