Chapter 4: Building SRE Teams and Applying Cultural Practices
The last three chapters introduced the fundamentals of Site Reliability Engineering (SRE), traced its origins, laid out how SRE is different than DevOps, introduced SRE jargon along with its key technical practices such as Service Level Agreements (SLAs), Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets, and focused on monitoring and alerting concepts to target reliability.
This chapter will focus on the fundamentals required to build SRE teams and apply cultural practices such as handling facets of incident management, being on call, achieving psychological safety, promoting communication, collaboration and knowledge sharing. These fundamentals and cultural practices can be used as a blueprint for teams or organizations that want to start their SRE journey.
In this chapter, we're going to cover the following main topics:
- Building SRE teams – Staffing, creating...