Preface
Site reliability engineering relates to constant improvement, bridging business and product issues as per customer requirements and technology limitations, thereby generating higher revenue. Quantifying and understanding reliability, resource handling, and developer needs can sometimes be overwhelming. Becoming a Rockstar SRE explores reliability from an infrastructure and coding perspective and uses real-world examples to bring forth the site reliability engineer (SRE) persona.
This book will acquaint you with who an SRE is, followed by discussions on the why and how of site reliability engineering. It walks you through the jobs of an SRE, from automation of continuous integration/continuous delivery (CI/CD) pipelines and reducing toil to the details of reliability and the best practices to excel in it. You’ll learn why harmful code is created and how to circumvent that with reliable designs and patterns. You’ll explore how to interact and negotiate with businesses and vendors on various technical matters. You’ll then deep dive into observability, outage, and why and how to craft an excellent runbook. Finally, you’ll learn how to elevate your site reliability engineering career, including certifications, interview tips, and questions.
By the end of this book, you’ll be able to identify and measure reliability, reduce downtime, troubleshoot outages, and enhance productivity to become a true rockstar SRE!