Resource Monitoring and Application Performance Management on Google Cloud
The previous chapter introduced us to the concepts of DevOps and SRE, highlighting the importance of operational excellence in managing development and technology infrastructure. In this chapter, we’ll dig into operationalizing those practices through resource monitoring and application performance management. In addition, we’ll highlight the impact of downtime on businesses and explore the Google Cloud-specific tools to address those needs.
By the end of this chapter, you will be able to do the following:
- Describe the impact of outages on customers
- Understand SRE practices and definitions in operating cloud environments
- Describe Google Cloud tools for observability and application performance practices
This chapter covers the following topics:
- Downtime and its impact on business
- Cloud operations – monitoring, logging, and observability
- Google Cloud...