Running distributed applications in production
So far, we’ve been discussing KPIs for running an application in production, taking inspiration from SRE principles. Now, let’s understand how we will put these thoughts in a single place to run a distributed application in production.
A distributed application or a microservice is inherently different from a monolith. While managing a monolith revolves around ensuring all operational aspects of one application, the complexity increases manyfold with microservices. Therefore, we should take a different approach to it.
From the perspective of SRE, running a distributed application in production entails focusing on ensuring the application’s reliability, scalability, and performance. Here’s how SREs approach this task:
- SLOs: SREs begin by defining clear SLOs that outline the desired level of reliability for the distributed application. SLOs specify the acceptable levels of latency, error rates, and...