Scaling is increasing or decreasing the resources available for a microservice. A big part of the appeal of microservices is that we can individually scale services up and down, just how we need them. When the traffic or workload of a microservice increases, the service should get more resources, either in CPU and RAM upgrades or additional servers.
We, of course, would like this to happen automatically, and without us doing anything. In this section, we want to take a look at how scaling and monitoring work with microservices in general.
Let's look into the following topics:
- Monitoring: This tells us when we need to scale.
- Scaling: Adding resources to our setup.
Scaling begins with monitoring, so let's take a closer look.