Scaling applications and achieving higher availability
The Kubernetes container orchestration platform provides a wide range of functionality to help us deploy our applications in a scalable and highly available way. When designing architecture that will support horizontally scalable services and applications, we need to be aware of some common strategies that will help to successfully scale our applications on Kubernetes clusters.
In the previous section, Learning application deployment strategies, we covered some strategies that would help us to scale our applications, including deployment strategies and implementing health checks using container probes. In this section, we will learn about scaling applications using the Horizontal Pod Autoscaler (HPA).
When we first deploy our application on Kubernetes clusters, applications will very likely not get accessed immediately and usage will gradually increase over time. In that case, rolling out a deployment with many replicas...