We have seen the use of Compute Engine as IaaS, including Kubernetes clusters, which also rely on Compute Engine VM instances, and persistent disks. We don't have to worry about issues such as the proper distribution of internet traffic, requests, or processing when we use only one VM instance. Even in the case of overload, we can always scale our instances vertically and fulfill the processing requirements. But when we use more than one VM for a heavily accessed application or multi-tier application, load balancing becomes essential.
We may have to guide the traffic to the machines serving the corresponding tier, or we may have to guide it towards the same tier, while making sure that none of our machines get overloaded while other machines are chilling out. Fortunately, GCP provisions quite a sophisticated load balancer, which allows the routing of internet...