Gremlin is a Chaos Engineering service that prevents outages and builds more reliable systems. In this section, we will run chaos attacks in production to validate the resiliency of the systems using Gremlin. You will learn how to create CPU and node shutdown attacks to test the resiliency of your infrastructure.
Getting ready
For this recipe, we need to have the Kubernetes command-line tool, kubectl, and helm installed.
All the operations mentioned here require a Gremlin account. If you don't have one, go to https://app.gremlin.com/signup and create one.
How to do it…
This section is further divided into the following subsections to make this process easier:
- Setting up Gremlin credentials
- Installing Gremlin on Kubernetes
- Creating a CPU attack against a Kubernetes worker
- Creating a node shutdown attack against a Kubernetes worker
- Running predefined scenario-based attacks
- Deleting Gremlin from your cluster