Packt+ | Advance your knowledge in tech

You're reading from The DevOps 2.1 Toolkit: Docker Swarm The next level of building reliable and scalable software unleashed

Product type Paperback

Published in May 2017

Publisher

ISBN-13 9781787289703

Length 436 pages

Edition 1st Edition

Tools

Docker

Concepts

DevOps

Author (1):

Viktor Farcic

View More author details

Table of Contents (17) Chapters

Preface

1. Continuous Integration with Docker Containers FREE CHAPTER

2. Setting Up and Operating a Swarm Cluster

3. Docker Swarm Networking and Reverse Proxy

4. Service Discovery inside a Swarm Cluster

5. Continuous Delivery and Deployment with Docker Containers

6. Automating Continuous Deployment Flow with Jenkins

7. Exploring Docker Remote API

8. Using Docker Stack and Compose YAML Files to Deploy Swarm Services

9. Defining Logging Strategy

10. Collecting Metrics and Monitoring the Cluster

11. Embracing Destruction: Pets versus Cattle

What now?

12. Creating and Managing a Docker Swarm Cluster in Amazon Web Services

13. Creating and Managing a Docker Swarm Cluster in DigitalOcean

14. Creating and Managing Stateful Services in a Swarm Cluster

15. Managing Secrets in Docker Swarm Clusters

16. Monitor Your GitHub Repos with Docker and Prometheus

Failover

Fortunately, failover strategies are part of Docker Swarm. Remember, when we execute a service command, we are not telling Swarm what to do but the state we desire. In turn, Swarm will do its best to maintain the specified state no matter what happens.

To test a failure scenario, we'll destroy one of the nodes:

docker-machine rm -f node-3

Swarm needs a bit of time until it detects that the node is down. Once it does, it will reschedule containers. We can monitor the situation through service ps command:

docker service ps go-demo

The output (after rescheduling) is as follows (ID is removed for brevity):

As you can see, after a short period, Swarm rescheduled containers among healthy nodes (node-1 and node-2) and changed the state of those that were running on the failed node to Shutdown. If your output still shows that some instances are running on the node-3, please wait for a few moments and repeat the service ps command.