Managing repositories
Elasticsearch provides a built-in system to rapidly ot and restore your data. When working with live data, keeping a backup is complex, due to the large number of concurrency problems.
An Elasticsearch snapshot allows for the creation of snapshots of individual indices (or aliases), or an entire cluster, into a remote repository.
Before starting to execute a snapshot, a repository must be created--this is where your backups/snapshots will be stored.
Getting ready
You need an up-and-running Elasticsearch installation as we described in the Downloading and installing Elasticsearch recipe in Chapter 2, Downloading and Setup.
To execute curl
via the command line you need to install curl
for your operating system.
We need to edit config/elasticsearch.yml
and add the directory of your backup repository:
path.repo: /tmp/
For our examples, we'll be using the /tmp
directory available in every Unix system. Generally, in a production cluster, this directory should be a shared repository...