Writing a MapReduce job for Redis
If you are a big data engineer, Redis can play an important role in your application design and development. In a batch job scenario, you can retrieve data in Redis to perform some complex computing algorithms in a distributed way. For an online query, you may store the resulting dataset on a Redis Server to achieve better performance.
In the last recipes of this chapter, we'll show you how to manipulate data in Redis using MapReduce and Spark, both of which are extremely popular distributed computing frameworks in the big data world.
Getting ready…
You need to finish the installation of the Redis Server as we described in the Downloading and installing Redis recipe in Chapter 1, Getting Started with Redis. You need to use the FLUSHALL
command to flush all the data in your Redis instance before moving on to the next section.
The requirements of IDE and JDK are the same as in the previous recipe, Connecting to Redis with Spring Data Redis.
A Hadoop cluster is...