Chapter 4. Intermediate Examples
This chapter goes through a real implementation in Scalding of non-trivial applications using the operations presented in the previous chapter. We will go through the data analysis, design, implementation, and optimization of data-transformation jobs for the following:
- Logfile analysis
- Ad targeting
Analyzing logfiles that have been stored for some time is a usual starting application of a new Hadoop team in an organization. The type of value to extract from the logfiles depends on the use case. As an example, we will use a case where we will need to think a lot about how to manage the data.
Another example of Ad targeting will make us look at how to structure and store the data to allow us to run daily jobs. It will involve input from data scientists and deep analysis of customer behavior to recommend personalized advertisements.