This chapter is about how to design and architect machine learning applications in the Hadoop platform. It addresses some of the common machine learning challenges that you can face in Hadoop and how to solve these. In this chapter, we will walk through different machine learning libraries and processing engines. This chapter also covers some of the common steps involved in machine learning and further elaborates on this with a case study.
In this chapter, we will cover the following topics:
- Machine learning steps
- Common machine learning challenges
- Spark machine learning
- Hadoop and R
- Mahout
- Case study in Spark