This chapter introduces the data developer to mining (not to be confused with querying) data, providing an understanding of exactly what data mining is and why it is an integral part of data science.
We'll provide working examples to help the reader feel comfortable using R for the most common statistical data mining methods: dimensional reduction, frequent patterns, and sequences.
In this chapter, we've broken things into the following topics:
- Definition and purpose of data mining
- Preparing the developer for data mining rather than data querying
- Using R for dimensional reduction, frequent patterns, and sequence mining