Chapter 6. Supervised Machine Learning
It is often believed that data science is machine learning, which means in data science, we only train models of machine learning. But data science is much more than that. Data science involves understanding data, gathering data, munging data, taking the meaning out of that data, and then machine learning if needed.
In my opinion, machine learning is the most exciting field that exists today. With huge amounts of data that is readily available, we can gather invaluable knowledge. Lots of companies have made their machine learning libraries accessible and there are lots of open source alternatives that exist.
In this chapter, you will study the following topics:
- What is machine learning?
- Types of machine learning
- What is overfitting and underfitting?
- Bias-variance trade-off
- Feature extraction and selection
- Decision trees
- Naïve Bayes classifier