Chapter 9: Getting Started with Machine Learning in Python
This chapter will expose us to the vernacular of machine learning and the common tasks that machine learning can be used to solve. Afterward, we will learn how we can prepare our data for use in machine learning models. We have discussed data cleaning already, but only for human consumption—machine learning models require different preprocessing (cleaning) techniques. There are quite a few nuances here, so we will take our time with this topic and discuss how we can use scikit-learn
to build preprocessing pipelines that streamline this procedure, since our models will only be as good as the data they are trained on.
Next, we will walk through how we can use scikit-learn
to build a model and evaluate its performance. Scikit-learn has a very user-friendly API, so once we know how to build one model, we can build any number of them. We won't be going into any of the mathematics behind the models; there are entire...