Reviewing the scikit-learn package
scikit-learn
is a very popular Python package for machine learning. You have already encountered this package in previous chapters. In particular, you have focused on some examples using supervised learning and model selection. However, the scikit-learn
package also provides other classes and methods, as shown in the following figure:
Figure 8.4 – An overview of the scikit-learn package
The package is divided into the following subpackages:
- Preprocessing
- Dimensionality reduction
- Model selection
- Supervised learning
- Unsupervised learning
Let’s investigate each subpackage briefly, starting with the first one: preprocessing. For a more in-depth analysis of each subpackage, you can refer to the Further reading section at the end of this chapter.
Preprocessing
Preprocessing contains all of the classes and methods that permit us to manipulate the dataset before giving it as input...