Scikit-learn is a Python machine learning library built on the top of NumPy, SciPy, and Matplotlib. It provides simple tools for data mining and data analysis. According to the description on its website (see https://scikit-learn.org/stable/), we can use it in six major areas:
- Classification: A supervised learning approach for learning given data and using it to generate a model for a classifier. Then, we use the model to predict new data in order to identify the category with the classifier.
- Regression: Using a statistical methodology to predict continuous values using a given set of data.
- Clustering: Grouping data into different categories.
- Dimensionality reduction: Reducing the dimension of the data.
- Model selection: Tuning the hyperparameters of the model.
- Preprocessing: Feature extraction and normalization.
In the last...