Technical requirements
In this chapter, we will use the Matplotlib
, pandas
, NumPy
, scikit-learn
, feature-engine
, and Category Encoders Python libraries. If you need to install Python, the free Anaconda Python distribution (https://www.anaconda.com/) includes most numerical computing libraries.
feature-engine
can be installed with pip
:
pip install feature-engine
If you use Anaconda, you can install feature-engine
with conda
:
conda install -c conda-forge feature_engine
To install Category Encoders, use pip
as follows:
pip install category_encoders
We will use the Credit Approval dataset from the UCI Machine Learning Repository (https://archive.ics.uci.edu/), licensed under the CC BY 4.0 creative commons attribution: https://creativecommons.org/licenses/by/4.0/legalcode. You’ll find the dataset at this link: http://archive.ics.uci.edu/dataset/27/credit+approval.
I downloaded and modified the data as shown in this notebook: https://github.com/PacktPublishing...