In this chapter, we will use the following Python libraries: pandas, NumPy, SciPy, and scikit-learn. I recommend installing the free Anaconda Python distribution (https://www.anaconda.com/distribution/), which contains all of these packages. For details on how to install the Python Anaconda distribution, visit the Technical requirements section in Chapter 1, Foreseeing Variable Problems in Building ML Models.Â
In this chapter, we will also use the open source Python library, Feature-engine, which I created and can be installed using pip:
pip install feature-engine
To find out more about Feature-engine, visit its documentation at https://feature-engine.readthedocs.io.