In this chapter, we will use the Python libraries: pandas, NumPy and scikit-learn. I recommend installing the free Anaconda Python distribution (https://www.anaconda.com/distribution/), which contains all these packages.
For details on how to install the Python Anaconda distribution, visit the Technical requirements section in Chapter 1, Foreseeing Variable Problems When Building ML Models.
We will also use the open source Python library called Feature-engine, which I created and can be installed using pip:
pip install feature-engine
To learn more about Feature-engine, visit the following sites:
- Home page: www.trainindata.com/feature-engine
- Docs:Â https://feature-engine.readthedocs.io
- GitHub:Â https://github.com/solegalli/feature_engine/
Check that you have installed the right versions of the numerical Python libraries, which...