In this chapter, we will use the following Python libraries: pandas, NumPy, Matplotlib, and scikit-learn. I recommend installing the free Anaconda Python distribution, which contains all of these packages.
For details on how to install the Anaconda Python distribution, visit the Technical requirements section in Chapter 1, Foreseeing Variable Problems in Building ML Models.
We will also use the open source Python library's feature-engine and category encoders, which can be installed using pip:
pip install feature-engine
pip install category_encoders
To learn more about Feature-engine, visit the following sites:
- Home page: https://www.trainindata.com/feature-engine
- GitHub: https://github.com/solegalli/feature_engine/
- Documentation: https://feature-engine.readthedocs.io
To learn more about category encoders...