Chapter 7: Discovering Exoplanets with XGBoost
In this chapter, you will journey through the stars in an attempt to discover exoplanets with XGBClassifier
as your guide.
The reason for this chapter is twofold. The first is that it's important to gain practice in a top-to-bottom study using XGBoost since for all practical purposes, that is what you will normally do with XGBoost. Although you may not discover exoplanets with XGBoost on your own, the strategies that you implement here, which include choosing the correct scoring metric and carefully fine-tuning hyperparameters with that scoring metric in mind, apply to any practical use of XGBoost. The second reason for this particular case study is that it's essential for all machine learning practitioners to be proficient at competently handling imbalanced datasets, which is the key theme of this particular chapter.
Specifically, you will gain new skills in using the confusion matrix and the classification report, understanding...