Introducing the business scenario
In this section, we'll introduce the business scenario that will be tackled with the XGBoost classification algorithm.
The business scenario is very similar to the use case presented and used in Chapter 6, Classifying Trees with Multiclass Logistic Regression. In this chapter, we'll use the same dataset but will apply a more advanced ML algorithm.
We can summarize and remember that the goal of the ML model is to automatically classify the trees of New York City into different species according to their characteristics, such as their position, their size, and their health status.
As we've done in Chapter 9, Suggesting the Right Product by Using Matrix Factorization, we can focus our attention only on the five most common species of trees present in the city.
Now that we've explained and understood the business scenario, let's take a look at the ML technique that we can use to automatically classify trees according...