Test your knowledge
Now that we've seen how to use some tree-based and boosting models, try using them on the full set of data for the housing loan dataset we worked on in this chapter. The full dataset has many more features. As always, you may want to perform some data cleaning and preparation before using some of the tree-based ML algorithms, although pycaret
can auto-clean and prepare data, and catboost
and lightgbm
can handle missing values and categorical columns (you need to convert the datatype from object
to category
for lightgbm
, however). Explore the feature importance using the methods we learned in the chapter. Be sure to write a summary of your results.