- Regarding ways to find the best hyperparameters, compare the advantages and disadvantages of grid search, random search, and Bayesian optimization as they are applied to hyperparameter tuning.
- Why do we typically need three splits of data when we do hyperparameter tuning?
- Which metric do you think would be best for our xgboost example: validation:auc or training:auc?




















































