The approach
You have decided to do the following:
- Train a proxy model: You don’t have the original features or model, but all is not lost because you have the COMPAS scores—the labels. And we also have features relevant to the problem we can connect to these labels with models. By approximating the COMPAS model via the proxies, you can assess the fairness of the COMPAS decisions. In this chapter, we will train a CatBoost model.
- Anchor explanations: Using this method will unearth insights into why the proxy model makes specific predictions using a series of rules called anchors, which tell you where the decision boundaries lie. The boundaries are relevant for our mission because we want to know why the defendant has been wrongfully predicted to recidivate. It’s an approximate boundary to the original model, but there’s still some truth to it.
- Counterfactual explanations: While Anchor explains why a decision was made, counterfactuals...