Monitoring data drift and model performance
In this section, we will run through an example that you can follow in the notebook available in the GitHub repository (at https://github.com/PacktPublishing/Machine-Learning-Engineering-with-MLflow/tree/master/Chapter11/model_performance_drifts) of the code of the package. We will run through the process of calculating different types of drift and exploring its integration with MLflow.
One emergent open source tool in the space of monitoring model performance is called Evidently (https://evidentlyai.com/). Evidently aids us in analyzing ML models during the production and validation phases. It generates handy reports integrated with pandas
, JSON, and CSV. It allows us to monitor multiple drifts in ML models and their performance. The GitHub repository for Evidently is available at https://github.com/evidentlyai/evidently/.
In this section, we will explore the combination of Evidently with MLflow, in order to monitor data drift and...