You're reading from Practical Deep Learning at Scale with MLflow Bridge the gap between offline experimentation and online production

Product type Paperback

Published in Jul 2022

Publisher Packt

ISBN-13 9781803241333

Length 288 pages

Edition 1st Edition

Tools

MLflow

Concepts

Deep Learning

Author (1):

Yong Liu

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1 - Deep Learning Challenges and MLflow Prime FREE CHAPTER

2. Chapter 1: Deep Learning Life Cycle and MLOps Challenges

3. Chapter 2: Getting Started with MLflow for Deep Learning

4. Section 2 – Tracking a Deep Learning Pipeline at Scale

5. Chapter 3: Tracking Models, Parameters, and Metrics

6. Chapter 4: Tracking Code and Data Versioning

7. Section 3 – Running Deep Learning Pipelines at Scale

8. Chapter 5: Running DL Pipelines in Different Environments

9. Chapter 6: Running Hyperparameter Tuning at Scale

10. Section 4 – Deploying a Deep Learning Pipeline at Scale

11. Chapter 7: Multi-Step Deep Learning Inference Pipeline

12. Chapter 8: Deploying a DL Inference Pipeline at Scale

13. Section 5 – Deep Learning Model Explainability at Scale

14. Chapter 9: Fundamentals of Deep Learning Explainability

15. Chapter 10: Implementing DL Explainability with MLflow

16. Other Books You May Enjoy

Chapter 4: Tracking Code and Data Versioning

DL models are not just models – they are intimately tied to the code that trains and tests the model and the data that's used for training and testing. If we don't track the code and data that's used for the model, it is impossible to reproduce the model or improve it. Furthermore, there have been recent industry-wide awakenings and paradigm shifts toward a data-centric AI (https://www.forbes.com/sites/gilpress/2021/06/16/andrew-ng-launches-a-campaign-for-data-centric-ai/?sh=5cbacdc574f5), where the importance of data is being lifted to a first-class artifact in building ML and, especially, DL models. Due to this, in this chapter, we will learn how to track code and data versioning using MLflow. We will learn about the different ways we can track code and pipeline versioning and how to use Delta Lake for data versioning. By the end of this chapter, you will be able to understand and implement tracking techniques for...