You're reading from Engineering MLOps Rapidly build, test, and manage production-ready machine learning life cycles at scale

Product type Paperback

Published in Apr 2021

Publisher Packt

ISBN-13 9781800562882

Length 370 pages

Edition 1st Edition

Tools

Azure Functions

Concepts

Machine Learning

Author (1):

Emmanuel Raj

View More author details

Table of Contents (18) Chapters

Preface

1. Section 1: Framework for Building Machine Learning Models

2. Chapter 1: Fundamentals of an MLOps Workflow FREE CHAPTER

3. Chapter 2: Characterizing Your Machine Learning Problem

4. Chapter 3: Code Meets Data

5. Chapter 4: Machine Learning Pipelines

6. Chapter 5: Model Evaluation and Packaging

7. Section 2: Deploying Machine Learning Models at Scale

8. Chapter 6: Key Principles for Deploying Your ML System

9. Chapter 7: Building Robust CI/CD Pipelines

10. Chapter 8: APIs and Microservice Management

11. Chapter 9: Testing and Securing Your ML Solution

12. Chapter 10: Essentials of Production Release

13. Section 3: Monitoring Machine Learning Models in Production

14. Chapter 11: Key Principles for Monitoring Your ML System

15. Chapter 12: Model Serving and Monitoring

16. Chapter 13: Governing the ML System for Continual Learning

17. Other Books You May Enjoy

Exploring different modes of serving ML models

In this section, we will consider how a model can be served for users (both humans and machines) to consume the ML service efficiently. Model serving is a critical area, which an ML system needs to succeed at to fulfill its business impact, as any lag or bug in this area can be costly in terms of serving users. Robustness, availability, and convenience are key factors to keep in mind while serving ML models. Let's take a look at some ways in which ML models can be served: this can be via batch service or on-demand mode (for instance, when a query is made on demand in order to get a prediction). A model can be served to either a machine or a human user in on-demand mode. Here is an example of serving a model to a user:

Figure 12.2 – Serving a model to users

In a typical scenario (in on-demand mode), a model is served as a service for users to consume, as shown in Figure 12.2. Then, an external application...