You're reading from Engineering MLOps Rapidly build, test, and manage production-ready machine learning life cycles at scale

Product type Paperback

Published in Apr 2021

Publisher Packt

ISBN-13 9781800562882

Length 370 pages

Edition 1st Edition

Tools

Azure Functions

Concepts

Machine Learning

Author (1):

Emmanuel Raj

View More author details

Table of Contents (18) Chapters

Preface

1. Section 1: Framework for Building Machine Learning Models

2. Chapter 1: Fundamentals of an MLOps Workflow FREE CHAPTER

3. Chapter 2: Characterizing Your Machine Learning Problem

4. Chapter 3: Code Meets Data

5. Chapter 4: Machine Learning Pipelines

6. Chapter 5: Model Evaluation and Packaging

7. Section 2: Deploying Machine Learning Models at Scale

8. Chapter 6: Key Principles for Deploying Your ML System

9. Chapter 7: Building Robust CI/CD Pipelines

10. Chapter 8: APIs and Microservice Management

11. Chapter 9: Testing and Securing Your ML Solution

12. Chapter 10: Essentials of Production Release

13. Section 3: Monitoring Machine Learning Models in Production

14. Chapter 11: Key Principles for Monitoring Your ML System

15. Chapter 12: Model Serving and Monitoring

16. Chapter 13: Governing the ML System for Continual Learning

17. Other Books You May Enjoy

Data ingestion and feature engineering

Data is essential to train ML models; without data, there is no ML. Data ingestion is a trigger step for the ML pipeline. It deals with the volume, velocity, veracity, and variety of data by extracting data from various data sources and ingesting the needed data for model training.

The ML pipeline is initiated by ingesting the right data for training the ML models. We will start by accessing the preprocessed data we registered in the previous chapter. Follow these steps to access and import the preprocessed data and get it ready for ML training:

Using the Workspace() function from the Azure ML SDK, access the data from the datastore in the ML workspace as follows:

from azureml.core import Workspace, Dataset
subscription_id = 'xxxxxx-xxxxxx-xxxxxxx-xxxxxxx'
resource_group = 'Learn_MLOps'
workspace_name = 'MLOps_WS'
workspace = Workspace(subscription_id, resource_group, workspace_name)

Note

Insert your own...