You're reading from Mastering Azure Machine Learning Perform large-scale end-to-end advanced machine learning in the cloud with Microsoft Azure Machine Learning

Product type Paperback

Published in Apr 2020

Publisher Packt

ISBN-13 9781789807554

Length 436 pages

Edition 1st Edition

Languages

Tools

Azure

Concepts

Machine Learning

Authors (2):

Christoph Körner

Kaijisse Waaijer

View More author details

Table of Contents (20) Chapters

Preface

About Mastering Azure Machine Learning

Section 1: Azure Machine Learning

1. Building an end-to-end machine learning pipeline in Azure FREE CHAPTER

2. Choosing a machine learning service in Azure

Section 2: Experimentation and Data Preparation

3. Data experimentation and visualization using Azure

4. ETL, data preparation, and feature extraction

5. Azure Machine Learning pipelines

6. Advanced feature extraction with NLP

Section 3: Training Machine Learning Models

7. Building ML models using Azure Machine Learning

8. Training deep neural networks on Azure

9. Hyperparameter tuning and Automated Machine Learning

10. Distributed machine learning on Azure

11. Building a recommendation engine in Azure

Section 4: Optimization and Deployment of Machine Learning Models

12. Deploying and operating machine learning models

13. MLOps—DevOps for machine learning

14. What's next?

Index

Summary

In this chapter, you have learned how to build enterprise-grade ETL pipelines and data transformations in Azure Machine Learning, as well as how to manage datasets.

You have learned how to load data into the cloud using blob storage and how to extract data from various other data formats. If you model your data in abstract data stores and datasets, then your users don't have to know where the data is located, how it is encoded, or what is the correct protocol and permission to access it. This is an essential part of an ETL pipeline. Another great way is to see your dataset definitions as contracts about what your users can expect from the data, very similar to an API. Therefore, it should make sense to follow a specific life cycle of creating datasets, updating and versioning them, before deprecating and archiving them if no longer used.

Using the Azure DataPrep SDK, you acquired the skills to write scalable data platforms using dataflows. We looked into how to create...