You're reading from Mastering Azure Machine Learning Execute large-scale end-to-end machine learning with Azure

Product type Paperback

Published in May 2022

Publisher Packt

ISBN-13 9781803232416

Length 624 pages

Edition 2nd Edition

Tools

Azure

Concepts

Machine Learning

Authors (2):

Marcel Alsdorf

Christoph Körner

View More author details

Table of Contents (23) Chapters

Preface

1. Section 1: Introduction to Azure Machine Learning

2. Chapter 1: Understanding the End-to-End Machine Learning Process FREE CHAPTER

3. Chapter 2: Choosing the Right Machine Learning Service in Azure

4. Chapter 3: Preparing the Azure Machine Learning Workspace

5. Section 2: Data Ingestion, Preparation, Feature Engineering, and Pipelining

6. Chapter 4: Ingesting Data and Managing Datasets

7. Chapter 5: Performing Data Analysis and Visualization

8. Chapter 6: Feature Engineering and Labeling

9. Chapter 7: Advanced Feature Extraction with NLP

10. Chapter 8: Azure Machine Learning Pipelines

11. Section 3: The Training and Optimization of Machine Learning Models

12. Chapter 9: Building ML Models Using Azure Machine Learning

13. Chapter 10: Training Deep Neural Networks on Azure

14. Chapter 11: Hyperparameter Tuning and Automated Machine Learning

15. Chapter 12: Distributed Machine Learning on Azure

16. Chapter 13: Building a Recommendation Engine in Azure

17. Section 4: Machine Learning Model Deployment and Operations

18. Chapter 14: Model Deployment, Endpoints, and Operations

19. Chapter 15: Model Interoperability, Hardware Optimization, and Integrations

20. Chapter 16: Bringing Models into Production with MLOps

21. Chapter 17: Preparing for a Successful ML Journey

22. Other Books You May Enjoy

Building a simple bag-of-words model

In this section, we will look at a surprisingly simple concept to tackle the shortcomings of label encoding for textual data using a technique called bag-of-words, which will build a foundation for a simple NLP pipeline. Don't worry if these techniques look too simple when you read through them; we will gradually build on top of them with tweaks, optimizations, and improvements to build a modern NLP pipeline.

A naïve bag-of-words model using counting

In this section, the main concept that we will build is the bag-of-words model. It is a very simple concept; that is, it involves modeling any document as a collection of words that appear in a given document with the frequency of each word. Hence, we throw away sentence structure, word order, punctuation marks, and more and reduce the documents to a raw count of words. Following this, we can vectorize this word count into a numeric vector representation, which can then be used for ML...