You're reading from Mastering Azure Machine Learning Execute large-scale end-to-end machine learning with Azure

Product type Paperback

Published in May 2022

Publisher Packt

ISBN-13 9781803232416

Length 624 pages

Edition 2nd Edition

Tools

Azure

Concepts

Machine Learning

Authors (2):

Marcel Alsdorf

Christoph Körner

View More author details

Table of Contents (23) Chapters

Preface

1. Section 1: Introduction to Azure Machine Learning

2. Chapter 1: Understanding the End-to-End Machine Learning Process FREE CHAPTER

3. Chapter 2: Choosing the Right Machine Learning Service in Azure

4. Chapter 3: Preparing the Azure Machine Learning Workspace

5. Section 2: Data Ingestion, Preparation, Feature Engineering, and Pipelining

6. Chapter 4: Ingesting Data and Managing Datasets

7. Chapter 5: Performing Data Analysis and Visualization

8. Chapter 6: Feature Engineering and Labeling

9. Chapter 7: Advanced Feature Extraction with NLP

10. Chapter 8: Azure Machine Learning Pipelines

11. Section 3: The Training and Optimization of Machine Learning Models

12. Chapter 9: Building ML Models Using Azure Machine Learning

13. Chapter 10: Training Deep Neural Networks on Azure

14. Chapter 11: Hyperparameter Tuning and Automated Machine Learning

15. Chapter 12: Distributed Machine Learning on Azure

16. Chapter 13: Building a Recommendation Engine in Azure

17. Section 4: Machine Learning Model Deployment and Operations

18. Chapter 14: Model Deployment, Endpoints, and Operations

19. Chapter 15: Model Interoperability, Hardware Optimization, and Integrations

20. Chapter 16: Bringing Models into Production with MLOps

21. Chapter 17: Preparing for a Successful ML Journey

22. Other Books You May Enjoy

Leveraging term importance and semantics

Everything we have done up to now has been relatively simple and based on word stems or so-called tokens. The bag-of-words model was nothing but a dictionary of tokens counting the occurrence of tokens per field. In this section, we will take a look at a common technique to further improve matching between documents using n-gram and skip-gram combinations of terms.

Combining terms in multiple ways will explode your dictionary. This will turn into a problem if you have a large corpus; for instance, 10 million words. Hence, we will look at a common preprocessing technique to reduce the dimensionality of a large dictionary through SVD.

While, now, this approach is a lot more complicated, it is still based on a bag-of-words model that already works well on a large corpus, in practice. However, of course, we can do better and try to understand the importance of words. Therefore, we will tackle another popular technique in NLP to compute the...

The rest of the chapter is locked

You're reading from Mastering Azure Machine Learning Execute large-scale end-to-end machine learning with Azure

Table of Contents (23) Chapters

Leveraging term importance and semantics

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you