You're reading from Data Science with .NET and Polyglot Notebooks Programmer's guide to data science using ML.NET, OpenAI, and Semantic Kernel

Product type Paperback

Published in Aug 2024

Publisher Packt

ISBN-13 9781835882962

Length 404 pages

Edition 1st Edition

Languages

Tools

Codespaces

Concepts

Artificial Intelligence

Author (1):

Matt Eland

View More author details

Table of Contents (22) Chapters

Preface

1. Part 1: Data Analysis in Polyglot Notebooks

2. Chapter 1: Data Science, Notebooks, and Kernels FREE CHAPTER

3. Chapter 2: Exploring Polyglot Notebooks

4. Chapter 3: Getting Data and Code into Your Notebooks

5. Chapter 4: Working with Tabular Data and DataFrames

6. Chapter 5: Visualizing Data

7. Chapter 6: Variable Correlations

8. Part 2: Machine Learning with Polyglot Notebooks and ML.NET

9. Chapter 7: Classification Experiments with ML.NET AutoML

10. Chapter 8: Regression Experiments with ML.NET AutoML

11. Chapter 9: Beyond AutoML: Pipelines, Trainers, and Transforms

12. Chapter 10: Deploying Machine Learning Models

13. Part 3: Exploring Generative AI with Polyglot Notebooks

14. Chapter 11: Generative AI in Polyglot Notebooks

15. Chapter 12: AI Orchestration with Semantic Kernel

16. Part 4: Polyglot Notebooks in the Enterprise

17. Chapter 13: Enriching Documentation with Mermaid Diagrams

18. Chapter 14: Extending Polyglot Notebooks

19. Chapter 15: Adopting and Deploying Polyglot Notebooks

20. Index

Why subscribe?

21. Other Books You May Enjoy

Summary

In this chapter, we covered high-level concepts in supervised machine learning in general and, more specifically, in classification.

Classification involves predicting a categorical variable, such as the presence or absence of cancer, predicting which football position a player might perform best in, predicting whether a customer will unsubscribe/churn in the next six months, or determining whether a social media post is likely to “go viral.”

In machine learning, we train models by providing training data to the model training process and selecting the machine learning algorithm and its hyperparameters to use. Pre-processing data may also be necessary to get data into a standardized form.

Once a model is trained, we can evaluate its performance by getting predictions for data we’ve reserved for testing purposes. This test data gives us an idea of whether our model accurately predicts values for data it hasn’t seen before.

Models are evaluated...