You're reading from Machine Learning with BigQuery ML Create, execute, and improve machine learning models in BigQuery using standard SQL queries

Product type Paperback

Published in Jun 2021

Publisher Packt

ISBN-13 9781800560307

Length 344 pages

Edition 1st Edition

Languages

Python

Tools

BigQuery

Concepts

Machine Learning

Author (1):

Alessandro Marrandino

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Introduction and Environment Setup

2. Chapter 1: Introduction to Google Cloud and BigQuery FREE CHAPTER

3. Chapter 2: Setting Up Your GCP and BigQuery Environment

4. Chapter 3: Introducing BigQuery Syntax

5. Section 2: Deep Learning Networks

6. Chapter 4: Predicting Numerical Values with Linear Regression

7. Chapter 5: Predicting Boolean Values Using Binary Logistic Regression

8. Chapter 6: Classifying Trees with Multiclass Logistic Regression

9. Section 3: Advanced Models with BigQuery ML

10. Chapter 7: Clustering Using the K-Means Algorithm

11. Chapter 8: Forecasting Using Time Series

12. Chapter 9: Suggesting the Right Product by Using Matrix Factorization

13. Chapter 10: Predicting Boolean Values Using XGBoost

14. Chapter 11: Implementing Deep Neural Networks

15. Section 4: Further Extending Your ML Capabilities with GCP

16. Chapter 12: Using BigQuery ML with AI Notebooks

17. Chapter 13: Running TensorFlow Models with BigQuery ML

18. Chapter 14: BigQuery ML Tips and Best Practices

19. Other Books You May Enjoy

Chapter 7: Clustering Using the K-Means Algorithm

In this chapter, we'll introduce unsupervised machine learning, and you'll learn how to use BigQuery ML to build K-Means algorithms to cluster similar data into multiple categories.

Unsupervised machine learning is particularly useful when we have datasets without any labels, and we need to infer the structure of the data without any initial knowledge.

In different industries, it can be very valuable to identify similar events, objects, and people according to a specific set of features. K-Means clustering is typically used to identify similar customers, documents, products, events, or items according to a specific set of characteristics.

In this chapter, we'll focus our attention on the K-Means clustering algorithm, which is widely used to reveal similarities in structured and unstructured data. We'll go through all the steps required to build a K-Means clustering model, leveraging BigQuery ML.

With...