You're reading from Google Machine Learning and Generative AI for Solutions Architects Build efficient and scalable AI/ML solutions on Google Cloud

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781803245270

Length 552 pages

Edition 1st Edition

Languages

Gen

Tools

Google Cloud Platform

Concepts

Machine Learning

Author (1):

Kieran Kavanagh

View More author details

Table of Contents (24) Chapters

Preface

1. Part 1:The Basics

2. Chapter 1: AI/ML Concepts, Real-World Applications, and Challenges FREE CHAPTER

3. Chapter 2: Understanding the ML Model Development Life Cycle

4. Chapter 3: AI/ML Tooling and the Google Cloud AI/ML Landscape

5. Part 2:Diving in and building AI/ML solutions

6. Chapter 4: Utilizing Google Cloud’s High-Level AI Services

7. Chapter 5: Building Custom ML Models on Google Cloud

8. Chapter 6: Diving Deeper – Preparing and Processing Data for AI/ML Workloads on Google Cloud

9. Chapter 7: Feature Engineering and Dimensionality Reduction

10. Chapter 8: Hyperparameters and Optimization

11. Chapter 9: Neural Networks and Deep Learning

12. Chapter 10: Deploying, Monitoring, and Scaling in Production

13. Chapter 11: Machine Learning Engineering and MLOps with Google Cloud

14. Chapter 12: Bias, Explainability, Fairness, and Lineage

15. Chapter 13: ML Governance and the Google Cloud Architecture Framework

16. Chapter 14: Additional AI/ML Tools, Frameworks, and Considerations

17. Part 3:Generative AI

18. Chapter 15: Introduction to Generative AI

19. Chapter 16: Advanced Generative AI Concepts and Use Cases

20. Chapter 17: Generative AI on Google Cloud

21. Chapter 18: Bringing It All Together: Building ML Solutions with Google Cloud and Vertex AI

22. Index

Why subscribe?

23. Other Books You May Enjoy

Optimizing for AI/ML at the edge

Serving ML models at the edge refers to running your models directly on user devices such as smartphones or IoT devices. The term “edge” is based on traditional network architecture terminology, in which the core of the network is in the network owner’s data centers, and the edge of the network is where user devices connect to the network. Running models and other types of systems at the edge can provide benefits such as lower latency, increased privacy, and reduced server costs. However, edge devices usually have limited computing power, so we may need to make some changes to our models for them to run efficiently on those devices. There are several things we can do to optimize our models to run at the edge, all of which we will discuss in this section.

Model optimization

Let’s start by discussing what kinds of measures we can take to optimize our models so that they can be used at the edge.

Model selection

First...

The rest of the chapter is locked

You're reading from Google Machine Learning and Generative AI for Solutions Architects Build efficient and scalable AI/ML solutions on Google Cloud

Table of Contents (24) Chapters

Optimizing for AI/ML at the edge

Model optimization

Model selection

Authors (1)

Personalised recommendations for you

You're reading from Google Machine Learning and Generative AI for Solutions Architects ​Build efficient and scalable AI/ML solutions on Google Cloud

Table of Contents (24) Chapters

Optimizing for AI/ML at the edge

Model optimization

Model selection

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you

You're reading from Google Machine Learning and Generative AI for Solutions Architects Build efficient and scalable AI/ML solutions on Google Cloud