You're reading from The Definitive Guide to Google Vertex AI Accelerate your machine learning journey with Google Cloud Vertex AI and MLOps best practices

Product type Paperback

Published in Dec 2023

Publisher Packt

ISBN-13 9781801815260

Length 422 pages

Edition 1st Edition

Tools

MLOps

Concepts

Data Science

Authors (2):

Kartik Chaudhary

Jasmeet Bhatia

View More author details

Table of Contents (24) Chapters

Preface

1. Part 1:The Importance of MLOps in a Real-World ML Deployment

2. Chapter 1: Machine Learning Project Life Cycle and Challenges FREE CHAPTER

3. Chapter 2: What Is MLOps, and Why Is It So Important for Every ML Team?

4. Part 2: Machine Learning Tools for Custom Models on Google Cloud

5. Chapter 3: It’s All About Data – Options to Store and Transform ML Datasets

6. Chapter 4: Vertex AI Workbench – a One-Stop Tool for AI/ML Development Needs

7. Chapter 5: No-Code Options for Building ML Models

8. Chapter 6: Low-Code Options for Building ML Models

9. Chapter 7: Training Fully Custom ML Models with Vertex AI

10. Chapter 8: ML Model Explainability

11. Chapter 9: Model Optimizations – Hyperparameter Tuning and NAS

12. Chapter 10: Vertex AI Deployment and Automation Tools – Orchestration through Managed Kubeflow Pipelines

13. Chapter 11: MLOps Governance with Vertex AI

14. Part 3: Prebuilt/Turnkey ML Solutions Available in GCP

15. Chapter 12: Vertex AI – Generative AI Tools

16. Chapter 13: Document AI – An End-to-End Solution for Processing Documents

17. Chapter 14: ML APIs for Vision, NLP, and Speech

18. Part 4: Building Real-World ML Solutions with Google Cloud

19. Chapter 15: Recommender Systems – Predict What Movies a User Would Like to Watch

20. Chapter 16: Vision-Based Defect Detection System – Machines Can See Now!

21. Chapter 17: Natural Language Models – Detecting Fake News Articles!

22. Index

Why subscribe?

23. Other Books You May Enjoy

Deploying a vision model to a Vertex AI endpoint

In the previous section, we completed our experiment of training a TF-based vision model to identify detects from product images. We now have a trained model that can identify defected or broken bangle images. To make this model usable in downstream applications, we need to deploy it to an endpoint so that we can query that endpoint, getting outputs for new input images on demand. There are certain things that are important to consider while deploying a model, such as expected traffic, expected latency, and expected cost. Based on these factors, we can choose the best infrastructure to deploy our models. If there are strict low-latency requirements, we can deploy our model to machines with accelerators (such as Graphical Processing Units (GPUs) or Tensor Processing Units (TPUs)). Conversely, we don’t have the necessity of online or on-demand predictions, so we don’t need to deploy our model to an endpoint. Offline batch...

The rest of the chapter is locked

You're reading from The Definitive Guide to Google Vertex AI Accelerate your machine learning journey with Google Cloud Vertex AI and MLOps best practices

Table of Contents (24) Chapters

Deploying a vision model to a Vertex AI endpoint

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you