You're reading from Learn TensorFlow Enterprise Build, manage, and scale machine learning workloads seamlessly using Google's TensorFlow Enterprise

Product type Paperback

Published in Nov 2020

Publisher Packt

ISBN-13 9781800209145

Length 314 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Author (1):

KC Tung

View More author details

Table of Contents (15) Chapters

Preface

1. Section 1 – TensorFlow Enterprise Services and Features

2. Chapter 1: Overview of TensorFlow Enterprise FREE CHAPTER

3. Chapter 2: Running TensorFlow Enterprise in Google AI Platform

4. Section 2 – Data Preprocessing and Modeling

5. Chapter 3: Data Preparation and Manipulation Techniques

6. Chapter 4: Reusable Models and Scalable Data Pipelines

7. Section 3 – Scaling and Tuning ML Works

8. Chapter 5: Training at Scale

9. Chapter 6: Hyperparameter Tuning

10. Section 4 – Model Optimization and Deployment

11. Chapter 7: Model Optimization

12. Chapter 8: Best Practices for Model Training and Performance

13. Chapter 9: Serving a TensorFlow Model

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

Chapter 7: Model Optimization

In this chapter, we will learn about the concept of model optimization through a technique known as quantization. This is important because even though capacity, such as compute and memory, are less of an issue in a cloud environment, latency and throughput are always a factor in the quality and quantity of the model's output. Therefore, model optimization to reduce latency and maximize throughput can help reduce the compute cost. In the edge environment, many of the constraints are related to resources such as memory, compute, power consumption, and bandwidth.

In this chapter, you will learn how to make your model as lean and mean as possible, with acceptable or negligible changes in the model's accuracy. In other words, we will reduce the model size so that we can have the model running on less power and fewer compute resources without overly impacting its performance. In this chapter, we are going to take a look at recent advances and...