You're reading from Learn TensorFlow Enterprise Build, manage, and scale machine learning workloads seamlessly using Google's TensorFlow Enterprise

Product type Paperback

Published in Nov 2020

Publisher Packt

ISBN-13 9781800209145

Length 314 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Author (1):

KC Tung

View More author details

Table of Contents (15) Chapters

Preface

1. Section 1 – TensorFlow Enterprise Services and Features

2. Chapter 1: Overview of TensorFlow Enterprise FREE CHAPTER

3. Chapter 2: Running TensorFlow Enterprise in Google AI Platform

4. Section 2 – Data Preprocessing and Modeling

5. Chapter 3: Data Preparation and Manipulation Techniques

6. Chapter 4: Reusable Models and Scalable Data Pipelines

7. Section 3 – Scaling and Tuning ML Works

8. Chapter 5: Training at Scale

9. Chapter 6: Hyperparameter Tuning

10. Section 4 – Model Optimization and Deployment

11. Chapter 7: Model Optimization

12. Chapter 8: Best Practices for Model Training and Performance

13. Chapter 9: Serving a TensorFlow Model

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

Converting a full model to a reduced hybrid quantization model

In the previous section, we converted a full model into a reduced float16 TFLite model, and demonstrated its scoring and evaluation processes. Now we will try the second type of supported quantization, which is a hybrid approach.

Hybrid quantization optimizes the model by converting the model to 8-bit integer weights, 32-bit float biases, and activations. Since it contains both integer and floating-point computations, it is known as hybrid quantization. This is intended for a trade-off between accuracy and optimization.

There is only one small difference that we need to make for hybrid quantization. There is only one line of difference, as explained below. In the previous section, this is how we quantized the full model to a reduced float16 TFLite model:

converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_types = [tf.float16]
tflite_model = converter.convert()

For hybrid quantization...

The rest of the chapter is locked

You're reading from Learn TensorFlow Enterprise Build, manage, and scale machine learning workloads seamlessly using Google's TensorFlow Enterprise

Table of Contents (15) Chapters

Converting a full model to a reduced hybrid quantization model

Authors (1)

Other recommended products

Personalised recommendations for you

You're reading from Learn TensorFlow Enterprise Build, manage, and scale machine learning workloads seamlessly using Google's TensorFlow Enterprise

Table of Contents (15) Chapters

Converting a full model to a reduced hybrid quantization model

Unlock this book and the full library FREE for 7 days

Authors (1)

Other recommended products

Personalised recommendations for you