You're reading from Learn TensorFlow Enterprise Build, manage, and scale machine learning workloads seamlessly using Google's TensorFlow Enterprise

Product type Paperback

Published in Nov 2020

Publisher Packt

ISBN-13 9781800209145

Length 314 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Author (1):

KC Tung

View More author details

Table of Contents (15) Chapters

Preface

1. Section 1 – TensorFlow Enterprise Services and Features

2. Chapter 1: Overview of TensorFlow Enterprise FREE CHAPTER

3. Chapter 2: Running TensorFlow Enterprise in Google AI Platform

4. Section 2 – Data Preprocessing and Modeling

5. Chapter 3: Data Preparation and Manipulation Techniques

6. Chapter 4: Reusable Models and Scalable Data Pipelines

7. Section 3 – Scaling and Tuning ML Works

8. Chapter 5: Training at Scale

9. Chapter 6: Hyperparameter Tuning

10. Section 4 – Model Optimization and Deployment

11. Chapter 7: Model Optimization

12. Chapter 8: Best Practices for Model Training and Performance

13. Chapter 9: Serving a TensorFlow Model

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

From all the examples that we have covered in this chapter, we learned how to leverage a distributed training strategy with the TPU and GPU through AI Platform, which runs on TensorFlow Enterprise 2.2 distributions. AI Platform is a service that wraps around TPU or GPU accelerator hardware and manages the configuration and setup for your training job.

Currently, in Google AI Platform, the data ingestion pipeline relies on TFRecordDataset to stream training data in batches into the model training workflow. We also learned how to leverage a prebuilt model downloaded from TensorFlow Hub through the use of the TFHUB_CACHE_DIR environment variable. This is also the means to import your own saved model from an offline estate into Google AI Platform. Overall, in this platform, we used a TensorFlow Enterprise 2.2 distribution to achieve scalable data streaming and distributed training on Google Cloud's TPU or GPU and serialized all the model checkpoints and assets back to...