Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Caffe2 Quick Start Guide Modular and scalable deep learning made easy

Product type Paperback

Published in May 2019

Publisher Packt

ISBN-13 9781789137750

Length 136 pages

Edition 1st Edition

Languages

C++

Tools

Caffe

Concepts

Deep Learning

Author (1):

Ashwin Nanjappa

View More author details

Table of Contents (9) Chapters

Preface

1. Introduction and Installation

2. Composing Networks FREE CHAPTER

3. Training Networks

4. Working with Caffe

5. Working with Other Frameworks

6. Deploying Models to Accelerators for Inference

7. Caffe2 at the Edge and in the cloud

8. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

In this chapter, we learned about inference engines and how they are an essential tool for the final deployment of a trained Caffe2 model on accelerators. We focused on two types of popular accelerators: NVIDIA GPUs and Intel CPUs. We looked at how to install and use TensorRT for deploying our Caffe2 model on NVIDIA GPUs. We also looked at the installation and use of OpenVINO for deploying our Caffe2 model on Intel CPUs and accelerators.

Many other companies, such as Google, Facebook, Amazon, and start-ups such as Habana and GraphCore, are developing new accelerator hardware for the inference of DL models. There are also efforts such as ONNX Runtime that are bringing together the inference engines from multiple vendors under one umbrella. Please evaluate these options and choose which accelerator hardware and software works best for deployment of your Caffe2 model.

In...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Ashwin Nanjappa

Ashwin Nanjappa is a senior architect at NVIDIA, working in the TensorRT team on improving deep learning inference on GPU accelerators. He has a PhD from the National University of Singapore in developing GPU algorithms for the fundamental computational geometry problem of 3D Delaunay triangulation. As a post-doctoral research fellow at the BioInformatics Institute (Singapore), he developed GPU-accelerated machine learning algorithms for pose estimation using depth cameras. As an algorithms research engineer at Visenze (Singapore), he implemented computer vision algorithm pipelines in C++, developed a training framework built upon Caffe in Python, and trained deep learning models for some of the world's most popular online shopping portals.

See other products by Ashwin Nanjappa