Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Computer Vision with TensorFlow 2 Leverage deep learning to create powerful image processing apps with TensorFlow 2.0 and Keras

Product type Paperback

Published in May 2019

Publisher Packt

ISBN-13 9781788830645

Length 372 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Computer Vision

Authors (2):

Eliot Andres

Benjamin Planche

View More author details

Table of Contents (16) Chapters

Preface

1. Section 1: TensorFlow 2 and Deep Learning Applied to Computer Vision FREE CHAPTER

2. Computer Vision and Neural Networks

3. TensorFlow Basics and Training a Model

4. Modern Neural Networks

5. Section 2: State-of-the-Art Solutions for Classic Recognition Problems

6. Influential Classification Tools

7. Object Detection Models

8. Enhancing and Segmenting Images

9. Section 3: Advanced Concepts and New Frontiers of Computer Vision

10. Training on Complex and Scarce Datasets

11. Video and Recurrent Neural Networks

12. Optimizing Models and Deploying on Mobile Devices

13. Migrating from TensorFlow 1 to TensorFlow 2

14. Assessments

15. Other Books You May Enjoy

Leave a review - let other readers know what you think

RoI pooling

The goal of the RoI pooling layer is simple—to take a part of the activation map of variable size and convert it into a fixed size. The input activation map sub-window is of size h × w. The target activation map is of size H × W. RoI pooling works by dividing its input into a grid where each cell is of size h/H × w/W.

Let's use an example. If the input is of size h × w = 5 × 4, and the target activation map is of size H × W = 2 × 2, then each cell should be of size 2.5 × 2. Because we can only use integers, we will make some cells of size 3 × 2 and others of size 2 × 2. Then, we will take the maximum of each cell:

Figure 5.13: Example of RoI pooling with an RoI of size 5 × 4 (from B3 to E7) and an output of size 2 × 2 (from J4 to K5)

An RoI pooling layer is very similar to a max-pooling layer. The difference is that RoI pooling works with inputs of variable size, while max-pooling...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Benjamin Planche

Dr. Benjamin Planche is a passionate research scientist in computer vision and machine learning. His main research efforts focus on data scarcity problems and industrial vision systems, leading to numerous patents and publications at international conferences. He worked in various research labs around the world (including in France, Japan, Germany, and the USA). Benjamin obtained his Ph.D. summa cum laude from the Faculty of Computer Science and Mathematics at the University of Passau, under the supervision of Prof. Dr. Harald Kosch. He also has a double master's degree from INSA-Lyon (France) and the University of Passau (Germany), with first-class honors and a multinational excellence award. He also likes sharing his knowledge and experience on various platforms or applying them to the creation of aesthetic demos.

See other products by Benjamin Planche

Eliot Andres

Eliot Andres is a freelance deep learning and computer vision engineer. He has more than 3 years' experience in the field, applying his skills to a variety of industries, such as banking, health, social media, and video streaming. Eliot has a double master's degree from cole des Ponts and Tlcom, Paris. His focus is industrialization: delivering value by applying new technologies to business problems. Eliot keeps his knowledge up to date by publishing articles on his blog and by building prototypes using the latest technologies.

See other products by Eliot Andres