Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from OpenCV 4 with Python Blueprints Build creative computer vision projects with the latest version of OpenCV 4 and Python 3

Product type Paperback

Published in Mar 2020

Publisher Packt

ISBN-13 9781789801811

Length 366 pages

Edition 2nd Edition

Languages

Python

Tools

OpenCV

Concepts

Computer Vision

Authors (4):

Michael Beyeler (USD)

Dr. Menua Gevorgyan

Michael Beyeler

Arsen Mamikonyan

View More author details

Table of Contents (14) Chapters

Preface

1. Fun with Filters

2. Hand Gesture Recognition Using a Kinect Depth Sensor FREE CHAPTER

3. Finding Objects via Feature Matching and Perspective Transforms

4. 3D Scene Reconstruction Using Structure from Motion

5. Using Computational Photography with OpenCV

6. Tracking Visually Salient Objects

7. Learning to Recognize Traffic Signs

8. Learning to Recognize Facial Emotions

9. Learning to Classify and Localize Objects

10. Learning to Detect and Track Objects

11. Profiling and Accelerating Your Apps

Accelerating with Numba

12. Setting Up a Docker Container

Defining a Dockerfile

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Localizing with CNNs

Being able to create your own localizer is a good way to acquire intuition on how an object detection network might work. This is because the only conceptual difference between object detection and localization networks is that a localization network predicts a single bounding box, while an object detection network predicts multiple boxes. Also, it is a good way to start understanding how to build a neural network that accomplishes other regression tasks.

In this section, we are going to use the same pretrained classifier network, MobileNetV2, as the previous section. However, this time we are going to use the network for localizing objects instead of classifying. Let's import the required modules and the base model in the same way that we did in the previous section—although, this time, we are not going to freeze the layers of the base model...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at R$50/month. Cancel anytime

Authors (4)

Michael Beyeler (USD)

Michael Beyeler is a postdoctoral fellow in neuroengineering and data science at the University of Washington, where he is working on computational models of bionic vision in order to improve the perceptual experience of blind patients implanted with a retinal prosthesis (bionic eye).His work lies at the intersection of neuroscience, computer engineering, computer vision, and machine learning. He is also an active contributor to several open source software projects, and has professional programming experience in Python, C/C++, CUDA, MATLAB, and Android. Michael received a PhD in computer science from the University of California, Irvine, and an MSc in biomedical engineering and a BSc in electrical engineering from ETH Zurich, Switzerland.

See other products by Michael Beyeler (USD)

Dr. Menua Gevorgyan

Dr. Menua Gevorgyan is an experienced researcher with a demonstrated history of working in the information technology and services industry. He is skilled in computer vision, deep learning, machine learning, and data science as well as having a lot of experience with OpenCV and Python programming. He is interested in machine perception and machine understanding problems, and wonders if it is possible to make a machine perceive the world as a human does.

See other products by Dr. Menua Gevorgyan

Mamikonyan

Arsen Mamikonyan is an experienced machine learning specialist with demonstrated work experience in Silicon Valley and London, and teaching experience at the American University of Armenia. He is skilled in applied machine learning and data science and has built real-life applications using Python and OpenCV, among others. He holds a master's degree in engineering (MEng) with a concentration on artificial intelligence from the Massachusetts Institute of Technology.

See other products by Mamikonyan

Michael Beyeler