Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Learn OpenCV 4 by Building Projects Build real-world computer vision and image processing applications with OpenCV and C++

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789341225

Length 310 pages

Edition 2nd Edition

Languages

C++

Tools

OpenCV

Concepts

Computer Vision

Authors (3):

David Millán Escrivá

Prateek Joshi

Vinícius G. Mendonça

View More author details

Table of Contents (14) Chapters

Preface

1. Getting Started with OpenCV FREE CHAPTER

2. An Introduction to the Basics of OpenCV

3. Learning Graphical User Interfaces

4. Delving into Histogram and Filters

5. Automated Optical Inspection, Object Segmentation, and Detection

6. Learning Object Classification

7. Detecting Face Parts and Overlaying Masks

8. Video Surveillance, Background Modeling, and Morphological Operations

9. Learning Object Tracking

10. Developing Segmentation Algorithms for Text Recognition

11. Text Recognition with Tesseract

12. Deep Learning with OpenCV

13. Other Books You May Enjoy

Leave a review – let other readers know what you think

Computer vision and the machine learning workflow

Computer vision applications with machine learning have a common basic structure. This structure is divided into different steps:

Pre-process
Segmentation
Feature extraction
Classification result
Post-process

These are common in almost all computer vision applications, while others are omitted. In the following diagram, you can see the different steps that are involved:

Almost all computer vision applications start with a Pre-process applied to the input image, which consists of the removal of light and noise, filtering, blurring, and so on. After applying all pre-processing required to the input image, the second step is Segmentation. In this step, we have to extract the regions of interest in the image and isolate each one as a unique object of interest. For example, in a face detection system, we have to separate the faces...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Millán Escrivá

David Millán Escrivá was 8 years old when he wrote his first program on an 8086 PC in Basic, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT with honors, through the Universitat Politécnica de Valencia, in human-computer interaction supported by computer vision with OpenCV (v0.96). He has worked with Blender, an open source, 3D software project, and on its first commercial movie, Plumiferos, as a computer graphics software developer. David has more than 10 years' experience in IT, with experience in computer vision, computer graphics, pattern recognition, and machine learning, working on different projects, and at different start-ups, and companies. He currently works as a researcher in computer vision.

See other products by Millán Escrivá

Joshi

Vijay Joshi is a full stack web developer having more than a decade of experience in working with PHP and JavaScript.

See other products by Joshi

Vinícius G. Mendonça

Vinícius G. Mendonça is a professor at PUCPR and a mentor at Apple Developer Academy. He has a master's degree in Computer Vision and Image Processing (PUCPR) and a specialization degree in Game Development (Universidade Positivo). He is also one of the authors of the book Learn OpenCV 4 by Building Projects, also by Packt Publishing. He has been in this field since 1996. His former experience includes designing and programming a multithreaded framework for PBX tests at Siemens, coordination of Aurélio Dictionary software (including its apps for Android, IOS, and Windows phones), and coordination of an augmented reality educational activity for Positivo's Mesa Alfabeto, presented at CEBIT. Currently, he works with server-side Node.js at a company called Tenet Tech.

See other products by Vinícius G. Mendonça