Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Learn OpenCV 4 by Building Projects Build real-world computer vision and image processing applications with OpenCV and C++

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789341225

Length 310 pages

Edition 2nd Edition

Languages

C++

Tools

OpenCV

Concepts

Computer Vision

Authors (3):

Millán Escrivá

Vinícius G. Mendonça

Joshi

View More author details

Table of Contents (14) Chapters

Preface

1. Getting Started with OpenCV FREE CHAPTER

2. An Introduction to the Basics of OpenCV

3. Learning Graphical User Interfaces

4. Delving into Histogram and Filters

5. Automated Optical Inspection, Object Segmentation, and Detection

6. Learning Object Classification

7. Detecting Face Parts and Overlaying Masks

8. Video Surveillance, Background Modeling, and Morphological Operations

9. Learning Object Tracking

10. Developing Segmentation Algorithms for Text Recognition

11. Text Recognition with Tesseract

12. Deep Learning with OpenCV

13. Other Books You May Enjoy

Leave a review – let other readers know what you think

Introducing optical character recognition

Identifying text in an image is a very popular application for computer vision. This process is commonly called optical character recognition, and is divided as follows:

Text preprocessing and segmentation: During this step, the computer must deal with image noise, and rotation (skewing), and identify what areas are candidate text.
Text identification: This is the process of identifying each letter in text. Although this is also a computer vision topic, we will not show how you to do this in this book purely using OpenCV. Instead, we will show you how to use the Tesseract library to do this step, since it was integrated in OpenCV 3.0. If you are interested in learning how to do what Tesseract does by yourself, take a look at Packt's Mastering OpenCV book, which presents a chapter on car plate recognition.

The preprocessing and segmentation...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Millán Escrivá

David Millán Escrivá was 8 years old when he wrote his first program on an 8086 PC in Basic, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT with honors, through the Universitat Politécnica de Valencia, in human-computer interaction supported by computer vision with OpenCV (v0.96). He has worked with Blender, an open source, 3D software project, and on its first commercial movie, Plumiferos, as a computer graphics software developer. David has more than 10 years' experience in IT, with experience in computer vision, computer graphics, pattern recognition, and machine learning, working on different projects, and at different start-ups, and companies. He currently works as a researcher in computer vision.

See other products by Millán Escrivá

Vinícius G. Mendonça

Vinícius G. Mendonça is a professor at PUCPR and a mentor at Apple Developer Academy. He has a master's degree in Computer Vision and Image Processing (PUCPR) and a specialization degree in Game Development (Universidade Positivo). He is also one of the authors of the book Learn OpenCV 4 by Building Projects, also by Packt Publishing. He has been in this field since 1996. His former experience includes designing and programming a multithreaded framework for PBX tests at Siemens, coordination of Aurélio Dictionary software (including its apps for Android, IOS, and Windows phones), and coordination of an augmented reality educational activity for Positivo's Mesa Alfabeto, presented at CEBIT. Currently, he works with server-side Node.js at a company called Tenet Tech.

See other products by Vinícius G. Mendonça

Joshi

Prateek Joshi is the founder of Plutoshift and a published author of 9 books on Artificial Intelligence. He has been featured on Forbes 30 Under 30, NBC, Bloomberg, CNBC, TechCrunch, and The Business Journals. He has been an invited speaker at conferences such as TEDx, Global Big Data Conference, Machine Learning Developers Conference, and Silicon Valley Deep Learning. Apart from Artificial Intelligence, some of the topics that excite him are number theory, cryptography, and quantum computing. His greater goal is to make Artificial Intelligence accessible to everyone so that it can impact billions of people around the world.

See other products by Joshi