Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Learn OpenCV 4 by Building Projects Build real-world computer vision and image processing applications with OpenCV and C++

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789341225

Length 310 pages

Edition 2nd Edition

Languages

C++

Tools

OpenCV

Concepts

Computer Vision

Authors (3):

David Millán Escrivá

Prateek Joshi

Vinícius G. Mendonça

View More author details

Table of Contents (14) Chapters

Preface

1. Getting Started with OpenCV

2. An Introduction to the Basics of OpenCV FREE CHAPTER

3. Learning Graphical User Interfaces

4. Delving into Histogram and Filters

5. Automated Optical Inspection, Object Segmentation, and Detection

6. Learning Object Classification

7. Detecting Face Parts and Overlaying Masks

8. Video Surveillance, Background Modeling, and Morphological Operations

9. Learning Object Tracking

10. Developing Segmentation Algorithms for Text Recognition

11. Text Recognition with Tesseract

12. Deep Learning with OpenCV

13. Other Books You May Enjoy

Leave a review – let other readers know what you think

Summary

In this chapter, we presented a brief introduction to OCR applications. We saw that the preprocessing phase of such systems must be adjusted according to the type of document we are planning to identify. We have learned about common operations while preprocessing text files, such as thresholding, cropping, skewing, and text region segmentation. Finally, we learned how to install and use Tesseract OCR to convert our image into text.

In the next chapter, we'll use a more sophisticated OCR technique to identify text in a casually taken picture or video –a situation known as scene text recognition. This is a much more complex scenario, since the text can be anywhere, in any font, and with different illuminations and orientations. There can even be no text at all! We'll also learn how to use the OpenCV 3.0 text contribution module, which is fully integrated...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Millán Escrivá

David Millán Escrivá was 8 years old when he wrote his first program on an 8086 PC in Basic, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT with honors, through the Universitat Politécnica de Valencia, in human-computer interaction supported by computer vision with OpenCV (v0.96). He has worked with Blender, an open source, 3D software project, and on its first commercial movie, Plumiferos, as a computer graphics software developer. David has more than 10 years' experience in IT, with experience in computer vision, computer graphics, pattern recognition, and machine learning, working on different projects, and at different start-ups, and companies. He currently works as a researcher in computer vision.

See other products by Millán Escrivá

Joshi

Vijay Joshi is a full stack web developer having more than a decade of experience in working with PHP and JavaScript.

See other products by Joshi

Vinícius G. Mendonça

Vinícius G. Mendonça is a professor at PUCPR and a mentor at Apple Developer Academy. He has a master's degree in Computer Vision and Image Processing (PUCPR) and a specialization degree in Game Development (Universidade Positivo). He is also one of the authors of the book Learn OpenCV 4 by Building Projects, also by Packt Publishing. He has been in this field since 1996. His former experience includes designing and programming a multithreaded framework for PBX tests at Siemens, coordination of Aurélio Dictionary software (including its apps for Android, IOS, and Windows phones), and coordination of an augmented reality educational activity for Positivo's Mesa Alfabeto, presented at CEBIT. Currently, he works with server-side Node.js at a company called Tenet Tech.

See other products by Vinícius G. Mendonça