Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Learn OpenCV 4 by Building Projects Build real-world computer vision and image processing applications with OpenCV and C++

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789341225

Length 310 pages

Edition 2nd Edition

Languages

C++

Tools

OpenCV

Concepts

Computer Vision

Authors (3):

David Millán Escrivá

Prateek Joshi

Vinícius G. Mendonça

View More author details

Table of Contents (14) Chapters

Preface

1. Getting Started with OpenCV FREE CHAPTER

2. An Introduction to the Basics of OpenCV

3. Learning Graphical User Interfaces

4. Delving into Histogram and Filters

5. Automated Optical Inspection, Object Segmentation, and Detection

6. Learning Object Classification

7. Detecting Face Parts and Overlaying Masks

8. Video Surveillance, Background Modeling, and Morphological Operations

9. Learning Object Tracking

10. Developing Segmentation Algorithms for Text Recognition

11. Text Recognition with Tesseract

12. Deep Learning with OpenCV

13. Other Books You May Enjoy

Leave a review – let other readers know what you think

Developing Segmentation Algorithms for Text Recognition

In the previous chapters, we learned about a wide range of image processing techniques such as thresholding, contours descriptors, and mathematical morphology. In this chapter, we will discuss common problems that you may face while dealing with scanned documents, such as identifying where the text is or adjusting its rotation. We will also learn how to combine techniques presented in the previous chapters to solve those problems. By the end of this chapter, we will have segmented regions of text that can be sent to an optical character recognition (OCR) library.

By the end of this chapter, you should be able to answer the following questions:

What kind of OCR applications exists?
What are the common problems while writing an OCR application?
How do I identify regions of documents?
How do I deal with problems like skewing...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (3)

Millán Escrivá

David Millán Escrivá was 8 years old when he wrote his first program on an 8086 PC in Basic, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT with honors, through the Universitat Politécnica de Valencia, in human-computer interaction supported by computer vision with OpenCV (v0.96). He has worked with Blender, an open source, 3D software project, and on its first commercial movie, Plumiferos, as a computer graphics software developer. David has more than 10 years' experience in IT, with experience in computer vision, computer graphics, pattern recognition, and machine learning, working on different projects, and at different start-ups, and companies. He currently works as a researcher in computer vision.

See other products by Millán Escrivá

Joshi

Vijay Joshi is a full stack web developer having more than a decade of experience in working with PHP and JavaScript.

See other products by Joshi

Vinícius G. Mendonça

Vinícius G. Mendonça is a professor at PUCPR and a mentor at Apple Developer Academy. He has a master's degree in Computer Vision and Image Processing (PUCPR) and a specialization degree in Game Development (Universidade Positivo). He is also one of the authors of the book Learn OpenCV 4 by Building Projects, also by Packt Publishing. He has been in this field since 1996. His former experience includes designing and programming a multithreaded framework for PBX tests at Siemens, coordination of Aurélio Dictionary software (including its apps for Android, IOS, and Windows phones), and coordination of an augmented reality educational activity for Positivo's Mesa Alfabeto, presented at CEBIT. Currently, he works with server-side Node.js at a company called Tenet Tech.

See other products by Vinícius G. Mendonça