In the previous chapters, we did a lot of work with videos and cameras. We created applications (Gazer and Facetious) with which we can play video from webcams attached to our computers. We can also record videos, take photos, detect motion and faces, and apply masks to faces detected in the video feed in real time with these apps.
Now we will move our focus to the text in images. There are many situations in which we want to extract the text or characters from an image. In the area of computer vision, there is a technology called Optical Character Recognition (OCR) to do this kind of work automatically instead of transcribing the text manually. In this chapter, we will build a new application to extract text from images and scanned documents with Qt and a number of OCR libraries.
We will cover the following topics in this chapter:
- Extracting text...