Extracting text and images
In this section, we will look at how to extract any text from a PDF file and save it to a text file. The walk-through will also show how a PDF can be saved as an image file.
Extracting text from a PDF file
When working with PDF files, we often have to read the text contained within them in order to process the text. A good example would be extracting the text from an invoice in PDF format. This text includes product information, including a description, the quantity, and the costs. As part of a business role, you may then validate the information before posting it to a purchase ledger. In the following walk-through, you will extract the text from the Chapter14_Letter.pdf
sample PDF file. You may remember this file; it's one of the sample loan letters used in Chapter 12, Automation Using Word. You will begin by adding the comments as usual.
Let's start this walk-through by executing the following steps:
- Log in to Control Room. ...