Extracting information from PDF documents
The ctx.pdf
library is the SDK extension library for extracting data from PDF documents. The ctx.pdf
library only supports text-based PDF documents for extracting complete text, part of the text from a position, or searching text that matches specific criteria specified as a regular expression. Image-based PDF documents, such as scanned documents, are not supported at the time of writing.
The ctx.pdf
library provides many methods that can be used to extract information from a PDF document. Developers are free to use either method in the library in the source code by adding the Custom activity to a workflow or the activities available under the PDF activity group in the Activity panel. To use the ctx.pdf
library directly in the source code, ensure that the library is included in the project by selecting the PDF library under the Framework Extensions folder in the Libraries tab of the Edit Project dialog. Including this library manually in...