Overview of existing Document AI processors
As discussed previously, the Document AI platform provides prebuilt parsers for general-purpose, as well as some specialized, use cases. As these processors are prebuilt, they are readily available to use in any relevant use case with very little effort. Before jumping into an example of how these processors work, let’s first look at the list of available processors as part of Google Cloud’s Document AI platform:
- Document OCR: Identify and extract both machine-printed as well as handwritten text from documents in over 200 languages
- Form Parser: Extract key-value pairs (entity and checkbox), tables, and generic entities in addition to OCR text
- Intelligent Document Quality Processor: Assesses the quality of documents based on their readability and provides a quality score
- Document Splitter: Automatically splits documents based on logical boundaries
Document AI provides us with numerous specialized processors...