You're reading from Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781801810562

Length 246 pages

Edition 1st Edition

Languages

Processing

Tools

AWS

Concepts

Data Science

Author (1):

Sonali Sahu

View More author details

Table of Contents (16) Chapters

Preface

1. Part 1: Accurate Extraction of Documents and Categorization

2. Chapter 1: Intelligent Document Processing with AWS AI and ML FREE CHAPTER

3. Chapter 2: Document Capture and Categorization

4. Chapter 3: Accurate Document Extraction with Amazon Textract

5. Chapter 4: Accurate Extraction with Amazon Comprehend

6. Part 2: Enrichment of Data and Post-Processing of Data

7. Chapter 5: Document Enrichment in Intelligent Document Processing

8. Chapter 6: Review and Verification of Intelligent Document Processing

9. Chapter 7: Accurate Extraction, and Health Insights with Amazon HealthLake

10. Part 3: Intelligent Document Processing in Industry Use Cases

11. Chapter 8: IDP Healthcare Industry Use Cases

12. Chapter 9: Intelligent Document Processing – Insurance Industry

13. Chapter 10: Intelligent Document Processing – Mortgage Processing

14. Index

Why subscribe?

15. Other Books You May Enjoy

Summary

In this chapter, we discussed the extraction stage of an IDP pipeline, and how we can leverage Amazon Textract to accurately extract elements from documents. Documents can be of different types, such as an unstructured dense text type of document, a semi-structured document such as a form, or a structured document such as a table. We walked through the sample code and its API response to accurately extract elements from any type of scanned document.

We then reviewed the need for accurate extraction of elements from specialized document types, such as ID documents such as a US driver’s license, a US passport, or invoice/receipt types of documents. We discussed Amazon Textract’s analyze_id and analyze_expense APIs to accurately extract elements from ID and invoice/receipt types of documents respectively. We walked you through the sample code for your accurate extraction of specialized document types.

In the next chapter, we will extend the extraction stage...

The rest of the chapter is locked

You're reading from Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Table of Contents (16) Chapters

Summary

Authors (1)

Personalised recommendations for you

You're reading from Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Table of Contents (16) Chapters

Summary

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you