You're reading from Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781801810562

Length 246 pages

Edition 1st Edition

Languages

Processing

Tools

AWS

Concepts

Data Science

Author (1):

Sonali Sahu

View More author details

Table of Contents (16) Chapters

Preface

1. Part 1: Accurate Extraction of Documents and Categorization

2. Chapter 1: Intelligent Document Processing with AWS AI and ML FREE CHAPTER

3. Chapter 2: Document Capture and Categorization

4. Chapter 3: Accurate Document Extraction with Amazon Textract

5. Chapter 4: Accurate Extraction with Amazon Comprehend

6. Part 2: Enrichment of Data and Post-Processing of Data

7. Chapter 5: Document Enrichment in Intelligent Document Processing

8. Chapter 6: Review and Verification of Intelligent Document Processing

9. Chapter 7: Accurate Extraction, and Health Insights with Amazon HealthLake

10. Part 3: Intelligent Document Processing in Industry Use Cases

11. Chapter 8: IDP Healthcare Industry Use Cases

12. Chapter 9: Intelligent Document Processing – Insurance Industry

13. Chapter 10: Intelligent Document Processing – Mortgage Processing

14. Index

Why subscribe?

15. Other Books You May Enjoy

Understanding data capture with Amazon S3

Document capture or ingestion is a process to aggregate all our data in a secure, centralized, scalable data store. While building a data capture stage for your IDP pipeline, you have to take data sources, data format, and a data store into consideration.

Data store

The first step is to store our documents for transformation. To store documents, we can use any type of document store, such as a local filesystem or Amazon S3. For this IDP pipeline, we will be leveraging AWS AI services, and we recommend, for an easier, more secure, and more scalable document store, to leverage Amazon S3, an object storage service that offers industry-leading scalability, data availability, security, and performance. Amazon S3 has 11 9s of durability, and millions of customers all around the world leverage Amazon S3 for their data store.

Many regulatory industries, such as GE Healthcare, use Amazon S3 for data storage during their digital transformation...

The rest of the chapter is locked

You're reading from Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Table of Contents (16) Chapters

Understanding data capture with Amazon S3

Data store

Authors (1)

Personalised recommendations for you

You're reading from Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Table of Contents (16) Chapters

Understanding data capture with Amazon S3

Data store

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you