Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Intelligent Document Processing with AWS AI/ML

You're reading from   Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Arrow left icon
Product type Paperback
Published in Oct 2022
Publisher Packt
ISBN-13 9781801810562
Length 246 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Sonali Sahu Sonali Sahu
Author Profile Icon Sonali Sahu
Sonali Sahu
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. Part 1: Accurate Extraction of Documents and Categorization
2. Chapter 1: Intelligent Document Processing with AWS AI and ML FREE CHAPTER 3. Chapter 2: Document Capture and Categorization 4. Chapter 3: Accurate Document Extraction with Amazon Textract 5. Chapter 4: Accurate Extraction with Amazon Comprehend 6. Part 2: Enrichment of Data and Post-Processing of Data
7. Chapter 5: Document Enrichment in Intelligent Document Processing 8. Chapter 6: Review and Verification of Intelligent Document Processing 9. Chapter 7: Accurate Extraction, and Health Insights with Amazon HealthLake 10. Part 3: Intelligent Document Processing in Industry Use Cases
11. Chapter 8: IDP Healthcare Industry Use Cases 12. Chapter 9: Intelligent Document Processing – Insurance Industry 13. Chapter 10: Intelligent Document Processing – Mortgage Processing 14. Index 15. Other Books You May Enjoy

Understanding data capture with Amazon S3

Document capture or ingestion is a process to aggregate all our data in a secure, centralized, scalable data store. While building a data capture stage for your IDP pipeline, you have to take data sources, data format, and a data store into consideration.

Data store

The first step is to store our documents for transformation. To store documents, we can use any type of document store, such as a local filesystem or Amazon S3. For this IDP pipeline, we will be leveraging AWS AI services, and we recommend, for an easier, more secure, and more scalable document store, to leverage Amazon S3, an object storage service that offers industry-leading scalability, data availability, security, and performance. Amazon S3 has 11 9s of durability, and millions of customers all around the world leverage Amazon S3 for their data store.

Many regulatory industries, such as GE Healthcare, use Amazon S3 for data storage during their digital transformation...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image