Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
Intelligent Document Processing with AWS AI/ML

You're reading from   Intelligent Document Processing with AWS AI/ML A comprehensive guide to building IDP pipelines with applications across industries

Arrow left icon
Product type Paperback
Published in Oct 2022
Publisher Packt
ISBN-13 9781801810562
Length 246 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Sonali Sahu Sonali Sahu
Author Profile Icon Sonali Sahu
Sonali Sahu
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. Part 1: Accurate Extraction of Documents and Categorization
2. Chapter 1: Intelligent Document Processing with AWS AI and ML FREE CHAPTER 3. Chapter 2: Document Capture and Categorization 4. Chapter 3: Accurate Document Extraction with Amazon Textract 5. Chapter 4: Accurate Extraction with Amazon Comprehend 6. Part 2: Enrichment of Data and Post-Processing of Data
7. Chapter 5: Document Enrichment in Intelligent Document Processing 8. Chapter 6: Review and Verification of Intelligent Document Processing 9. Chapter 7: Accurate Extraction, and Health Insights with Amazon HealthLake 10. Part 3: Intelligent Document Processing in Industry Use Cases
11. Chapter 8: IDP Healthcare Industry Use Cases 12. Chapter 9: Intelligent Document Processing – Insurance Industry 13. Chapter 10: Intelligent Document Processing – Mortgage Processing 14. Index 15. Other Books You May Enjoy

Intelligent Document Processing with AWS AI and ML

It was a Wednesday evening – I was busy collecting all my receipts and filling out my insurance claim document. I wanted my health insurance to provide reimbursement for the COVID-19 test kits that I had purchased. The next day, I went to the post office to send the documents through postal mail to my insurance provider. This made me think how we are still working with physical documents in the 21st century. With my approximate math, this month alone, we will use 650 million documents per month, considering that 2% of the entire US population buys a test kit and applies for reimbursement using a paper-based application. This is a ton of documents in this instance. In addition to physical copies, we may have tons of documents that might just be scanned documents – we are looking at manual processing for these documents too. Can we do any better in the 21st century to automate the processing of these documents?

Besides this particular instance, we use documents for many other use cases across industries, such as claims processing in the insurance industry, loan, and mortgage documents in the financial industry, and legal and contract documents. If you have bought a house or refinanced a house, you will already be aware of the number of documents that you need to use for loan processing. IDC predicts worldwide data to exceed 175 zettabytes by 2025. The volume of data is huge. On top of the volume of data, we are talking about data of different formats and unstructured – some are forms, as with insurance claims, and some can be dense text, as with legal contractual documents. The volume and varying formats of documents make manual processing time-consuming, error-prone, and expensive. According to IDC, there is a 23% growth in data every year. The immense scale and format of documents make it a challenge to process them. Moreover, the legacy or traditional document extraction technologies can work well for pristine documents, but when document quality varies, the performance of those early-generation systems frequently does not meet customer needs. Manual document extraction carried out by a human workforce introduces variability into the process since people make mistakes and double-checking all work is not cost-effective. The most important of these factors is the ability to get the key information from the documents into your decision-making systems to make high-quality decisions more quickly and based on accurate information. Hence, we are all looking for efficient, less time-consuming, cost-effective ways to process our documents for better insights.

In this introductory chapter, we will be establishing the basic context to familiarize you with some of the underlying concepts of document processing, the challenges in document processing, and how AWS Artificial Intelligence (AI)/Machine Learning (ML) services can help solve these problems.

We will be covering the following topics in this chapter:

  • Understanding common document processing use cases across industries
  • Understanding the AWS ML and AI stack
  • Introducing Intelligent Document Processing pipeline
You have been reading a chapter from
Intelligent Document Processing with AWS AI/ML
Published in: Oct 2022
Publisher: Packt
ISBN-13: 9781801810562
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image