Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Intelligent Document Capture with Ephesoft, Second Edition
Intelligent Document Capture with Ephesoft, Second Edition

Intelligent Document Capture with Ephesoft, Second Edition: Automate the processing of scanned and digital documents by improving accuracy using web-based open and modern intelligent document capture software , Second Edition

Arrow left icon
Profile Icon Pat Myers Profile Icon Ike Kavas Profile Icon Michael Muller Profile Icon Jon Solove
Arrow right icon
£16.99 per month
Full star icon Full star icon Full star icon Full star icon Full star icon 5 (1 Ratings)
Paperback Aug 2015 164 pages 2nd Edition
eBook
£23.99 £26.99
Paperback
£32.99
Subscription
Free Trial
Renews at £16.99p/m
Arrow left icon
Profile Icon Pat Myers Profile Icon Ike Kavas Profile Icon Michael Muller Profile Icon Jon Solove
Arrow right icon
£16.99 per month
Full star icon Full star icon Full star icon Full star icon Full star icon 5 (1 Ratings)
Paperback Aug 2015 164 pages 2nd Edition
eBook
£23.99 £26.99
Paperback
£32.99
Subscription
Free Trial
Renews at £16.99p/m
eBook
£23.99 £26.99
Paperback
£32.99
Subscription
Free Trial
Renews at £16.99p/m

What do you get with a Packt Subscription?

Free for first 7 days. £16.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing
Table of content icon View table of contents Preview book icon Preview Book

Intelligent Document Capture with Ephesoft, Second Edition

Chapter 1. A Quick Tour of Ephesoft

As an introduction to Ephesoft, we will first walk you through the user interface and then examine the installation folder. The locations of certain files and folders within the Ephesoft installation are important because an administrator must make changes here to enable some features.

In this chapter, we will examine the following aspects of Ephesoft:

  • The user interface
  • The installation folder

The user interface

After logging in, users can access Ephesoft's features from an automatically hiding menu of navigation items that we will refer to as the side navigation. To display this menu, simply move your mouse cursor to the left-hand side of the browser window.

Ephesoft has organized this side navigation so that administrative features are separate from the common functions that operators use. Operators typically submit batches and review and validate Ephesoft's output, supplying additional information about the document images being processed.

Administrators enable these activities by defining the operations to be performed on each type of batch. Administrators also monitor and control the processing of the batches.

The user interface

Ephesoft's navigation menu

Administrative features

The side navigation provides links to five areas of the system that are commonly used by administrators:

  • Batch class management
  • Batch instance management
  • Folder management
  • System configuration
  • Reports

Batch class management

A batch class defines a set of operations that should be performed on the page images that are provided as input. A batch class consists of document types, document fields, batch class fields, e-mail configuration, and workflow/plugin configuration. The Batch Class Management interface allows administrators to create, modify, edit, and delete batch classes.

Batch class management

Ephesoft's batch class management user interface

The batch class management interface displays a list of batch classes. Administrators can open a batch class to configure the following:

  • Document types: The documents that will be processed in the batch class are configured here. Each document type is described by a distinct set of properties called fields. Rules can be configured to extract information from the document into the fields, thereby automating the process of indexing the document.
  • Modules: Modules are the major steps in the processing of documents. Each module is implemented by a series of plugins.
  • E-mail configuration: In this portion of the administrative interface, users can provide connection information for an e-mail account, and Ephesoft will process e-mails sent to this address. Ephesoft processes both the e-mail body and the attached documents.
  • Scanner profiles: This is where administrators can associate one or more scanner configurations with each batch class. These profiles are available in the web scanner.
  • CMIS import: CMIS is a standard protocol for communicating with document repositories. Ephesoft can use CMIS to monitor a standards-compliant document repository for input.
  • Batch class fields: Ephesoft can associate information with a batch (the group of page images that are processed together) as a whole. Each piece of information associated with a batch is called a batch class field. Batch class fields are applied to batches and should not be confused with document fields, which contain information that applies to individual documents.

Batch instance management

A batch instance is a set of page images processed together. The terms batch and batch instance are usually interchangeable. This area within the administrative interface allows administrators to see the status of batches, reprioritize batches, and restart batches in a previous processing step.

Batch instance management

Ephesoft's batch instance management user interface

Folder management

The folder management interface allows the administrator to upload files for batch class configuration. These files are also accessible from the installation folder, but this is often a more convenient way to manipulate these files.

Folder management

Ephesoft's folder management user interface

System configuration

This administrative interface allows users to manage Ephesoft in ways that are not specific to a batch class or instance.

System configuration

Ephesoft's system configuration user interface

System configuration allows the modification of the following features:

  • Regex pool: The regular expression pool is a library of regular expressions that administrators can access when creating extraction rules for a batch class.
  • Workflow management: Ephesoft's features are implemented in components called plugins. The workflow is the sequence in which these plugins are executed. This portion of the user interface allows an administrator to specify what plugins are available when configuring the workflow for a batch class.
  • Connection manager: The connection manager allows you to create and test database connections. These connections are used by plugins to access databases.
  • License details: This allows administrators to see the expiration date of the license and the features that are enabled.

Reports

Reporting can be enabled to provide administrators with statistics on the system and throughput. The administrator can filter reports by criteria such as batch class or start date. Advanced reports are also available, including correction reporting. Correction reporting identifies when operators made corrections to Ephesoft's automated processing. This information can be used to optimize the configuration over time.

Reports

Ephesoft's reporting user interface

The operator user interface

The side navigation provides links to the following four areas of the system that are commonly used by operators:

  • Batch list
  • Review validate
  • Web scanner
  • Upload batch

Batch list

The batch list shows the batch instances that require review or validation.

The review process involves documents that could not be identified as being of a certain type. In Ephesoft, as with most image capture systems, we say that these documents could not be classified. The review interface allows operators to split and merge pages of documents and specify the document type.

The validation process involves fields for which data could not be extracted from the document, or fields where the extracted values do not comply with the previously specified rules.

Batch list

Ephesoft's batch list user interface

Review validate

The review validate screen will present the operator with the next available batch for processing according to priority and batch date.

Review validate

Ephesoft's review validate user interface

Web scanner

Ephesoft is capable of capturing content from a scanner attached to the user's workstation. What is unique about the web scanner is that no software needs to be installed on the workstation; Ephesoft uses a Java applet to send content directly to the server from any TWAIN-enabled scanner.

Web scanner

Ephesoft's web scanner user interface

The first time a user logs into the operator interface and selects the Web Scanner link on the side navigation, the user will have to choose a scanner. When the user selects the Source button, the user will be shown all TWAIN devices that have been installed on the user's workstation. Once the scanner is selected, the user can select the batch class to be used for processing and start the scan job.

Upload batch

Operators can submit PDF and TIF files directly to Ephesoft for processing by using the upload batch feature. Once the documents are selected and uploaded, the operator can select the appropriate batch class and start the batch processing.

Upload batch

Ephesoft's upload batch user interface

File system

The following are some important folders that are created when Ephesoft is installed. These are subfolders beneath the Ephesoft installation folder:

  • Apache 2.2: Apache can be used in front of Ephesoft for load balancing and failover. It is included in the installation but not configured.
  • Application: The Ephesoft Java web application is installed in this folder.
  • Application/i18n, themes: These folders contain files to customize and localize the Ephesoft application.
  • Application/native/RecostarPlugin: This plugin provides the image OCR functionality.
  • Application/WEB-INF/classes/META-INF: System configuration property files are stored in this folder.
  • Dependencies/gs, ImageMagick: Applications that Ephesoft uses for image manipulation are installed here.
  • Dependencies/licence-util, licensing: These folders contain tools to collect the information needed to generate and install license keys.
  • Dependencies/luke: Luke is a tool that helps troubleshoot problems with Lucene indexes.
  • JavaAppServer: This folder contains the Tomcat configuration for Ephesoft.
  • JavaAppServer/conf: This is where the contexts are defined for Ephesoft; it is where URLs are bound to java code.
  • EphesoftReports: The configuration and binaries for reporting are stored here.
  • SharedFolders/BC99: The configuration for each batch class is stored here. The contents of the batch class folder can be modified through the Folder Management interface by a batch class or system administrator.

Summary

In this chapter, we looked at the administrative and the operator functionality of Ephesoft. We also looked at the installation folder on the filesystem. It's time to put Ephesoft to work.

In the next chapter, you'll learn how to train the system to recognize your documents, extract content from them, and test the configuration.

Left arrow icon Right arrow icon

Description

Every organization, public or private, processes documents in various formats, especially paper and fax formats. Processing documents manually is an expensive and time-consuming endeavor. Ephesoft Enterprise is a modern document capture solution that allows an organization to automate the business process. It uses powerful technology to classify and capture the vital information from the document's content. This helps to minimize the time your company spends on reviewing and processing any physical and electronic documents. This book teaches you about document capture in general and implementation of document capture using Ephesoft. Start by learning about document capture and how Ephesoft revolutionized the industry. Progress to a tour of key features, including operator and administrator interfaces and then learn to configure Ephesoft to process your business’s specific document types and extract content from those documents. You will also get to know the advanced customization techniques that make Ephesoft accommodate your unique business needs. Finally, the book concludes by teaching you how to embed the classification and extraction functionality using Ephesoft’s web services. By the end, you will learn to optimize the processing of your documents, saving your company time and money.

Who is this book for?

This book is intended for information technology professionals interested in installing and configuring Ephesoft Enterprise for their organization, but it is a valuable resource for anyone interested in learning about intelligent document capture.

What you will learn

  • Discover the benefits of using intelligent document capture in your work place
  • Learn to capture, classify, and separate any type of document
  • Extract important information from your documents
  • Transfer the documents and data into your content management system
  • Customize Ephesoft to meet your unique business requirements
  • Understand the integration techniques using the Ephesoft web services API
  • Convert your paper archive to electronic records efficiently
  • Automate business processes that depend on documents in paper, fax, or email attachment format
  • Implement distributed capture for mailroom automation

Product Details

Country selected
Publication date, Length, Edition, Language, ISBN-13
Publication date : Aug 24, 2015
Length: 164 pages
Edition : 2nd
Language : English
ISBN-13 : 9781783558582

What do you get with a Packt Subscription?

Free for first 7 days. £16.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing

Product Details

Publication date : Aug 24, 2015
Length: 164 pages
Edition : 2nd
Language : English
ISBN-13 : 9781783558582

Packt Subscriptions

See our plans and pricing
Modal Close icon
£16.99 billed monthly
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Simple pricing, no contract
£169.99 billed annually
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just £5 each
Feature tick icon Exclusive print discounts
£234.99 billed in 18 months
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just £5 each
Feature tick icon Exclusive print discounts

Frequently bought together


Stars icon
Total £ 102.97
Intelligent Document Capture with Ephesoft, Second Edition
£32.99
Getting Started with Powershell
£32.99
Alfresco One 5.x Developer???s Guide
£36.99
Total £ 102.97 Stars icon

Table of Contents

7 Chapters
1. A Quick Tour of Ephesoft Chevron down icon Chevron up icon
2. Creating a Batch Class Chevron down icon Chevron up icon
3. Core Ephesoft Features Chevron down icon Chevron up icon
4. Ephesoft's Advanced Features Chevron down icon Chevron up icon
5. Tips Chevron down icon Chevron up icon
A. References Chevron down icon Chevron up icon
Index Chevron down icon Chevron up icon

Customer reviews

Rating distribution
Full star icon Full star icon Full star icon Full star icon Full star icon 5
(1 Ratings)
5 star 100%
4 star 0%
3 star 0%
2 star 0%
1 star 0%
Brad Gillespie Dec 02, 2017
Full star icon Full star icon Full star icon Full star icon Full star icon 5
I highly recommend this book to get an understanding of Ephesoft. Working with Ephesoft and it’s intelligent document capture after reading this book will be much easier. The book does a solid job of helping the reader understand the capabilities, terms, and areas to find settings within the system.You will still need to work with a system implementing Ephesoft, but this will get you in the door. Get it, read it, try it.
Amazon Verified review Amazon
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

What is included in a Packt subscription? Chevron down icon Chevron up icon

A subscription provides you with full access to view all Packt and licnesed content online, this includes exclusive access to Early Access titles. Depending on the tier chosen you can also earn credits and discounts to use for owning content

How can I cancel my subscription? Chevron down icon Chevron up icon

To cancel your subscription with us simply go to the account page - found in the top right of the page or at https://subscription.packtpub.com/my-account/subscription - From here you will see the ‘cancel subscription’ button in the grey box with your subscription information in.

What are credits? Chevron down icon Chevron up icon

Credits can be earned from reading 40 section of any title within the payment cycle - a month starting from the day of subscription payment. You also earn a Credit every month if you subscribe to our annual or 18 month plans. Credits can be used to buy books DRM free, the same way that you would pay for a book. Your credits can be found in the subscription homepage - subscription.packtpub.com - clicking on ‘the my’ library dropdown and selecting ‘credits’.

What happens if an Early Access Course is cancelled? Chevron down icon Chevron up icon

Projects are rarely cancelled, but sometimes it's unavoidable. If an Early Access course is cancelled or excessively delayed, you can exchange your purchase for another course. For further details, please contact us here.

Where can I send feedback about an Early Access title? Chevron down icon Chevron up icon

If you have any feedback about the product you're reading, or Early Access in general, then please fill out a contact form here and we'll make sure the feedback gets to the right team. 

Can I download the code files for Early Access titles? Chevron down icon Chevron up icon

We try to ensure that all books in Early Access have code available to use, download, and fork on GitHub. This helps us be more agile in the development of the book, and helps keep the often changing code base of new versions and new technologies as up to date as possible. Unfortunately, however, there will be rare cases when it is not possible for us to have downloadable code samples available until publication.

When we publish the book, the code files will also be available to download from the Packt website.

How accurate is the publication date? Chevron down icon Chevron up icon

The publication date is as accurate as we can be at any point in the project. Unfortunately, delays can happen. Often those delays are out of our control, such as changes to the technology code base or delays in the tech release. We do our best to give you an accurate estimate of the publication date at any given time, and as more chapters are delivered, the more accurate the delivery date will become.

How will I know when new chapters are ready? Chevron down icon Chevron up icon

We'll let you know every time there has been an update to a course that you've bought in Early Access. You'll get an email to let you know there has been a new chapter, or a change to a previous chapter. The new chapters are automatically added to your account, so you can also check back there any time you're ready and download or read them online.

I am a Packt subscriber, do I get Early Access? Chevron down icon Chevron up icon

Yes, all Early Access content is fully available through your subscription. You will need to have a paid for or active trial subscription in order to access all titles.

How is Early Access delivered? Chevron down icon Chevron up icon

Early Access is currently only available as a PDF or through our online reader. As we make changes or add new chapters, the files in your Packt account will be updated so you can download them again or view them online immediately.

How do I buy Early Access content? Chevron down icon Chevron up icon

Early Access is a way of us getting our content to you quicker, but the method of buying the Early Access course is still the same. Just find the course you want to buy, go through the check-out steps, and you’ll get a confirmation email from us with information and a link to the relevant Early Access courses.

What is Early Access? Chevron down icon Chevron up icon

Keeping up to date with the latest technology is difficult; new versions, new frameworks, new techniques. This feature gives you a head-start to our content, as it's being created. With Early Access you'll receive each chapter as it's written, and get regular updates throughout the product's development, as well as the final course as soon as it's ready.We created Early Access as a means of giving you the information you need, as soon as it's available. As we go through the process of developing a course, 99% of it can be ready but we can't publish until that last 1% falls in to place. Early Access helps to unlock the potential of our content early, to help you start your learning when you need it most. You not only get access to every chapter as it's delivered, edited, and updated, but you'll also get the finalized, DRM-free product to download in any format you want when it's published. As a member of Packt, you'll also be eligible for our exclusive offers, including a free course every day, and discounts on new and popular titles.