Prosar-Aida - Auto Capture, Classification, Extraction and Index
Each and every capture process involving paper has the same problem. How to reduce the cost of scanning and data entry?
With Paradatec's Prosar-Aida users now have the ability to compile many types of paper documents without separater sheets, place them into the scanner and have a technology not only classify but also intelligently extract non-templated OCR data contained within the document.
Better yet while it classifies and extracts data, it also integrates to the SmartSearch document management system for storage at the same time as it feeds another line of business application for adjudication purposes.

Reference 1:
Classification is performed by keyword search from
the OCR output.
Spatial relationships between keywords as well as
hierarchical classification allows thousands of types
of documents to be captured
and interpreted.

Reference 2: The classification
process behaves like a mail room person - rough sort first, then a finer sort. Leveraging “Following
Page” detection, removes the need for separator sheets
between documents and rules for detecting “following pages”
are configurable.
What Is It?
Paradatec's Prosar-Aida is a
platform for the development of applications for
document classification and data extraction leveraging
the fastest OCR technology on the market
¦ Paradatec's Prosar-Aida is an unstructured data capture product that can intelligently extract and classify documents using revolutionary artificial intelligence (AI) capabilities. This solution will allow between 70%-96% capture automation, meaning the percentage of documents that flow through Prosar-Aida without the need to be manually corrected.
¦ Paradatec's Prosar-Aida is the solution for the common capture problems that companies face when implementing document management–which is how to control the cost of capture and data entry.
¦ It can be used to tackle all kinds of input management tasks, regardless of whether the volume of documents is high or low, the workflow is straightforward or complex, the processing is central or distributed, or if the documents to be processed are structured or unstructured.

How Does It Work?
¦Paradatec's Prosar-Aida is text recognition
and analysis software based on artificial intelligence
that can
ascertain document types and
classify images for appropriate
routing and archival.
¦ The process begins with a highspeed and efficient full page OCR scan of each image. This first step allows Prosar-Aida to search each and every word of every document to discover the document content.
¦ In order to classify
the current document, Paradatec's Prosar-Aida searches
the document for
tell-tale characteristics specified by
the administrator using pattern
matching, spatial relationships and
synonyms like invoice number,
invoice #, inv_num, etc. These tend
to be certain keywords or phrases
¦ This overall process of discovery is
unique and makes Prosar-Aida very different
from the typical template based solutions.
With other solutions the document
content is expected and assumed to
always appear in the same
geographical areas.
¦A significant advantage
of Prosar-Aida document content discovery
method is its ability to
process virtually unlimited number
of versions and formats of particular
document types.
¦ Prosar-Aida is confidence based, meaning if the engine does not have a high degree of confidence, the data value is flag and sent for correction within the data validation module.
¦ Since the Prosar-Aida engine is transparent to the user, the only interaction the user has with the system is within the validation module, simply correcting values and rerouting them back into the process workflow.
Reference 3:
During the extraction process label-value pair logic
is used to find
data. Utilizing the extracted OCR, the solution
finds all synonyms of
a type like “invoice date”, “inv date”, “date”. Then
the artificial intelligence engine discovers the
date
that is spatially in the
correct RELATIVE location to the label. Note – no
fixed zones are used – anywhere!

Reference 4: Advanced Capture supports tabular data
extraction for any type of document: EOB, invoice, traffic instructions or
sales orders. Each
type of document is processed same way, with no limit
to the variety of layouts
Application Use Scenario's
Paradatec's Prosar-Aida can be used
in many business applications but typically its used
in a few specific verticals, namely: AP, healthcare,
mortgage, media broadcasting and manufacturing.
¦ Automate AP invoice, EOB and sales order entry – capture sales orders from fax, mail, email or print and automate the process of classification, data entry as well as integration with internal line-ofbusiness systems and document management archiving.
¦ Automate traffic instructions – media
outlets like cable, radio and satellite receive large amounts of traffic instructions which tell the media provider which commercial
to run, how long, on what station and at what frequency.
¦ Document classification and distribution in mail system.
¦ Extraction of business process relevant data from documents.
¦ Capturing unstructured forms, surveys, assessments and exams.
Use Case
By improving the speed and efficiency at which
a company captures and processes documents such as
patient
Explanation of Benefits (EOB) information, Paradatec's
Prosar-Aida can produce measurable benefits at every
level of the organization. The benefits resulting
from unstructured data capture encompass additional
revenues as well as savings from reduced costs. The
savings are real- hard cost dollars from reducing
an enormous data entry staff both on the payables
and document management application side.
Hard-dollar benefits include:
Labor savings
• Reduced need for data entry operators; document
prep workers, sorters, etc.,
• Increased accuracy of data extracted from EOB’s,
invoices, loan documents reduces the time spent on
error correction
Improved cash management
• Faster payment postings and access to more accurate
payment data, means increased control over billing
and accounting processes, which leads to better overall
cash flow management
• Faster, more accurate generation of critical payment
data, expedites the process of Secondary Claim submissions
Rapid return on investment
• In mid to high volume organizations, the payback
on unstructured data capture is less than a year.
•15% of an organization's revenues are spent creating, managing & distributing documents
•60% of employee time is spent working with documents
•85% of business documents are in paper form
•The avg document is printed 5x
•90% of a business's information is in documents
•At $30/hr, knowledge workers waste $4,500/year working w/ paper. More>> | Medical
Sources: Gartner, ARMA and AIIM
>>Paperless Project Collateral
>>Oce Business Services Joins the Paperless Project
>>Snowbound Software Newsletter. The Paperless Office - Its Time Has Come
>>The Paperless Project in Document Imaging Report
>>The Perplexing Future of Paper - Blog post on buildingCTGreen.com
![]()
>>Join
The Paperless Project LinkedIn Group
>>Corporate
Greening: Tech professionals shedding dependency on paper. PDF
>>Sanjel Corporation Secures Intellectual Property with Brava!® for Livelink ® ECM



