1. Image import

New features of ABBYY FlexiCapture 9.0 Professional compared to FormReader 6.5
Desktop
Types of documents processed
Program architecture
Multipage documents with a complex structure
The program allows the automatic identification and processing of structured, semi-structured, and unstructured
documents with a varying number of pages, recurring pages, and documents with annexes.
Types of documents processed with ABBYY FlexiCapture 9.0
Creating Document Definitions for multipage documents
Creating Document Definitions for documents with annexes
Unstructured documents
Unstructured documents with text or images separated with blank pages, pages with barcode or patch-code are indexed
with the help of simple Document Definitions, processed and exported to PDF searchable files or to graphic files.
Processing unstructured documents
Semi-structured multipage documents with complex tables or tables on several pages
Semi-structured multipage documents with tables continuing from one page to another are identified, assembled into
documents, and processed with the help of a single flexible description.
Creating a Document Definition for flexible document processing
Fixed forms with tables
To simplify processing of documents containing tables, the program now supports tables in fixed forms.
Creating a table
Recurring elements
The program supports recurring field groups in structured and semi-structured documents. This simplifies the process of
configuring the processing of documents containing identical element groups repeating several times, including
processing of complex tables with such groups.
Creating a field group
Operator and Administrator modes
The Operator and Administrator modes are enabled when you select different commands in the Start menu. They can be
installed on different computers. All program functions are available in the Administrator mode. In Operator mode, the
user cannot edit Documents Definitions, import profiles or access test batches.
Administrator and operator modes
Working with projects
A project is a single environment uniting document batches and the settings required to process them, such as
Document Definitions, import profiles, and suchlike.

Time-consuming operations, such as adding images, recognition or export, can be performed in background
mode, which significantly reduces document processing time. The user can always check the status of any
background process, change the priorities of the tasks, and view the processing log.
Document Definition editor
A project can be used by different copies of the program: a project can be exported and imported as a single object, a
project can be updated or transferred from one computer to another. Different workstations can work with the same
projects over a network, making it possible to streamline document processing.
Creating a project
Project properties
Convenient tools for working with background processes
Background mode
Working at document level
Document Definitions are created for the entire document as processing units. A Document Definition determines the
way in which documents of a relevant type are identified and processed. Each Document Definition contains page
images, assembly rules, descriptions of fields and static elements, verification rules, verification settings, descriptions of
data types, and data export settings.
Creating Document Definitions for multipage documents
Switching between image modes
Using the Document Definition editor you can now switch between two image modes: you can view images in the original
color and in black-and-white, regardless of the format in which images are added to the batch. A color image simplifies
the creation of Document Definitions for color forms, since in this mode you can see field marking and explanatory text:
this is necessary in order to draw the field region and assign a name to the field. Viewing black-and-white images is
important to make sure that anchors and static text are visible after the binarization that takes place in the process of
recognition.
Data form editor
The data form editor makes it easy to create a custom presentation of extracted data and to display them in the most
convenient manner for the operator.
Configuring data presentation in the document window
Presentation of fields in a tree form
The document structure is graphically presented as a fields tree. This form of field presentation allows you to quickly edit,
copy, delete, and move fields.
Presentation of the document structure in the form of page outlines
The document structure is conveniently presented as page outlines. This makes it easier to view, edit, and create
Document Definitions for multipage
documents.
Separate viewing of static elements and fields
Static elements and fields are shown separately. Presenting static elements and fields in this way makes it easier to

Fields can be combined into groups to enable clear presentation of document structures and the creation of
recurring field groups.
create Document Definitions: this process can be divided into two stages – static elements are created separately, while
anchors, separators, anchor text and barcodes do not interfere with the creation of field regions.
Creating fields of a Document Definition
Static elements
Grouping fields
Creating a field group
Properties of a field group
Tables in fixed Document Definitions
You can now create a Table field in fixed Document Definitions, which considerably simplifies configuration, data
extraction, and data export from tables. A Table field is set of same-type columns and recurring lines.
Creating a table
Table properties
Recurring elements in fixed Document Definitions
Recurring elements are fields or field groups that occur a number of times in the form and which describe similar objects,
such as the same details of several employees, children, accounts, etc. The structure of such recurring elements is
similar to that of a table: a set of fields is a column, while each repetition is a table row. The Document Definition editor
simplifies the configuration and processing of such fields.
Fields with several instances
Fields located on several pages of a form
If your form has fields whose region is located on several pages (for example long tables), the Document Definition editor
lets you view them as a single field, greatly simplifying configuration and processing.
Fields with several regions
Automated fixed Document Definition creation
The Document Definition editor offers new tools that automate and simplify the creation of fixed Document Definitions:
automatic identification of text fields with a region;
automatic identification of vertical and horizontal separators in tables;
automatic identification of a field with a single click of a mouse button in the field region;
automatic assignment of a field name that matches the nearest explanatory text;
recognition and copying of static text from the Document Definition image to field properties;
automatic adjustment of field boundaries.
Automatic detection of fields and static elements
How to change a field name
Improving the configuration of data types
We have elaborated the settings of data types to simplify the creation of Document Definition, and to improve the
extraction and verification of field data. The most frequently used data types (text, date and time, currency, number) are
included in the set of preconfigured data types. In turn, the most frequently used formats of preconfigured data types are

Image import
included in the set of preconfigured formats and can be selected from the list without the need to create complex rules
for format verification.
Data types of the text entry field
Creating custom data types
Description of verification rules using the JScript
®
language
Verification rules can now be described as script using the JScript
®
language (in addition to the VBScript language). This
allows you to configure any custom rules for verification of data that is extracted.
A rule set with the help of a script
Using scripts to create rules
Index fields
If the option
Index field
is enabled in the field properties, then values of this field can be displayed in the form of an
additional column in the document batch window. Documents can be sorted based on the value of this field. During
export, the values of the index fields can be added to the file names, which makes it easier to find the information you
need.
Index fields
Rules for naming export files
Additional statuses of Document Definitions
When you open a Document Definition in the Document Definition editor, a temporary local version of the Document
Definition is automatically created. This editable version is not included in the processing of documents until it is
published. This functionality allows you to organize work with Document Definitions in projects that are split.
Editing and publishing a Document Definition
Testing Document Definitions on special batches in the Document Definition editor
To simplify and speed up Document Definition creation, editable Document Definitions can be tested during creation
without the need to exit the Document Definition editor. A Document Definition can be tested while it is being created, on
the image that is being used to create the Document Definition. Special test batches can be used for such testing, which
are not used for the actual processing of documents.
Testing Document Definitions
Document batches
Automatic assembly of pages into documents
The incoming flow of images from the scanner or hot folder can be automatically divided into documents, when you use:
multipage image formats, such as TIFF or PDF;
blank pages, pages with barcodes or patch-codes.
Adding page images
Image pre-processing options
Image import profiles

Classification, Document Definition application, recognition
Matrix 25
Industrial 25
PostNet
Image import profiles are used to create sets of preconfigured settings for various document import sources (scanners or
folders). An import profile is saved and assigned a name, which can be then selected by the operator from a list.
Image import profiles
Single-click import
Image import according to the settings of the profile last used can be performed with a single click.
Adding page images
Background import mode
Import can be performed in background mode. The program checks for new images in the hot folder at specific intervals.
Background mode
Full-text recognition for export to a PDF-searchable file
The program supports a full-text document recognition scenario with subsequent export to PDF and PDF\A searchable
files. This is especially useful when archiving unstructured and semi-structured documents
.
New recognition languages
Kirghiz (OCR/ICR)
Kazakh (OCR/ICR)
Thai (OCR)
New barcode types
Location of corrected characters in fixed forms
If a character entered by mistake has been erased, it is presented to the operator during verification to enable him or her
to input the correct character manually.
Ability to remove separator pages
Pages with barcodes, pages with patch codes, and blank pages that are used as separator sheets can be automatically
removed from the batch.
Image pre-processing options
Ability to remove blank pages
Blank pages can be removed during recognition.
Project properties

Data verification
Improvements in the verification interface
Full-screen mode.
The status bar shows the current percentage of verified characters.
Verification of recognized data
New interface configuration possibilities
The location of the window with the field image can be changed.
The window's scale can be changed. There is also a function for automatic selection of the best scale.
The user can change the font size of characters.
Verification of recognized data
Improvements in group verification
A popup tip with the context of characters.
Configurable size and contrast of the area adjacent to the character.
Group verification
Improvements in field verification
Format rules can now be checked during the process of field verification. The user sees an error description and
has a list of proposed changes to the value.
There is now a mode for displaying a field with character cutting.
Multiline fields are supported.
Popup tips for characters that simplify understanding, for example, the letter "O" and number "0".
Field verification
On-the-fly field recognition
During verification in the document window, every change in the boundaries of the field region entails an automatic re-
recognition of the corresponding image region. This functionality simplifies and speeds up data extraction from fields with
inaccurately specified regions.
Improved interface for rule error correction
Every rule in the list of rule errors is accompanied by a reference to the relevant field. These references allow you
to go quickly to fields with possible errors.
The list of error rules can be docked at the bottom, top, right-hand or left-hand side of the document window.
Rule validation
Correction of document assembly errors in page outline mode
A new page outline mode for batch documents is available. This mode is very convenient for correcting document
assembly errors, allowing the user to:

Together with data export, you can export images of the relevant documents.
view page outlines grouped by documents;
see mismatches between values of key fields and descriptions of assembly errors;
correct assembly errors by moving pages from one document to another, dragging and dropping pages with the
help of the mouse;
change the scale of page outlines to view details (by holding down the Ctrl key and using the scroll wheel).
Assembling pages into documents
Password-protected PDF files can now be opened
DjVu files can now be added
Export Rule validation
Export to Microsoft SharePoint
Now you can set up export to Microsoft SharePoint using the program interface.
Exporting data to a SharePoint document library
Single-click export
If export settings have been configured, a document batch can be exported with a single click of a button. This prevents
the operator having to perform extra actions and speeds up the document input process.
Exporting recognized data
Export of document images together with data
Helpful rules for naming resulting files
When exporting data to a file to Microsoft SharePoint, file names are generated automatically based on user-defined
rules. You can append the values of index fields to filenames, the value of the barcodes from separator pages, etc.
Meaningful file names make it much easier to find the information you need.
Rules for naming export files
New document and image formats
PDF, PDF/A searchable
Supported graphic formats
Linking fields when exporting to a database
Configuration of export to a database is accompanied by a convenient interface for linking fields, enabling the user to
quickly match document fields with database fields.
Export to database

Export described with a script
The user can describe any export procedures using the VBScript and JScript scripting languages.
Export described with a script
Using scripts to describe exports
Changing image resolution during export
The resolution of exported images can be modified to suit storage requirements.
Configuring data export
Picture export to file
The image of the entire document or an individual Picture field can be exported to a separate graphic file.
Removing exported documents from the batch
If you do not need to store exported documents in the batch, you can delete them immediately following export.
Exporting recognized data
Additional options

Back to top