02 Advanced OCR - Accessing Higher Ground

advertisement
Advanced OCR
with OmniPage and FineReader
Overview







Optical character recognition
Structural recognition
Options
Loading
Zoning
OCR
Editing
Optical Character
Recognition (OCR)


OCR turns pictures of text into e-text
Does well unless…
– The
– The
– The
– The
– The
picture is fuzzy
contrast is poor
font is unusual
font is too small or too large
material has unusual characters
Structural Recognition

Analyzes the layout of the page
– Columns
– Headings
– Graphics
– Tables

Usually does fairly well, unless the
layout is non-standard
Programs that Run OCR

Programs for consumers
– Kurzweil 1000, 3000
– OpenBook
– Intel Reader
– Many others…

Programs for production
– ABBYY FineReader
– Nuance OmniPage
Consumer Programs



Highly automated
Designed for individuals who have
print disabilities
Are not good production tools
– Do not provide flexibility
– Do not allow much overriding
– Interfaces not designed for editing
Production Programs in
General

A good program for production allows
you to…
– Control the zones (areas or blocks of text
and graphics)

Add, delete, change
– Edit easily
– Improve recognition
Preferred Programs

ABBYY FineReader
– Relatively easy to learn
– Fairly intuitive
– Good structural recognition

Nuance OmniPage
– Less intuitive but more accessible
– Often does better with technical materials
Both Good Tools


If you can afford to have both, it’s
nice, but not absolutely necessary.
If you have both, run a couple test
pages through each to see which is
doing better on a particular job.
Under the Hood


For best results with a program, set up
your options before you begin!
Tools > Options
Lots of Languages


FineReader and OmniPage handle
multiple languages.
For foreign language, turn on all the
languages in the book.
– It will recognize the diacritical marks.
– Turn on what you need, but only what
you need.
Math

If you are running OCR on math, try
turning on Greek.
– Greek will allow the program to recognize
alphas, deltas, sigmas, etc.
Another Decision

Detect page orientation or not?
– Does not always get it right
– Try it if you have many pages turned
Considerations

You may or may not want to keep
headers and footers.
– I generally keep them to pull the page
numbers.

You may want to keep the page
breaks.
– Retaining page breaks helps to maintain
one-to-one page correspondence with the
book.
Fitting Everything


In some cases, you may need to work
with a custom paper size to fit
everything onto one page.
This feature can be helpful when you
are retaining everything on the page
but not the layout.
Loading Files

“Open”
– Opens saved program files

“Load”
– Loads image files to process

Note that this same issue comes up
with saving!
Wizards Are Evil…


Do not rely on the automation
Load the image file and choose the
processes you want
Workspace


The program has three primary areas
Pages Pane
– Either thumbnails or details
– Allows simple navigation of pages

Image Pane
– Your graphic

Text Pane
– Area where the text from OCR will show
More Accessible

Both programs have a detail view.
– Shows text instead of graphics


Detail view is more accessible for
screen readers.
Otherwise, it is personal preference.
Two Ways to Save

To Save the program file to access
later in the OCR program, choose File
> Save
– This saves your work file.

You save your converted file during
the last phase of the processing.
Production Tips

Work with dual monitors
– Check your computer and video card


Stretching an OCR program across two
monitors is a HUGE time-saver!
Learn to use keyboard shortcuts.
– They save tons of time!
Download