Advanced OCR with OmniPage and FineReader Overview Optical character recognition Structural recognition Options Loading Zoning OCR Editing Optical Character Recognition (OCR) OCR turns pictures of text into e-text Does well unless… – The – The – The – The – The picture is fuzzy contrast is poor font is unusual font is too small or too large material has unusual characters Structural Recognition Analyzes the layout of the page – Columns – Headings – Graphics – Tables Usually does fairly well, unless the layout is non-standard Programs that Run OCR Programs for consumers – Kurzweil 1000, 3000 – OpenBook – Intel Reader – Many others… Programs for production – ABBYY FineReader – Nuance OmniPage Consumer Programs Highly automated Designed for individuals who have print disabilities Are not good production tools – Do not provide flexibility – Do not allow much overriding – Interfaces not designed for editing Production Programs in General A good program for production allows you to… – Control the zones (areas or blocks of text and graphics) Add, delete, change – Edit easily – Improve recognition Preferred Programs ABBYY FineReader – Relatively easy to learn – Fairly intuitive – Good structural recognition Nuance OmniPage – Less intuitive but more accessible – Often does better with technical materials Both Good Tools If you can afford to have both, it’s nice, but not absolutely necessary. If you have both, run a couple test pages through each to see which is doing better on a particular job. Under the Hood For best results with a program, set up your options before you begin! Tools > Options Lots of Languages FineReader and OmniPage handle multiple languages. For foreign language, turn on all the languages in the book. – It will recognize the diacritical marks. – Turn on what you need, but only what you need. Math If you are running OCR on math, try turning on Greek. – Greek will allow the program to recognize alphas, deltas, sigmas, etc. Another Decision Detect page orientation or not? – Does not always get it right – Try it if you have many pages turned Considerations You may or may not want to keep headers and footers. – I generally keep them to pull the page numbers. You may want to keep the page breaks. – Retaining page breaks helps to maintain one-to-one page correspondence with the book. Fitting Everything In some cases, you may need to work with a custom paper size to fit everything onto one page. This feature can be helpful when you are retaining everything on the page but not the layout. Loading Files “Open” – Opens saved program files “Load” – Loads image files to process Note that this same issue comes up with saving! Wizards Are Evil… Do not rely on the automation Load the image file and choose the processes you want Workspace The program has three primary areas Pages Pane – Either thumbnails or details – Allows simple navigation of pages Image Pane – Your graphic Text Pane – Area where the text from OCR will show More Accessible Both programs have a detail view. – Shows text instead of graphics Detail view is more accessible for screen readers. Otherwise, it is personal preference. Two Ways to Save To Save the program file to access later in the OCR program, choose File > Save – This saves your work file. You save your converted file during the last phase of the processing. Production Tips Work with dual monitors – Check your computer and video card Stretching an OCR program across two monitors is a HUGE time-saver! Learn to use keyboard shortcuts. – They save tons of time!