Home
Readiris Pro 12
Contents
1. sscscscsessseees 81 Chapter 12 Recognizing multipage documents 006 83 Opening and recognizing multiple image files 83 Scanning and recognizing multipage documents 85 Editing multipage document eee eee eseeeeeeeees 86 Chapter 13 Recognizing handprinted text ssssseseees 89 Chapter 14 Recognizing barcodes ccssccssscccsscccssscesseeees 93 Index E E E 95 Readiris Pro 12 User Guide Copyrights ReadirisPro12 dgi 110209 04 Copyrights 1987 2009 I R I S All Rights Reserved LR LS owns the copyrights to the Readiris software to the online help system and to this publication The information contained in this document is the property of LR IS Its content is subject to change without notice and does not represent a commitment on the part of I R I S The software described in this document is furnished under a license agreement which states the terms of use of this product The software may be used or copied only in accordance with the terms of that agreement No part of this publication may be reproduced transmitted stored in a retrieval system or translated into another language without the prior written consent of LR LS This user guide utilizes fictitious names for purposes of demonstration references to actual persons companies or organizations are strictly coincidental Trademarks The Readiris logo and Readiris are trademarks of Ima
2. The documents will be sent as PDF Image Text by default via your default e mail application See the section Formatting documents to learn more about the other available formats Note that the SmartTasks apply predefined settings but can be configured easily to fit more particular needs To configure the SmartTasks e Right click the SmartTask you want to use e Select Scanner or Image files as image source o When you select Scanner Readiris will start your scanner as soon as you click the SmartTask The scanned document s will be displayed in the interface processed and saved Your scanner must be configured correctly in order for the SmartTasks to work To do so Click the Scanner button on the main toolbar Click Scanner model and select your scanner in the list If your scanner is not in the list select Twain other models Click Configure if applicable to select the Twain source Then click OK to save the settings For more information on the scanner settings and on scanning paper documents see the section Scanning paper documents 21 Chapter 4 The Readiris SmartTasks o When you select Image files and click the SmartTask Readiris opens the Input dialog box in which you can select the image files you want to process For more information on opening image files see the section Opening image files e Click Configure to change the output format and its options Note that the availa
3. A complete and a custom installation are offered Select the required options and click Next each time you are ready to go to the next screen All lexicons and sample images will be installed by default as well as an electronic user guide and online help Click Finish to complete the installation The submenu LR LS Applications Readiris on the Windows Programs menu is created automatically by the installation program The installation program also creates a shortcut to the Readiris application on the Windows desktop Readiris 12 Pro User Guide Readiris Pro 12 e Repeat the installation process to install any additional software from the CD ROM UNINSTALLING THE SOFTWARE There is only one correct way to uninstall Readiris by using the Windows un install wizard You are strongly recommended not to uninstall Readiris or any of its software modules by manually erasing the program files To uninstall Readiris e Close the application e On the Start menu click Control Panel e Under the Programs icon click Uninstall a program e Select Readiris in the list and click the Uninstall button e Follow the on screen instructions SOFTWARE REGISTRATION Remember to register your Readiris license By doing so you will e be kept informed of future product developments and related LR LS products e be entitled to product support 11 Chapter 2 Installing Readiris e be entitled to spe
4. Pro 12 User Guide Despeckling removes small spots from black and white images e Click Apply to preview the results e Ifthe results are satisfactory click OK If not change the settings again e Click Recognize Save to recognize the document 37 Readiris Pro 12 User Guide CHAPTER 7 SAVING DOCUMENTS AS IMAGE FILES Paper documents you scan do not need to be OCRed right away They can be saved as image files To do so e Scan the document e On the File menu click the commands Save Full Page as Image or Save All Pages as Image e Afterwards open the saved image file and perform the recognition Saving graphics only You can also choose to save the graphics windows without the text of the document To do so e Scan or open the document e On the File menu click Save Graphics e All the graphics of the document will be saved in a single file 39 Readiris Pro 12 User Guide CHAPTER 8 WINDOWING DOCUMENTS WINDOWING DOCUMENTS AUTOMATICALLY When scanning or opening documents Readiris will automatically apply Page Analysis to split up the documents in different windows The Page Analysis option is selected by default Click the Options button and disable Page Analysis should you want to avoid automatic page analysis The page analysis results can be modified manually after automatic page analysis For more information see the section Windowing documents manually
5. The part of the page you select will be analyzed automatically You will be prompted whether you want to exclude the same outer zone from page analysis on every page of the document WINDOWING DOCUMENTS MANUALLY Besides windowing documents automatically by means of Page Analysis Readiris allows you to window documents manually Manual windowing comes in handy when having to modify the automatic page analysis results It is also useful to use when creating windowing templates For more information on windowing templates see the section Using windowing templates Note that barcode and handprinted text zones always need to be windowed manually Operation e In order to window a document manually first click the Options button and deselect Page Analysis e Open or scan the document by clicking the Scan or Open button Some e Select the correct window type by clicking the corresponding window type button on the image toolbar Readiris uses five window types text blocks graphic zones tables barcode zones and handprinted zones 43 Chapter 8 Windowing documents m baa e Draw a frame around the text blocks graphics tables barcodes and handprinting zones you want to window For more information on recognizing barcodes and handprinting see the sections Recognizing barcodes and Recognizing handprinted text respectively e When you are done windowing the document click the Recognize Save
6. HN Window types Readiris uses five window types text blocks graphic zones tables barcode zones and handprinted zones 41 Chapter 8 Windowing documents Page analysis detects text graphic and table zones automatically Barcode zones and handprinted zones need to be drawn manually For more information see the section Windowing documents manually Each window type has its own color code text windows are orange graphics are purple and table windows pink Barcode zones are green and handprinted zones blue The windows are sorted top down left to right Numbers indicate the sort order of the windows The sort order and window types can be changed however For more information see the section Windowing documents manually Ignore text zones on page borders When your scanner generates black borders around the actual image page analysis tends to find zones where there s only noise To avoid this click Do Not Detect Windows on Borders on the Settings menu and scan the document again Ignore exterior zone As an alternative to windowing documents automatically the function Ignore exterior zone can be used This function is useful when only one particular area on the document pages needs to be OCRed Select Ignore exterior zone by clicking the corresponding button on the image toolbar Draw a frame around the part of the page you want Readiris to recognize 42 Readiris Pro 12 User Guide
7. Readiris Pro 12 User Guide Readiris Pro 12 User Guide Table of Contents CCG GE ee te asa cases 1 Chapter 1 Introducing Readiris cccsscssssecssssssssssssssesees 3 Save time avoid retyping eee eee ereetreereeaees 3 The Readiris senies angeln neant 6 Chapter 2 Installing Readiris 0000s0000r00002200002000020000000000 9 System requirementS ueessessnessnensnennnennnernnnnnnnnnnnnnnnnnnannn 9 Software installation 0 eee eesseeeeneeceeceeceeeeeeeceeeeesees 10 Uninstalling the software ueeeesserssersnersneennnennnennnennn 11 Software registration uesesnnesnnesnnesnnensnsnnnnnnnnnnnnennnnnne 11 Product supporteren nennen an 12 Chapter 3 Getting started ccscssscccsscccsssscssscccssccssssccsscees 13 Rimming Readinis u 20eear snn serien 13 Using the OCR Wizard eesserssersnersneesnennnnennnennnennn 13 ser interf ce n rs enge eek ons 15 Changing the user interface language ee 17 Chapter 4 The Readiris SmartTasks cccscccsssccssscessscees 19 Using the Readiris SmartTasks n 19 Chapter 5 Scanning documents ccessessssssesssssesssnessesessssensnnnne 23 Selecting the options ueeeessersnersnersnernnernnnennnennnenn nenn 23 Opening mage file Suninen en a e a 24 iii Table of Contents Scanning paper documents uuerseesseessessnnsnnennnennenne 26 Chapter 6 Adjusting scanned docum
8. o Click Undo to correct mistakes Readiris keeps track of the last 32 operations o Click Abort to abort interactive learning All learning results will be deleted Next time you click Recognize Save interactive learning will start again USING FONT DICTIONARIES When scanning many documents of the same type font quality and printing quality you may not want to repeat the learning process every time Therefore it is useful to use font dictionaries Font dictionaries contain font information learned during interactive learning and can substantially increase the recognition results Note that font dictionaries are limited to 500 shapes You are recommended to create separate dictionaries for specific applications To create a new font dictionary e On the Learn menu click the command New Font Dictionary Readiris will open the dictionary Readiris dus by default Change the file name and click Save to save it as a new dictionary e Click Interactive Learning on the Learn menu to activate it 58 Readiris Pro 12 User Guide e Click Recognize Save to recognize the document e Readiris enters the interactive learning phase Use the buttons of the dialog box to save characters in the font dictionary To use an existing font dictionary e On the Learn menu click Font Dictionary e Select the dictionary you want to use and click Open e On the Learn menu click either Append Font Dictionary or Read Font Dictio
9. RTF files created by Readiris can also be opened in the web based office applications AjaxWrite ThinkFree Zoho Writer and Google Writely which opens both RTF and HTML documents When using one of these applications make sure to select the layout option Retain Word and Paragraph Formatting Operation e Click the Format button on the main toolbar to select the output mode Readiris either 61 Chapter 10 Formatting and saving documents o sends documents to an application which will open automatically or o saves documents as an external file The option Send by e mail creates a new e mail message and inserts the recognized document as e mail attachment Output n 2 Send to Microsoft Word 97 2000 2002 2003 RTF External file Rich Text Format rtf r Open after saving V Send by e mail e Click the different tabs to select the settings you want to apply Settings that are unavailable for the selected output format appear dimmed The most commonly used output file formats as well as their options are discussed in the sections below e Click Recognize Save to execute the recognition and save the 62 documents The OCR results can be exported several times without repeating the recognition Click the Format button again and change the text format and formatting options Then click Recognize Save again For searching and sorting reasons Readiris allows you t
10. applications and output formats of your choice Er Scan to Word Pa Scan to OpenOffice PDF Archive as PDF i Archive as XPS E Scan to e mail 19 Chapter 4 The Readiris SmartTasks The various SmartTask buttons allow you to 1 Scan and recognize documents and send them directly to Word for text processing Microsoft Word is the default target application See the section Formatting text documents to learn more about the other available applications 2 Scan and recognize documents and send them directly to OpenOffice for text processing OpenOffice org Writer is the default target application See the section Formatting text documents to learn more about the other available applications 3 Scan and recognize tables and send them directly to Excel and other spreadsheets Microsoft Excel is the default target application See the section Formatting table based documents to learn more 4 Scan and recognize documents and archive them as PDF files Adobe Acrobat PDF Image Text is the default output format See the section Creating PDF documents to learn more about the other available formats 5 Scan and recognize documents and archive them as XPS files XPS Image Text is the default output format See the section Creating XPS documents to learn more about the other available formats 6 Scan and recognize documents and send them directly by e mail 20 Readiris Pro 12 User Guide
11. attention to line skew Line skew over 0 5 increases the risk of OCR errors 31 Readiris Pro 12 User Guide CHAPTER 6 ADJUSTING SCANNED DOCUMENTS When opening or scanning extremely light or extremely dark grayscale and color images it may be necessary to adjust those images before executing the recognition in order to obtain satisfactory OCR results To adjust images e Open or scan a color grayscale document Make sure that the scanner settings are correct Important the scanner settings and adjustment settings appear to be the same but note that both sets of settings are applied at different stages in the scan recognition process e On the Process menu click Adjust image Readiris uses intelligent binarization routines to convert color grayscale images into black and white images which are used to perform OCR on 33 Chapter 6 Adjusting scanned documents o Select Smoothen color image to even out the image This option renders grayscale and color images more homogeneous by smoothening out differences in intensity As a result a stronger contrast is created between the foreground text and background artwork Note sometimes smoothening is the only way to separate text from a colored background Original image Binarized black and white image IN QUEST OF CALYPSO from only 1 650 16 nights Sth 25th Oct 2000 Smoothened image o Use the slider to increase or de
12. the characters have a different width To select the character pitch e On the Settings menu point to Character Pitch e The character pitch is set to Automatic by default 55 Chapter 9 Recognizing documents e Click Fixed if all characters of the typeface have the same width This is often the case in old typewriter documents e Click Proportional if the characters of the typeface have a different width Virtually all fonts in newspapers magazines and books are proportional Important these document characteristics do not apply to Asian Hebrew or Arabic documents USING INTERACTIVE LEARNING Readiris offers an interactive learning function By means of Interactive learning you can train the recognition system on fonts and character shapes and correct the OCR results if necessary During interactive learning any characters the recognition system isn t sure of are displayed in a preview window in combination with their parent word and the proposed solution Interactive learning can substantially enhance the accuracy of the recognition system and is particularly useful when recognizing distorted defaced forms Interactive learning can also be used to train Readiris on special symbols it is unable to recognize initially such as mathematical and scientific symbols and dingbats To enable interactive learning e On the Learn menu click Interactive Learning e Define the necessary settings and click the Recogniz
13. will start your scanner as soon as you click the Scan button and display the scanned document in the interface To scan documents e Click the Scanner button to set the scanner settings Note that several of the options in the Scanner dialog box are also available in the Open dialog box Pa So e Select the correct scanner model If your scanner is not in the list select Twain other models and click OK e Readiris supports almost every flatbed and sheet fed scanner all in one device MFP Multifunctional Peripheral digital camera and scanner standard currently available Readiris is Twain compliant Note that the Configure button is only available when applicable Click it to select the Twain source e Select the scanner settings 27 Chapter 5 Scanning documents Scanner S odel 2 Format u HP Scanjet 5590 TWAIN ad een me Resolution An Contrast 0 300 Landscap Black and white Invert Bri 0 D Greyscale C Inve ane 5 Color Digital camera l j Z Process as 300 dpi darken lighten Bl Smocth a 7 Optimize resolution for OCR u ren color images Scan another page after 10 second s Format and Resolution Readiris supports a wide range of paper formats and resolutions Note that it is recommended to use a scan resolution of 300 dpi Use a resolution of 400 dpi when recognizing business cards Asian text or very small print Color mode Readiris can scan docume
14. window in the middle e the document panel at the bottom 16 Readiris Pro 12 User Guide The document panel displays statistical information about the documents that are open in Readiris such as the scan and OCR time the resolution width and height of the documents etc if Readiris C Users Desktop RI12 doc_type pdf page 1 of 4 File Edit Settings View Process Lear Register Help YB Options rd Scanner m Save English USA D D m e BS p Format CHANGING THE USER INTERFACE LANGUAGE The user interface of Readiris is available in a wide range of languages To change the user interface language e On the Settings menu click User Interface Language e Inthe Language list select the required language then click OK to confirm Note If you selected an incorrect language click Ctrl U The Language dialog box will open and you will be able to select another language in the list 17 Readiris Pro 12 User Guide CHAPTER 4 THE READIRIS SMARTTASKS USING THE READIRIS SMARTTASKS When starting Readiris click anywhere in the Readiris startup screen and click Cancel when the OCR Wizard launches The Readiris SmartTasks will be displayed The SmartTasks are predefined commands that allow you to use the most frequent Readiris functions at the touch of a button Simply click the SmartTasks to scan documents or image files to the target
15. Microsoft XPS files 77 Chapter 10 Formatting and saving documents IHQC COMPRESSING XPS DOCUMENTS Besides four types of regular XPS output Readiris offers IHQC compressed XPS output XPS documents of the types Image Text and Image can be hyper compressed by means of iHQC iHQC stands for intelligent High Quality Compression I R LS proprietary efficient compression technology iHQC is to images what MP3 is to music and what DivX is to movies To generate iHQC compressed XPS output e Click the Format button on the main toolbar and choose between the two output modes e In the Send to or External file list select the PDF type of your choice XPS Image Text or XPS Image e On the XPS Options tab select the required compression level Readiris supports Level I Good size and Level I Good quality compression oi t PDF options XPS options Optio s SpreadsheetML optio s Signat e v Create bookmarks XPS intelligent High Quality Compression iHQC Level I Good size v 78 Readiris Pro 12 User Guide SELECTING THE GRAPHICS OPTIONS Depending on the output format and target application you select advanced graphics options may be available The graphics options can be used to alter the image quality and resolution To access the graphics options e Click the Format button on the main toolbar and select the output format of your choice in the Send to or External file drop down li
16. Place the mouse pointer over a marker on the sides and in the corners of the window Click the marker and drag the mouse to modify the window size Moving windows Select the window you want to move Click inside the window and drag the mouse to modify the position of the window Recognizing a particular window Right click the window you want to recognize and select Copy as Text The results are sent to the clipboard as body text This also works for handprinted text Graphic windows and barcode windows can also be copied to the clipboard Deleting windows Select the window s you want to delete or click the command Select All on the Edit menu to select all windows Select the commands Cut or Clear on the Edit menu to cut or delete the windows 46 Readiris Pro 12 User Guide or Right click the selected windows point to Window then click Delete Deleting small windows Some documents faxes for instance often have stray dots on pages causing Readiris to create superfluous windows that do not contain text To erase all small windows click Delete Small Windows on the Edit menu This option erases all windows smaller than 0 5 and re sorts the remaining zones USING WINDOWING TEMPLATES When OCRing many documents with a similar page layout it may be useful to use windowing templates instead of automatic page analysis That way the same window structure is applied to all scanned or open
17. TXT TIFF etc Arabic and Farsi recognition Hebrew recognition Large volume recognition Automated processing Document indexing Business card recognition Readiris Pro 12 User Guide CHAPTER 2 INSTALLING READIRIS SYSTEM REQUIREMENTS This is the minimal system configuration required to use Readiris e a486 based Intel PC or compatible A Pentium based PC is recommended e 256 MB RAM e 120 MB free disk space 105 MB of disk space suffices when you do not install the sample files e the Windows Vista Windows XP or Windows 2000 operating system Note that some scanner drivers may not work under the latest version s of Windows See the documentation supplied with your scanner to find out which platforms are supported Chapter 2 Installing Readiris SOFTWARE INSTALLATION To install the software 10 Log on to Windows as administrator or make sure you have the necessary administration rights Connect your scanner to your PC and install the corresponding software Test your scanner If you experience any problem contact your scanner manufacturer Insert the Readiris CD ROM in the CD ROM drive and follow the on screen instructions to install the software Click Readiris to start the installation additional software products are offered Copernic Desktop Search Home Edition and Cardiris 4 LE Select the installation language and click OK Accept the terms of the license agreement
18. alyze page button on the image toolbar E 48 Readiris Pro 12 User Guide e Click Recognize Save to execute the OCR 49 Readiris Pro 12 User Guide CHAPTER 9 RECOGNIZING DOCUMENTS INTRODUCTION To recognize documents Readiris applies linguistics during the recognition phase As a result Readiris recognizes text tables graphics barcodes and handprinted text in all kinds of documents Readiris even copes with complex columnized documents low quality documents faxes dot matrix printouts badly scanned and copied documents containing too light or dark font shapes etc Readiris supports 128 languages all American and European languages are supported including the Central European Baltic and Cyrillic languages as well as Greek and Turkish Optionally Readiris can read Arabic Farsi and Hebrew documents and four Asian languages Japanese Simplified and Traditional Chinese and Korean Readiris even copes with mixed alphabets the software detects Western words that occur in Greek Cyrillic Arabic Hebrew and Asian documents many untranscribable proper names brand names etc are written using the Western symbols Readiris is based on the most advanced recognition technologies Font independent text recognition is complemented by self learning techniques The system is able to learn new characters and words through contextual and linguistic analysis This means that the OCR accuracy of th
19. and save it as an iHQC compressed PDF file Readiris Pro 12 User Guide REPURPOSING PDF DOCUMENTS Next to generating PDF documents Readiris can also repurpose PDF files Readiris converts image PDFs into text PDFs or any other supported text format and unlocks read only PDF content Warning Readiris does not open user password protected PDF documents Operation e Click the Open button on the main toolbar and select the PDF file you want Readiris to repurpose er In the Page range area of the dialog box select Pages and indicate which pages you want Readiris to open e Click the Open button in the dialog box to open the PDF file of your choice e Click the Format button on the main toolbar and select the PDF type of your choice For more information on the PDF types see the section Creating PDF documents e Click the Recognize Save button to repurpose the document CREATING XPS DOCUMENTS Readiris generates four types of XPS files Text Text Image Image Text and Image 75 Chapter 10 Formatting and saving documents XPS stands for XML Paper Specification and is a fixed layout format developed by Microsoft To generate XPS output Click the Format button on the main toolbar and select the XPS type of your choice in the Send to or External file drop down list XPS Image When you select XPS Image Readiris generates image only XPS documents it does not execute OCR XPS I
20. and white images 29 Chapter 5 Scanning documents Using a digital camera Select Digital camera when you are using a camera as scan source Readiris uses special recognition routines to process digital camera images Tips for using a digital camera as scan source Calibrate the camera by photographing a white document Always select the highest image resolution Enable the macro mode of the camera to take close ups Only use optical zoom not digital zoom Hold the camera directly above the document Avoid photographing the document at an angle Produce stable images Use a tripod if necessary Disable the flash when capturing glossy paper Avoid opening compressed camera images Adapt the Readiris brightness and contrast settings to the environment day light lamp light neon light Select color or grayscale as color mode Processing as 300 dpi Select Process as 300 dpi when you are processing images of an incorrect or unknown resolution The images will be processed as if they had a 300 dpi resolution The resolution of digital camera images is nearly always unknown Smoothening color images This option is selected by default as image smoothening is needed with some scanners to recognize color and grayscale images successfully 30 Readiris Pro 12 User Guide e When you are done defining all the settings Scanner settings Options click Scan to scan documents Note pay
21. ble output formats and options depend on the selected SmartTask See the chapter Formatting and saving documents to learn more about the available formats and options e When you are done configuring the SmartTasks use the buttons on the main toolbar to specify the language settings and image enhancement options and if still needed the Scanner settings a Options yO Scanner English USA For more information on the above mentioned settings see the sections Selecting the options Scanning paper documents and Selecting the document language e Finally click the SmartTask to use it Readiris will go through the entire recognition process automatically 22 Readiris Pro 12 User Guide CHAPTER 5 SCANNING DOCUMENTS SELECTING THE OPTIONS Before scanning paper documents or opening image files you can select several image enhancement options When enabled these options will be applied during the opening and scanning of documents Operation e Click the Options button on the main toolbar to select several image enhancement options Page Deskewing Rotation v Page Analysis o Click Page Deskewing to straighten pages scanned at an angle o Point to Rotation and determine whether you want Readiris to rotate pages automatically or 90 to the left 90 to the right or 180 Note that these two options slow down the scanning process somewhat Only select them when necessary o Page Analysis
22. button to execute the OCR Sorting windows To change the sort order of windows click the Sort button on the image toolbar and click the windows one by one in the required order When you are done click the Recognize Save button to execute the OCR Windows you do not click will be excluded from recognition Drawing polygons Windowing documents manually is not limited to rectangular shapes You can create polygonal windows by merging rectangular 44 Readiris Pro 12 User Guide ones Whenever two windows of the same type intersect they become a polygon automatically Give the Heute a Break Automatic page analysis Should the current page be too complex to window manually click the Analyze page button on the image toolbar to window the page automatically Note that barcode zones and handprinted zones always need to be drawn manually Changing the window type To change the window type of a window right click the window point to Window then to Type and then click the required window type You can also change the window type of several windows simultaneously e Click the pointer button on the image toolbar e Hold down the Shift key while selecting multiple windows 45 Chapter 8 Windowing documents e Right click any of the selected windows point to Window then to Type and then click the required window type Modifying the window size Click the window you want to modify
23. ce 17 character pitch 55 INDEX color image 26 33 color mode 28 CONILASU ns 28 36 D deskewing ceeenen 23 despeckling 36 digital camera 30 document characteristics 54 document panel 16 document properties 62 dot matrix wo eee eee 55 E editing multipage documents 86 Excel output 61 F factory settings 81 font dictionaries 58 TONE types east 54 95 Index G graphics options 79 grayscale image 26 H handprinting 89 90 Hebrew documents 4 7 52 HTML output 61 I image toolbar 16 installation eee 10 interactive learning 56 interval scanning 85 inverted images 29 J JPEG 2000 compression 80 L language 52 layout files 0 47 layout options 64 line Skew zn sr 31 loading settings 81 96 M main toolbar 16 manual windowing 43 Middle East edition 4 7 52 mixed languages 54 multipage documents 83 85 N DUMENIC Hy rn 53 O OpenDocument output 61 0 0 6 0 1 ee 23 output formats ee 61 P page analysis 23 page deskewing 23 Pages a en 86 dele
24. ce in the drop down list 52 Readiris Pro 12 User Guide Langu ee guage E Numeric N lEnalish USA 2 English USA a Dutch p French Spanish German Afaan Oromo i Afrikaans Albanian Asturian Aymara L Azeri Latin Balinese Basque Bemba Bikol m The 5 most recently selected languages are moved to the top of the language list Important select the document language before executing page analysis when you are dealing with Asian Hebrew and Arabic documents Specific page analysis routines are used for these documents The recognition can also be limited to a numeric character set to optimally recognize tables and figures Readiris then only recognizes the numerals 0 9 and the following series of symbols period opening closing parenthesis parenthesis dollar sign pound sign euro sign To activate numeric mode select Numeric in the Language dialog box 53 Chapter 9 Recognizing documents Recognizing documents with mixed languages Readiris also allows you to enable mixed character sets That way Readiris switches languages in the middle of a sentence automatically and recognizes English words proper names etc that occur in exotic languages Click the globe button on the main toolbar and select the required language combination in the language drop down list Note when processing Asian or Hebrew documents mix
25. cial offers on I R I S products To register Use the Registration wizard on the Register menu Follow the on screen instructions PRODUCT SUPPORT Once you have registered your product you are entitled to product support from I R l S on all basic software functionalities Contact LR LS at Europe support irislink com Tel 32 10 45 13 64 USA support irisusa com Tel 1 800 447 4744 Asia Pacific support irislink com Tel 852 22646133 12 Readiris Pro 12 User Guide CHAPTER 3 GETTING STARTED RUNNING READIRIS To run Readiris e Start Readiris from the Windows Start menu or double click the shortcut on your desktop Readiris Pro 12 e Click anywhere in the startup screen to launch Readiris The OCR Wizard automatically opens USING THE OCR WIZARD The OCR Wizard allows you to define all the settings needed to operate Readiris efficiently When you start Readiris click anywhere in the startup screen to start the OCR Wizard 13 Chapter 3 Getting started Step 1 Select the image source You can capture images using your scanner or open image files Select the rotation and deskewing options you want to use For more information see the section Selecting the options To familiarize yourself with Readiris use the sample images provided with the software They can be found on the Readiris CD ROM and in the subfolder Samples of the Readiris installation fol
26. crease the Brightness The Brightness settings determine the overall brightness of the image Use these settings to darken or lighten the image when the text is illegible 34 Readiris Pro 12 User Guide Example 1 lighten a dark image to eliminate the page background Color image Binarized image The default binarization settings yield a black image Verenigde Staten een antwoord te vi The lightened image yields satisfactory recognition results Example 2 darken an image when the text is so light it doesn t show up in the binarized image wyiscia kazdego Jrawid ze nasze Color image 35 Chapter 6 Adjusting scanned documents wy kazdego Beye Fee FITS FO Binarized image The default brightness settings yield fragmented characters wyjscia kazdego brawia ze nasze The darkened image yields satisfactory recognition results o Use the slider to increase or decrease the Contrast The Contrast settings determine the contrast between darker and lighter zones of an image Use these settings to make character shapes stand out against a colored background Color image A Look at International Planning the Future Default contrast settings yield broken characters A Look at International Planning the Future Increased contrast settings yield satisfactory recognition results o Use the slider to increase or decrease the Despeckle options 36 Readiris
27. der Click Next to go to the next step Step 2 In case you selected a scanner click the Change button to select the scamner settings For more information on the scanner settings see the section Scanning paper documents Click OK to save the settings Click Next to go to the next step Step 3 Click the Change button to change the document language The document language is set to American English by default Select the required language or language combination and secondary languages in the list and click OK Use the slider to set the required Speed Accuracy settings For more information see the section Selecting the document language Click OK to save the settings Click Next to go to the next step Step 4 Click the Change button to change the output format or target application The default target application is Microsoft Word 14 Readiris Pro 12 User Guide Select the required output format or application in the Send to or External file list Click the various tabs and select the options of your choice Options that are unavailable for the chosen format application appear dimmed For more information see the chapter Formatting and saving documents Click OK to save the settings Click Next to go to the next step Step 5 Click GO to open scan and recognize the document USER INTERFACE To explore the Readiris interface click anywhere in the Readiris startup screen and click Cancel w
28. e Save button to recognize the document 56 Readiris Pro 12 User Guide e At the end of the recognition Readiris enters the interactive learning phase The characters the recognition system isn t sure of are displayed New Dictionary C Users Documents Readiris DUS Other procedures that do not necessarily fall under this rubric such as Be angioplasty Se GE Date Ue Lt If the results are correct o Click the Learn button to save the result as sure The learning results are temporarily stored in the computer memory for the duration of the recognition Readiris will no longer display the learned characters when OCRing the rest of the document When a new document is OCRed the learning results are erased To save learning results permanently use a font dictionary For more information see the section Using font dictionaries o Click Finish to save all solutions the software offers If the results are incorrect o Type in the correct characters and click the Learn button or o Click Don t learn to save the result as unsure 57 Chapter 9 Recognizing documents Use this command for damaged characters which could be confused with other characters if learned E g the number 1 and the letter I which have an identical form in many fonts o Click Delete to delete characters from the output Use this button to prevent document noise from appearing in the output file
29. e recognition system will improve as it goes along Besides that Readiris has an optional user verification function When activated the user verification function Interactive learning not only flags the characters the recognition system isn t 51 Chapter 9 Recognizing documents sure of but also allows to increase the system s accuracy All solutions you confirm are memorized temporarily during recognition increasing the system speed and confidence and rendering the system more intelligent as you go along This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts The interactive learning results can also be stored permanently in font dictionaries for future use SELECTING THE DOCUMENT LANGUAGE Readiris offers OCR in 128 languages Readiris supports all American and European languages including the Central European Cyrillic and Baltic languages as well as Greek and Turkish Readiris Pro Asian and Readiris Corporate Asian additionally recognize documents in Japanese Simplified Chinese Traditional Chinese and Korean Readiris Pro Middle East and Readiris Corporate Middle East additionally recognize documents in Arabic Farsi and Hebrew In order for Readiris to recognize a document the document language must be specified To do so Click the globe button on the main toolbar and select the language of your choi
30. eadsheetML RTF HTM XML TXT TIFF etc Traditional and Simplified Chinese recognition Japanese recognition Korean recognition Readiris Pro 12 Middle East Basic features 128 recognition languages Supports PDF DCX DJV DJ VU JPG JPEG J2C J2K JP2 PNG TIF TIFF Readiris Corporate 12 Asian Basic features 128 recognition languages Supports PDF DCX DJV DJ VU JPG JPEG J2C J2K JP2 PNG TIF TIFF BMP PCX Generates four types of PDF files PDF iHQC level 1 111 PDF A four types of XPS XPS iHQC level I DOCX ODT XLS WordML SpreadsheetML RTF HTM XML TXT TIFF etc Traditional and Simplified Chinese recognition Japanese recognition Korean recognition Large volume recognition Automated processing Document indexing Business card recognition Readiris Corporate 12 Middle East Basic features 128 recognition languages Supports PDF DCX DJV DJ VU J PG JPEG J2C J2K JP2 PNG TIF TIFF Chapter I Introducing Readiris BMP PCX Generates four types of PDF files PDF iHQC level I four types of XPS XPS iHQC level I DOCX ODT XLS WordML SpreadsheetML RTF HTM XML TXT TIFF etc Arabic and Farsi recognition Hebrew recognition No Mac version available BMP PCX Generates four types of PDF files PDF iHQC level I I11 PDF A four types of XPS XPS iHQC level I DOCX ODT XLS WordML SpreadsheetML RTF HTM XML
31. ed characters sets are used automatically Speed Accuracy Select the right trade off between OCR speed and OCR accuracy Recognition Speed Accuracy This trade off is available for the Latin Cyrillic and Greek alphabets Tip favor accuracy over speed when the image quality is rather poor DEFINING THE DOCUMENT CHARACTERISTICS Next to the document language other document characteristics such as the Font type and Character pitch play an important role in the recognition process 54 Readiris Pro 12 User Guide Font type Readiris distinguishes between regular and dot matrix printed documents Dot matrix symbols of the type 9 pin are made up of isolated separate dots Special segmentation and recognition techniques are required to recognize dot matrix documents and need to be activated Far out in the uncharted back To select the font type e On the Settings menu point to Font type e The font type is set to Automatic by default That way Readiris recognizes 25 pin or NLQ Near Letter Quality dot matrix or other normal printing e To recognize only dot matrix printed documents click Dot matrix Readiris will recognize so called draft or 9 pin dot matrix printed documents Character pitch The character pitch is the number of characters per inch in a typeface The character pitch can either be fixed in which case all characters have the same width or proportional in which case
32. ed documents which speeds up the process Operation e Window the first page of the document manually by using the image toolbar buttons For more information see the section Windowing documents manually e On the File menu click the command Save Layout e Open or scan the other pages of the document by clicking the Open or Scan button on the main toolbar 47 Chapter 8 Windowing documents e On the File menu click the command Load Layout e Select the layout file you saved e To apply the layout to all opened or scanned pages select Apply Layout to All Pages in the Layout file dialog box e Click Open to load the layout file Note that when you add a document to Readiris the layout file must be loaded again as page analysis is enabled by default Ignore exterior zone As an alternative to windowing templates you can use the option Ignore exterior zone That way you can define one particular area on the page that needs to be OCRed Any data outside the OCR area will be excluded from recognition Operation e Select Ignore exterior zone by clicking the corresponding button on the image toolbar e Draw a frame around the part of the page you want Readiris to recognize The part of the page you select will be analyzed automatically You will be prompted whether you want to ignore the same exterior zone for all pages of the document To cancel this function re execute Page Analysis by clicking the An
33. ent feeder Chapter I Introducing Readiris THE READIRIS SERIES The table below gives an overview of the available versions Readiris Home 12 Limited features 25 recognition languages Supports PDF DCX DJV DJ VU JPG JPEG J2C J2K JP2 PNG TIF TIFF BMP PCX Generates PDF Image Text DOCX ODT WordML SpreadsheetML RTF HTM XML TXT TIFF etc output Readiris Pro 12 Basic features 128 recognition languages Supports PDF DCX DJV DJ VU JPG JPEG J2C J2K JP2 PNG TIF TIFF BMP PCX Generates four types of PDF files PDF iHQC level I four types of XPS XPS iHQC level I DOCX ODT XLS WordML SpreadsheetML RTF HTM XML TXT TIFF etc Readiris Corporate 12 Basic features 128 recognition languages Supports PDF DCX DJV DJ VU JPG JPEG J2C J2K JP2 PNG TIF TIFF BMP PCX Generates four types of PDF files PDF iHQC level 1 111 PDF A four types of XPS XPS iHQC level I DOCX ODT XLS WordML SpreadsheetML RTF HTM XML TXT TIFF etc Large volume recognition Automated processing Document indexing Business card recognition Readiris Pro 12 User Guide Readiris Pro 12 Asian Basic features 128 recognition languages Supports PDF DCX DJV DJ VU JPG JPEG J2C J2K JP2 PNG TIF TIFF BMP PCX Generates four types of PDF files PDF iHQC level I four types of XPS XPS iHQC level I DOCX ODT XLS WordML Spr
34. ents cssccsssscssseees 33 Chapter 7 Saving documents as image files seseseees 39 Chapter 8 Windowing documentts ccsscccsscccssscessscessseees 41 Windowing documents automatically 41 Windowing documents manually eee 43 Using windowing templates 47 Chapter 9 Recognizing documents csscssscccssccsssssessseees 51 Introduction tintei aniisi si 51 Selecting the document language eee 52 Defining the document characteristics 54 Using interactive learning uesseessersseessennnennnennnnnnn 56 Using font dictionaries eeessnesssersnernneennnennnennnennnnnnn 58 Chapter 10 Formatting and saving documentsS escrsseeesoeees 61 Formatting documents 0 eee cee cee cese cess cess enseeeaeees 61 Formatting text document8 uennessesssersnesnnnennnennnnnnnnnn 63 Formatting table based documents uen 67 Creating PDF documents anieri i 71 Selecting the PDF options uusseseesnennnsnnennenne 72 i1HQC compressing PDF documents eee 73 Repurposing PDF documents eee ee eee eeeeeeeee 75 Readiris Pro 12 User Guide Creating XPS documents ucssesssersnerssernnnnnnnennnennnnnn 75 Selecting the XPS options nsessersensnnesnnesnnennenne 77 iHOC compressing XPS documents eee 78 Selecting the graphics options ussesennnnnennn 79 Chapter 11 Saving and loading settings
35. ext maintains the original colors of the text across the recognition e The option Retain colors of background maintains the spot colors of the page background across the recognition A uniform background color is created per paragraph in the output file Paper sizes Depending on the format you selected you can select preferred paper sizes 66 Readiris Pro 12 User Guide e Click the Paper size tab and use the arrow buttons to apply and exclude paper sizes e Readiris will go through the active paper sizes in the indicated order and will use the first paper size that is sufficiently large to hold the scanned document FORMATTING TABLE BASED DOCUMENTS With Readiris you can output tabular data to spreadsheets word processors and web browsers tables are reconstructed cell by cell in worksheets and inserted as table objects in word processor files Readiris recognizes both gridded and non gridded tables Performance test optical media CD ROM Average 123 985 69 31 time n 60 987 745 129 2 Tested on 333 MHz Pentium II 287 410 49 52 58 19 149 91 gridded non gridded To generate table based documents e Click the Format button on the main toolbar and select the output format of your choice in the Send to or External file drop down list e Select the layout options of your choice 67 Chapter 10 Formatting and saving documents Layout PDF options XPS options Option
36. ge Recognition Integrated Systems S A OCR ICR and barcode technology by I R I S AutoFormat and Linguistic technology by I R I S BCR and field analysis technology by I R I S iHQC compression technology by LR LS XML parser developed by Apache This product includes software developed by the Apache Software Foundation All other products mentioned in this user guide are trademarks or registered trademarks of their respective owners Readiris Pro 12 User Guide CHAPTER 1 INTRODUCING READIRIS SAVE TIME AVOID RETYPING Congratulations on acquiring Readiris This software package will undoubtedly be of great help in recapturing your texts tables graphics barcodes and handprinted texts As efficient as computers are you have to key in your information first If you have ever retyped a 15 page report or a large table of figures you know how tedious and time consuming it can be Use this state of the art OCR package to automatically convert paper documents or scanned image files into text searchable and editable documents that can be archived and shared Two recognition modes are available one ensures maximal speed the other guarantees optimal OCR accuracy Scan a printed or typed document indicate the zones you want to recognize with Readiris or have the system detect them for you execute the character recognition and export the document to your word processor Documents composed of many pages are processed from
37. groups of documents which all require different settings it is useful to save separate settings files for each group Operation Select the settings you want to use for a certain document group e On the File menu click the command Save Settings e When scanning or opening a document of the same group at a later time click the command Load Settings on the File menu e Select the correct settings file and click the Open button e Click Recognize Save to recognize the document using the correct settings Note the Info command on the File menu gives an overview of the most important settings you selected 81 Readiris Pro 12 User Guide CHAPTER 12 RECOGNIZING MULTIPAGE DOCUMENTS OPENING AND RECOGNIZING MULTIPLE IMAGE FILES Readiris is designed to process multiple image files at a time To open multiple image files e Click Open on the main toolbar _ a e Select the image files you want to open o Select the first image file and hold down the Ctrl key as you select additional images or o Select a continuous range of image files by clicking the first image and holding down the Shift key as you select the last image Note when you open a single file that consists of multiple pages e g a multipage TIFF file or a PDF document you can indicate the page range In the Page range area of the dialog box select Pages and indicate which pages you want to recognize e Click the Open button to open
38. hen the OCR Wizard launches The empty Readiris interface will be displayed KJ Readiris ot xe Help Pa Scan to Word Pa Scan to OpenOffice File Edit Settings View Process Learn Register WA Options BEPPE ee a F ga Scanner Archive as PDF 2 Archive as XPS English USA B Scan to e mail By Format Be 15 Chapter 3 Getting started The Readiris interface is composed of e the SmartTasks in the middle The SmartTasks are predefined commands that allow you to use the most frequent Readiris functions at the touch of a button Click the SmartTask you want to use to scan recognize and send your documents to the target application or output format of your choice The SmartTasks apply default settings but can be configured easily by right clicking to fit more particular needs e the main toolbar left toolbar Use the main toolbar commands and options to scan and recognize documents manually The order in which you are advised to do so is given in the OCR Wizard e the image toolbar right toolbar Use the image toolbar buttons to edit documents in the Readiris interface Point to the different buttons to display their tooltips When a document has been opened or scanned in Readiris three main zones are added to the interface e the page toolbar right of the main toolbar The page toolbar displays the page thumbnails which provide settings information if pointed to e the image
39. is enabled by default 23 Chapter 5 Scanning documents This way scanned or opened images will be split up in windows automatically You can also use the windowing tools on the image toolbar to modify the page analysis results or to window documents manually For more information see the chapter Windowing documents e When you are done defining all the settings Scanner settings Options click the Scan or Open button to scan documents or open image files Note that the above mentioned options are also available on the Settings menu OPENING IMAGE FILES With Readiris you can either process paper documents you scan with your scanner or process already existing images files of various formats To open existing image files e Click the Open button to search for image files So 24 Readiris Pro 12 User Guide _ Date taken brazilian tif i card jpg cards jpg O28 Pe Ey Size colors2 jpg columns jpg amp czech tif catalan jpg deskew jpg i colorsl jpg i digital jpg lt File name columns jpg Files of type Al image files V Load PDF documents in color F Digital camera V Smoothen color images E Process as 300 dpi Page range All pages Pages Tip you can also drag image files to the Readiris image window to open them Tip Right click any image file you want to open point to Open With and click IOCR applicati
40. izes your texts but can format them for you as well Various levels of formatting are available When you make use of autoformatting Readiris recreates a facsimile copy of the scanned document the word paragraph and page formatting of the original document are retained Similar typefaces are used the point sizes and type styles as used in the source document are maintained across the recognition The placement of columns text blocks and graphics follows your original documents Readiris can even include the background photo of a scanned page in the recognized document And as Readiris supports grayscale and color scanning effortlessly you can recapture any graphics be they line art black and white photos or color illustrations When a document contains tables Readiris reorganizes them in real cells and recreates the cell borders of the original tables In other words Readiris allows you to archive a true copy of your documents be it editable and compact text files instead of scanned images Barcodes that occur on a scanned page can also be read and the same goes for handprinted text provided you write well spaced block letters Readiris is Twain compliant and supports a wide range of flatbed and sheetfed scanners all in one devices or MFPs multifunctional peripherals and digital cameras Interval scanning allows you to scan multipage documents efficiently when your scanner is not equipped with a docum
41. lp a Options ER Yo Scanner i i Save English USA Format Moving a page inside a document e Right click the page you want to move and click Select Page 86 Readiris Pro 12 User Guide e Drag the page to the correct position e Or right click a page and click Move Page Up or Down Deleting a page e Right click the page you want to delete and click Delete page e Or select the page and hit the Delete button on your keyboard Excluding a page from recognition e Right click the page you want to exclude and click Exclude page e Or clear its page number box in the document panel Excluded pages are stricken out in the page toolbar Excluded pages are ignored when you print the scanned images and when you save the scans to multipage image files Page Image source Scan time Oh C Docume 5 68 M2 c Pocume 9 45 Tip the commands Include All Pages and Exclude All Pages on the Edit menu apply to all pages simultaneously 87 Readiris Pro 12 User Guide CHAPTER 13 RECOGNIZING HANDPRINTED TEXT Next to typed text tables graphics and barcodes Readiris recognizes handprinted text Handprinting consists of separated block letters CELL PHONE It takes highly specialized ICR software intelligent character recognition to recognize handprinted characters To recognize handprinting e Click the handprinting button on the image
42. mage Text When you select XPS Image Text Readiris recognizes text and creates searchable XPS files that contain the page image and the recognized text The page image is placed on top of the text With this format you can always see the original document as it was scanned while you are able to search for and copy paste the OCRed text which is hidden beneath the image As a result this format is useful for archiving purposes XPS Text When you select XPS Text Readiris recognizes text and creates searchable XPS files The page image is not contained in these single layered XPS files 76 Readiris Pro 12 User Guide XPS Text Image When you select XPS Text Image Readiris recognizes text and creates searchable XPS documents that contain the page image and the recognized text The page image is contained beneath the text SELECTING THE XPS OPTIONS To select the XPS options e Click the Format button on the main toolbar and select the XPS type of your choice in the Send to or External file drop down list e Depending on the XPS type you select several options are available Click the XPS options tab to access them a out PDF options XPS options Options SpreadsheetML options Signatu e Create bookmarks XPS intelligent High Quality Compression HQC Level I Good size v Create bookmarks The option Create bookmarks creates bookmarks for each text block graphic and table in
43. nary When selecting Append Font Dictionary make sure to enable Interactive Learning Readiris will recognize the character shapes stored in the dictionary and use interactive learning allowing you to store new information in the dictionary When selecting Read Font Dictionary Readiris will recognize the character shapes stored in the dictionary but will not add new content to the dictionary even if Interactive Learning is enabled Note that it is still useful to use Interactive Learning to check and if necessary correct the recognition results which are not saved in the font dictionary Caution do not click Font Dictionary on the Learn menu and open an existing dictionary while the dictionary mode New Dictionary is enabled Otherwise the contents of the existing font dictionary will be erased e Click Recognize Save to recognize the document 59 Readiris Pro 12 User Guide CHAPTER 10 FORMATTING AND SAVING DOCUMENTS FORMATTING DOCUMENTS The documents you OCR in Readiris can be saved in various output formats Readiris saves OCR results as Adobe Acrobat PDF files Microsoft XPS files Word WordML RTF and OpenDocument text files HTML and XML files SpreadsheetML worksheets and Ansi and Unicode text files Besides that Readiris can export results directly to such target applications as Microsoft Word and Excel Adobe Reader Microsoft XPS Viewer the major web browsers and e mail software etc Note
44. nts and open image files in color black and white and grayscale Contrast Brightness Use the slider to determine the appropriate brightness and contrast settings in order to obtain an optimal scan result Optimizing resolution for OCR Select Optimize resolution for OCR to correct the resolution of images scanned with too much detail over 600 dpi Readiris will reduce the resolution 28 Readiris Pro 12 User Guide Note that this option never increases the resolution of images scanned with too little detail Scanning multipage documents When scanning multipage documents and using a scanner equipped with a document feeder select the ADF automatic document feeder option Place the pages you want to scan in the feeder and start scanning Or use interval scanning when using a flatbed scanner select the option Scan another page after and indicate after how many seconds you want Readiris to scan another page For more information see the section Scanning and recognizing multipage documents Important any options that are unavailable for the selected scanner appear dimmed Scanning landscape images Select the Landscape option when scanning landscape oriented images Auto exposure With some scanners the option Auto exposure is selected by default This option adjusts the contrast and brightness settings automatically Scanning inverted images Select the Invert option when scanning inverted black
45. o define document properties of PDF XPS Word RTF WordML SpreadsheetML and HTML output To define the document properties of a document click Document Properties on the File menu Note that the document properties options are also accessible in the Output File dialog box which opens when you click Recognize Save Readiris Pro 12 User Guide Note that when saving a multipage document as external file you can create a separate output file for each page in Readiris or save all pages that belong to the same document to a single output file Simply click the corresponding options in the Output File dialog box Create one file per page and Create one file per document respectively Note however that the options Create one file per page and Create one file per document are only available when saving documents as an external file not when opening documents in a target application FORMATTING TEXT DOCUMENTS With Readiris you can generate several types of text based output formats Readiris offers a o Word WordML RTF txt and OpenDocument Text output To generate text based output files e Click the Format button on the main toolbar and select the output format of your choice in the Send to or External file drop down list e Depending on the text format you selected several formatting options are available Any options that are unavailable for the selected text format appear dimmed 63 Chapter 10 Forma
46. of each column Any text you edit add or remove remains inside its column no text ever flows automatically across a column break Tip disable this option when you have columnized body text You ll ensure the natural flow of the text from one column to the next o The option Add image as page background places the scanned image as page background beneath the recognized text This option increases the file size of the output files substantially however The format PDF Text Image provides the same result for PDF files The option Retain colors of background on the Options tab provides a less drastic more compact alternative General options Click the Options tab to select the general options T 65 Chapter 10 Formatting and saving documents Layout PDF options XPS options Options V Merge lines into paragraphs V Indude graphics Retain colors of text _ Retain colors of background e The option Merge lines into paragraphs enables automatic paragraph detection Readiris wordwraps the recognized text until a new paragraph starts and reglues hyphenated words at the end of a line e The option Include graphics includes the graphics in autoformatted files This is essential to create a true copy of a document Use the graphic options on the Graphics tab to determine the color mode and resolution of the graphics stored inside the output files e The option Retain colors of t
47. ompressed PDF output e Click the Format button on the main toolbar and choose between the two output modes e In the Send to or External file list select the PDF type of your choice PDF Image Text or PDF Image 73 Chapter 10 Formatting and saving documents 74 On the PDF Options tab select the required compression level Readiris Pro supports Level I Good size and Level I Good quality compression Readiris Corporate also supports both Level II and III Good size and Good quality compression as well as Custom compression In Level II compression the option Compress symbols is enabled automatically to compress text compactly In Level III compression also the option Wavelet compression is enabled automatically to compress graphics compactly When you select Custom compression you can enable or disable these options independently of one another You can also use the slider to define the Good size Good quality ratio PDF intelligent High Quality Compression iHQC Level III Good size Acrobat 6 0 and higher HOC Level III Good quality Acrobat 6 0 and higher iHQC Level II Good size Acrobat 6 0 and higher iHQC Level II Good quality Acrobat 6 0 and higher iHQC Level I Good size Acrobat 5 0 and higher iHQC Level I Good quality Acrobat 5 0 and higher iHQC Custom Acrobat 6 0 and higher Disable iHQC compression Click Recognize Save to recognize the document
48. on The Readiris software will open and display the image Tip when loading multipage image files TIFF images and DCX faxes and PDF documents you can define the page range in case you only need a certain chapter of a document for instance To do so click Open on the main toolbar In the Page range area select Pages and enter which pages you want to load See also Opening and recognizing multiple image files Tip to speed up the loading process click the Open button and deselect Load PDF documents in color when processing PDF documents e Readiris supports the following graphic formats 25 Chapter 5 Scanning documents Graphic format Adobe Acrobat PDF DCX fax DjVu images JPEG images JPEG 2000 images images uncompressed and LZW PackBits Group 3 Group 4 and JPEG compressed Windows bitmap ZSoft Paintbrush images e Select the image file of your choice and click Open Note the options of the Input dialog box also apply to document scanning and are discussed in the Scanning paper documents section Note that you can specify other settings before opening or scanning documents For more information see the sections below SCANNING PAPER DOCUMENTS With Readiris you can either process paper documents you scan with your scanner or process already existing images files of various formats 26 Readiris Pro 12 User Guide When you process paper documents Readiris
49. on on the main toolbar and select the PDF type of your choice in the Send to or External file drop down list e Depending on the PDF type you select several options are available Click the PDF options tab to access them out PDF options PS options Options SpreadsheetML options Paper size Graphic 7 Create bookmarks E Create PDF A compliant files Embed fonts PDF1 4 Acrobat 5 0 and higher x PDF intelligent High Quality Compression HQC Level I Good size Acrobat 5 0 and higher 72 Readiris Pro 12 User Guide Create bookmarks The option Create bookmarks creates bookmarks for each text block graphic and table in Adobe Acrobat PDF files Embed fonts Select the option Embed fonts to embed fonts in Adobe Acrobat PDF files Embedding fonts prevents font substitution and ensures that readers regardless of their computer configuration see the text in its original fonts Embedding fonts increases the file size of recognized documents somewhat IHQC COMPRESSING PDF DOCUMENTS Besides four types of regular PDF output Readiris offers IHQC compressed PDF output PDF documents of the types Image Text and Image can be hyper compressed by means of iHQC without loss of image quality iHQC stands for intelligent High Quality Compression I R LS proprietary efficient compression technology iHQC is to images what MP3 is to music and what DivX is to movies To generate iHQC c
50. per size that is sufficiently large to hold the scanned document 70 Readiris Pro 12 User Guide CREATING PDF DOCUMENTS Readiris generates four types of PDF output Text Text Image Image Text and Image To generate PDF output Click the Format button on the main toolbar and select the PDF type of your choice in the Send to or External file drop down list PDF Image When you select PDF Image Readiris generates image only PDF documents it does not execute OCR PDF Image Text When you select PDF Image Text Readiris recognizes text and creates searchable PDF files that contain the page image and the recognized text The page image is placed on top of the text With this format you can always see the original document as it was scanned while you are able to search for and copy paste the OCRed text which is hidden beneath the image As a result this format is useful for archiving purposes PDF Text When you select PDF Text Readiris recognizes text and creates searchable PDF files The page image is not contained in these single layered PDF files 71 Chapter 10 Formatting and saving documents PDF Text Image When you select PDF Text Image Readiris recognizes text and creates searchable PDF documents that contain the page image and the recognized text The page image is contained beneath the text SELECTING THE PDF OPTIONS To select the PDF options e Click the Format butt
51. s Create body text _ Retain word and paragraph formatting Recreate source document C Use columns instead of frames Insert column breaks Add image as page background For more information on formatting options see the section Formatting text documents SpreadsheetML options When selecting Microsoft Excel 2002 2003 as target application specific SpreadsheetML options are available Click the tab SpreadsheetML options to display them Note that the layout option Recreate source document becomes unavailable when this format is selected Layout PDF options XPS options Options SpreadsheetML options _ Ignore all text outside the tables Convert figures into numbers Create one worksheet per page Create one worksheet per table e The option Ignore all text outside the tables saves the tables and ignores all other recognition results All data inside the tables is recaptured any data outside the tables is not 68 Readiris Pro 12 User Guide e The option Convert figures into numbers encodes recognized figures as numbers As a result you can execute arithmetical operations on those cells The text cells in any table remain text Note that only figures inside tables are encoded as numbers Excel exclusively executes mathematical operations on data that is encoded as numbers e The option Create one worksheet per page sees to it that one worksheet is created per
52. scanned page If a page contains tables and text all will be placed on the same worksheet e The option Create one worksheet per table places each table in a separate worksheet and includes the recognized text outside the tables in another worksheet If the recognized document contains several pages you ll see that structure repeated per page 39 5 nNa 1 1 1 2 ABR2 1 2 2 ABR3 1 General options Click the Options tab to select the general options 69 Chapter 10 Formatting and saving documents Layout PDF options XPS options Options V Merge lines into paragraphs Retain colors of text E Retain colors of background e The option Merge lines into paragraphs enables automatic paragraph detection Readiris wordwraps the recognized text until a new paragraph starts and reglues hyphenated words at the end of a line e The option Retain colors of background recreates the background color of each cell Average access CPU igital Versatile Disk time msec utilization 4 CD ROM 24xspeed sa s2 5 CD ROM 32x ed 6 ni d 7 Tested on 333 MHz Pentium Il with 64 MB RAM and 4GB HD Paper sizes Depending on the format you selected you can indicate preferred paper sizes e Click the Paper size tab and use the arrow buttons to apply and exclude paper sizes e Readiris will go through the active paper sizes in the indicated order and will use the first pa
53. st e Click the Graphics tab to display the options Layout PDF options PS options Options Spreadsheet L options Signatu e ass ord Paper size Graphics Black and white graphics Help Retain scan resolution Reduce resolution to 150 dpi JPEG quality least best Color mode Readiris saves graphics in color by default Select Black and white to save graphics in black and white Resolution Readiris retains the scan resolution by default You can also choose to reduce the resolution 79 Chapter 10 Formatting and saving documents Tip When saving documents as HTML files to post on a web site reduce the resolution to 70 dpi screen resolution JPEG quality Graphics stored inside PDF XPS Word and RTF documents are saved in the JPEG format Use the slider to adjust the JPEG quality JPEG 2000 compression When saving files in the PDF or XPS format Readiris can apply JPEG 2000 compression to the color grayscale images stored inside those files JPEG 2000 is the newest more compact version of the JPEG standard Select the option JPEG 2000 compression to apply it 80 Readiris Pro 12 User Guide CHAPTER 11 SAVING AND LOADING SETTINGS Any settings you specify in Readiris are saved automatically for future use after you close the application To restore the factory settings click the command Restore Factory Settings on the File menu When scanning various
54. start to finish in a single effort A few mouse clicks beat long hours of work as Readiris converts your paper documents into editable computer files it s up to 40 times faster than manual retyping The wizard smoothly guides you through the settings required to operate Readiris allowing you to obtain quick and easy results Or use the SmartTasks to speed up the process even more You can send the reading results directly to your word processor or Chapter 1 Introducing Readiris spreadsheet archive them as PDF or XPS files etc To recognize faxes and convert PDF documents drag their image files from Windows Explorer to the Readiris application window Or send an image promptly to Readiris via the context menu Readiris recognizes tabular data and recreates them as worksheets in your spreadsheet software or as table objects inside your word processor your numeric data are immediately ready for further processing Readiris is based on the most advanced recognition technologies Font independent text recognition is complemented by self learning techniques The system is able to learn new characters and words through contextual and linguistic analysis This means that the OCR accuracy of the recognition system will improve as it goes along Readiris supports up to 128 languages all American and European languages are supported including the Central European Baltic and Cyrillic languages as well as Greek and Turkish Optionall
55. terleaved 2 of 5 MSI Pharmaceutical MSI Plessey Kodak patch code PDF 417 PostNet UCC 128 UPC A and UPC E 1 23456 7890 2 C o d 12 8 1 Note that laser printed and inkjet printed barcodes are required in order for Readiris to perform OCR Matrix printed barcodes are not supported as they do not produce sufficient contrast and their resolution is mostly limited to 60 dpi Manual barcode reading e Determine which barcodes you want Readiris to recognize o On the Settings menu click Barcodes 93 Chapter 14 Recognizing barcodes o Select the symbologies you want Readiris to recognize o Determine whether you want Readiris to verify or remove the check digits e Click the barcode button on the image toolbar and draw a frame around the barcodes zones in the document e Click Recognize Save on the main toolbar The entire document including the barcode content will be recognized Note right click a barcode zone and click Copy as Data to copy its content to the clipboard 94 Readiris Pro 12 User Guide A Arabic documents 4 52 Asian documents 6 52 Asian edition 4 7 52 automatic document feeder 85 automatic windowing 41 B background color 65 background color of table cells acta ER EEE TEE 70 barcodes un 93 black and white image 26 brightness 28 34 C changing the user interfa
56. the image s 83 Chapter 12 Recognizing multipage documents Note that you can also drag and drop image files from Windows Explorer to the Readiris image window to open them e The page toolbar will display the opened image files Tip hold the mouse cursor over the page thumbnails to display the settings information per page The page toolbar can be used to edit multipage documents For more information see the section Editing multipage documents Include Page yes Cover page Image source C Documents Scan time 1 07 OCR time 2 03 Resolution 300 Width 2146 Height 2461 Lineskew 7 Rotation e Determine the recognition settings and click Recognize Save to execute the recognition e Should you want to open or scan additional images to the current document click the Scan or Open button on the main toolbar You will be prompted whether you want to delete the current document or not Click yes to delete the current document and start a new one or click no to add the additional scans to the current document 84 Readiris Pro 12 User Guide SCANNING AND RECOGNIZING MULTIPAGE DOCUMENTS Readiris is designed to process documents consisting of multiple pages Readiris Pro processes documents of up to 50 pages Readiris Corporate processes documents of an unlimited number of pages To scan multipage documents in Readiris you can either use the automatic document feeder func
57. ting oo eee 87 excluding 87 MOVING insel 86 selecting 86 paper SIZES oo eects 66 PDF iHQC output 73 Readiris Pro 12 User Guide PDF options 72 PDF output 61 71 product support 12 R recreating source documents 64 registration eee eeeeeeeeeeteeeeee 11 repurposing PDF documents 75 resolution nueneeeseenneeneenn 28 restoring factory settings 81 TOAG ON sekene 23 RTF output eee 61 S saving settings 81 scanner SettingS nen 27 sending documents by e mail62 settings file 81 SmartTasks 0 19 smoothening color images 30 34 speed vs accuracy 54 spreadsheet documents 67 SpreadsheetML output 61 supported image formats 25 system requirements 9 T tables es ale en 67 text documents 63 U Unicode output 61 uninstalling Readiris 11 user interface eee 15 user interface language 17 WwW windowing templates 47 Wizard 2 228088 13 Word output 61 WordML output 61 worksheets 67 X XML output ee 61 XPS iHQC output 78 XPS options 77 XPS output 61 75 97
58. tion when using a sheet fed scanner or use interval scanning function when you are using a flatbed scanner Scanning multipage documents with a document feeder sheet fed scanner e Click the Scanner button on the main toolbar and select the ADF automatic document feeder option e Place the pages in your scanner s document feeder and click Scan to start scanning e Click Recognize Save to recognize the documents Scanning multipage documents with interval scanning flatbed scanner e Click the Scanner button on the main toolbar e Select Scan another page after and indicate the time interval using the arrow buttons 85 Chapter 12 Recognizing multipage documents The scanner will automatically scan another page after the indicated number of seconds without you having to click the Scan button every time Click Abort in the interval scanning dialog box to end the automatic scanning or press ESC on the keyboard Click Pause in the interval scanning dialog box to freeze the scanning interval or press the space bar on the keyboard Click Resume when you re ready to continue EDITING MULTIPAGE DOCUMENTS When multiple documents are opened or scanned in Readiris the page toolbar displays their thumbnails The thumbnails in the page toolbar can be used to edit the multipage documents IEJ Readiris C Program Files Readiris Pro Samples batch tif page 1 of 4 File Edit Settings View Process Learn Register He
59. toolbar e Draw a frame around the handprinted text e Click Recognize Save on the main toolbar The entire document including the handprinted text will be recognized Note right click the handprinted zone and click Copy as Text to recognize only the handprinted zone and send it to the clipboard 89 Chapter 13 Recognizing handprinted text Recognized symbols Handprinting recognition is limited to the Latin alphabet and supports numerals 0 9 uppercase letters A Z and the punctuation symbols comma period plus sign and hyphen Accents umlauts and other special characters are not supported Notes e Readiris supports handprinting not handwriting For more information see the section Handprinting rules e Uppercase characters are replaced by lowercase characters after recognition unless they occur at the beginning of a sentence e The document characteristics language font type and character pitch do not apply to handprinting e Interactive learning does not apply either The ICR technology is based on more than one million writing samples HANDPRINTING RULES Several rules must be taken into account in order for Readiris to recognize handprinting e Write regular well spaced characters ABCDEFGH Jk LMNOP QAR STUVWX Y Z 1234S6F890 Note how the characters A G and Q are written 90 Readiris Pro 12 User Guide Use a sufficiently thick ballpoint Black pens yield better resul
60. ts than blue pens Do not use pencils Don t stylize too much 3443333 s t Excessively stylized characters increase the risk of OCR errors e Don t open loops which should be closed don t close loops which should be open SI3ETVZEFS e Avoid broken characters e Avoid retracing 89942 Retracing reduces the image quality and clarity of handprinted symbols Characters that are entirely stricken out will not be recognized Write ones correctly The number 1 can be written in the anglicized and European style Ones can be underlined or not 91 Chapter 13 Recognizing handprinted text The horizontal underlining bar does not have to touch the rest of the font form Jt I IAAN I 4 2A 17 Tip when less than optimal results are obtained use the I R I S writing form and adapt your writing style The blank I R I S writing form serves as a full page template on which block letters can be filled out correctly and in the right size The form can be found on the Readiris CD ROM and in the Readiris installation folder 92 Readiris Pro 12 User Guide CHAPTER 14 RECOGNIZING BARCODES INTRODUCING BARCODE READING Next to optical character recognition of 128 languages Readiris also offers barcode reading All widespread barcode symbologies are supported Codabar Code 128 Code 39 Code 39 extended Code 39 HIBC Code 93 Datalogic 2 of 5 Discrete 2 of 5 EAN 13 EAN 8 In
61. tting and saving documents Layout options Layout PDF options XPS options Options Create body text Retain word and paragraph formatting Recreate source document Use columns instead of frames Add image as page background e The option Create body text avoids text formatting by Readiris Readiris generates a continuous running text e The option Retain word and paragraph formatting takes an intermediate position between body text and autoformatting The font type size and type style are maintained across the recognition The tabs and the alignment of each block are recreated The text blocks and columns aren t recreated the paragraphs just follow each other The tables are recaptured correctly e The option Recreate source document recreates a facsimile copy of the original document Readiris generates a true copy of the source document no longer a scanned image Readiris also recreates any hyperlinks to e mail addresses and web sites 64 Readiris Pro 12 User Guide o The option Use columns instead of frames creates columnized documents Columnized texts are easier to edit than documents containing multiple frames the text flows naturally from one column to the next Note when the system is unable to detect columns in the source document this formatting mode uses frames as a fallback position o The option Insert column breaks inserts a hard column break at the end
62. y Readiris can read Arabic Farsi and Hebrew documents and four Asian languages Japanese Simplified and Traditional Chinese and Korean Readiris even copes with mixed alphabets the software detects Western words that occur in Greek Cyrillic Arabic Hebrew and Asian documents many untranscribable proper names brand names etc are written using the Western symbols Readiris uses linguistics during the recognition phase not afterwards As a result Readiris recognizes all kinds of documents with top accuracy including low quality documents faxes and dot matrix printouts It copes beautifully with badly scanned and copied documents containing too light or dark font shapes Joined characters are resolved while fragmented characters such as dot matrix symbols are recomposed Besides that Readiris has an optional user verification function When activated the user verification function Interactive learning not only flags the characters the recognition system isn t sure of but also allows to increase the system s accuracy All 4 Readiris Pro 12 User Guide solutions you confirm are memorized increasing the system speed and confidence and rendering the system more intelligent as you go along This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts To increase your productivity further Readiris not only recogn
Download Pdf Manuals
Related Search
Related Contents
Tokai 1406 CK User's Manual HQ W9-SOCLE Un trimestre en images Everest Goggle - Electrocomponents D-Link DCS-6314BS surveillance camera Whirlpool GR9FHMXVQ00 User's Manual Anzeigen - TEAM6 Software KG Dossier de presse - Agence Bretagne Presse Live 802 User Guide Copyright © All rights reserved.
Failed to retrieve file