Introduction to Forms Processing Session 4 – Introduction to Data Capture UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region Doha, State of Qatar, 18-22 May 2008 Fred Highland Census Practice Architect Lockheed Martin Transportation & Security Solutions © 2008 Lockheed Martin Corporation. All Rights Reserved. What is Forms Processing? • Function The collection and extraction of respondent data from paper forms • Advantages Response hand written on paper Most people can read and write Respondent needs no special tools or equipment Form becomes an archival record • Disadvantages Forms must be printed, distributed and collected Data must be captured from handwriting Forms can be lost or damaged Forms most be discarded 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 2 Process Flow Quality Control Registration 5 6 Key from Image 4 7 Automatic Imaging & Recognition 1 Mail Edits/Coding 8 Paper Forms 10 Workflow 2 3 9 Document Preparation 7/23/2016 Questionnaire Scanning Paper Trays of Forms Check-Out © 2008 Lockheed Martin Corporation. All Rights Reserved. Final Storage Disposition 3 Preparation • Form Design Respondent Friendly • Question design and Layout • Person vs Topic structure Capture Friendly • Dropout Color • Segmentation • Registration and Barcodes Printer Friendly • Page size • Number of Pages • Binding • Packaging • Printing Production and distribution of forms Addressing/Personalization • Form Definition Defining the form to the processing system 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 4 Registration • Identifying incoming forms Respondents vs. non-respondents Priority processing • Issues Volume! Accuracy of identification Census Composite Daily Receipt 7000000 Mailout Forms Census Day NRFU Cut-off Capture Complete 6000000 Mail Daily Receipt Internet Visits Inbound Calls Outbound Calls Key Dates Scanned Receipts 5000000 4000000 3000000 2000000 1000000 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 2- Ju n ay 26 -M ay 19 -M ay 12 -M M ay 5- 28 -A pr 21 -A pr 14 -A pr Ap r 7- ar 31 -M ar 24 -M ar 17 -M ar 10 -M 3- M ar 0 5 Scanning & Imaging • Document Preparation for scanning Remove from envelope Repair Acclimatize • Scanning Throughput (Rated vs. Achievable) Black & White vs. Color Image Capture Image Quality Dealing with exceptions 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 6 Automated Recognition • Optical/Intelligent Character Recognition Commercial “Engines” Languages Supported Additional Features • Formats/templates • Trigrams • Dictionaries • Optical Mark Recognition Pixel Counting Style Analysis • Multiple Engines Engine Strengths Weaknesses Arbitration Scheme Cost vs. Complexity vs. Accuracy 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 7 Key Correction • Purpose Correct/Recognize fields that are not automatically captured • Approaches Character Keying • slower and less accurate Field Keying • Fastest and most accurate • Natural to keyers Snippets vs Images Keying Rules • Better data for methodologies • Lower capture productivity • General Rule Simple interfaces Let keyers key©not think! 2008 Lockheed Martin Corporation. All Rights Reserved. 7/23/2016 8 Checkout/Disposition • Purpose Ensure all forms have been processed Dispose of paper • Approach Check against processing inventory Reprocess if necessary Shred or burn paper forms 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 9 Summary • Forms Processing A series of steps transforming paper responses into digital information Can be accurate and efficient Requires planning and management 7/23/2016 © 2008 Lockheed Martin Corporation. All Rights Reserved. 10