Data Entry and Processing

advertisement
Multiple Indicator Cluster Surveys
Survey Design Workshop
Data Entry and Processing
MICS4 Survey Design Workshop
MICS4 Data Processing System:
Actors and Roles
– Country data processing manager and country team:
• Customization of data entry programs, data entry, editing, and
production of datasets
• Customization of tabulation syntaxes and tabulation
– Regional Office MICS Coordinator
• Coordination and supervision, organization of the Data Processing
workshop
– Regional Office Data Processing Consultant
• Technical support and review of customized programs and close
work with country teams
– HQ Data processing unit
• Development of standard programs, templates and coordination
of Data Processing workshops
MICS4 Survey Design Workshop
MICS4 Data Processing System
• Designed to deliver the first results of a survey within several weeks after
the end of fieldwork
• Such rapid turnaround time is possible when completed questionnaires
are entered simultaneously with survey fieldwork
• Data for each cluster is stored in a separate data file and is processed as
soon as all the questionnaires from a cluster are returned from the field
• This approach breaks data processing down into discrete segments and
allows it to progress while fieldwork is ongoing
• By the time the last questionnaires are finished and returned to
headquarters, most of the data have already been processed
MICS4 Survey Design Workshop
MICS4 Data Processing System
• The data-processing system can be divided
into following phases:
– Customization of MICS4 data entry program and
tabulation syntaxes [Regional Workshop],
– Establishing the data entry system locally,
– Primary data processing,
– Secondary data processing, and
– Tabulation.
MICS4 Survey Design Workshop
Customization of data entry files
• Adapting MICS4 standard data entry program
files to the country-specific questionnaire
– Second Regional Workshop [Data Processing]
• Adapting MICS4 standard tabulation syntax
files
– During and immediately after the workshop
MICS4 Survey Design Workshop
Establishing the data entry system
• The goal is to be ready to begin data entry shortly (about one
week later) after the fieldwork commences
• The preparation phase involves the following steps:
– Obtaining computer equipment and setting up a data-processing
room
– Identifying and recruiting appropriate personnel so that they
participate to fieldwork personnel training
– Setting up a system for managing the questionnaires and data files
MICS4 Survey Design Workshop
Computers
• Operating system: Windows 98 or later
• Computers adequate to run Windows 98 or better
versions
– Adequate hard disk space
– Adequate memory
• Network recommended
– otherwise use USB memory sticks to backup and transfer
data
• Printer
MICS4 Survey Design Workshop
Software
• CSPro (version 4.0) - available at: http://www.cspro.org for
free download (registration required)
– Data Entry
• Interactive
• Range, skip, consistency checks
– Structure check
• Ensures completeness of data
– Verification
• 100% double entry
– Secondary editing
• Complex consistency checks
– Export to SPSS
MICS4 Survey Design Workshop
Primary Data Processing
• The goal of primary data processing is to produce
clean, edited data files. Primary data processing
involves the following steps:
–
–
–
–
Entering all questionnaires for a cluster onto a data file
Production of field check tables
Checking the structure of the data file
Entering the data a second time and then verifying the
data file
– Backing up the checked and verified data file
– Performing secondary editing on the data file
– Backing up the edited, or final, data file
MICS4 Survey Design Workshop
Data processing requirements
• Computers
– 1 per data entry operator
– Plus 1 for supervisor
• Arrange day/night shifts if workload is heavy
• Staff
– Data processing supervisor (1 or 2)
– Data entry operators (variable)
• Can enter about 1 cluster per day
– Data editors (1 or 2)
• 6-7 hours per day including breaks
MICS4 Survey Design Workshop
Main Data Entry
Data-entry operator 1
Structure Check
Data-entry operator 1
Structure OK?
Correct Main Data File
Data-entry operator 1
No
Investigate Errors
Data-processing supervisor
Yes
Flow
of
data
entry
Verification Data Entry
Data-entry operator 2
Verification
Data-processing supervisor
Differences?
Correct Both Data Files
Data-entry operators 1 & 2
Yes
Determine Correct Values
Data-entry operators 1 & 2
No
Back-up Raw Data File
Data-processing supervisor
Secondary Editing
Data-processing supervisor
Inconsistencies?
Correct Raw Data File
Data-processing supervisor
Yes
No
Back-up Final Data File
Data-processing supervisor
MICS4 Survey Design Workshop
Resolve Inconsistencies
Secondary Editor
MICS4 Survey Design Workshop
Secondary Data Processing
• The goal of secondary data processing is to produce
analysis data files and to create the MICS4 standard
tables. Secondary data processing involves the
following steps:
–
–
–
–
–
–
–
Bringing together all cluster data files into one data file
Exporting the data to the SPSS software
Recoding some variables to be used in analysis
Calculating sample weights and adding to data files
Computing wealth index and adding to data files
Creating the tables required to analyse the data
[Archiving and distributing the data files]
MICS4 Survey Design Workshop
Software
• PASW statistics 18 [formerly SPSS]
– Construction of analysis files
•
•
•
•
HH: Household
HL: Household Listing
WM: Women
CH: Children
– Production of tabulations
– Analysis of sampling errors/confidence intervals
MICS4 Survey Design Workshop
MICS4 Survey Design Workshop
Data sharing and ownership of data
• Data are the property of the country
• UNICEF agreements with countries should permit
UNICEF (CO, RO and HQ) and country partners to:
– Receive datasets and tabulations as soon as data
processing is complete
– Receive reports, including drafts, as soon as available and
prior to publication
– Distribute datasets, tabulations and reports to other
users and researchers after publication of report
MICS4 Survey Design Workshop
Why share data?
• Permits further analysis of data by researchers
• Leads to greater benefit in terms of maximizing the
results of the survey by
– Providing more insights into situation of children and
women in each country
– Providing more valuable information for planning of
programmes
• Leads to greater confidence in survey results
MICS4 Survey Design Workshop
MICS4 Data Processing Workshop
• Goal: modify standard data entry and processing
program files to reflect country questionnaires
• Structure
– Introduction of software
– Lecture: Overview of topic
– Practical: apply topic to your country
• Review of adaptations during (and after) the
workshop
MICS4 Survey Design Workshop
Data Processing Workshop Agenda
• Days 1-9: CSPro
• Days 10-12: SPSS
• Day 13: Archiving
MICS4 Survey Design Workshop
MICS4 Data Processing Workshop
• Who will participate?
– Only staff responsible from data entry and/or data
processing (from implementing agency)
• Experts in data entry and/or processing
• Prior expertise in data entry supervision and/or data
processing
• 2-3 participants per country
– UNICEF staff are not expected to participate unless they
are directly involved in data entry and processing due to
technical nature of the workshop
MICS4 Survey Design Workshop
What they will bring home
after 2nd workshop
– MICS4 Standard programs (CSPRO)
– MICS4 Standard SPSS tabulation programs
– Documentation (Manual, editing guidelines, GPS
tracking form and more)
– Presentations
– A copy of adapted data entry and processing
program files
– A copy of CSPRO 4.0
– A license for SPSS 18.0
MICS4 Survey Design Workshop
Challenges
• Not all countries in the region will be at the
same stage of survey implementation
• Workshop is most productive for countries
that finalized their questionnaires and have
not started data collection
• Option to participate data processing
workshops in other regions [if any]
MICS4 Survey Design Workshop
Agenda – Day 1
• Opening
• Overview of MICS Data Processing System
• Introduction to CSPro
– Overview
– Simple Data Entry Applications
– Data Entry Applications with Logic
MICS4 Survey Design Workshop
Agenda – Days 2-3
•
•
•
•
•
•
MICS dictionary and forms
Practical: adapting the dictionary
Practical: adapting the forms
Data path within entry applications
Functions in data entry applications
Practical: adapting the data entry program
MICS4 Survey Design Workshop
Agenda - Days 4-5
•
•
•
•
•
•
Data entry program: data entry editing
Practical: adapting the data entry program
Data entry menu in CSPro
Practical: data entry menu
Structure checking, verification and field check tables
Practical: structure checking, verification and field check
tables
• Secondary editing: editing methodology
• Practical: identifying and documenting edits
MICS4 Survey Design Workshop
Agenda - Days 6-8
• Secondary editing program in CSPro
• Practical: modifying the editing program
• Day 7 – Off day
MICS4 Survey Design Workshop
Agenda – Day 9
•
•
•
•
Supervisor’s menu in CSPro
Practical: adapting the supervisor’s menu
Review of primary data processing
Set up of a system for managing data
processing and data editing guidelines
• GPS data
MICS4 Survey Design Workshop
Agenda – Day 10
•
•
•
•
Exporting data from CSPro to SPSS
Practical: adapting the export program
Description of MICS4 SPSS data files
Creating analysis files: Description of
preparation steps
• Introduction to SPSS: General commands in
SPSS
MICS4 Survey Design Workshop
Agenda – Day 11
•
•
•
•
•
•
Structural check and frequency tables
Creating an analysis file: recoding variables
Creating an analysis file:sample weights
Practical: adapting analysis files
Background variables: the wealth index
Demonstrations and practices with Sample
Data File
MICS4 Survey Design Workshop
Agenda – Days 12 & 13
• Tables in MICS4
• Production of tables on response rates, background
characteristics
• Practical with sample data sets
• Selected tables from different modules
• File copying and backup
• Archiving and distribution
• Closing
MICS4 Survey Design Workshop
Download