ESCM Chapter 8: Data Quality and Meta Data OG7, Helsinki, Finland

advertisement
ESCM Chapter 8:
Data Quality and Meta Data
United Nations Oslo City Group on Energy Statistics
OG7, Helsinki, Finland
October 2012
1
Introduction
 IRES Chapter 9: deals with Data Quality
Assurance and Meta Data
 Under IRES, countries are encouraged to:
• Develop national quality assurance programs
• Document these programs
• Develop measures of data quality
• Make these available to users
2
Prerequisites of Data Quality
 Institutional and organizational conditions,
including:
 Legal basis for compilation of data
 Adequate data-sharing and coordination between
partners
 Assurance of confidentiality and security of data
 Adequacy of resources – human, financial, technical
 Efficient management of resources
 Quality awareness
3
Promoting Data Quality






4
Make quality a stated goal of the organization
Establish standards for data quality
Track quality indicators
Conduct regular quality assurance reviews
Develop centres of expertise to promote quality
Deliver quality assurance training
What is a Quality Assurance
Framework?
 All planned activities to ensure data produced are
adequate for their intended use
 Includes: standards, practices, measures
 Allows for:
•
•
•
•
Comparisons with other countries
Self-assessment
Technical assistance
Reviews by international and other users
 See Figure 8.1 for examples of quality frameworks
5
Quality Assurance Framework
 Six Dimensions of Data Quality, based on
ensuring “fitness for use”
1.
2.
3.
4.
5.
6.
6
Relevance
Accuracy
Timeliness
Accessibility
Interpretability
Coherence
Quality Measures and Indicators
 Should cover all elements of the Quality
Assurance Framework
 Methodology should be well-established,
credible
 Must be easy to interpret and use
 Should be practical – reasonable, not an overburden
 For Key Indicators, see Chapter 8, Table 8.2
7
Sample Quality Indicators
 From IRES Table 9.2, linked to QA Framework
 Relevance: user feedback on satisfaction, utility of
products and data
 Accuracy: response rate, weighted response rate, number
and size of revisions
 Timeliness: time lag between reference period and release
of data
 Accessibility: number of hits, number of requests
 Interpretability: amount of background info available
 Coherence: validation of data from other sources
8
Quality assurance must be built into
all stages of the survey process
Survey Stages:
Quality Assurance
Framework
9
1.
2.
3.
4.
5.
6.
7.
8.
9.
Specify needs
Design
Build
Collect
Process
Analyze
Disseminate
Archive
Evaluate
1. Specify Needs
Activities:
 Determine needs: define
objectives, uses, users
 Identify concepts,
variables
 Identify data sources and
availability
 Prepare business case
10
Quality Assurance
 Consult with users and
key stakeholders
 Clearly state objectives,
concepts
 Establish quality targets
 Check sources for quality,
comparability, timeliness
 Gather input and support
from respondents
2. Design
Activities:
 Determine outputs
 Define concepts,
variables
 Design data collection
methodology
 Determine frame &
sampling strategy
 Design production
processes
11
Quality Assurance
 Consult users on outputs
 Select, test & maintain frame
 Design & test questionnaire
and instructions
 Use established standards
 Develop processes for error
detection
 Develop & test imputation
3. Build
Activities:
 Build collection
instrument
 Build processing system
 Design workflows
 Finalize production
systems
12
Quality Assurance
 Focus test questionnaire
with respondents
 Test systems for
functionality
 Test workflows; train staff
 Document
 Develop quality measures
4. Collect
Activities:
 Select sample
 Set up collection
 Run collection
 Finalize collection
13
Quality Assurance
 Maintain frame
 Train collection staff
 Use technology with built
in edits
 Implement verification
procedures
 Monitor response rates,
error rates, follow-up
rates, reasons for nonresponse
5. Process
Activities:
 Integrate data from all
sources
 Classify and code data
 Review, validate and edit
 Impute for missing or
problematic data
 Create and apply weights
 Derive variables
14
Quality Assurance
 Monitor edits
 Implement follow-ups
 Focus of most important
respondents
 Analyze and correct
outliers
6. Analyze
Activities:
 Transform data to outputs
 Validate data
 Scrutinize and explain
data
 Apply disclosure controls
 Finalize outputs
15
Quality Assurance
 Track all indicators
 Calculate quality indicators
 Compare data with
previous cycles
 Do coherence analysis
 Validate against
expectations and subject
matter intelligence
 Document all findings
7. Disseminate
Activities:
 Load data into output
systems
 Release products
 Link to meta data
 Provide quality indicators
 Provide user support
16
Quality Assurance
 Format & review outputs
 Verify that tools do not
introduce errors
 Verify disclosure control
 Ensure all meta data and
quality idicators are
available
 Provide contact names
for user support
8. Archive
Activities:
 Create rules and
procedures for archiving
and disposal
 Maintain catalogues,
formats, systems
17
Quality Assurance
 Periodic testing of
processes and systems
 Ensure meta data are
attached
9. Evaluate
Activities:
 Conduct post mortem
reviews to assess
performance, identify
issues
 Take corrective actions or
make new investments,
as required
18
Quality Assurance
 Consult with clients about
needs, concerns
 Monitor key quality
indicators
 Periodic data quality
reviews
 Perform ongoing coherence
analysis
 Compare with best
practices elsewhere
Meta Data
 Important for assessing “fitness for use” and
ensuring interpretability
 Required at every step of the survey process
 Critical for enabling comparisons with other data
 Should include results of data quality reviews
 Figure 8.4: generic set of meta data
requirements
19
Future of Meta Data
 Should become a driver of survey design
 Can be used proactively to prescribe definitions,
concepts, variables and standards
 Can support the harmonization of international
surveys and data
 Efforts are underway to create an integrated
approach for producing and recording meta data
20
Thank you!
Andy Kohut, Director
Manufacturing and Energy Division
Statistics Canada
Section B-8, 11th Floor, Jean Talon Building
Ottawa, Ontario
Canada K1A 0T6
Telephone: 613-951-5858
E-mail: andy.kohut@statcan.gc.ca
www.statcan.gc.ca
21
Download