C Session 1: Assessment of data quality Yearbook of Labour Statistics

advertisement
Conference on Data Quality
for International Organisations
(Rome, Italy, 7-8 July 2008)
Session 1: Assessment of data quality
The example of the Wages part in the
ILO Yearbook of Labour Statistics
(http://laborsta.ilo.org)
Le Anh Hua (hua@ilo.org)
Bureau of Statistics
International Labour Office
Geneva, Switzerland
Overview of the presentation
•
•
•
•
The ILO tools for ensuring quality
Some statistics in Laborsta
Data collection instruments
What we do to check the quality of the
statistics reported
• Forms of dissemination
• Measurements linked to the statistics
• Points of concern for discussion
ILO tools for ensuring quality
• Set of procedures to ensure
consistency, timeliness and coherence
– Standardised questionnaires pre-filled with earlier
statistics, and with instructions and classifications
– Methodological descriptions
– Correspondence and follow up with countries
• Database documentation
Wage statistics in Laborsta
(last observation in 2004 or later)
Table 5A
by economic
activity
91 countries 142
time
series
4000 time
lines
Table 5B
91 countries 135
time
manufacturing
series
4600 time
lines
Data collection instruments
• Questionnaires, incl. instructions and annexes
with classifications
• On the following slides
–
–
–
–
–
1st page with contact information
Overview of concepts, units, coverage, source
Table 5A, by ISIC tabulation categories
Table 5B, by divisions of manufacturing
Table 5C, for agricultural activities
ST-125-63 / Part 5
Country: AAA
INTERNATIONAL LABOUR OFFICE
QUESTIONNAIRE FOR
THE YEARBOOK OF LABOUR STATISTICS,
2008
WAGES
Data provided by your country will be entered in the
Bureau of Statistics' principal database, LABORSTA,
and made available on its Website
(http://laborsta.ilo.org).
In order to be included in the
2008 edition of the Yearbook of Labour Statistics
and in the corresponding CD-ROM ,
data must reach the ILO by 30 June 2008.
>> Instructions
>> 5A - Average wages per hour and per person, by economic activity
>> 5B - Average wages per hour and per person, in manufacturing, and by division
>> 5C - Average wages per hour per person in agricultural activities, by sex
Agency/organization responsible for the statistics:
Person who may be contacted for further information about the statistics
- Name:
- Address:
- Fax number:
- Tel. number:
- E-mail address:
- Web site:
ST-125-63 (5A) AAA - series I
Table 5.A - ISIC-Rev.3 - Average wages per hour and per person, by economic activity
Country: AAA
Currency: AAA
CONCEPT, TIME UNIT AND PERSONS COVERED:
Concept:
- Average earnings
- Average wage rates
- Other (specify)
Time unit:
- Per hour (a)
- Per day
- Per week
- Per month
(a) Please, specify:
- Per hour actually worked
- Per hour paid for
- Per normal hour
Persons covered:
- All employees (wage earners and
salaried employees)
- Wage earners
- Salaried employees
- All persons employed
- Other (specify)
If the coverage is "wage earners" or "salaried employees", please provide the operational
definition used in your country to identify them in practice
SOURCE OF DATA:
- Establishment survey
- Administrative records
- Labour force survey
- Other
TITLE:
REFERENCE PERIOD:
PUBLICATION(S)/WEBSITE(S) IN WHICH
DATA APPEAR:
TOTAL (MEN+WOMEN)
Economic activity
ISIC-Rev.3
Average, all activities (A-Q)
Average, non-agricultural activities
(C-Q)
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
X
1998 1999 2000
.
.
.
.
25.44
26.78
30.05
24.83
.
.
.
.
.
.
.
.
.
.
.
.
.
26.04
27.53
30.71
25.22
.
.
.
.
.
.
.
.
.
.
.
.
.
26.31
27.78
30.83
25.02
.
.
.
.
.
.
.
.
.
.
.
.
Notes: (1) 2001: change in the currency - conversion factor
2001
2002 2003 2004 2005 2006
.
.
.
.
13.96
14.72
16.50
13.39
.
.
.
.
.
.
.
.
.
.
.
.
(1)
13.66
14.42 (1)
16.13 (1)
(1)
13.06
.
.
.
.
.
.
.
.
.
.
.
.
.
.
2007
.
.
.
.
.
.
14.29 14.6 14.76
15.09 15.4 15.60
17.07 17.61 18.16
13.71 13.87 13.90
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
14.81
15.74
18.55
13.79
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
ST-125-63 (5B)
- series I
Table 5.B - ISIC Rev. 3 - D - Average wages per hour and per person, in
in manufacturing, and by division
TOTAL (MEN+WOMEN)
Manufacturing
ISIC Rev. 3 - D
Average, divisions (15-37)
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
1998
.
22.68
27.70
20.57
18.36
18.96
23.27
25.29
29.55
32.09
28.04
23.62
23.80
28.14
24.96
28.01
25.98
25.79
25.25
25.33
33.92
27.68
23.36
19.88
1999
.
23.30
28.28
20.98
18.90
19.48
23.88
26.07
30.53
32.88
28.55
24.16
24.31
28.83
25.49
28.77
26.05
26.51
26.08
25.98
35.21
28.64
23.82
20.18
2000
.
23.27
29.21
20.82
18.88
19.61
23.62
26.75
30.70
33.98
29.37
24.50
24.48
30.05
25.66
29.19
25.18
26.36
26.44
25.80
35.24
29.16
23.82
20.51
Notes: (1) Prior to 2001: DEM; 1 Euro = 1.95583 DEM.
2001
.
12.11(1)
14.08(1)
10.84(1)
9.83(1)
10.05(1)
12.21(1)
13.88(1)
15.95(1)
17.86(1)
15.22(1)
12.69(1)
12.71(1)
15.55(1)
13.28(1)
15.14(1)
13.12(1)
13.65(1)
13.66(1)
13.31(1)
18.08(1)
15.31(1)
12.40(1)
10.85(1)
2002
.
12.38
15.70
11.11
10.09
10.28
12.42
14.18
16.19
18.14
15.54
13.06
12.96
15.93
13.60
15.49
13.49
14.06
13.92
13.59
18.01
15.76
12.67
11.28
2003
.
12.70
16.05
11.25
10.25
10.50
12.56
14.53
16.39
19.00
15.85
13.34
13.29
16.23
13.86
15.92
13.59
14.41
14.24
13.95
18.58
16.13
12.91
11.38
2004
.
12.94
16.53
11.56
10.45
10.57
12.74
14.84
16.49
19.40
15.93
13.52
13.44
16.63
14.05
16.32
13.41
14.73
14.37
14.27
19.24
16.33
13.07
11.67
2005
.
13.12
16.24
11.59
10.55
10.60
12.75
15.00
16.56
19.35
16.13
13.69
13.51
16.85
14.21
16.55
13.40
14.91
14.46
14.50
19.56
16.67
13.18
11.77
2006
.
13.27
16.03
11.69
10.86
10.71
12.78
15.28
16.41
19.82
16.24
13.80
13.66
17.23
14.37
16.73
13.45
15.07
14.55
14.62
19.59
16.91
13.26
11.94
2007
.
Assessment criteria
•
•
•
•
•
Official statistics?
Source, coverage, time unit, currency units
Divergence from main concepts
Upward/downward variation from trend
New series v. old series
• National websites & national publications
• Metadata available facilitate quality control
checks
Variation from trends
• Are the newly submitted values reasonable?
– Change in currency ?
– CPI ?
– Typing mistake in the reported figures ?
• Data identified as possible error
– corrected from original or supplementary
information
– Enquiry with data supplier for correction
– If unresolved: not disseminated.
Old series v. new series
Possible situations
• Old series not updated, but new series
provided
• Old series updated, and new series
provided
• Existing series not updated
• Tables returned empty
Validation tests
• consistency across tables: average for D
(manufacturing) in 5A should be identical to “total”
in 5B
• consistency within tables: average for total (♀+♂) ≤
maximum ♀ & ♂ , and ≥ minimum ♀ & ♂
• implausible values: average for Total (♀+ ♂) ≠
arithmetic mean (♀+♂) (except in the case of an
identical wage rate for men and women)
• If break in series then note shown at the year
• Coherence between source of series and source in
“Sources & Methods”
Forms of dissemination
• http://laborsta.ilo.org
– Statistical series and methodological information
– Display on screen and downloadable files
• ILO Yearbook of Labour statistics
• Sources & methods publications
– Volume 2: Establishment Surveys: Employment, Wages,
Hours of Work and Labour Cost
– Volume 4: Administrative Records and Related
Sources: Employment, Unemployment, Wages and
Hours of Work
Measurements linked to the
statistics
•
•
•
•
•
•
•
No. of hits on Laborsta
No. of requests by email/telephone.
No. of series updated each year
No. of responses processed each year
User satisfaction survey on the web
Problems observed and resolved
Running review of lessons learned and
ways to make the processes more efficient
Points of concern for discussion
• How to « fill the gaps » in the available series ?
• What could be done to improve the response rates ?
• Should we also consider using non-official statistics ?
• What are the main quality concerns for an international
gathering and dissemination of statistics from countries ?
– Country coverage (geographical, persons) ?
– Time series length and coherence?
– Comparability between countries ?
• Can the quality of the series be quantitatively expressed ?
Download