PowerPoint

advertisement
Adapting Credentialing Exams
for Use in Multiple Languages
and Cultures
Ronald K. Hambleton
University of Massachusetts
1
Three Reasons for Translating
and Adapting Exams
1.
Knowledge and skills of interest are
often the same across language
groups--exam adaptation ensures
consistency of content and structure.
2. Often more efficient to translate and
adapt an exam than to build a new exam.
3. Easier to insure exam equivalence and
fairness (than construct new exam in a
second language).
2
Goals of the Presentation
1. To describe several myths
about test adaptation.
2. To review steps in the test
adaptation process.
3. To present the ITC guidelines
for test translation/adaptation.
3
Test Translation vs. Test
Adaptation?
“Test Adaptation” is more
descriptive of the process that
usually takes place—directions,
formats, contexts, etc.
“Test Translation” is sometimes
only a small part of the process.
4
Examples of Test Adaptations
Intelligence, Aptitude, and
Personality Tests
 Quality of Life Surveys
 TIMSS, OECD/PISA, and More
 SAT, GED, NAEP, State
Assessments
 Credentialing Exams

5
Example 1
Out of sight, out of mind
(Back translated from French)
invisible, insane
6
Example 2 (IEA Study in
Reading)
Are these words similar in
meaning?
Pessimistic -- Sanguine
7
Pessimistic -- Sanguine
Adapted to
Pessimistic -- Optimistic
8
Example 3 (1995 TIMMS Pilot)
Alex reads his book for 1 hour and
then used a book mark to keep his
place. How much longer will it take
him to finish the book?
A. ½ hour
B. 2 hours
C. 5 hours
D. 10 hours
9
Example 4
Parker Pen in Mexico: “It won’t
leak in your pocket and
embarrass.”
Became:
“It won’t leak in your pocket and
make you pregnant.”
10
Four Common Myths About
Adapting Credentialing Exams
1. Know two languages and you can
be a translator.
11
Selection and Training of
Translators
Knowledgeable in the languages
 Knowledgeable in the cultures
 Knowledgeable in the subject
matter
 Knowledgeable in the principles of
test development, item writing,
and scoring rubrics, etc.

12
Four Common Myths About
Adapting Credentialing Exams
2. A good translation guarantees
validity.
13
Four Common Myths About
Adapting Credentialing Exams
3. Judgmental reviews are
sufficient to identify
problems in a exam
translation/adaptation.
14
Four Common Myths About
Adapting Credentialing Exams
4. The common strategy of a
back-translations design and the
use of a bilingual design to
compile empirical data is
sufficient to justify exam use.
15
8 Steps for Adapting Exams
1. Checking exam content and
format equivalence.
-content may not be
generalizable
-item formats are not
universally used
16
8 Steps for Adapting Exams
2. Deciding on the desirability of
a translation/adaptation.
-unsuitability of content and/or
the exam format?
-technical expertise to produce
and validate a new exam?
17
8 Steps for Adapting Exams
3. Choosing translators.
-Four qualifications given
earlier are important.
-Some technical expertise may
be especially helpful.
18
8 Steps for Adapting Exams
4. Translating and adapting the
credentialing exam.
-Should often be more than a
single translator.
-Several designs and variations
available.
19
Judgmental Designs
for Assessing Suitability of
a Translation/Adaptation
1. Backward Adaptations
2. Forward Adaptations
20
Backward Translation Design
Advantages
1.
2.
Some problems can be identified.
Researchers do not need to be bilingual.
Disadvantages
1.
No review of target language version!
21
Forward Translation Design
Advantages
1.
Judgments made directly about
language versions of an exam.
Disadvantages
1.
Bilingual judges make assessments
of an exam for monolingual
examinees.
22
8 Steps for Adapting Exams
5. Reviewing the adapted version
of the exam.
-Multiple reviewers are often
very valuable.
-”Smoothing” of language is
possible at this step.
23
8 Steps for Adapting Exams
6. Conducting a small tryout of
the adapted exam
-item analysis, reliability
analysis, candidate review, etc.
24
8 Steps for Adapting Exams
7. Carrying out a more ambitious
study of the adapted exam
(that is, checking for exam
validity).
-construct equivalence,
method equivalence, item
equivalence
25
Statistical Designs
1. Bilingual Designs
2. Monolingual Designs
A. Source Language Examinees
Take Original and Backward
Adapted Versions
B. Monolingual Examinees Take
Versions in Their Own
Languages
26
Checking for Construct Equivalence:
Item
1
2
3
4
5
…
19
20
21
22
Factor 1
Language A Language B
.56
.49
.43
.48
.54
…
-.03
.25
.29
.09
.49
.40
.57
.52
.46
…
.22
.13
.08
.26
Factor 2
Language A Language B
-.07
.59
.11
.63
.38
…
.49
.53
.27
.02
.14
.41
-.04
.68
.45
…
.44
.56
-.13
.22
27
18
16
Delta (ES1)
14
12
10
8
6
4
4
6
8
10
12
14
16
18
Delta (English)
28
Checking for Item Equivalence: Delta
Plots
18
16
Delta (RLO)
14
12
10
8
6
4
4
6
8
10
12
14
16
18
Delta (English)
29
Checking for Item Equivalence:
IRT Methods (change in difficulty)
1.00
Probability of
Correct Response .
1.0
A
B
0.5
0.50
0.0
0.00
0
-3
10
-2
20
-1
30
0
40
1
50
2
60
3
Proficiency
Frequency
Language B
Language A
-0.50
-1.00
0
-3
10
-2
20
-1
30
0
40
1
50
2
60
3
Proficiency
30
IRT Methods (change in
discrimination)
1.00
Probability of
Correct Response .
1.0
A
B
0.5
0.50
0.0
0.00
0
-3
10
-2
20
-1
30
0
40
1
50
2
60
3
Proficiency
Language A
Frequency
Language B
-0.50
-1.00
0
-3
10
-2
20
-1
30
0
40
1
50
2
60
3
Proficiency
31
Figure 1-1 Item characteristic curves for possible DIF patterns
ICCs: No DIF
Figure 1-2
1.0
.9
.9
.8
.8
.7
.7
Probability
Probability
Figure 1-1
1.0
.6
.5
.4
.3
ICCs: DIF in bs
.6
.5
.4
.3
.2
.1
0.0
-3
.2
Ref.
.1
0.0
-3
Foc.
-2
-1
0
1
2
3
Ref.
Foc.
-2
-1
Ability Score
ICCs: DIF in as
Figure 1-4
1.0
1.0
.9
.9
.8
.8
.7
.7
.6
.5
.4
.3
2
3
ICCs: DIF in as, bs, and cs
.6
.5
.4
.3
.2
.1
0.0
-3
1
Ability Score
Probability
Probability
Figure 1-3
0
Ref.
Foc.
-2
-1
0
Ability Score
1
2
3
.2
.1
0.0
-3
Ref.
Foc.
-2
-1
0
1
2
3
Ability Score
32
Checking for Method
Equivalence
1. Any biases due to directions,
exam format, scoring?
2. Preparation of exam
administrators?
3. Equivalence of exam taking
skills?
33
8 Steps for Adapting Exams
8. Documenting the process.
-Steps in exam adaptation,
details, evidence for validity.
34
International Test Commission
Guidelines for Adapting
Educational and Psychological
Instruments
Prepared By
International Test Commission
and
Seven International Organizations
(EAPA, ETP, IACCP, IAAP, IEA, ILTA, IUPsyS)
35
Instrument Adaptation Guideline
A practice which is judged as
important for conducting and
evaluating the adaptation or
parallel development of
psychological and education
instruments for use in different
populations.
36
Organization
of the 22 ITC Guidelines
Context (2)
 Instrument Development and
Adaptation (10)
 Administration (6)
 Documentation/Score
Interpretations (4)

37
Organization of Each of the 22
ITC Guidelines
Description of the Guideline
 Rationale/Explanation
 Steps to Meet the Guideline
 Common Errors
 References for Additional Study

38
D.3
Test developers/publishers
should provide evidence that
the choice of testing
techniques, item formats, test
conventions, and procedures
are familiar to all intended
populations.
39
D.8
Test developers/publishers should
provide information on the
evaluation of validity in all
target populations for whom the
adapted versions are intended.
40
D.9
Test developers/publishers should
provide statistical evidence of
the equivalence of questions for
all intended populations.
41
Three Main Conclusions
1. Has progress been made in
test adaptation methodology?
-Definitely!
-Moved from single unqualified
translator and bilingual design
to considerably more
sophisticated methodologies.
42
Conclusions, cont.
-Emergence of ITC guidelines;
methodological advances in
judgmental designs and
statistical methods (e.g., DIF,
SEM);more resources and time
being allocated.
43
Conclusions, Cont.
2. What lies ahead for the field
of test adaptations?
-I’m very optimistic about the
future.
-Needed is commitment of
resources and time, and
implementation of existing
methods.
44
Conclusions, Cont.
3. What are the most important
areas for improvement?
-choose multiple qualified
translators and give them
sufficient time
-aggressively implement empirical
methods
-build on past experiences and
knowledge base
45
For more information about exam
translation and adaptation,
contact Ron Hambleton at
Rkh@educ.umass.edu
46
Download