A_procedure_for_dimensionality_analyses_of_response_data

advertisement
A procedure for dimensionality
analyses of response data from
various test designs
Jinming Zhang
William Stout
Introduction

Dimension




Dimensional structure of the test (e.g., algebra
and geometry)
statistical dimensional structure of response
data
Incorporate both Judgments about test
content and evidence from statistical
analyses
Missing data: CAT & multistage testing

Missing item pair measurement
DETECT index
Modified DETECT index
Example 3


Stage 1 booklet: 2 dim & Stage 2 booklet 1: 2
dim, Stage 2 booklet 2: 1 dim (additional dim),
Stage 2 booklet 3: 1 dim (additional dim).
Bridge item (in Stage 1)


E(1,2) and E(2,3), so E(1,3)
First-stage booklet should


measure all of the constructs/contents the whole
test aim to measure, though it is unknown.
classify examinees into different proficiency levels
Item 1 and 2 are measuring the same dimension
1. If E(1,2), E(2,3), and E(1,3)


D(P*) is maximized in P* partition if item 1 and 2 are in
the same cluster, other than in other P
2. If only E(1,2) and E(2,3), and if item 2 is a bridge
item

D(P*) is maximized in P* partition if item 1 and 2 are in
the same cluster when item 2 is in the same cluster or
not, other than in other P
Whether a test has an approximate
simple structure or not
Discordance

Resulted from



Not a approximate simple structure
Inaccuracy of conditional covariance estimation
Given a partition of items,

What made Dd(P*) and PropD(P*) large



Unidimensionality
Violation of approximate simple structure
Inaccuracy of conditional covariance estimation
polyDETECT


Obtain the composite theta score:
unidimensinoal approximation & simple
structure approach.
Use percentiles of composite theta scores as
cut-off points in forming AHGs.


Between 25 to 100 in each group
Cross-validation:
polyDETECT

Evidence of multidimensionality
Condition 3 is hold or not?




Each sizable cluster contains at least one
stage-1 item.
Exist at least one sizable cluster that does not
contain any Stage-1 items, and all items in such
cluster belong to the same booklet.
Exist at least one sizable cluster that does not
contain any Stage-1 items, and all items in such
cluster belong to at least two booklet.
Exist at least two sizable cluster that does not
contain any Stage-1 items, items in each such
cluster come from the same booklet but
different clusters belong to different booklets.
Dealing with CAT data




Estimates of item parameters were obtained
before conducting CAT. Why do
dimensionality assessment on CAT data?
Sparse data set of CAT
100,000 responses are required at least
Item selection, item exposure control,
content balance to satisfy the condition 3
Simulation study










M2PL
Each booklet has 30 items
Dimension: 1, 2, 3
Number of examinees: 750, 1500, 3000, 4000
Theta: MVN(0,sigma), correlation = 0.8
Cut-off points for low-, moderate-, and highscoring group: <10, 11~18, >18
About 37.82% unestimable item pairs
Cross-validation
Replication: 100
Composite theta: use unidimensional IRT model
Results
Real data analyses

Missing values are large (55%~71%)
Real data analyses



Weak dimensionality (M value)
High PropD(P*) indicates a large amount of
spurious information to form partition P*
Confirmatory analyses: D3 >D2

But DETECT tends to underestimate, so twocluster partition solution may be preferred.
Real data analyses

High correlation indicates a weak degree of
multidimensionality
Concluding remarks

Moderate violation of approximate simple
structure is still hold
Download