Psychometrics, Dynamics, and Functional Data Analysis

advertisement
Psychometrics, Dynamics, and
Functional Data Analysis
“
T
h
e
v
i
e
w
s
Jim Ramsay
McGill University
Testing as Input/Output Analysis





A test score is actually a derivative with respect
to time.
Consequently a differential equation model for
testing data seems natural.
Dynamic testing data will be more and more
important.
We have some new tools for working with
dynamic data.
So let’s consider how to use time as a covariate.
Learning to Play Golf




We buy some clubs. We play a few games, each
being an 18 item test. It’s harder than it looks.
We take a lesson. We play a few games. Our
score improves to a better level.
We take another lesson, play some games, and
things improve again.
Key question: How quickly is a lesson reflected
in an improvement in score?
Brainergy


Energy is defined as
“the capacity to do
work.”
Kinetic energy E =
Mv2/2, and involves
mass, distance, and
time.
• We are interested in
“the capacity to solve
problems.”
• Problems involve
difficulties (=mass),
number of problems
(=distance) and time.
• Let’s call mental
energy brainergy.
Brain Power





What counts is problem solving per unit
time.
Power = energy expended per unit time.
Brain power = maximum difficulty of
problem solvable per unit time, or
number of lighter problems solved per unit
time.
That is, brain power = d brainergy/dt.
Brain Power and Time Scales
We need the concept of brain power when we
consider intelligence on two time scales:
1.
Long term: How much knowledge is available
over large time intervals, like a school year
2.
Short term: How much new knowledge can be
acquired over a short time interval, like a
single class.
Tests Measure Brain Power



Mental tests and psychological scales are
one of the greatest technological
achievements of the 20th century.
Tests work so well because they are timelimited.
Test scores reflect brain power rather than
brainergy.
Inputs to Brain Power





Information about the structure of the
problems.
A set of tools to solve them.
Training in the use of these tools.
All these require time.
Inputs to acquisition of brain power are
functions of time.
A Differential Equation in Time





Links one or more time-derivatives, dx/dt,
d2x/dt2,…, to the function x(t) itself.
Is a model for system dynamics: change
over time.
Can also include one or more input or
covariate functions.
x(t) is a long-term description.
dx/dt is a short-term description.
A Simple Example
dE
   E (t )   f (t )
dt



E(t) is brainergy, dE/dt is brain
power.
f(t) is an input function of time,
such as education.
α and β are constants, β > 0.


Most differential equations don’t have explicit
solutions, but this one does.
Let E0 be brainergy at time t = 0, and which
will often be 0.
E (t )  e
t
t
[ E0    e f (u)du]
t
0
see what happens when α=1, β varies,
and f(t) is a step function.
Let’s
E (t )  e
t
t
[ E0    e f (u)du]
0
t




The slope of E(t) when f(t) goes positive is
β.
β controls how fast the system responds
to the input f(t).
If the system is a problem solver, then β
indicates how quickly the person learns to
solve a problem.
After about 4/β time units, full capacity is
reached, and the system is ready for more
input.
Fitting Differential Equations



We have noisy discrete-time data, and
want to use them to estimate a differential
equation.
We want a solution E(t) to the equation to
fit the data as well as possible.
We need lots of flexibility in choosing a
differential equation, and we can’t assume
that there is an explicit solution to the
equation.
Functional Data Analysis




A collection of methods for analyzing
curves or functions as data
A common theme is using derivatives in
various ways
See Ramsay and Silverman (1997)
Functional Data Analysis. Springer.
And Ramsay and Silverman (2002) Applied
Functional Data Analysis. Springer.
Two Functional Data Analysis
Techniques


L-spline Smoothing: given noisy data and
a differential equation, find a function E(t)
that will smooth the data and at the same
time be nearly a solution to the differential
equation.
Principal Differential Analysis: given a
function E(t), estimate a linear differential
equation for which E(t) is a solution.
Estimating a DIFE from noisy data



We’ve recently combined these two methods
into a technique for estimating a differential
equation from noisy data.
In our simple example, this amounts to
estimating parameters α and β.
But much more complex DIFE’s can be
estimated as well, including linear or nonlinear,
and single or multiple variable systems.
An Oil Refinery



Here are some data from an oil refinery in
Corpus Christi.
The input f(t) (reflux flow) is negatively
coupled to the output E(t) (tray 47 level).
The smooth curve is a solution to the
differential equation that best represents
this relationship.
Perhaps this oil refinery is not too smart!



Many situations will call for multiple
outputs: Performance with a putter, a
driver, and an iron, for example. Or in
algebra and geometry.
And many situations will involve multiple
inputs: Regular classes, tutoring sessions,
labs and etc.
The technology used in these illustrations
can handle these situations, at least for
linear differential equations. Nonlinear
equations don’t pose any problem in
principle.
Some Simulated Data


Imagine that the data are golf scores over
successive games, and that the input is a
set of three equally-spaced lessons from a
golf pro.
The following slides show three golfers.
Which is a future Tiger Woods?
These lessons are nicely timed.
This person needs to find another sport!
This person should get lessons more often!
Is this Model Good Enough?



Specifying β to be constant is too simple.
Allowing for fatigue, boredom, and other
things requires a function β(t).
A first order equation can’t allow for
sudden transient effects like insight. We
may need a differential equation involving
higher derivatives.
We may need nonlinear equations as well.
A Nonlinear Differential Equation
2
dE1
A( BE2 (t ))
   [ E1 (t ) 
]   f (t )
2
dt
1  C ( BE2 (t ))
2
dE2
A( BE1 (t ))
   [ E2 (t ) 
]   f (t )
2
dt
1  C ( BE1 (t ))
•The summed output from these two equations
will exhibit both the rapid learning and long-term
retention required of human learners.
•See H. R. Wilson (1999) Spikes, Decisions and
Actions, Oxford, for many more examples of
differential equation models in neuroscience.
Control Theory


Engineers who work with input/output
systems have developed ways of
designing feedback loops to optimize
outputs.
We’re working with a team of chemical
engineers at Queen’s University.
Where Would the Data Come From?




Can we design customized learning
situations, like golf, and track how a
learner makes progress as a function of
time and inputs?
Perhaps video and computer games are
nearly what we need.
We already know that people will pay big
money to have these experiences.
Would corporations with deep pockets pay
for this kind of testing?
Conclusions



Dynamic testing would generate
performance data over time that depend
on one or more functional covariates.
New tools are available for these data that
fit them with a differential equation.
Dynamic psychometrics looks promising!
Download