student presentation

advertisement
Structural equation
modeling:
Fit Analysis
Outline of the Presentation
•
•
•
•
Definitions and basics
Theory of implied covariance matrix
Implication of theory in our student evaluation model
R code (Lavaan)
• Degrees of freedom
• Interpretation of results
• Final discussion
Definition
•
Structural equation modeling is a multivariate
regression model which estimates causal relations
between variables.
Courtesy to Kiran Pedada and Jiyoon An
Fit Analysis
• What is fit Analysis?
• Fit refers to the ability of a model to reproduce the data (e.g.
variance-covariance matrix).
• A good-fitting model is one that is reasonably consistent with
the data
Courtesy to David Kenny
URL: http://davidakenny.net/cm/fit.htm
http://courses.ttu.edu/isqs5347-westfall/isqs5347.htm
Fit Analysis
• When data are produced by the specified SEM, the observed and
fitted covariance matrix will differ by chance alone.
• Simulation resolves the mystery!
http://courses.ttu.edu/isqs6348-westfall/images/6348/sim_cfa_r.txt
Courtesy to Dr. Westfall
Fit Analysis
• When data are produced by a model that is close to the specified
SEM(e.g, there are truly additional paths but their coefficients are
small), the observed and fitted covariance matrix may differ more than
can be explained by chance, but should still be close.
Courtesy to Dr. Westfall
General mathematical form
of latent variable model
•
Assumptions of the latent variable
model
•
The measurement model
•
Sample covariance
matrix
•
Definition of implied
covariance matrix
•
How to identify the
implied covariance matrix
•
Implied covariance matrix
•
Implied covariance matrix
in our case
•
Implied covariance matrix
•
Fit Analysis
•
• In this statistical hypothesis testing(Fit analysis) we would like to
see that our null hypothesis supported
• The likelihood ratio test is a test of full model versus restricted
model(null model).
• Full model is MVN with unrestricted covariance matrix
• Restricted model is MVN with restricted covariance matrix
• Restricted models always have lower likelihood.
• Chi square statistic = 2 x (difference between log likelihoods).
Dr. Westfall, ISQS5347 notes
Baseline model Chi square
• Full model is MVN with unrestricted covariance matrix, get a log
likelihood.
• Restricted model is MVN with restricted covariance matrix (this
time, with 0’s off the diagonal), get a log likelihood.
• As we all know, restricted models always have lower
likelihood.. The chi square statistics is 2 x(difference between log
likelihoods).
Dr. Westfall, ISQS5347 notes
• One of the key issues in calculating chi square is degree of freedom:
• In our case df= number of parameters in the unrestricted covariance
matrix (p(p+1)/2) minus number of free parameters to estimate in
the sem.
Dr. Westfall
• If the chi square test insignificant, then the difference between the
log likelihoods is explainable by chance alone. I.e., the difference is
within the realm of differences that are commonly seen when the
data are in fact simulated from a SEM.
Dr. Westfall
Other fit indices
•
The RMSEA Index
•
Discussion
• Why Chi square is so important?
• How degree of freedom affect the fitted model?
Download