Diagrams Representing Relationships between Observed and

advertisement
EFA, CFA, and Amos
(You should have read the EFA chapter in Field, the Structural Equation Modeling chapter, and be reading the Byrne
text.)
Observed Variable
A variable whose values are observable.
Examples: IQ Test scores (Scores are directly observable), GREV, GREQ, GREA, UGPA,
Minnesota Job Satisfaction Scale, Affective Commitment Scale, Gender, Questionnaire items.
Latent Variable
A variable, i.e., characteristic, presumed to exist, but whose values are NOT observable. A Factor
in Factor Analysis literature. A characteristic of people that is not directly observable.
Intelligence, Depression, Job Satisfaction, Affective Commitment, Tendency to display affective
state
No direct observation of values of latent variables is possible. Brain states? Brain chemistry?
Indicator
An observed variable whose values are assumed to be related to the values of a latent variable.
Reflective Indicator
An observed variable whose values are partially determined by, i.e., are influenced by or reflect,
the values of a latent variable. For example, responses to Conscientiousness items are assumed to
reflect a person’s Conscientiousness.
Formative Indicator
An observed variable whose values partially determine, i.e., cause or form, the values of a latent
variable.
Exogenous Variable (Ex = Out)
A variable whose values originate from / are caused by influences outside the model, i.e., are not
explained within the theory with which we’re working. That is, a variable whose variation we don’t
attempt to explain or predict by whatever theory we’re working with. Causes of exogenous variable
originate outside the model. Exogenous variables can be observed or latent.
Endogenous variable (En ~~ In)
A variable whose values are explained within the theory with which we’re working. We account for
all variation in the values of endogenous variables using the constructs of whatever theory we’re
working with. Causes of endogenous variables originate within the model.
CFA, Amos - 1
Printed on 2/7/2016
Basic SEM Path Analytic Notation
Observed variables are symbolized by squares or rectangles.
103
84
121
76
...
97
81
Observed
Variable
Latent Variables are symbolized by Circles or ellipses.
106
78
115
80
...
93
83
Latent
Variable
Values of individuals on
latent variables are not
observable, hence the
dimmed text.
Correlations or covariances between variables are represented by double-headed arrows.
"Cor / Cov"
Arrow
"Cor / Cov"
Arrow
Observed
Variable B
Observed
Variable A
106
78
115
80
...
93
83
101
90
128
72
...
93
80
103
84
121
76
...
97
81
Latent
Variable B
Latent
Variable A
104
79
114
79
...
92
81
"Causal" or "Predictive" or “Regression” relationships between variables are represented by single-headed
arrows
Latent
Variable
Latent
Variable
"Causal"
Arrow
Observed
Variable
Observed
Variable
"Causal"
Arrow
Latent
Variable
Observed
Variable
CFA, Amos - 2
"Causal"
Arrow
"Causal"
Arrow
Latent
Variable
Observed
Variable
Printed on 2/7/2016
Exogenous Observed Variables
"Correlation"
Arrow
"Causal"
Arrow
Observed
Variable
Observed
Variable
Exogenous variable connect to other variables in the model through either a “causal” arrow or a correlation
Exogenous Latent Variables
Latent
Variable
"Correlation"
Arrow
"Causal"
Arrow
Latent
Variable
Exogenous latent variables also connect to other variables in the model through either a “causal” arrow or a correlation
Endogenous Observed Variables "Causal"
Arrow
Observed
Variable
Endogenous Latent Variable
Random
error
Random
error
"Causal"
Arrow
Latent
Variable
Endogenous variables connect to other variables in the model by being on the “receiving” end of one or more “causal”
arrows. Specifically, endogenous variables are typically represented as being “caused” by 1) other variables in the
theory and 2) random error. Thus, 100% of the variation in every endogenous variable is accounted for by either
other variables in the model or random error. This means that random error is an exogenous latent variable in SEM
diagrams. Random error is a catch-all concept representing all “other” things that are affecting the endogenous variable.
Summary statistics associated with symbols
Mean, Variance
Mean, Variance
Observed
Variable
Latent
Variable
Our SEM program, Amos, prints means and variances above and to the right. Typically the mean and variance of latent
variables are fixed at 0 and 1 respectively, although there are exceptions to this in advanced applications.
"Correlation"
Arrow
"Causal"
Arrow
r or Covariance
B or 
CFA, Amos - 3
Printed on 2/7/2016
Path Diagrams of Analyses We’ve Done Previously
Following is how some of the analyses we’ve performed previously would be represented using path
diagrams.
1. Simple correlation between two observed variables.
rVQ
GRE-Q
GRE-V
2. Simple correlations between three observed variables.
rVA
rVQ
GRE-V
rQA
GRE-Q
GRE-A
3. Simple regression of an observed dependent variable onto one observed independent variable.
GRE-Q
B or 
P511G
Note that the
endogenous variable
is caused in part by
catch-all influences.
e
4. Multiple Regression of an observed dependent variable onto three observed independent variables.
GRE-V
GRE-Q
UPGA
BV or V
BQ or Q
P511G
e
Note that the
endogenous variable
is caused in part by
catch-all influences.
BU or U
CFA, Amos - 4
Printed on 2/7/2016
ANOVA in SEM Models
Since ANOVA is simply regression analysis, the representation of ANOVA in SEM is merely as a
regression analysis. The key is to represent the differences between groups with group coding
variables, just as we did in 513 and in the beginning of 595 . . .
1) Independent Groups t-test
The two groups are represented by a single, dichotomous observed group-coding variable. It is the
independent variable in the regression analysis.
Dichotomous
variable representing
the two groups
Dependent
Variable
e
2) One Way ANOVA
The K groups are represented by K-1 group-coding variables created using one of the coding
schemes (although I recommend contrast coding). They are the independent variables in the
regression analysis. If contrast codes are used, the correlations between all the group coding
variables are 0, so no arrows between them need be shown.
1st Group-coding
contrast code
variable
2nd Group-coding
contrast code
variable.
Note: Contrast codes were used so Groupcoding variables are uncorrelated.
Dependent
Variable
e
. . . . .
(K-1)th Groupcoding contrast code
variable.
3) Factorial ANOVA.
Each factor is represented by G-1 group-coding variables created using one of the coding schemes.
The interaction(s) is/are represented by products of the group-coding variables representing the
factors. Again, no correlations between coding variables need be shown if contrast codes are used.
1st Factor
1st Factor
2st Factor
2st Factor
Note: Contrast codes should be used to make sure
the group-coding variables uncorrelated (assuming
equal sample sizes.)
Dependent
Variable
e
Interaction
Interaction
Interaction
Interaction
CFA, Amos - 5
Printed on 2/7/2016
Path Diagrams representing Exploratory Factor Analysis
1) Exploratory Factor Analysis solution with one factor.
The factor is represented by a latent variable with three or more observed indicators. (Three is the
generally recommended minimum no. of indicators for a factor.)
e1
Obs 1
Obs 2
e2
Obs 3
e3
F
Note that factors are exogenous. Indicators are endogenous. Since the indicators are
endogenous, all of their variance must be accounted for by the model. Thus, each indicator must
have an error latent variable to account for the variance in it not accounted for by the factor.
2) Exploratory Factor Analysis solution with two orthogonal factors.
Each factor is represented by a latent variable with three or more indicators. The orthogonality of
the factors is represented by the fact that there is no arrow connecting the factor symbols.
Let’s assume that Obs1, 2, and 3 are thought to be primary indicators of F1 and 4,5,6 of F2.
For exploratory factor analysis, each variable is allowed to load on all factors. Of course, the hope is
that the loadings will be substantial on only some of the factors and will be close to 0 on the others,
but the loadings on all factors are retained, even if they’re close to 0. The loadings that might be
close to 0 in the model are shown in red. These are sometimes called cross loadings.
Obs 1
e1
Obs 2
e2
Obs 3
e3
F1
Obs 4
e4
Obs 5
e5
Obs 6
e6
F2
Orthogonal factors represent uncorrelated aspects of behavior.
Note what is assumed here: There are two independent characteristics of behavior – F1 and F2.
Each one influences responses to all six items, although it is hoped that F1 influences primarily the
first 3 items and that F2 influences primarily the last 3 items.
If Obs 1 thru Obs 3 are one class of behavior and Obs 4 thru Obs 6 are a second class, then if the
loadings “fit” the expected pattern, this would be evidence for the existence of two independent
dispositions – that represented by F1 and that represented by F2.
CFA, Amos - 6
Printed on 2/7/2016
3) Exploratory Factor Analysis solution with two oblique factors.
Each factor is represented by a latent variable with three or more indicators. The obliqueness of the
factors is represented by the fact that there IS an arrow connecting the factors.
Obs 1
e4
Obs 2
e5
Obs 3
e6
F1
Obs 4
e7
Obs 5
e8
Obs 6
e9
F2
Again, in exploratory factor analysis, all indicators load on all factors, even if the loadings are close
to zero.
Exploratory factor analysis (EFA) programs, such as that in SPSS, always report estimates of all
loadings.
This solution is potentially as important as the orthogonal solution, although in general, I think that
researchers are more interested in independent dispositions than they are in correlated dispositions.
But discovering why two dispositions are separate but still correlated is an important and potentially
rewarding task.
CFA, Amos - 7
Printed on 2/7/2016
Path Diagrem of EFA model of NEO-FFI Big Five 60 item questionnaire.
(From Biderman, M. (2014). Against all odds: Bifactors in EFAs of Big Five Data. Part of
symposium: S. McAbee & M. Biderman, Chairs. Theoretical and Practical Advances in Latent
Variable Models of Personality. Conducted at the 29th annual conference of The Society for
Industrial and Organizational Psychology; Honolulu, Hawaii, 2014.
Crossloadings are in red.
CFA, Amos - 8
Printed on 2/7/2016
Path Diagrams vs the Table of Loadings.
(Crossloadings are in red in both representations.)
Pattern Matrixa
Factor
1
2
3
4
ne1
-.025
-.001
-.056
-.067
ne2
.086
.002
.021
-.026
ne3
.126
-.023
.133
.229
ne4
.054
.014
.112
.184
ne5
-.115
-.045
-.069
-.269
ne6
.042
.206
-.200
.085
ne7
.194
-.168
.046
-.203
ne8
.342
-.127
.087
.299
ne9
.270
.046
.092
.419
ne10
-.017
-.138
-.105
.135
ne11
.209
-.270
.009
.102
ne12
.140
-.170
-.103
-.070
na1
-.100
-.193
.081
.518
na2
.347
-.153
-.076
.421
na3
.091
-.123
-.050
.560
na4
-.139
.034
-.011
.504
na5
.298
.073
.033
.335
na6
.335
.157
-.070
.353
na7
.019
-.177
.067
.231
na8
.053
.029
-.114
.543
na9
.163
.115
-.120
.319
na10
-.057
-.123
.167
.594
na11
.028
.012
.055
.471
na12
.027
-.210
-.075
.534
nc1
.092
-.429
-.090
.183
nc2
.019
-.580
-.049
.055
nc3
-.030
-.376
-.037
.011
nc4
-.093
-.406
.052
.156
nc5
.026
-.716
-.025
-.156
nc6
.146
-.476
.100
.241
nc7
-.154
-.694
-.070
-.121
nc8
.092
-.528
.019
-.017
nc9
.040
-.573
.044
-.050
nc10
.021
-.720
.016
-.103
nc11
.035
-.551
-.067
.148
nc12
.065
-.628
.035
-.011
ns1
.501
.072
.145
-.231
ns2
.544
-.119
.132
-.044
ns3
.653
.037
-.026
-.069
ns4
.664
.025
-.141
.006
ns5
.660
.082
-.031
.200
ns6
.677
-.053
-.104
.011
ns7
.658
.035
.027
-.069
ns8
.552
.068
-.078
.265
ns9
.724
-.093
.134
-.021
ns10
.662
-.041
-.068
-.009
ns11
.563
-.266
.035
.030
ns12
.611
-.082
-.177
.075
no1
-.146
.334
.267
-.003
no2
.030
.366
.136
.061
no3
-.014
.044
.661
-.040
no4
.116
.037
.142
-.119
no5
-.010
-.090
.731
.102
no6
.029
.168
.217
.124
no7
-.064
.025
.207
.082
no8
.024
.233
.240
-.101
no9
-.008
-.012
.822
.049
no10
.052
.135
.536
.026
no11
-.026
-.111
.560
-.041
no12
.021
.131
.615
-.174
Extraction Method: Maximum Likelihood.
Rotation Method: Oblimin with Kaiser Normalization.
a. Rotation converged in 14 iterations.
5
.678
.268
.260
.492
.624
.563
.166
.432
.223
.327
.278
.483
.249
-.139
.125
-.099
.188
.031
.346
.292
-.211
.271
-.227
-.022
.008
-.086
-.140
.028
.057
-.052
.109
.110
.094
.044
.005
.018
-.031
.027
-.118
.189
.047
.046
.011
.012
.027
.220
-.132
-.003
-.103
.087
.046
-.008
-.166
.207
-.082
-.197
-.123
-.033
.110
.090
Whew – there are tons of crossloadings, most of them near 0. Can’t they just be assumed to be zero?
This kind of thinking leads to Confirmatory Factor Analysis Models.
CFA, Amos - 9
Printed on 2/7/2016
Confirmatory vs Exploratory Factor Analysis
In Exploratory Factor Analysis, as discussed above, the loading of every item on every factor is
estimated. The analyst hopes that some of those loadings will be large and some will be small. An
EFA two-orthogonal-factor model is represented by the following diagram.
Obs 1
F1
Exploratory
F2
e4
Obs 2
e5
Obs 3
e6
Obs 4
e7
Obs 5
e8
Obs 6
e9
Note that there are arrows (loadings) connecting each variable to each factor. We have no
hypotheses about the loading values – we’re exploring – so we estimate all loadings and let them
lead us. No EFA programs (except that in Mplus) allow you to specify or fix loadings to predetermined values.
In contrast to the exploration implicit in EFA, a factor analysis in which some loadings are fixed at
specific values is called a Confirmatory Factor Analysis. The analysis is confirming one or more
hypotheses about loadings, hypotheses representing by our fixing them at specific (usually 0) values.
Unfortunately, EFA and CFA cannot be done using the same computer program except MPlus.
The problem is that all programs that do just EFA (except Mplus) won’t allow some loadings to be
fixed at predetermined values. And programs designed to do CFAs (except Mplus) canNOT
estimate the above EFA model. Amos and all CFA programs other than MPlus require that some of
the loadings be fixed and are unable to do EFAs such as the above models.
So, in many instances, you will have to employ both SPSS (for EFA) and AMOS (for CFA) in
exploring the interrelations between variables and factors. Often, analysts will use an EFA program
to estimate ALL loadings to all factors, then use an SEM program to perform a confirmatory factor
analysis, fixing those loadings that were close to 0 in the EFA to 0 in the CFA.
Obs 1
F1
Confirmatory
F2
E1
Obs 2
E2
Obs 3
E3
Obs 4
E4
Obs 5
E5
Obs 6
E6
Note that in the above confirmatory model, loadings of indicators 4-6 on F1 are fixed at 0, as are
loadings of indicators 1-3 on F2. (The arrows are missing, therefore assumed to be zero.)
CFA, Amos - 10
Printed on 2/7/2016
The Identification Problem
Mean, Variance
Consider the simple regression model . . .
E
Mean, Variance
Cov(E,Y)
X
Y
Y = a + b*X
Quantities which can be computed from the data . .
Mean of the X variable
Variance of the X variable
Mean of the Y variable.
Variance of the Y variable.
Correlation of Y with X
Quantities in the diagram .
Remember that in SEM path diagrams, all the variance in every endogenous variable must be
accounted for. For that reason, the path diagram includes a latent “Other factors” or “Error of
measurement” variable, labeled “E”.
Mean of X
Mean of E
Variance of X
Variance of E
Intercept of X->Y regression
Slope of X->Y regression
Correlation of E with Y
Note: Mean and variance of Y are
not separately identified in the model
because they are assumed to be
completely determined by Y’s
relationship to X and to E.
Whoops! There are 5 quantities in the data but 7 in the model. There are too few quantities in the
data. The model is underidentified. – not identified enough - there aren't enough quantities from
the data to identify each model value.
CFA, Amos - 11
Printed on 2/7/2016
Dealing with underidentification . . .
Solution 1
0) The mean of E is always assumed to be 0.
1) Fix the variance of E to be 1.
So in this regression model, the path diagram will be
0, 1
E
Mean, Variance
X
Y
Y = a + b*X
In this case, there are 5 quantities in the model that must be estimated – mean of X, variance of X,
intercept of equation, slope of equation, and correlation of E with Y. There are also 5 quantities that
can be estimated from the observed data. The model is said to be “just identified” or “completely
identified”. This means that every estimable quantity in the model corresponds in some way to one
quantity obtained from the data.
Or,
Solution 2
0) The mean of E is always assumed to be 0.
1) Fix covariance of E with Y at 1.
0, Variance
E
1
Mean, Variance
X
Y
Y=a+b*X
Underidentified models: Cannot be estimated.
Just identified models: Every model quantity is a function of some data quantity. But no parsimony.
Overidentified models: There are more data quantities than model quantities. It is said that you then
have “degrees of freedom” in your model. This is good. Relationships are being explained by fewer
model quantities than there are data quantities. This is parsimonious – what science is all about.
CFA, Amos - 12
Printed on 2/7/2016
Identification in CFA models – We’ll have to do this for the reasons outlined above.
Here’s a typical CFA two-factor model.
Identifying the residuals part of the CFA . . .
Solution 11. Fix all residual variances to 1.
0,1
E1
Obs 1
0,1
F1
Obs 2
E2
Obs 3
E3
0,1
0,1
E4
Obs 4
0,1
F2
Obs 5
E5
Obs 6
E6
0,1
or
Solution 2. Fix all E->O covariances to 1 and estimate variances of “E”s.
1
Obs 1
F1
Obs 2
1
E2
Obs 3
1
E3
1
E4
1
E5
1
E6
Obs 4
F2
E1
Obs 5
Obs 6
CFA, Amos - 13
I
recommend
this.
Printed on 2/7/2016
Insuring that the Factors part of the CFA is identified
1. Fix one of the loadings for each factor at 1 and estimate all factor variances
1
F1
1
F2
Obs 1
E1
Obs 2
E2
Obs 3
E3
Obs 4
E4
Obs 5
E5
Obs 6
E6
Or
2. Fix the variance of each factor at 1 and estimate all factor loadings.
Obs 1
1
F1
E1
Obs 2
E2
Obs 3
E3
Obs 4
E4
1
F2
Obs 5
E5
Obs 6
E6
I
recommend
this.
This probably seems quite arcane right now, and it is really the province of the mathematical
statisticians and programmers who discovered the algorithms that allow us to apply the models.
But we’ll use these conventions when actually applying models such as these.
The main point for us.
The above models tell us that the variation in 6 observed variables – Obs1, Obs2, Obs3, Obs4,
Obs5, Obs5 - is due to variation in two internal characteristics – F1 and F2.
So we have explained why there is variation in the observed variables – because of variation in F1
and F2. We have also explained why the variation in Obs1 to Obs3 is unrelated to the variation in
Obs4 to Obs6 – because F1 and F2 are uncorrelated.
CFA, Amos - 14
Printed on 2/7/2016
Examples
1. Fixing all variances.
1
E1
Obs 1
1
F1
1
E2
Obs 2
1
E3
Obs 3
E4
Obs 4
1
1
E5
Obs 5
F2
1
1
E6
Obs 6
2. Fixing residual loadings but Factor variances
1
Obs 1
1
1
Obs 2
F1
E1
E2
1
E3
Obs 3
My
favorite.
1
E4
Obs 4
1
1
E5
Obs 5
F2
1
E6
Obs 6
3. Fixing residual loadings and factor loadings.
1
1
Obs 1
Obs 2
F1
1
E1
E2
1
E3
Obs 3
1
E4
Obs 4
1
1
F2
E5
Obs 5
1
Obs 6
E6
CFA, Amos - 15
Printed on 2/7/2016
Real Life Example of a CFA model
Now consider a Big Five questionnaire – the NEO FFI
questionnaire, for example.
(The path diagram does not have the “residual” latent variables.
To have included them would have made the figure even “busier”
than it is.)
The path diagram explains variation in 60 behaviors in terms of
only 5 internal characteristics. It says that variation in Extrav1
through Extrav12 is due primarily to individual differences in E,
for example.
It also explains why Extrav1 is typically uncorrelated with Agree1,
etc – because E is uncorrelated with A.
Extrav1
Extrav2
Extrav3
Extrav4
Extrav5
Extrav6
Extrav7
Extrav8
Extrav9
Extrav10
Extrav11
Extrav12
Agree1
Agree2
Agree3
Agree4
Agree5
Agree6
Agree7
Agree8
Agree9
Agree10
Agree11
Agree12
Consc1
Consc2
Consc3
Consc4
Consc5
Consc6
Consc7
Consc8
Consc9
Consc10
Consc11
Consc12
Stabil1
Stabil2
Stabil3
Stabil4
Stabil5
Stabil6
Stabil7
Stabil8
Stabil9
Stabil10
Stabil11
Stabil12
Open1
Open2
Open3
Open4
Open5
OIpen6
Open7
Open8
Open9
Open10
Open11
Open12
CFA, Amos - 16
E
A
C
S
O
Printed on 2/7/2016
Programming with path diagrams: Introduction to Amos
Amos is an add-on program to SPSS that performs confirmatory factor analysis and structural
equation modeling.
It is designed to emphasize a visual interface and has been written so that virtually all analyses can
be performed by drawing path diagrams.
It also contains a text-based programming language for those who wish to write programs in the
command language.
The Amos drawing toolkit with functions of the most frequently used tools.
Observed variable tool
Tool to draw latent variables
Tool to draw regression arrows
Tool to draw correlation arrows
Tool to put text on the diagram
Tool to select all objects in diagram
Tool to select a single object
Tool to deselect all objects in diagram
Tool to copy an object
Tool to erase an object
Tool to move an object
Tool to tell Amos to run the diagram.
Other programs
LISREL
EQS
Amos
Mplus – the best
LAVAAN
CFA, Amos - 17
Printed on 2/7/2016
Creating an Amos analysis
1. Open Amos Graphics.
1b. File -> New
2. File -> Data Files . . .
(Because you have to connect the path diagram to a data file.)
3. Specify the name of the file that contains the raw or summary data.
a. Click on the [File Name] button.
b. Navigate to the file and double-click on it.
c. Click on the [OK] button.
In this example, I opened a file called IncentiveData080707.sav
4. Draw the desired path diagram using the appropriate drawing tools.
The example below is a simple
correlation analysis.
5. Name the variables by right-clicking on each object. And
choosing “Object Properties . . .”
6. Save the model. File -> Save As...
7. To run Amos, click on the
button.
8. Click on
to see output.
CFA, Amos - 18
Printed on 2/7/2016
Amos Details
For most of the analyses you’ll perform using Amos, you should get in the habit of doing the
following . . .
View -> Analysis Properties -> Estimation
Check “Estimate means and intercepts”
View -> Analysis Properties -> Output
Check “Standardized estimates”
Check “Squared multiple correlations”
Remember that you must fix some parameter values to make the models identified.
CFA, Amos - 19
Printed on 2/7/2016
Doing old things in a new way: Analyses we’ve done before, now
performed using Amos
The data used for this example are the valdatnm data in the htlm2\5510\datafiles folder. We’ll
simply look at the output here. Later, we’ll focus on the menu sequences needed to get this output.
a. SPSS analysis of the correlation of FORMULA with P511G
Correlations
b. Amos Input Path Diagram - Input Parameter Values
All variables are
exogenous.
(Note, I told Amos to estimate means for this analysis.)
c. Amos Output Path Diagram - Unstandardized (Raw) coefficients
The covariance of
p511g and Formula.
The mean and
variance of p511g.
The mean and variance
of Formula.
c. Amos Path Diagram - Standardized coefficients
The correlation of p511g
and Formula.
Means and variances of
standardized variables are
not displayed, since they
are 0 and 1 respectively.
CFA, Amos - 20
Printed on 2/7/2016
Simple Regression Analysis: SPSS and Amos
The data used here are the VALDAT data.
a. SPSS Version 10 output
GET FILE='E:\MdbT\P595\Amos\valdatnm.sav'.
.
REGRESSION /MISSING LISTWISE /STATISTICS COEFF OUTS R ANOVA
/CRITERIA=PIN(.05) POUT(.10) /NOORIGIN /DEPENDENT p511g
/METHOD=ENTER formula .
Regression
Variables Entered/Removedb
Model
1
Variables
Entered
FORMULAa
Variables
Removed
Method
Enter
.
a. All requested variables entered.
b. Dependent Variable: P511G
Model Summary
Model
1
R
.480a
R Square
.230
Adjusted
R Square
.220
Std. Error of
the Estimate
4.725E-02
a. Predictors: (Constant), FORMULA
ANOVAb
Model
1
Regres sion
Residual
Total
Sum of
Squares
5.005E-02
.167
.217
df
1
75
76
Mean Square
5.005E-02
2.233E-03
F
22.420
Sig.
.000a
t
6.361
4.735
Sig.
.000
.000
a. Predic tors: (Constant), FORMULA
b. Dependent Variable: P511G
Coefficientsa
Model
1
(Constant)
FORMULA
Unstandardized
Coefficients
B
Std. Error
.496
.078
3.004E-04
.000
Standardi
zed
Coefficien
ts
Beta
.480
a. Dependent Variable: P511G
CFA, Amos - 21
Printed on 2/7/2016
b. Amos Input Path Diagram - Input parameter values
1
The model is underidentified
unless you fix the value of one
parameter. Fix either the
variance of the latent error
variable to 1 or the regression
weight to 1. Here, the
variance has been fixed.
error
p511g
formula
c. Amos Output Path Diagram - Unstandardized (Raw) coefficients; Means not estimated
Variance
Mean of X
of
Formula.
The estimated
Variance of X
unstandardized (raw
score) relationship of
p511g .to Formula the slope, to 2 decimal
places.
1.00
error
.05
7203.60
formula
.00
p511g
Note that the fixed
parameter values
were not changed.
For what it's worth,
the estimated
unstandardized (raw
score) relationship of
p511g to the “other
factors” latent
variable.
d. Amos Output Path Diagram - Standardized coefficients
(View/Set -> Analysis Properties -> Output to get Amos to print Standardized estimates what a
pain!!)
Correlation of p511g with latent “other
factors”..= sqrt(1-r2)=sqrt(1-.482) = sqrt(1-.23)
Correlation of p511g
error
= sqrt(.77)=.88
with Formula.
.88
.23
formula
.48
r2 for the model. You may
have to pull down [View/Set] > Analysis Properties ->
Output to ask for this to be
printed.
p511g
Note that .482 + .882 = 1. All of variance of p511g has been accounted for. We say that formula
and error partition the total variance of p511g.
CFA, Amos - 22
Printed on 2/7/2016
Two IV Regression Example - SPSS and Amos
The data here are the VALDATnm data. UGPA and GREQ are predictors of P511G.
a. SPSS output.
GET
FILE='G:\MdbT\P595\P595AL09-Amos\valdatnm.sav'.
DATASET NAME DataSet1 WINDOW=FRONT.
REGRESSION /DESCRIPTIVES MEAN STDDEV CORR SIG N /MISSING LISTWISE
/STATISTICS COEFF OUTS R ANOVA /CRITERIA=PIN(.05) POUT(.10) /NOORIGIN
/DEPENDENT p511g /METHOD=ENTER ugpa greq .
Regression
[DataSet1] G:\MdbT\P595\P595AL09-Amos\valdatnm.sav
Correla tions
Pe arson
Co rrelat ion
p5 11g
1.0 00
ug pa
.22 5
ug pa
.22 5
1.0 00
-.2 62
gre q
.32 2
-.2 62
1.0 00
p5 11g
Sig . (1-t ailed )
p5 11g
N
gre q
.32 2
.
.02 5
.00 2
ug pa
.02 5
.
.01 1
gre q
.00 2
.01 1
.
p5 11g
77
77
77
ug pa
77
77
77
gre q
77
77
77
Va riabl es Entere d/Rem ov e db
Mo del
1
Va riable s
Re move d
Va riable s En tered
gre q, ug pa a
.
Me thod
En ter
a. All requ ested vari ables ente red.
b. De pend ent V ariab le: p 511g
Model S umm ary
Mo del
1
R
.45 5 a
R S quare
.20 7
Std . Erro r of
the Estim ate
.04 828
Ad justed R S quare
.18 5
a. Pre dicto rs: (Consta nt), g req, ugpa
Coeffici ents a
Un stand ardized Co efficients
Mo del
1
B
Std . Erro r
(Co nstan
t)
.57 1
.06 9
ug pa
.04 8
.01 6
gre q
.00 0
.00 0
Sta ndardized
Co effici ents
Be ta
t
Sig .
8.2 28
.00 0
.33 2
3.0 98
.00 3
.41 0
3.8 17
.00 0
a. De pend ent V ariab le: p5 11g
CFA, Amos - 23
Printed on 2/7/2016
b. Amos Input Path Diagram - Input parameters.
1
The variance of the
(unobserved) error latent
variable must be specified at 1.
error
ugpa
Note that if the IVs are
correlated, you must specify
that they are correlated.
Otherwise, Amos will perform
the analysis assuming they're
uncorrelated.
p511g
greq
c. Amos Output Path Diagram - Unstandardized (Raw) coefficients
I forgot to check “Estimate means and intercepts, so no means are printed.)
Variance of ugpa
Raw partial
regression
coefficient
relating p511g
to ugpa
Covariance of ugpa and greq.
1.00
error
.13
ugpa
.05
Raw
Regression
coefficient
relating
p511g to
residual
effects.
.05
-8.54
p511g
7897.62
.00
greq
Raw partial
regression coefficient
relating p511g to
GREQ to 2 decimal
places.
CFA, Amos - 24
Printed on 2/7/2016
d. Amos Output Path Diagram - Standardized coefficients.
Standardized
partial regression
coefficients ,
sometimes called
betas..
Correlation of ugpa
and greq.
ugpa
error
SQRT(1-R2)=sqrt(1-.21)
= sqrt(.79)=.89
.89
.33
-.26
.21
p511g
.41
greq
Multiple R2.
Note that .332 + .412 + .892 = 1.07 > 1.0. This is because r2s partition variance only when
variables are uncorrelated.
e. Amos Text Output - Details of input and minimization
(Early version of Amos without p values)
Chi-square = 0.000
Degrees of freedom = 0
Probability level cannot be computed
Maximum Likelihood Estimates
---------------------------Regression Weights:
------------------p511g <----- ugpa
p511g <---- error
p511g <----- greq
Standardized Regression Weights:
-------------------------------p511g <----- ugpa
p511g <---- error
p511g <----- greq
Covariances:
-----------ugpa <-----> greq
Correlations:
-------------
Estimate
--------
S.E.
-------
0.048
0.047
0.000
0.015
0.004
0.000
C.R.
-------
Label
-------
3.140
12.329
3.869
Estimate
-------0.332
0.891
0.410
Estimate
--------
S.E.
-------
C.R.
-------
-8.537
3.861
-2.211
S.E.
-------
C.R.
-------
Label
-------
Estimate
-------ugpa <-----> greq
Variances:
----------
-0.262
Estimate
--------
error
ugpa
greq
Squared Multiple Correlations:
------------------------------
1.000
0.134
7897.622
Estimate
--------
p511g
0.207
0.022
1281.163
Label
-------
6.164
6.164
Note – No overall test of significance of R2.
This test is available in the ANOVA box in
SPSS.
CFA, Amos - 25
Printed on 2/7/2016
Oneway Analysis of Variance Example - SPSS and Amos
The data for this example follow. They're used to introduce the 595 students to contrast coding. The
dependent variable is Job Satisfaction (JS). The research factor is Job, with three levels. It is
contrast coded by CC1 and CC2.
The data for this example are in ‘MdbT\P595\Amos\ OnewayegData.sav’
ID JS
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
JOB
6
7
8
11
9
7
7
5
7
8
9
10
8
9
4
3
6
5
7
8
2
1
1
1
1
1
1
1
2
2
2
2
2
2
2
3
3
3
3
3
3
3
CC1
CC2
.667
.667
.667
.667
.667
.667
.667
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
-.333
.000
.000
.000
.000
.000
.000
.000
.500
.500
.500
.500
.500
.500
.500
-.500
-.500
-.500
-.500
-.500
-.500
-.500
The rule for forming a contrast variable between two sets
of groups is
1st Value = No. of groups in 2nd set / Total no. of groups.
2nd Value= - No. of groups in 1st set / Total no. of groups.
3rd Value = 0 for all groups to be excluded.
1st Value of CC1 = 2 / 3 = .667.
So,
2nd Value of CC1 = - 1 / 3
1st Value of CC2 = 1 / 2 = .5
2nd Value of CC2 = -1 / 2 = -..5
3rd Value of CC2 = 0 to exclude Job 1.
a. SPSS Oneway output.
Oneway
ANOVA
JS
Su m of
Sq uares
Be tween
Gro ups
Wit hin G roup s
To tal
df
Me an S quare
40. 095
2
20. 048
60. 857
18
3.3 81
100 .952
20
F
5.9 30
CFA, Amos - 26
Sig .
.01 1
Printed on 2/7/2016
b. SPSS Regression Output.
regression variables = js cc1 cc2
/dependent = js /enter.
Regression
Va riable s Entered/Rem ov e db
Mo del
1
Va riable s
En tered
CC2, CC1 a
Va riable s
Re move d
.
Me thod
En ter
a. All requ ested varia bles ente red.
b. De pend ent V ariab le: JS
Model S umm ary
Mo del
1
R
.63 0 a
R S quare
.39 7
Ad justed R
Sq uare
.33 0
Std . Erro r of
the Estim ate
1.8 387
a. Pre dicto rs: (Consta nt), CC2, CC1
ANOVAb
Mo del
1
Su m of
Sq uares
Re gressi o
n
Re sidua l
To tal
df
Me an S quare
F
Sig .
a
40. 095
2
20. 048
60. 857
18
3.3 81
100 .952
20
5.9 30
.01 1
a. Pre dicto rs: (Consta nt), CC2, CC1
b. De pend ent V ariab le: JS
Coeffici ents a
Un stand ardized Co efficients
Mo del
1
B
Std . Erro r
(Co nstan
t)
6.9 52
.40 1
CC1
1.3 57
.85 1
CC2
3.0 00
.98 3
Sta ndardized
Co effici ents
Be ta
t
Sig .
17 .326
.00 0
.29 2
1.5 94
.12 8
.55 9
3.0 52
.00 7
a. De pend ent V ariab le: JS
CFA, Amos - 27
Printed on 2/7/2016
c. Amos Input Path Diagram.
This was prepared using Amos 3.6. I chose the "Estimate
means" option. This was not required, but it caused means to
be displayed.
d. Amos Output Path Diagram - Unstandardized (Raw) Coefficients
Intercept
Mean and variance.
e. Amos Output Path Diagram - Standardized Coefficients
Note that the correlation between
group coding variables must be
estimated. It's zero here because
they're contrast codes, but
estimate it anyway.
Multiple R2
Note that .292 + .562 + .782 = 1.01 ~~ 1.
r2s partition variance since the variables are all
independent.
CFA, Amos - 28
Printed on 2/7/2016
f. Amos Text Output – Results
CFA, Amos - 29
Printed on 2/7/2016
Results continued . . .
Note that AMOS does not provide a test of
the null hypothesis that in the population, the
multiple R = 0. This test is provided in the
ANOVA box in SPSS.
CFA, Amos - 30
Printed on 2/7/2016
Download