SPSS For Windows Lab Manual

advertisement
D:\106752692.doc
Page 1 of 3
SPSS Computing Exercise: Correlation
Correlation Analysis -- QUESTIONS
Introduction
In this week’s exercise, SPSS procedures relevant to correlation analysis are demonstrated, using continuous and
dichotomous variables. Also, you will be introduced to Scatterplots using SPSS. The data come from a general
social survey conducted in the US (file gss.sav). In particular, this exercise uses the following variables:
SATJOB:
a measure of job satisfaction with values from 1 (very satisfied) to 4 (very dissatisfied);
HAPMAR: a measure of happiness of marriage with values from 1 (very happy) to 3 (not too happy);
LIFE:
a response to the question “Is life exciting or dull?” with values from 1 (exciting) to 3 (dull);
DEGREE: highest degree received ranging from 1 (less than high school) to 4 (a postgraduate degree);
INCOME: total family income with values from 1 (under $1,000) to 17 ($50,000 or over);
HEALTH: a measure of condition of health ranging from 1 (excellent) to 4 (poor);
EDUCINDX: a dichotomous variable representing the education level of the respondent:1 = didn’t finish high
school, 2 = completed or went beyond high school;
PAEDUC: the highest year of education that the respondent’s father reached ranging from 1 (first grade) to
20 (postgraduate);
MAEDUC: the highest year of education that the respondent’s mother reached ranging from 1 (first grade) to
20 (postgraduate);
PAEINDEX: a dichotomous variable signifying the education level of the respondents’ fathers:1 = father didn’t
finish high school, 2 = father completed or went beyond high school;
MAEINDEX: a dichotomous variable representing the education level of the respondent’s mother:1 = mother
didn’t finish high school, 2 = mother completed or went beyond high school;
SEX:
respondent’s sex, 1 = male and 2 = female.
Figure 1
D:\106752692.doc
Page 2 of 3
Task 1: Correlations between continuous variables.
Compute bivariate correlations between the following variables: SATJOB, HAPMAR, LIFE, DEGREE,
INCOME and HEALTH. To do this, click on the Analyze pull down menu and then click on the Correlate
option. Now select Bivariate. The window presented in Figure 1 will appear.
On the left-hand side of the window you will see the list of the variables in the data file. Select the variables
SATJOB, HAPMAR, LIFE, DEGREE, INCOME and HEALTH into the Variables: box. Also, select the
Spearman and Pearson options in the Correlation Coefficients frame (the Pearson option should already be
selected). Once you have done this, click on OK.
Questions for Task 1
Q1)
Q2)
Q3)
Q4)
Q5)
Q6)
Q7)
Q8)
What are the correlations between INCOME and each of the following: SATJOB, LIFE and DEGREE
(report both Pearson and Spearman coefficients). On how many cases is each of the correlations based?
Correlation coefficients give information on the magnitude and the direction of the relationship between
two variables. Describe the magnitude and direction of the relationships for the correlations in question
1 (use only Pearson correlations).
What are the significance levels for each of the correlations in question 1? Report p-levels for both
Spearman and Pearson correlations.
In light of the significance levels, how do you interpret the correlations in question 1?
In terms of their magnitudes, which pairs of variables in the correlation matrix are most and least
strongly correlated?
Generally, how do the results of Pearson correlations differ from Spearman correlations?
Why do the significance levels of Pearson correlations differ from Spearman correlations?
For these variables, are Pearson or Spearman correlations most appropriate? Why?
Task 2: The Scatterplot
Produce a scatterplot graph plotting the highest education level of the respondent’s father (PAEDUC) against
that of the respondent’s mother (MAEDUC). To do this, click on the Graphs pull down menu and then on the
Scatter option. Then click on the Simple selection and click on Define. In the window that appears, enter the
variable MAEDUC in the Y-Axis box, and enter the variable PAEDUC in the X Axis box. Click on OK and
your graph will now appear. Enter your name as a footnote. Print this scatterplot.
Questions for Task 2
Q1)
Q2)
Q3)
From looking at this graph, do you think there is a linear relation between PAEDUC and MAEDUC?
If so, make a rough guess as to the direction and magnitude of their correlation.
What are the exact magnitude, direction and significance level of the correlation between PAEDUC and
MAEDUC? (carry out the procedure you followed in Task 1 but only for these two variables)
Task 3: Correlations with Dichotomous Data
3(a) Testing the correlation between EDUCINDX and PAEINDEX.
Here you are going to carry out correlational analysis on two dichotomous variables (such as “male-female”,
“didn’t finish high school - finished high school” and so on). The first two variables you are going to look at are
EDUCINDX (whether or not the respondent finished high school) in relation to PAEINDEX (whether or not the
respondent’s father finished high school). To do this, select the Analyze pull down menu and click on the
Descriptive Statistics option. Then select Crosstabs. In the window that appears, enter the variable
EDUCINDX into the Row(s) box, and PAEINDEX into the Column(s) box. Then click on Cells and make sure
both the Observed option and the Expected option are selected in the Counts frame (Observed should already
be selected). Then click on Continue. Now click on Statistics and in the window that appears,.select the
Correlations option and in the Nominal Data frame select "Phi and Cramer’s V" then click on Continue.
Click on OK and your analysis will run.
D:\106752692.doc
Page 3 of 3
3(b) Testing the correlation between EDUCINDX and PAEINDEX, again.
Using the procedures described under Task 1, generate the Pearson correlation between EDUCINDX and
PAEINDEX.
Questions for Task 3:
Q1)
Q2)
Q3)
Q4)
State the direction, magnitude and significance level for the correlations in 3 (a) and 3(b) (use the
Pearson and Phi coefficients).
What does the correlation in 3 (a) mean? Guess at some reasons for this correlation.
What are the direction, magnitude and significance level for the Pearson correlation in 3(b)?
Compare the Pearson and Phi coefficient generated in 3(a) with the Pearson correlation generated in
3(b). Are they similar or different? What does this suggest about the relationship between these three
statistics?
Download