SPSS Project Regression Name

advertisement
SPSS Project Regression
Name:
(Download and work on this WORD document, and submit it through WebCT assignment
drop box.)
I. Linear Regression
In a research study for the effect of smoking on the infant birth weight, three variables were
recorded, and they are the followings.
 Birth Weight in grams
 Gestation period in weeks
 Mother Smoke or Not
 Mother’s Age
The data is in the file stored in the following address:
http://people.ysu.edu/~gchang/stat/reg_weightgestage_07.sav
Use the linear regression modeling technique to answer the following questions:
1. Make a scatter plot for birth weight versus length of gestation period and a scatter plot
for birth weight versus mother’s age, and describe the relation between each pair of
variables.
[Paste the graphs here!]
[Describe here!]
2. Make a scatter plot for birth weight versus length of gestation period with mother
smoking status as the other categorical factor variable (markers variable). Does the
smoking variable appear to be a significant factor on birth weight?
[Paste the graph here!]
[Describe here!]
3. Run the regression analysis and check the multicollinearity condition using VIF. Are
gestation period in weeks, mother’s age, and mother’s smoking status significant
factors in predicting infant’s birth weight? Does smoking increase the risk of having
low birth weight baby? Is there a multicollinearity problem? (Show the SPSS
coefficient table and interpret.)
[Paste SPSS output table here!]
[Describe here!]
4. Estimate the average weight of infants from mothers aged 35 who smoke and have a
gestation period of 37 weeks with a 95% confidence level. [Use a model with all
predictor variables mentioned in 3.]
5. Estimate the weight of infant from mother aged 35 who smoke and has a gestation
period of 37 weeks with a 95% confidence level. [Use a model with all predictor
variables mentioned in 3.]
6. Find the linear regression equation for predicting the average infant birth weight using
only the significant predictor variables.
[Write the linear regression equation here!]
7. Perform a two independent samples t-test to see if there is significant difference
between the average infant birth weights for mothers who smoked versus not smoked.
Does the result contradict with the result in 3? If yes, why?
[Paste the SPSS output on two independent samples t-test here!]
II. Logistic Regression on Risk Behavior Study
In a Risk Behavior Survey study on college students the following variables were observed:
Risk Behavior Variable: Wear seatbelt while last time driving a car
Risk Factor Variables:
 Sex (0=Female, 1=Male)
 Race (1=White, 2=Black, 3=Other)
 Live on campus (0=Off Campus, 1=On Campus)
The data is in the following address:
http://people.ysu.edu/~gchang/stat/log_YRB_07.sav
Use the logistic regression technique to answer the following questions: (Do not use stepwise
regression.)
1. Report the frequency and percentage distribution for each of the four variables
observed in this study.
2. Find the significant factor(s) that affects whether students wearing seatbelt or not.
[Paste the SPSS output containing parameter estimation information here!]
3. Use the odds ratios to explain how each of the significant factors affecting the risk
behavior.
4. Use the logistic regression model to estimate the probability of a randomly selected
white male student and living off campus that will not wear a seatbelt.
Download