Stat 203, Statistics for Social Sciences – Assignment 5, Due... Questions 2,3, and 5 will be marked fully.

advertisement
Stat 203, Statistics for Social Sciences – Assignment 5, Due Wednesday August 1, 4:30pm.
Questions 2,3, and 5 will be marked fully.
Questions 1,4, and 6 will be marked for completion only.
Student Name _______________________________
Student Number ______________________________________
Question 1: Consider the following crosstab from two samples, one randomly selected from those born
prematurely, and one from those not born prematurely. They were asked if they had chronic oral
infections.
Oral Infection * Born Premature Crosstabulation
Count
Born Premature
No
Oral Infection
Total
Yes
Total
No
57
8
65
Yes
6
44
50
63
52
115
a) What is the probability of a prematurely born person having chronic oral infections?
b) What are the odds of a prematurely born person having chronic oral infections?
c) What are the odds of a non-prematurely born person having chronic oral infections?
d) How many times are large are the odds of chronic oral infection if a child is born
prematurely than if they were born at a normal time. Use your answers from part B and
C to find the odds ratio.
e) How many times are large are the odds of chronic oral infection if a child is born
prematurely than if they were born at a normal time. Use the quick odds ratio formula.
Question 2: Consider the sample of court cases where the gender of the offender and whether or
not the offender was abused as a child. The dataset is A5CrimeAbuse.sav
a) Get the observed frequencies table from SPSS and include it. How many male offenders
were no victims of abuse?
b) Get the odds ratio table from SPSS and include it.
c) Interpret the odds ratio in real world terms.
d) The null hypothesis of a chi-squared test is that being a victim of child abuse and
offender’s gender are independent. What would the frequencies be on average if this were
true?
e) What is the 95% confidence interval of the odds ratio? Can you reject the null hypothesis
a two-tailed alpha=0.05 test with this information?
f) Use SPSS to get the score of the chi-square test. Do we reject the null at alpha = 0.05
with this test? (You can use the sig. here)
Question 3: Consider this survey of students at United States college, which includes their
participation in extra activities at the school, and the region of the US that they’re from. The
data can be found in A5College.sav
a) Build a crosstab using SPSS. Find by hand how many degrees of freedom does this chisquared test have in this situation?
b) State the null hypothesis of the chi-squared test in this situation.
c) Run a chi-squared test in SPSS and test the null hypothesis.
d) Confirm your conclusion from 3C by using the chi-squared value from SPSS and the
Table E on page 523. If you don’t have a textbook, use the Chi-squared table provided in
chisqtable.csv or chisqtable.xls
Question 4: Complete the table
Probability
Odds
0
0.10
0.25
1/3
0.50
0.75
0.99
Question 5: Consider the data set of a sample of families with three different types of father
figures and three difference parenting styles. Found in A5Father.sav
a) Find the degrees of freedom of a chi-squared test by hand.
b) Do the chi-squared test in SPSS. Included the table.
c) Get the observed values from the cross tab. The null hypothesis is that father figure type
and parenting style are independent. Under this assumption how many moderate
stepfathers would we expect?
d) How many authoritarian fathers would we expect?
e) If the chi-squared value were twice as large as they are (from largest deviations from
independence), would you reject the null?
Question 6: Consider the following crosstab of data from the students of a university which
includes the transportation to school and their primary faculty (arts or science).
Arts
Science
Total
Transit
44
23
67
Drives/Rides Car
16
6
22
Drives/Rides Truck
1
2
3
Total
65
31
96
a) What potential problem is there to performing a chi-squared test on this data? (Don’t
actually do the chi-squared test).
b) How could the problem in 6A be fixed? Insert a table of the improved crosstab.
c) Get the odds-ratio of the fixed data set.
Odds Ratio from 2x2 contingency plot: (Quick odds-ratio formula)
Odds Ratio = AD / BC
(AD means A times D)
Expected frequencies formula (as found on page 317 of textbook, examples pp. 317-323)
Expected freq. = (row total) x (column total) / (grand total)
Degrees of freedom for crosstabs (as found on page 316)
Df = (rows – 1) x (columns – 1)
SPSS Tips:
Use
Analyze  Descriptive Statistics  Crosstabs.
For expected frequencies click on the “Cells” button, put a check on “Expected”.
For odds ratio click on the “Statistics” button, and put a check on “Risk”.
Download