Statistics 101 Fall 2010 Introduction to Quantitative Methods for

Statistics 101
Fall 2010
Introduction to Quantitative Methods
for Psychology and the Behavioral Sciences
Course Objective: Statistics 101 introduces the basic concepts of statistics and the statistical methods
that are most commonly used in the social and behavioral sciences. There is no prerequisite. The first half of
the course introduces descriptive statistics and the inferential statistical methods of confidence intervals and
significance tests, applied to means and proportions. The second half introduces bivariate and multivariate
methods, emphasizing contingency table analysis, regression, and analysis of variance.
Instructor: Alan Agresti, Science Center 300E, e-mail AA@STAT.UFL.EDU, phone (617) 495-5325
Office Hours: Tuesday 2:30-4:30 pm, Thursday 11:30-12:30, and by appointment.
Teaching Fellows: Jonathan Hennessy (jhenness@fas.harvard.edu), SC 908, and Roee Gutman
(rgutman@fas.harvard.edu), SC 610. TF sections will meet in SC 705 Wednesday 10-11 am (Jon) and 2-3
pm (Roee) and Thursday 12-1 pm (Jon) and 2-3 pm (Roee) at which you can ask questions about course
material, homework exercises, and software, and they may present extra examples to supplement the ones
presented in class lectures. The TF’s will also hold office hours for individual help.
Course web page: You can access this at my.harvard.edu, and also at www.stat.ufl.edu/∼aa/harvard/
Text: Statistical Methods for the Social Sciences, by A. Agresti and B. Finlay, Pearson Prentice-Hall
4th edition, 2009
Exams and grades: Two exams during the term and the final exam each contribute to 25% of the final
course grade. Make-ups will not be given unless arrangements have been made prior to the exam, and then
only for illness or family emergencies. The remaining 25% is based on homework assignments. A project
to present during the reading period (details later) is optional but recommended; for those who do it, the
course grade will be based on the best 4 of the grades for the three exams, the homework, and the project.
Exam Dates (tentative):
Exam 1
Exam 2
Final Exam
October 5
November 9
December final exam period
Homework: Textbook exercises are designed to help you to improve your understanding of the course
material and to help you prepare for exams. By the Thursday lecture each week I will assign a set of
exercises, which will be listed at the course webpage. You should give your solutions to your TF by 4 pm,
at the latest, on Friday of the following week, unless otherwise specified. (You can leave this in the course
mailbox in the Statistics Department office on the 7th floor of the Science Center.) Late solutions will not
be accepted. You are welcome to discuss the exercises with other students and your TF, but you must write
your final answer yourself, in your own words. Any software output that is part of your solutions must
come from work you have done yourself. Selected solutions will be graded by the TF and will count for
25% of your course grade. Solutions will be posted at the course home page. No homework extensions will
be given because of excuses such as illness or family emergencies. However, since many students will have
such an occasion sometime during the term, we will drop the poorest homework grade and base the overall
homework grade on the average of the remaining homework grades.
Software: Some examples in class will use SPSS statistical software, especially for the more computationally
complex methods in the second half of the course. Students are encouraged to become familiar with the
use of SPSS or another software package (such as Stata, SAS, or R) and use it in homework exercises where
the use of software is appropriate. SPSS is available through FAS computing (www.fas-it.fas.harvard.edu).
Outline: The class lectures will cover topics presented in the following text material. On exams you are
responsible for all material presented in lectures plus any extra material that you are assigned to read.
Chapter and Topic
Text Pages
Homework Problems (optional parenthesized)
1. Introduction to Statistics
1-7
2, 3, (5, 17)
2. Sampling and Measurement
11-21
1-3, 7, 13, 14, 22, 27, 34, 36, 37 (5, 24, 38)
3. Descriptive Statistics
Univariate description
31-55
Bivariate description
55-61
6, 10ab, 11, 19, 22, 24, 25, 28, 30, 35 (8, 33, 34ab)
39, 41, 43, 59, 69, 70, 72, 73, 74, 78, (33, 43, 64, 66, 68)
48, 49, 50, (47, 51, 52)
4. Probability & Sampling Distributions
Probability, normal distributions
85-99
8ab, 9ab, 10abc, 11ab, 12ab, 17, 21, 23,
(3, 8c, 9cd, 10de, 11cde, 12cd, 19, 47)
27, 29, 33, 41, 42, 46, 50-53, 57, (36, 54, 55)
5. Inference: Estimation
Confidence interval for proportions
Confidence interval for means
Sample size determination
107-116
116-123
123-129
4, 7, 12, 13, 47, 48, 66-68, 77, (8, 18, 76)
21, 24, 32, 49a, 69-71, 73, (22, 25, 28, 54)
35, 41, 44, 62, (34, 38, 39)
6. Inference: Significance Tests
Mean: Steps of significance test
Proportion: Steps of significance test
Decisions and types of errors
143-155
156-159
159-166
166-169
169-173
1-3, 9, 11, 37a, 39, 52-55, (6)
15, 40, 42, (14, 16, 21, 65)
17, 18, 24, 25, 45, 50, 51, 59, 61,
(22, 23, 26, 27, 46, 63)
29, 30, 56, (28, 31, 57)
33, 34
7. Comparison of Two Groups
Comparing proportions
Comparing means
Matched samples
183-190
191-193, 197-201
193-197
1, 9, 11, 16, 17, 59, 62, (8, 12, 14, 17, 47, 65)
21, 23, 32b, 41, 49, 50, 60, 61, 63, (5, 6, 22, 52, 55)
26, (27, 28, 31)
8. Association for Categorical Variables
Contingency tables, Chi-squared
Residuals
Summarizing association
221-229
229-233
233-239
1, 3, 5, 9, 10, 29a, (2, 4, 40, 44)
11, 16, (14)
18, 20, 22, (17)
9. Linear Regression and Correlation
Regression model, least squares
Correlation and r2
Inference, assumption, influence
255-268
269-276
276-289
1, 3, 7, 10, (25, 63, 66)
11-12, 18, 20, 38, 58-61, (13, 67, 68)
29, 32, 42, 50-55
10. Intro. to Multivariate Relationships
Association/causality, statistical control
Relationships, statistical interaction
301-307
307-313
3, 5, 7b, 33, 34, 40, (7a, 39)
11, 14, 16, 32, 42-44, (41, 45)
11. Multiple Regression
Multiple regression and correlation
Inference, interaction, comparing models
321-335
335-346
1, 4-7, 19, 46, 49, 66, (67)
25, 44, 48(a-f), (48(h-k,m,n))
12. Comparing Groups: Anal. of Variance
Comparing means: F test
Multiple comparisons
ANOVA using regression
Two-way and factorial ANOVA
369-376
376-378
378-381
382-391
2, 5, 8c, 38, 51, 52, (1, 3, 6, 24, 36, 57)
10, 14, 42, (12, 40, 53)
15, (16)
17, 22, (23, 41, 44, 45, 48)
13-15. Advanced topics (brief intro.)
Quantitative and categorical predictors
Modelling nonlinearity
Logistic regression
413-422
462-473
483-493
Sampling distributions
Finding power and P(Type II error)
Binomial test for small n
73-85