Uploaded by Camron Williams

Skinny 2021-22 Final

Unit 1
Chapter 1
Data Analysis
1
AP Statistics Handout: Lesson 1.1
Topics: quantitative and categorical data, misleading graphs
Lesson 1.1 Guided Notes
Quantitative vs. Categorical Data
Quantitative data: Data that is _________________ (think ‘quantities’). Usually a __________________.
You can line the values up in order.
Ex: weight, # of AP classes, SAT score, blood pressure, income, yards per catch, etc.
Categorical data: Data that fits an individual into one of several categories that don’t _______________.
Usually represented by counts or percentages, and isn’t a number (but it can be).
Ex: eye color, race, gender, social security number, and zip code.
Graphing Quantitative Data
Histogram
Boxplot
Graphing Categorical Data
Pie Chart
Bar Plot
Pie Chart
2
Misleading Graphs
How to spot a misleading graphic:
1. It may not have axis labels or _____________________.
2. It may ________________________ the x or y axis, or start at a weird place.
3. It may use _________________ for bar graphs (or a ‘pictograph’).
Example 1:
Why is the pictograph to the left misleading?
Lesson 1.1 Discussion
Discussion Question: What story does this graphic tell?
Is that story misleading? Explain.
Graphic from 2015 Planned Parenthood congressional hearing
3
Lesson 1.1 Practice
Misleading Data Graphics
A
Graphic ‘A,’ which we analyzed earlier, was
prepared by the organization: Americans
United For Life. It uses data from Planned
Parenthood’s annual reports.
Graphic ‘B’ was published on the White
House’s official blog during the Obama
Administration. It uses national school data
prepared by the Department of Education.
Graphic ‘C’ was presented at summit of
climate change skeptics. It uses global landocean temperature data from NASA’s
Goddard Institute for Space Studies.
B
For graphics ‘B’ and ‘C,’ answer these
questions:
-Is the visual misleading? Why or why not?
Source link: https://obamawhitehouse.archives.gov/blog/2016/10/17/graduation-rate-reaches-new-high-one-student-shares-his-story
C
4
A
A (same data, new graphic): In what
way(s) is this graphic more transparent
than the graphic on the previous page of
the handout?
Data courtesy of Politifact: https://www.politifact.com/factchecks/2015/oct/01/jason-chaffetz/chart-shown-planned-parenthood-hearing-misleading-/
B
B (same data, new graphic): In what
way(s) is this graphic more transparent
than the graphic on the previous page
of the handout?
C
C (including more data): In what way(s)
is this graphic more transparent than
the graphic on the previous page of the
handout?
5
Can Joy Smell Parkinson’s Disease?
Joy Milne participated in a study where she was given 12 t-shirts, half of which were worn by
Parkinson’s patients, and half of which were worn by a control group. Joy correctly identified 11 out of
the 12 shirts. Does this provide convincing evidence that Joy can smell Parkinson’s?
1. Why would it be important to know that someone can smell Parkinson’s disease?
2. How many correct decisions would you expect Joy to get out of 12 if she really couldn’t smell
Parkinson’s (she was just guessing)? Explain.
3. Do we have some evidence that Joy can smell Parkinson’s? Why?
4. How many correct decisions out of 12 would it take to convince you that Joy really could smell
Parkinson’s?
Thank you to AP Stats teacher Doug Tyson for this awesome lesson!
6
Let’s investigate whether Joy’s result could have happened purely by chance, just by guessing.
Working in pairs, you will simulate this study. One person will be the experimenter and one person will
be Joy and then you will switch.
Important: the experimenter should not reveal the truth for each shirt. They should simply record
whether the guess was correct or incorrect.
4. As the experimenter, keep track of the results:
Correct
Incorrect
5. Count up the number of correct decisions. Write the number on a sticker dot and bring it to the
poster at the front of the room. Copy the dotplot here.
‘
0
2
4
8
6
10
12
6. What does each dot represent?
7. Based on the class simulation, what proportion of the simulations resulted in 11 or more correct
identifications?
8. Based on these results, do we have convincing evidence that Joy can smell Parkinson’s? Explain.
Thank you to AP Stats teacher Doug Tyson for this awesome lesson!
7
AP Statistics Handout: Lesson 1.2
Topics: marginal & conditional distributions, barplots, segmented barplots, associations
Lesson 1.2 Guided Notes
Stop and Frisk Data
This 2011 data is a random sample of police stops from
New York’s “Stop and Frisk” program. The program
allowed police officers to stop people on the street and
search them for weapons or contraband. The program
was controversial. Critics alleged that it led to
heightened police discrimination of people of color.
Force Level
No Force Used
Hands Used
Higher Force Level*
Race of Suspect
Black
Hispanic White
1371
853
260
263
188
30
109
72
18
*Includes push to wall/ground, handcuffs, draw/point weapon, pepper spray, baton
Data source: NYC.GOV, https://www1.nyc.gov/site/nypd/stats/reports-analysis/stopfrisk.page
Two-Way Table: A table of counts describing two ________________________. Each _______________
represents one variable.
Ex: -Force Level (categorical)
-Race (categorical)
Marginal & Conditional Distributions
Marginal distributions
i) What number is always used when calculating marginal distributions?
Force level
ii) Find the marginal distribution for force level:
Proportion
No Force Used
Hands Used
Higher Force Level
Race
iii) Find the marginal distribution for race:
Black
Hispanic
White
8
Proportion
Conditional distributions
i) Find the conditional distribution for each race:
Black
White
No Force Used
Hands Used
Higher Force Level
Is this a good take? A television commentator says, “Police had ‘no force’ interactions with only 260
white suspects. Meanwhile, a much higher number of black suspects—1,371—didn’t experience force.
Clearly, black people experience ‘force-free’ interactions with police more frequently than white
people.”
Was that a good analysis of this data? Why or why not?
Barplots
What slogan should we always
remember when graphing?
9
Segmented Barplots
Please draw your representation of the segmented barplot below:
Associations
i) What three components do you need when writing an answer for an association question?
ii) Does the data suggest there is an association between race of suspect and level of force used?
There appears to be _______________________ between race and level of force used. For example,
white suspects receive no force from police officers at ____________________ (84.4%) than black
(78.7%) and Hispanic (76.6%) suspects. So, level of force is associated with race of the suspect.
Lesson 1.2 Discussion
Force level
Black
Hispanic
White
No Force
78.7%
76.6%
84.4%
Hands Used
15.1%
16.9%
9.7%
Higher Force
6.3%
6.5%
5.8%
Discussion Question: Does our data provide enough
evidence to prove that New York police are racially
biased (in terms of use of force)? Why or why not?
10
1.1 PRACTICE!
1. What variables are measured? Identify each as categorical or quantitative. In what units were the quantitative
variables measured?
State
Number of Family Members
Age
Gender
Kentucky
Florida
Wisconsin
California
Michigan
Virginia
Pennsylvania
Virginia
California
New York
2
6
2
4
3
3
4
4
1
4
61
27
27
33
49
26
44
22
30
34
Female
Female
Male
Female
Female
Female
Male
Male
Male
Female
Marital
Status
Married
Married
Married
Married
Married
Married
Married
Never married/ single
Never married/ single
Separated
Total Income
Travel time to work
21000
21300
30000
26000
15100
25000
43000
3000
40000
30000
20
20
5
10
25
15
10
0
15
40
2. A sample of 200 children from the United Kingdom ages 9-17 was selected from the CensusAtSchool website
(www.censusatschool.com). The gender of each student was recorded along with which super power they would most
like to have: invisibility, super strength, telepathy (ability to read minds), ability to fly, or ability to freeze time. Here are
the results:
Invisibility
Super Strength
Telepathy
Fly
Freeze Time
Total
Female Male Total
17
13
30
3
17
20
39
5
44
36
18
54
20
32
52
115
85
200
a. What proportion of males want the power of invisibility?
b. What proportion of females want the power of freeze time?
c. What proportion of children that want the power of telepathy are male?
d. What proportion of children that want the power of fly are female?
11
Female Male Total
Invisibility
17
13
30
Super Strength
3
17
20
Telepathy
39
5
44
Fly
36
18
54
Freeze Time
20
32
52
Total
115
85
200
3. Create a well labeled segmented bar graph of the distributions of power preference and gender. Be sure
to include a key.
Female:
Males:
Key:
4. Based on the graphs above, can we conclude that boys and girls differ in their preference of superpower? Give
appropriate evidence to support your answer.
12
How are your favorite classes related?
Is your favorite elective class associated with your favorite core class? Collect class
data to see if there is a relationship.
1. Which of the following is your favorite elective class? You must choose only one
and mark your choice on the board.
Art
Music
Physical
Education
Foreign
Language
Technology
2. Identify the individuals and variable?
3. Is the variable categorical or quantitative?
4. Go to stapplet.com to enter the class data. Make a bar graph and a pie chart.
Sketch them below.
5. Sometimes it is helpful to investigate more than one variable. Come to the board
and put a tally mark where you belong.
Find each of the following:
Core Class
% of all students who chose P.E.:
Math
English
Art
Music
Elective
% of all students who chose Math and
chose Art:
P.E.
Foreign Lang.
Tech.
% of the students who prefer math that
chose Tech.
13
6. How many variables does the table have? Are the variables categorical or
quantitative?
7. Which variable would best explain or predict the other variable?
8. Go to stapplet.com and enter the data. Make a side-by-side bar graph and a
segmented bar graph. Sketch them below.
9. How do the bars in the side-by-side-bar graph relate to the bars in the
segmented bar graph?
10. Is there an association between favorite core subject and favorite elective? If so,
describe it.
11. If there was not an association between favorite core subject and favorite
elective, what would the graphs look like? Explain.
14
Analyzing Categorical Data
Important Ideas:
Check Your Understanding:
1. The following graph was displayed by a
national news organization. Explain why
the graph may be misleading, and
sketch a corrected version of the graph.
2. A real estate agent is collecting data on the number of houses built in his town’s
three neighborhoods during three different decades. The table below gives
information.
1960s
1970s
1980s
Shady Lane
40
30
10
Oakcrest
60
15
5
Pinewood Estates
0
45
15
a. What proportion of the houses shown were built in Pinewood Estates?
b. Find the distribution of Decade Built for the houses in this town using relative
frequencies.
c. What percent of homes were built in Oakcrest and in the 1960s?
15
What will be the mascot?
When the high schools were built , the schools needed to pick a mascot. The
principal decided to have the students and teachers vote between three choices:
thoroughbreds, bumble bees, or bull dogs. His random sample results were:
Teachers
Students
Thoroughbreds Bees
80%
10%
30%
60%
Bull Dogs
10%
10%
1. Create two bar graphs below to display the results. Use three different colors for the bars.
2. Complete the third graph by taking each bar from the teacher sample and stacking them.
Use the colors to mark each section. Do the same for the student sample.
Teachers
Students
3. According to your displays, which mascot appears to have the most support? Explain.
4. Upon hearing the results of the surveys, the students argued that the decision was
incorrect because 100 teachers had been surveyed and 500 students had been
surveyed. Use this information to fill in the table below with the number of responses.
Rams
Falcons
Prairie Dogs
Teachers
Students
5. How many times more students were sampled than teachers? _____. How can you
update the third graph in #1 to take into account the sample size? Adjust your graph.
6. What should they make the mascot? Explain.
16
Representing Two Categorical Variables
Important Ideas:
Check Your Understanding:
The following table gives the result of a random sample of upper level students at Rocky Vista
University (the Fighting Prairie Dogs!), along with a mosaic plot.
Employment Status
Currently working
Not working but have had a job
Never had a job
Grade Level
Junior
14
22
15
Senior
30
40
10
a. Calculate the proportion of Juniors that are currently working, not working but have had a
job, and never had a job.
b. Calculate the proportion of Seniors that are currently working, not working but have had a
job, and never had a job.
c. Write a few sentences summarizing what the display in part (a) reveals about the
association between grade level and job experience for the students in the sample.
17
AP Statistics Handout: Lesson 1.3
Topics: relative frequency, dot plots, stemplots, histograms, CSOCS
Lesson 1.3 Guided Notes
Relative Frequency
Salaries
(thousands of $)
Salary
Frequency
Relative Frequency
29
1
8.3%
32
1
8.3%
34
4
33.3%
35
2
16.7%
39
1
8.3%
43
1
8.3%
Frequency Table
Raw Data
39
34
34
35
34
32
43
34
185
35
29
67
67
1
8.3%
185
1
8.3%
Total
12
100%
Dot Plots
What are some advantages
to using a dot plot? What
are some disadvantages?
Title
Tick, Tick
Label, Label, Label
Stemplots
Stems
Title
Tick, Tick
Label, Label, Label
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
9
24444559
3
2
3
4
5
6
…
18
7
Leaves
Key:
3|2 represents a
worker with a salary
of $32,000
5
18
9
24444559
3
7
5
Key:
3|2 represents a
worker with a salary
of $32,000
What is wrong about the above stemplot?
Why is this problem important?
Salary
29
32
34
35
39
43
67
185
Total
Frequency
1
1
4
2
1
1
1
1
12
Relative Frequency
8.3%
8.3%
33.3%
16.7%
8.3%
8.3%
8.3%
8.3%
100%
Grouped Data Values
Individual Data Values
Histograms
Salary
20-29
30-39
40-49
50-59
60-69
…
180-189
Total
Frequency
1
8
1
0
1
Relative Frequency
8.3%
66.7%
8.3%
0%
8.3%
1
12
8.3%
100%
When is it useful to group data values?
Relative Frequency Histogram
Histogram
Title
Tick, Tick
Label, Label, Label
CSOCS
Shape
“Describe the distribution…”
Context – What ________________ is being measured?
Unimodal
Shape – ___________ skew, symmetric, modes
Outliers – _____________ points
___________
_
Center – Mean, median, _____________________
Spread – ____________, IQR, standard deviation
__________
19
Symmetric
_________
i) Describe the distribution in the space below.
Label each part of CSOCS:
Describing the shape of stemplots
20
1.2 PRACTICE!
Smart Phone Battery Life
Smart Phone
Battery Life (minutes)
Apple iPhone
300
Motorola Droid
385
Palm Pre
300
Blackberry Bold
360
Blackberry Storm
330
Motorola Cliq
360
Samsung Moment
330
Blackberry Tour
300
HTC Droid
460
Here is a dotplot
Collection
1 of the data:
300
Dot Plot
340 380 420 460
BatteryLife (minutes)
1. Describe the shape, center, and spread of the distribution. Are there any (potential) outliers?
Dotplot of EnergyCost vs Type
Type
Top vs. Bottom Freezers
How do the annual energy costs (in dollars) compare for refrigerators with top freezers and refrigerators with
bottom freezers? The data below is from the May 2015 issue of Consumer Reports.
bottom
top
56
70
84
98
112
EnergyCost
126
140
2. Compare the distributions of annual energy costs for these two types of refrigerators.
21
Who’s Taller?
Which gender is taller? A sample of 14-year-olds from the United Kingdom was randomly selected using the
CensusAtSchool website.
Here are the heights of the students (in cm):
Male: 154, 157, 187, 163, 167, 159, 169, 162, 176, 177, 151, 175, 174, 165, 165, 183, 180
Female: 160, 169, 152, 167, 164, 163, 160, 163, 169, 157, 158, 153, 161, 165, 165, 159, 168, 153, 166, 158, 158,
166
Here is a back-to-back stemplot comparing male and female heights:
Female
332
98887
433100
99876655
Male
15 14
15 79
16 23
16 5579
17 4
17 567
18 03
18 7
Key: 15|1 represents a
student who is 151 cm tall.
3. Compare the distributions of height for females and males.
22
How many pairs of shoes do you own?
1. How many pairs of shoes do you own? Record your answer on the board.
2. Is “Number of pairs of shoes” a categorical or quantitative variable?
3. Enter the data at www.stapplet.com. Make a dotplot, stemplot, and histogram and
sketch each below.
4. List the mean and median of the distribution. Which value do you think is a more
appropriate measure of center? Explain.
5. Describe the distribution of the number of pairs of shoes for your class.
Shape:
Outliers:
Center:
Variability (spread):
6. Which of the three types of display do you prefer? Why?
23
Displaying Quantitative Data
Important Ideas:
Check Your Understanding:
1. Mr. Wilcox is a huge fan of University of Michigan football. His favorite season
was the 1997 season (a perfect season!). The dotplot shows the number of
points scored by the U of M team in the 12 games that season.
(a) Use the dotplot to create a stemplot of the distribution.
(b) Describe the shape of the distribution.
(c) Are there any potential outliers? Why?
(d) What measure of center is most appropriate to describe the distribution? Explain.
24
AP Statistics Handout: Lesson 1.4
Topics: measures of center, measures of spread, using technology to find summary stats
Lesson 1.4 Guided Notes
Measures of Center
Mean:
ð‘ĨĖ… =
ð‘ĨĖ… =
=
𝑛
∑ ð‘Ĩ𝑖
𝑛
39 + 34 + 34 + 35 + 34 + 32 + 43 + 34 + 185 + 35 + 29 + 67 601
=
= _______
12
12
Median:
Show the steps to finding the median:
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
Salaries
(thousands of $)
39
34
34
35
34
32
43
34
185
35
29
67
Question Preview (for later discussion): The boss is trying to hire you to work at this company. She says,
“Our typical salary is $50,100.” Is this misleading? Why or why not?
Approximating a median in a histogram: The graph below describes 55 ACT scores.
1. Divide the sample size (the total number of
data points) by 2. This is the ______________
of the ____________ value of the dataset –
the median.
2. From the lowest value datapoints, count
the ________________ until you reach the
central position from step #1.
3. Report the _____________ of possible
values for the median.
25
Measures of Spread
Range:
Range = Max – Min
Range = 185 – 29 = _____
Standard Deviation:
𝒔𝒙 = 𝟒𝟑. 𝟔𝟏
The salaries in the dataset are typically
$43,610 _______________ the mean.
𝑠ð‘Ĩ = √
∑(ð‘Ĩ𝑖 − ð‘ĨĖ… )2
𝑛−1
Interquartile Range (IQR):
Show the steps to finding the interquartile range (IQR):
Formula: IQR = Q3 – Q1
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
Question Preview (for later discussion): Which measure of spread (range, standard dev., or IQR) best
represents the “typical” distance between salaries? Why?
26
Technology: Summary Statistics
1. Put data into List 1
(STAT → EDIT)
2. Find 1-Var Stats
(STAT → CALC → 2)
3. Select Data List
(→ Calculate)
4. Scroll through
the summary stats
(use Sx for stdev.)
Lesson 1.4 Discussion
1. The boss is trying to hire you to work at this company. She says,
“Our typical salary is $50,100.” Is this misleading? Why or why not?
Measures of Center
Mean: 50.1 ($50,100)
Median: 34.5 ($34,500)
2. Which measure of spread (range, standard dev., or IQR) best
represents the “typical” distance between salaries? Why?
Measures of Spread
Range: $156,000
𝑆ð‘Ĩ : $43,610
IQR: $7,000
27
Salaries
(thousands of $)
39
34
34
35
34
32
43
34
185
35
29
67
“Resistance is futile”
The median is ________________ (not seriously affected by) skew and outliers. The mean
_______________________ to skew and outliers. The mean follows skew/outliers.
The interquartile range (IQR) is resistant to skew and outliers. The range and ______________________
are not resistant to skew and outliers.
Why are the median and IQR resistant to outliers? Let’s explore with the salary data:
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
For the mean: The outlier salary – $185,000 – drags up the mean because its high value is given _______
___________ in the calculation
For the median: The ______________ matters more than the __________. Because $185,000 is the
highest data point, it’s crossed off right away. The outlier is _____ given a large weight in the calculation.
For the IQR: Like the median, the position matters more than the value for the IQR.
Applet: Use the (very cool) simulation linked here to explore these properties of the mean and median.
Link: http://digitalfirst.bfwpub.com/stats_applet/stats_applet_6_meanmed.html
Source: Digital First project from Bedford, Freeman, & Worth publishers
Right Skew
Symmetric
Left Skew
Med. ____ Mean
Med. ____ Mean
Med. ____ Mean
28
How many colleges are you applying to?
How many different colleges is your group of 4 applying to? Find the total number of
colleges for your whole group.
1. Record the data for the class here.
2. Calculate the mean and median for the set of data. Compare them.
3. What is the range of the data?
Finding Standard Deviation
4. Finding range is helpful but it
does not tell us how spread out the
data is between the minimum and
maximum. How can we find the
average distance of the values
from the mean?
Value
Distance from mean
(Distance from mean)2
a. Complete the table.
b. The average you calculated is the
average of the squared distances
from the mean. How do we use this
to find the average distance from
the mean? Find it.
Total:
Average (Distance from mean)2:
5. Go to stapplet.com. Enter the classroom data and find the summary statistics. Verify
our work. How does it compare?
6. We forgot to add one group that applied to 40 colleges! Add this group to the data
set. Calculate the new mean, median and standard deviation using the applet. How
does it compare to the original measures? Why do you think this is?
29
Describing Quantitative Data
Important Ideas:
Check Your Understanding:
A researcher is interested in how much annual rainfall is typical in the United States. She
takes a random sample of 9 cities in the U.S. and records the annual rainfall, in inches.
8.2
10.3
33.5
39.1
40.5
41.9
42.4
44.9
53.7
1. Calculate the mean annual rainfall for these cities.
2. Find the median annual rainfall for these cities.
3. Would you use the mean or the median to summarize the typical annual rainfall
for a U.S. city? Explain.
4. The standard deviation of the annual rainfall for these 9 cities is 15.52 inches.
Interpret this value.
30
AP Statistics Handout: Lesson 1.5
Topics: five-number summary, determining outliers, boxplots, comparing distributions
Lesson 1.5 Guided Notes
Five-Number Summary
Determining Outliers
Formulas:
Outliers are __________
high or low data values.
Upper Limit: Q3 + 1.5 x IQR
Lower Limit: Q1 – 1.5 x IQR
Example:
Upper Limit: 41 + 1.5 x 7 = 51.5
Lower Limit: 34 – 1.5 x 7 = 23.5
Which of the following salaries are outliers?
Data values ______
this are outliers.
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
Data values _______
this are outliers.
Boxplots
31
_________
Right Skew
a) What percent of the data is below Q1?
b) What percent of the data is below Q3?
c) What percent of the data is above Q1?
d) What percent of the data is above the median?
e) What percent of the data is within the IQR?
Describe the distribution…
Context: Subject of data
Shape: Skew (not modes)
Outlier: _________  for boxplots
Center: _____________  for boxplots
Spread: ______ for boxplots
Data was collected on ___________ at a company. The distribution
appears ______________ (high outliers). There are two high outliers, at
____________________. The ______________ of the distribution is
$34,500. The ______ is $7,000.
32
Comparing Distributions
Two tax plans (A & B), each with average tax cuts of $4,000
1. If comparing dotplots, stemplots, histograms, or boxplots…
- use _________
2. Use ________________ language for each feature…
- “less than,” “greater than,” “similar,” etc.
C (Context) – tax cuts ($) under two plans
S (Shape) – A is left skew, B is right skew
O (Outlier) – A has no outliers, B has high outliers
C (Center) – Median A is 4.75, Median B is 0.20
S (Spread) – IQR A is 2.75, IQR B is 0.60
Household ____________ under Plan A are left skew,
whereas they are severely right skew under Plan B. There
are no outliers under Plan A, while there are ________
_____________ higher than $12,000 in Plan B. The median
tax cut under Plan A (about $4,750) is __________ than
under Plan B (about $200). Plan A also has a _________ IQR
(about $2,750) than Plan B (about $600).
Lesson 1.5 Discussion
Discussion Question: Answer the Press Secretary’s
question – what reason could a congressperson give
for opposing a tax plan that produces an average
tax cut of $4,000?
From official White House Press Secretary Twitter account:
https://twitter.com/presssec/status/922245672409198597?lang=en
33
Critics:
Low-Income: no tax cut
Middle Income: no tax cut
High Income: large tax cut
Averages ________ skew and outliers – this is
sometimes called the “flaw of averages”
= Large ___________ tax cut
Critics of the tax plan argue that, once we get reliable data, the Trump tax cut will look more like Plan B
than Plan A. The median tax cut will be much lower than $4,000, with tax cuts well below $1,000 for the
majority of households (look at the IQR).
This is why we describe the _____________________ of data – a one number summary can be
deceiving.
Image courtesy of Sam Savage and Jeff Danziger from The Flaw of Averages
34
1.3 PRACTICE!
McDonald’s Beef Sandwiches
Here are data for the amount of fat (in grams) for McDonald’s beef
sandwiches:
Sandwich
Hamburger
Cheeseburger
Double Cheeseburger
McDouble
Quarter Pounder®
Quarter Pounder® with Cheese
Double Quarter Pounder® with Cheese
Big Mac®
Big N' Tasty®
Big N' Tasty® with Cheese
Angus Bacon & Cheese
Angus Deluxe
Angus Mushroom & Swiss
McRib ®
Mac Snack Wrap
Fat (g)
9g
12 g
23 g
19 g
19 g
26 g
42 g
29 g
24 g
28 g
39 g
39 g
40 g
26 g
19 g
Use your graphing calculator to find the following:
Mean
Median
5 Number Summary
IQR
Are there any outlier/s
using the IQR*1.5 Rule?
The Previous Home Run King
Using your graphing calculator, create a box plot using the data below. Be sure to identify each number in a five number
summary and any outliers using the IQR*1.5 Rule.
Number of home runs that Hank Aaron hit in each of his 23 seasons:
13 27 26 44 30 39 40 34 45 44 24 32 44 39 29 44 38 47 34 40 20 12 10
35
Who Has More Contacts—Males or Females?
The following data show the number of contacts that a sample of high school students had in their cell phones. Do the
data give convincing statistical evidence that one gender has more contacts than the other? You need both graphical
and numerical evidence.
Male: 124 41 29 27 44 87 85 260 290 31 168 169 167 214 135 114 105 103 96 144
Female: 30 83 116 22 173 155 134 180 124 33 213 218 183 110
36
Where Do I Stand?
How does my height compare with the other AP Stats students in my class? In order
to answer this question, Ashmita, a student in 4th hour AP Stats, recorded the
heights of everyone in her class. The heights (in inches) were:
68 72 61 62 63 63 64 64 59 62 61 60 65 62 57 77 62 71 65 62 70
1. Create a dotplot to display the class distribution of heights.
2. What is the median height? Describe how you found it.
3. What is Q1 and Q3? Describe how you found them.
4. Record the following values and then use them to make a boxplot.
Minimum:
Q1:
Median:
Q3 :
Maximum:
4. The interquartile range (or IQR) is defined as Q3 − Q1. Find the IQR. Where do you see
the IQR in the boxplot?
5. An outlier is a data value that is way too small or way too big (using the rules below). Are
there any outliers? Show your work.
Way too small < Q1 −1.5IQR
Way too big > Q3 +1.5IQR
6. Ashmita is 63 inches tall. How does her height compare with the other AP Stats students
in her class?
37
Describing Quantitative Data
Important Ideas:
Check Your Understanding:
Mr. Wilcox is a huge fan of University of Michigan football. His favorite season was the 1997
season (a perfect season!). Here is a back-to-back stemplot of the points scored by the 1997
University of Michigan football team and the archrival Michigan State University football
team. Write a few sentences comparing the distributions.
38
Who is Baseball’s Greatest Home Run Hitter?
Barry Bonds broke Mark McGwire’s record when he hit 73 home runs in the 2001 season. How
does this accomplishment fit with the rest of Bond’s career? Here are Bond’s home run counts
for the years 1986 to 2007.
16 25 24 19 33 25 34 46 37 34 49 73 46 45 45 5 26 28
1. What display did your group get assigned? _____________________
2. Create the display on the whiteboard and bring it to the front of the room.
3. Describe the distribution.
4. What are the advantages and disadvantages of your type of display?
5. Below is this distribution of the number of home runs per season for Mark McGwire.
Compare this distribution to the one for Barry Bonds.
39
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 1 hour and 5 minutes on this part of the exam.
Percent of Section II score—75
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. The sizes, in square feet, of the 20 rooms in a student residence hall at a certain university are summarized in the
following histogram.
(a) Based on the histogram, write a few sentences describing the distribution of room size in the residence hall.
(b) Summary statistics for the sizes are given in the following table.
Mean
Standard
Deviation
Min
Q1
Median
Q3
Max
231.4
68.12
134
174
253.5
292
315
-640
GO ON TO THE NEXT PAGE.
Determine whether there are potential outliers in the data. Then use the following grid to sketch a boxplot of
room size.
(c) What characteristic of the shape of the distribution of room size is apparent from the histogram but not from
the boxplot?
41
®
2015 AP STATISTICS FREE-RESPONSE QUESTIONS
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 65 minutes on this part of the exam.
Percent of Section II score—75
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. Two large corporations, A and B, hire many new college graduates as accountants at entry-level positions. In
2009 the starting salary for an entry-level accountant position was $36,000 a year at both corporations. At each
corporation, data were collected from 30 employees who were hired in 2009 as entry-level accountants and were
still employed at the corporation five years later. The yearly salaries of the 60 employees in 2014 are
summarized in the boxplots below.
(a) Write a few sentences comparing the distributions of the yearly salaries at the two corporations.
(b) Suppose both corporations offered you a job for $36,000 a year as an entry-level accountant.
(i) Based on the boxplots, give one reason why you might choose to accept the job at corporation A.
(ii) Based on the boxplots, give one reason why you might choose to accept the job at corporation B.
42
Unit 1
Chapter 2
Modeling
Distributions of
Quantitative Data
43
AP Statistics Handout: Lesson 2.1
Topics: percentiles, cumulative relative frequency, standardized scores (z-scores)
Lesson 2.1 Guided Notes
Percentiles
Percentile: the percent of data _____________ a certain data value.
At what percentile is the person who makes a salary of $43,000?
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
1) What percentile is Q1?
2) What percentile is the median?
3) What percentile is Q3?
Whose score is more impressive? Why?
44
Cumulative Relative Frequency
1) Is an ACT score of 18 a good score?
2) You are applying for an elite college and want to
score in the top quartile of test takers. What score
do you need?
Standardized Scores (Z-Scores)
Z-Scores (also called standardized scores): measures how many ______________________ a data point
is __________________ the mean.
𝑧=
ð‘‘ð‘Žð‘Ąð‘Ž ð‘ð‘œð‘–ð‘›ð‘Ą − 𝑚𝑒𝑎𝑛
ð‘ ð‘Ąð‘Žð‘›ð‘‘ð‘Žð‘Ÿð‘‘ 𝑑𝑒ð‘Ģð‘–ð‘Žð‘Ąð‘–ð‘œð‘›
|
z=
ð‘Ĩ𝑖 − μ
𝜎
Variation Matters:
14 PPG
In which league (A or B) is this player relatively
“better?” Why?
Standardization: A point’s location in the distribution depends on both distance from the ____________
and the distribution’s _________ or _________________.
45
Standardized: Who Was the Best?
Show all z-score calculations below:
Who was the G.O.A.T?
30.1 ppg
30.1 ppg
27.1 ppg
Mean (𝜇) ppg
in their era:
10.8 ppg
8.7 ppg
8.4 ppg
Stdv (𝜎) ppg
in their era:
7.0 ppg
5.9 ppg
5.5 ppg
MJ’s PPG was ____ standard
deviations ______ the mean
for his era, making him the
most _____________ scorer
of these three legends.
Z-Score:
(show work)
Standardized: Players who were not the best…
Photo: opencourt-basketball.com
Calculate Adam Morrison’s z-score for PPG…
While with the Lakers, he averaged ______ PPG.
(League: µ = 8.4 ppg, σ = 5.5 ppg)
z=
Adam Morrison
______ − 8.4
= _________
5.5
Adam Morrison’s scoring rate was 1.1 standard deviations
_____________ the league _______________ in his era.
Positive and Negative Z-Scores
data value > mean → ______________
data value < mean → ______________
𝑧=
ð‘‘ð‘Žð‘Ąð‘Ž ð‘ð‘œð‘–ð‘›ð‘Ą − 𝑚𝑒𝑎𝑛
ð‘ ð‘Ąð‘Žð‘›ð‘‘ð‘Žð‘Ÿð‘‘ 𝑑𝑒ð‘Ģð‘–ð‘Žð‘Ąð‘–ð‘œð‘›
46
Positive Z-Score: The number of standard
deviations _________ the mean.
Negative Z-Score: The number of standard
deviations _________ the mean.
2.1 PRACTICE!
1. The stemplot below shows the number of wins for each of the 30 Major League Baseball teams in 2009.
5
6
7
8
9
10
9
2455
00455589
0345667778
123557
3
Key: 5|9 represents a
team with 59 wins.
Find the percentiles for the following teams:
(a) The Colorado Rockies, who won 92 games.
(b) The New York Yankees, who won 103 games.
(c) The Kansas City Royals and Cleveland Indians, who both won 65 games.
2. Here is a table showing the distribution of median household incomes for the 50 states and the District of Columbia.
Calculate the relative frequency and cumulative relative frequency.
Median
Income
($1000s)
Frequency
35 to < 40
1
1
40 to < 45
10
11
45 to < 50
14
25
50 to < 55
12
37
55 to < 60
5
42
60 to < 65
6
48
65 to < 70
3
51
Relative
Frequency
47
Cumulative
Frequency
Cumulative
Relative
Frequency
3. Use the cumulative relative frequency graph for the state income data to answer each question.
(a) At what percentile is California, with a median income of $57,445?
(b) Estimate and interpret the first quartile of this solution.
4. Miami-Dade County Public Schools employs teachers at salaries between $40,000 and $71,000. The teachers’ union
and the school board are negotiating the form of next year’s increase in the salary schedule.
(a) If every teacher is given a flat $1000 raise, what will this do to the mean salary? To the median salary? Explain your
answers.
(b) What would a flat $1000 raise do to the extremes and quartiles of the salary distribution? To the standard deviation
of teachers’ salaries? Explain your answers.
48
Where do I stand?
How does my height compare with other AP Stats students?
The dotplot below represents a random sample of the heights of 20 AP Stats students
to the nearest inch.
1.
Describe the distribution.
2.
a. Arianna is 65 inches tall. What percent of the heights are less than or
equal to 65?
b. What is your height?
than or equal to your height?
3.
Complete the table.
What percent of the heights are less
Height Frequency
Relative
Cumulative
Frequency
Relative Freq.
56-60
61-65
66-70
71-75
76-80
4.
Use the info in the table to create a cumulative relative frequency graph.
a. Mrs. Gallas is 66 inches tall.
Estimate and interpret the percentile
she is at using the graph.
Cumulative
Relative
Frequency
b. Estimate and interpret the 80th
percentile.
Height (in)
49
Describing Location in a Distribution
Important Ideas:
Check Your Understanding:
1. According to a 2019 article at Insider.com, the state of Pennsylvania was at the 82nd percentile
for Pre-K to 12th grade education and was at the 0th percentile for higher education. Explain
what these values mean.
2. The graph displays the cumulative relative frequency of the cost of in-state public college
education for each of the 50 states.
Cumulative relative frequency (%)
120
100
80
60
40
20
0
0
2000
4000
6000
8000
10000
12000
14000
16000
In-State Public College Tuition ($)
a. About what percent of states have in-state public college tuition less than or equal to
$8000? More than $8000?
b. Estimate Q1, Q3, and the IQR of the distribution of phone in-state public tuition.
50
AP Statistics Handout: Lesson 2.2
Topics: transforming data (adding, subtracting, multiplying, and dividing data by a constant)
Lesson 2.2 Guided Notes
Effects of adding/subtracting to a distribution
Adding $6000 to each salary below (in thousands of $):
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
+6 +6 +6 +6 +6 +6 +6 +6 +6 +6 +6 +6
Original dataset
Median = 34.5
Range = 156
35, 38, 40, 40, 40, 40, 41, 41, 45, 49, 73, 191
Transformed dataset
Median = 40.5
Range = 156
Shifted up by _____
Stayed __________
Subtract 6 from each datapoint
Add 6 to each datapoint
When adding or subtracting every data value by a constant…
1. Center: adds/subtracts by that constant amount.
2. Spread: remains the ____________.
3. Shape: remains the _____________.
51
Effects of multiplying/dividing a distribution by a constant
Simple Example: Multiply the following dataset by 3…
x3
1, 2 → 3, 6
Median: 1.5
Range: 1
x3
Median: 4.5
Range: 3
Multiplication and Spread
When multiplying, big numbers __________
______ than small numbers. Because the
numbers change at different rates, the
spread ___________________ grows larger.
6% salary increasing (multiply by 1.06):
29, 32, 34, 34, 34, 34, 35, 35, 39, 43, 67, 185
Original dataset
Median = 34.5
Range = 156
x1.06 x1.06 x1.06 x1.06 x1.06 x1.06 x1.06 x1.06 x1.06 x1.06 x1.06 x1.06
30.7, 33.9, 36.0, 36.0, 36.0, 36.0, 37.1, 37.1, 41.34, 45.6, 71.0, 196.1
Transformed dataset
Median = 36.6
Range = 165.4
Grew by a factor of 1.06
Grew by a factor of 1.06
Multiply each datapoint by 2
When multiplying or dividing every data value by a constant…
1. Center: multiplies/divides by that constant amount.
2. Spread: _____________________ by that constant amount.
3. Shape: remains the ____________.
52
Divide each datapoint by 2
Lesson 2.2 Discussion
Transformation (Multiplication) of Money: Stocks
Stock: a small ____________________ share of a company.
• If you buy a stock of Coca-Cola, you are a part owner (a tiny
part) of Coca-Cola. If the company makes profits and gains
value, your stock ___________ in value. If the company
suffers losses, your stock ___________ value.
S&P 500: an _____________ that tracks the value of stock from 500 large American companies.
• It’s a good indicator of how the stock market is performing overall.
Stocks (and other wealth assets) tend to multiply in value over time.
Example: Let’s say we invested $185 (in today’s dollars) in an S&P 500 index stock fund in 1950. How
much money would we have in 2020?
Year
1950
1951
1952
1953
1954
Value
$185.00
$192.40
$200.10
$208.10
$216.42
×1.04
×1.04
×1.04
×1.04
2016
2017
2018
2019
2020
$2,462.48
$2,560.98
$2,663.41
$2,769.95
$2,880.75
×1.04
×1.04
×1.04
×1.04
Assumptions:
-Gains 4% in value per year (inflation adjusted)
-No dividends
You’d have $2,880.75!
Inflation adjusted using CPI for March 2020 dollars.
Data from: multpl.com/inflation-adjusted-s-p-500/table/by-year
53
Wealth Inequality in the U.S.
____________ = all assets (stocks, home
value, bonds, etc.)
For the bottom 90% of families, wealth
has _____________. That’s good!
U.S. Family Real Wealth Over Time
Discussion Questions:
1. Why is wealth inequality increasing?
Hint: think about multiplication.
2. Is wealth inequality a problem? If so, what
should we do about it? If not, why not?
U.S. Family Real Wealth Over Time
Monetary values expressed in 2010 dollars.
Data Source: Appendix Table B3 of Emmanuel Saez, Gabriel Zucman, “Wealth Inequality in the United States since 1913: Evidence
from Capitalized Income Tax Data,” The Quarterly Journal of Economics, Volume 131, Issue 2, May 2016, Pages 519–578,
https://doi.org/10.1093/qje/qjw004
54
Multiplication increases ___________
Year
Value
(Low Start)
Value
(High Start)
1950
$185.00
$100,000
1951
$192.40
$104,000
1952
$200.10
$108,160
2018
$2,663.41
$1,439,683.65
2019
$2,769.95
$1,497,271.00
2020
$2,880.75
$1,557,161.84
Initial Difference:
____________
×1.04
×1.04
×1.04
×1.04
55
Final Difference:
____________
Multiplication increases spread. So
______________ multiplication
really increases spread.
How Did I Do?
How well did you do on the Chapter 1 Test? How well did you do relative to your
classmates?
Here are the results of a random sample of 20 of the Chapter 1 Tests, along with a dotplot
and summary statistics.
Test
Scores
61 65 65 73 75 77 78 78 79 80 80 80 80 81 81 88 89 93 98 99
n mean SD min Q1 med Q3 max
20 80 10 61 76 80 84.5 99
1. Biff scored a 65. What is Biff’s percentile?
2. Was Biff above or below the mean? By how many points? By how many standard
deviations?
3. Marty scored an 88. What is Marty’s percentile?
4. Was Marty above or below the mean? By how many points? By how many standard
deviations?
A z-score is defined as the number of standard deviations above or below the mean.
z=
5. Write a formula for calculating a z-score.
6. Goldie scored a 98 on the Chapter 1 Test. Find and interpret the z-score.
Bonus: Goldie was aspiring for what job?
56
7. There are two mathematical operations used when calculating a z-score:
a. First, we take each score, and ____________________ the mean (remember the mean
was 80). Fill in the table and then make a dotplot for each.
SCORE
SCORE - MEAN
61 65 65 73 75 77 78 78 79 80 80 80 80 81 81 88 89 93 98 99
Dotplot for SCORE
Dotplot for SCORE - MEAN
What happens to the shape, center, and variability when you subtract the mean from each
score?
b. Second, we take the SCORE – MEAN and ____________________by the standard
deviation (remember the standard deviation is 10). Fill in the table and then make a dotplot
for each.
SCORE - MEAN
SCORE – MEAN
SD
-19 -15 -15 -7 -5 -3 -2 -2 -1 0
Dotplot for
0
0
0
1
1
8
9 13 18 19
SCORE − MEAN
SD
What happens to the shape, center, and variability when you divide by the standard deviation for
each value?
8. Summarize: What happens to the shape, center and variability of a distribution when you
…add or subtract the same value a from each value?
…multiply or divide by the same value b from each value?
9. What is the mean and standard deviation of the distribution of z-scores? Will this be true
for any distribution of z-scores? Explain.
57
Describing Location in a Distribution
Important Ideas:
Check Your Understanding:
1. According to an article on Yahoo!news, you should change your sheets every 7 days…at
minimum. To investigate the sheet changing habits of adults, a random sample of 20 adults
reported how often they change their sheets using an anonymous survey. Here is a dotplot
and summary statistics of the results.
a. Suppose you convert the time before changing sheets from days to weeks. Describe the
shape, mean, and standard deviation of the distribution of time before changing sheets in
weeks.
b. The adults in the study are given an article explaining the health benefits that would arise
from changing their sheets more often. After reading the article each person agrees to
change their sheets one week sooner than they used to. How does the shape, center, and
variability of this distribution compare with the distribution of time in part (a)?
c. Now suppose you convert the time before changing sheets from part (b) to z-scores. What
would be the shape, mean, and standard deviation of this distribution? Explain your
answers.
58
AP Statistics Handout: Lesson 2.3
Topics: normal curves, the empirical rule (68-95-99.7 rule), race and “intelligence”
Lesson 2.3 Guided Notes
What is a normal curve?
The Normal Curve
Frequency
1. _______________ and “bell shaped”
2. The mean = ____________, both located at
the exact center
3.
Most data points are __________ the center
Data Values
Example: IQ scores are commonly thought to be normally distributed, with mean 100 pts. and standard
deviation 15 pts.
𝝁 = 𝟏𝟎𝟎
55
70
85 100 115 130 145
𝝈 = 𝟏𝟓
The Empirical Rule (a.k.a the 68-95-99.7 rule)
68% of the data is within
_____ of the mean
95% of the data is within
_____ of the mean
68%
𝝈
55
70
𝝈
99.7% of the data is within
_____ of the mean
95%
85 100 115 130 145
𝟐𝝈
55
70
𝟐𝝈
99.7%
85 100 115 130 145
𝟑𝝈
55
59
70
𝟑𝝈
85 100 115 130 145
Strategy: Normal Curve Problems
1. ______________________ curve (include mean, standard deviation, tick marks, shade
area/point in question)
2. Perform _________________ (show work)
3. Answer the question _____________________
Examples:
a) You take an IQ test and get a score of 130. Assume IQ is normally distributed with mean 100 pts.
and standard deviation 15 pts. What percentile are you at?
b) What score would you have to earn on an IQ test to be at least at the 84th percentile?
60
Lesson 2.3 Discussion
Race and “intelligence”
-In The Bell Curve, the authors
state that black people score
about 15 points lower than
white people on IQ tests, on
average.
An argument the book makes…
On average:
• Black test takers showed below average IQ
• White test takers showed above average IQ
-Using this evidence, some
make the argument that
white people tend to be more
intelligent than black people.
Herrnstein, Richard J., and Charles
A. Murray. The Bell Curve:
Intelligence and Class Structure in
American Life. Free Press, 1994.
15
Average black
IQ scores
Average white
IQ scores
Simulated questions* from the AFQT (the test used by The Bell Curve to obtain IQ data):
Word Knowledge
1. “Solitary” most nearly means
a) sunny
b) being alone
c) playing games
d) soulful
Math Knowledge
2. In the drawing below, JK is the median of the
trapezoid. All of the following are true EXCEPT
a) LJ = JN
b) a = b
c) JL = KM
d) a ≠ c
Discussion Question: Do you believe, based on these sample questions, the AFQT accurately measures
IQ? Why or why not?
*Source: Bock, R. Darrel, and Elsie G.J. Moore. Advantage and Disadvantage: A Profile of American Youth. Lawrence Erlbaum
Associates, 1986. pg. 28-30
61
2.2 PRACTICE!
Wins in Major League Baseball
1. In 2009, the mean number of wins was 81 with a standard deviation of 11.4 wins.
Find and interpret the z-scores for the following teams.
(a) The New York Yankees, with 103 wins.
(b) The New York Mets, with 70 wins.
Batting Averages
2. In the previous alternate example about batting averages for Major League Baseball players in 2009, the mean of the
432 batting averages was 0.261 with a standard deviation of 0.034. Suppose that the distribution is exactly Normal with
 = 0.261 and ïģ = 0.034.
(a) Sketch a Normal density curve for this distribution of batting averages. Label the points that are 1, 2, and 3 standard
deviations from the mean.
(b) What percent of the batting averages are above 0.329? Show your work.
(c) What percent of the batting averages are between 0.227 and .295? Show your work.
62
Exploring Density Curves
Complete each of the following experiments and submit your answers using the google
form. Resubmit your answers for a total of 3 submissions. Predict (sketch) what the
graphs of the class data from each experiment will look like if we did this many many
times. Draw and label lines where you predict the mean and median will be.
Experiment 1: Roll a die and record the value it lands on. 1st roll:
Prediction:
2nd roll:
3rd roll:
Actual:
Experiment 2: Try to toss a penny and make it land on the target. Measure the
distance of the penny from the target in cm. Round to the nearest tenth.
2nd Attempt:
3rd Attempt:
1st Attempt:
Prediction:
Actual:
Experiment 3: Try to stop your stopwatch at exactly 5 seconds. Record what the
stopwatch reads below. Record to the hundredths place.
2nd Attempt:
3rd Attempt:
1st Attempt:
Prediction:
Actual:
Normal Curves: Label the values 1, 2, and 3 standard deviations above and below the
mean using the stopwatch data.
What percentage of the data is within two standard
deviations of the mean?
What percentage of the data is further than two standard
deviations from the mean?
What percentage of the data is greater than 1 standard
deviation above the mean?
What percentage of the data is between z = –1 and z = 2?
63
Density Curves and Normal Distributions
Important Ideas:
Check Your Understanding:
1.
A game that is sometimes played at baby showers asks the guests to cut a length of yarn
that they believe will best measure the distance around the mom’s baby bump. At a particular
baby shower the length of string cut by the guests was uniformly distributed over the interval
40 to 50 inches.
a. What height must the density curve have? Justify your
answer.
b. About what percent of the guests cut their yarn longer
than 48 inches?
c. Calculate and interpret the 25th percentile of this distribution.
2.
The distribution of waist circumference for women who are 8 months pregnant is approximately
Normal with mean µ = 44 inches and standard deviation σ = 4 inches.
a. Sketch the Normal curve that approximates the distribution of waist circumference for
women who are 8 months pregnant. Label the mean and the points that are 1, 2, and 3
standard deviations from the mean.
b. About what percent of women who are 8 months pregnant have a waist circumference
that is less than 40 inches? Show your work.
c. A waist circumference greater than 52 inches may indicate excess amniotic fluid. What
percent of women who are 8 months pregnant may have excess amniotic fluid? Justify
your answer.
64
AP Statistics Handout: Lesson 2.4
Topics: normal calculations – finding percentages from values and values from percentages
Lesson 2.4 Guided Notes
Finding percentages from values
When between standard deviations, you must use a ____________________:
1. When finding percentages from values, use __________________ calculator command.
2. When finding values from percentages, use ___________________ calculator command.
a) You take an IQ test and get a score of 124. Assume IQ scores are normally distributed with a mean of
100 points and a standard deviation of 15 points. What percentile are you at?
Calculator Steps
2nd→VARS→ 2:normalcdf
Finding values from percentages
b) What score would you have to earn on an IQ test to be below 88% of test-takers? Assume IQ scores
are normally distributed with a mean of 100 points and a standard deviation of 15 points.
Calculator Steps
2nd→VARS→ 3:invNorm
65
3. According to the CDC, the heights of 3 year old females are approximately Normally distributed with a mean of 94.5
cm and a standard deviation of 4 cm. Be sure to draw curves for each calculation!
(a) What is the third quartile of this distribution?
(b) What is the median of this distribution?
(c) If a 3 year old female was 91.7 cm tall, what percentile would she be in?
(d) If a mother knew her daughter was at the 91st percentile in height, how tall is her daughter?
(e) If a 3 year old female was 96.4 cm tall, what percentile would she be in?
4. Scores on the Wechsler Adult Intelligence Scale (a standard IQ test) for the 20 to 34 age group are approximately
Normally distributed with μ = 110 and σ = 25. For each part, follow the four-step process.
(a) At what percentile is an IQ score of 150?
(b) What percent of people aged 20 to 34 have IQs between 125 and 150?
(c) MENSA is an elite organization that admits as members people who score in the top 2% on IQ tests. What score on
the Wechsler Adult Intelligence Scale would an individual have to earn to qualify for MENSA membership?
66
Key points and the “genius” example
Key Points
1. Three steps: Draw+_________, Calculate, Answer
2. Percentile is the percent ___________ a data value.
3. “At least” means the data value __________________.
4. invNorm’s input of “area” is always the area _____________ the data value.
5. To approximate an upper bound of ∞, use a very large ______________ number
(such as: 1000000000)
6. To approximate lower bound of -∞, use high magnitude _______________ number
(such as: -1000000000)
Mensa International is an organization that “accepts individuals who
score in the top 2%, ie, two SDs or more above the average” on IQ
tests. Source: https://www.mensa.org/iq/what-iq, for Mensa
international, qualifications may vary by country.
a) For an IQ test with mean of 100 and stdev. of 15, what score would
you have to get to qualify for Mensa International?
b) There are about 328 million people in the United States. How many Mensa International “geniuses”
live in the United States?
Lesson 2.4 Discussion
Discussion Question: Can we identify “intelligence” using data? Is it possible to get an accurate, singlenumber summary of someone’s intellect?
67
Will Marty Make it Back to the Future?
After accelerating for 20 seconds, a DeLorean sports car has a wide range of speeds that it
can achieve, depending on traction. The distribution of speed follows an approximately
Normal distribution with a mean of 80 mph and a standard deviation of 7.7 mph.
1. Label the appropriate values
on the normal distribution
2. What percentage of the runs will give the Delorean a speed greater than 87.7 mph?
3. What percentage of the runs will give the Delorean a speed between 64.6 mph and 87.7
mph?
4. What percentage of the runs will give the Delorean a speed less than 64.6 mph?
5. What percentage of the runs will give the Delorean a speed less than 68.45 mph?
68
6. What percentage of the runs will give the Delorean a speed greater than 85 mph? Show
work.
7. What percentage of the runs will give the Delorean a speed between 70 and 95 mph?
Show work.
8. Marty wants his last run to be in the top 15% of all the possible speeds. What speed does
he need to achieve to be in the top 15%?
69
Density Curves and Normal Distributions
Important Ideas:
Check Your Understanding:
According to an article at africageographic.com there are many unexpected uses for elephant
waste. But how much waste do elephants produce per day? Studies show that the distribution of
amount of waste produced by adult elephants can be modeled by a Normal distribution with mean
250 pounds and standard deviation 21 pounds.
a.
What percent of adult elephants produce at least 300 pounds of waste in a day?
b.
A zoo has an agreement with the local farmers. They sell the daily waste from their elephant
to the farmers for $5.00 per pound. If the amount of waste produced by the elephant on a
given day is at the 10th percentile of the distribution of waste, how much money would the
zoo make selling the waste that day?
70
Do we have Normal test scores?
Here are the Chapter 1 Test scores for 50 of the current AP Statistics students
89
89
78
85
83
96
63
78
94
78
89
87
72
74
89
76
72
81
81
85
75
85
72
100
74
83
61
81
74
55
76
91
76
80
79
67
57
76
78
93
91
83
72
67
96
85
93
70
76
83
Is the distribution of Chapter 1 Test scores approximately normal? Justify your answer using
several different approaches. The group with the most convincing argument will win a prize.
71
Density Curves and Normal Distributions
Important Ideas:
Check Your Understanding:
The following figure is a Normal probability plot of the difference between the desired thickness of
a cell phone screen protector and the actual thickness of the cell phone screen protector for a
random sample of 300 screen protectors, in micrometers. Use the graph to determine if this
distribution of difference in thickness is approximately Normal.
Difference in thickness (micrometers)
72
2011 AP® STATISTICS FREE-RESPONSE QUESTIONS
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 65 minutes on this part of the exam.
Percent of Section II score—75
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. A professional sports team evaluates potential players for a certain position based on two main characteristics,
speed and strength.
(a) Speed is measured by the time required to run a distance of 40 yards, with smaller times indicating
more desirable (faster) speeds. From previous speed data for all players in this position, the times to run
40 yards have a mean of 4.60 seconds and a standard deviation of 0.15 seconds, with a minimum time of
4.40 seconds, as shown in the table below.
Mean
Time to run 40 yards 4.60 seconds
Standard Deviation
Minimum
0.15 seconds
4.40 seconds
Based on the relationship between the mean, standard deviation, and minimum time, is it reasonable to
believe that the distribution of 40-yard running times is approximately normal? Explain.
(b) Strength is measured by the amount of weight lifted, with more weight indicating more desirable (greater)
strength. From previous strength data for all players in this position, the amount of weight lifted has a mean
of 310 pounds and a standard deviation of 25 pounds, as shown in the table below.
Mean
Amount of weight lifted 310 pounds
Standard Deviation
25 pounds
Calculate and interpret the z-score for a player in this position who can lift a weight of 370 pounds.
(c) The characteristics of speed and strength are considered to be of equal importance to the team in selecting a
player for the position. Based on the information about the means and standard deviations of the speed and
strength data for all players and the measurements listed in the table below for Players A and B, which
player should the team select if the team can only select one of the two players? Justify your answer.
Player A
Time to run 40 yards
Player B
4.42 seconds 4.57 seconds
Amount of weight lifted 370 pounds
73
375 pounds
2009 AP® STATISTICS FREE-RESPONSE QUESTIONS (Form B)
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 65 minutes on this part of the exam.
Percent of Section II score—75
Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. As gasoline prices have increased in recent years, many drivers have expressed concern about the taxes they pay
on gasoline for their cars. In the United States, gasoline taxes are imposed by both the federal government and
by individual states. The boxplot below shows the distribution of the state gasoline taxes, in cents per gallon,
for all 50 states on January 1, 2006.
(a) Based on the boxplot, what are the approximate values of the median and the interquartile range of the
distribution of state gasoline taxes, in cents per gallon? Mark and label the boxplot to indicate how you
found the approximated values.
(b) The federal tax imposed on gasoline was 18.4 cents per gallon at the time the state taxes were in effect.
The federal gasoline tax was added to the state gasoline tax for each state to create a new distribution of
combined gasoline taxes. What are approximate values, in cents per gallon, of the median and interquartile
range of the new distribution of combined gasoline taxes? Justify your answer.
© 2009 The College Board. All rights reserved.
Visit the College Board on the Web: www.collegeboard.com.
GO ON TO THE NEXT PAGE.
-674
Unit 2
Chapter 3
Exploring TwoVariable Quantitative
Data
75
AP Statistics Handout
Topics: explanatory/response, describing scatterplots, correlation coefficient (r), causation
Lesson 3.1 Guided Notes
Explanatory and response variables
Bivariate data: data with _________ variables. Two quantitative variables are visualized in a ______________
Income and Food Access Example (H.E.B Grocery Stores) *
Zip Code
Grocery Store
Location
Average
Household
Income (x)
Organic
Vegetables
Offered (y)
78204
78207
78204
78201
78212
78202
78237
78228
78227
78240
78230
78251
78238
78223
78221
78224
78220
78209
78216
78223
78218
78213
78227
78244
78231
78239
78217
78251
78250
78230
78247
78247
78251
78247
78248
78232
78249
South Flores
N. Rosillo st
Nogalitos st
Frederickburg rd
Olmos
New Braunfels
Castroville
Culebra rd
Marbach rd
Babcock rd
Wurzbach rd
W Loop 1604 N
Bandera rd
S.New Braunfels
SW Military
S Zarzamora
W.W. White rd
East basse rd
San pedro
S.E Military dr
Austiin hwy
West Avenue
Valley Hi dr
Foster dr
N.W Military
Montogomery
Perrinbeiter rd
FM 471 west
Guilbeau rd
De Zavala
Thousand oaks
O’Connor rd
Potranco rd
Bulverde rd
NW Loop 1604
18140 San Pedro
9238 Loop 1004
$71,186
$34,234
$71,186
$48,760
$78,096
$40,506
$38,166
$50,398
$49,437
$66,073
$86,566
$78,176
$59,154
$50,252
$48,364
$56,274
$41,318
$125,145
$65,911
$50,252
$53,945
$59,072
$49,437
$72,080
$108,486
$70,530
$57,199
$78,176
$78,288
$86,566
$84,181
$84,181
$78,176
$84,181
$135,547
$92,946
$77,894
36
4
28
31
78
14
12
18
38
84
61
56
62
44
26
29
15
95
18
65
50
35
36
28
95
46
29
73
53
86
68
56
85
86
93
82
96
Using this data from San Antonio, TX, we will explore
whether there is a relationship between neighborhood
income and access to organic items at local grocery stores.
Explanatory (independent) variable: the variable that
predicts, explains, or influences a trend in the response
variable. This is the _______________.
Response (dependent) variable: the measured outcome.
Responds to trends in the explanatory variable. This is the
_________________.
In this example, which variable is the explanatory variable?
Why?
In this example, which variable is the response variable?
Why?
*Dataset compiled by student Linda Saucedo, Fall 2019
76
Describing scatterplots
Graphic inspired by mathisfun.com
Correlation: measures how two variables are _________________.
Positive correlations: as the x values increase, the y values also tend to ___________________.
Negative correlations: as the x values increase, the y values tend to ____________________.
Graphic inspired by mathisfun.com
Least Squares Regression Line (LSRL): a straight line that roughly puts half of your data ____________ it
and half ______________ it.
• More formal definition coming next lesson.
Strong correlations: data is __________ to the LSRL
• The LSRL is a __________________ for the data
• If you used the LSRL to predict new data, you would make ________________________.
Weak correlations: data is ______ from the LSRL
• The LSRL is a __________________ for the data
• If you used the LSRL to predict new data, you would may be _________________________.
Direction:
Strength:
77
C – Context
D – Direction (positive/negative)
O – Outliers
F – Form (linear/non-linear)
S – Strength (strong/moderate/weak)
Put it all together: Describe the relationship between average
household income in a zip code and the number of organic
vegetables offered at the local grocery store…
Correlation coefficient (r)
Graphic inspired by mathisfun.com
𝑟=1
𝑟 = 0.91
𝑟 = 0.48
𝑟=0
𝑟 = −0.48
𝑟 = −0.91
𝑟 = −1
Correlation Coefficient (r): A number between _______________ that tells you the strength and
direction of a correlation.
Strength:
r close to 0 → _________ correlation
r close to -1, 1 → __________ correlation
Direction:
Negative r value → ______________ correlation
Positive r value → ______________ correlation
78
Lesson 3.1 Discussion
From 1994-2020, a statistician collected three pieces of data each summer at a beach:
1. The average temperature
2. The amount of ice cream sold at the beach shop
3. The amount of drownings
A
A) Discussion Question: Describe the
relationship between temperature and ice
cream sales. Does this relationship make sense?
Why or why not?
B
B) Discussion Question: Describe the
relationship between ice cream sales and
drownings. Does this relationship make sense?
Why or why not?
79
Does ice cream really cause drownings? No.
What’s really going on here? There is an ________________ third variable, temperature, that causes ice
cream sales and drownings to _____________.
Ice cream sales and drownings are correlated but do ________ cause one another.
Correlation ___ Causation
Sometimes, correlations…
1. Have an ___________________________ (3rd variable)
2. Cause one another, but not in assumed _________________
3. Are completely ______________, with no apparent cause
Completely Random Correlations, courtesy of Tyler Vigen: https://www.tylervigen.com/spurious-correlations
80
3.1 PRACTICE!
1. The table below shows data for 13 students in a statistics class. Each member of the class ran a 40-yard sprint and
then did a long jump (with a running start).
Sprint Time (s)
5.41 5.05 9.49 8.09 7.01 7.17 6.83 6.73 8.01 5.68 5.78 6.31 6.04
Long Jump Distance (in) 171 184 48
151 90
65
94
78
71
130 173 143 141
A. Create and label a scatterplot of the data:
B. Describe and interpret the scatterplot above.
C. What is the correlation coefficient? What does it mean?
81
2. A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory
roommate, and the women in the adjoining rooms. Then she measures the next man each woman dates. Here are the
data (heights in inches):
A. How would r change if all the men were 6 inches shorter than the heights given in the table? Does the correlation tell
us if women tend to date men taller than themselves?
B. If heights were measured in centimeters rather than inches, how would the correlation change? (There are 2.54
centimeters in an inch.)
3. Consider each of the following relationships:
A. the heights of fathers and the heights of their adult sons
B. the heights of husbands and the heights of their wives
C. the heights of women at age 4 and their heights at age 18.
Rank the correlations between these pairs of variables from highest to lowest. Explain your reasoning.
82
Lesson 3.1: Day 1: How many rubber bands does Barbie need?
How many rubber bands should we attach to Barbie so that she has the absolute most
fun without smashing her head if she were to jump from the top of the bleachers in the
gym? Here’s the catch: You many only use 7 rubber bands to figure this out.
Complete the table:
# Rubber
0
bands
Distance
traveled
1
2
3
4
5
6
Use your group’s data to complete the following:
1. Identify the explanatory and response variables.
2. How many variables do we have? Are they categorical or quantitative?
3. Make a scatterplot.
4. Describe the relationship displayed in the scatterplot.
83
7
Lesson 3.1 – Displaying Relationships: Scatterplots
Big Ideas:
Check Your Understanding:
1. Is there a relationship between the amount of sugar (in grams) and the number of
calories in movie-theater candy? Here are the data from a sample of 12 types of
candy.
a. Identify the explanatory and response variables. Explain your reasoning.
b. Make a scatterplot to display the relationship between amount of sugar and the
number of calories in movie-theater candy.
c. Describe the relationship shown in the scatterplot.
84
AP Statistics Handout: Lesson 3.2
Topics: least squares regression line, slope, y-intercept, predictions using LSRL, extrapolation
Lesson 3.2 Guided Notes
Least squares regression line (LSRL)
Closing the school achievement gap with attendance: Low-income students tend to have lower
attendance rates and lower math test scores than their middle/upper income peers. Would raising their
attendance close the achievement gap? To explore this possibility, a random sample was collected of
students in Texas. For each student, data was collected on their attendance rate (percent of school days
attended) and raw test scores on the Algebra 1 state exam.
STAAR
Percent
Algebra 1
Attendance
Raw Score
(x)
(y)
95
89
67
98
99
76
92
91
76
85
82
45
42
31
51
49
38
46
41
35
39
37
Variables:
Explanatory:
________________
Response:
________________
Scatterplot:
How do we find
the exact equation
of the LSRL?
85
How can we use the information provided by the
squared residuals to determine which model (A or B)
better fits the data?
A
B
Least Squares Regression Line (LSRL): a linear model that minimizes the sum of the _________________
between the data and the model.
• Also called “line of best fit”
The LSRL model for our
data is the line that
minimizes the sum of the
residuals squared.
Slope and y-intercept
y-value
______________ y-value
Algebra 1: Linear Equation
Stats: Linear Regression
ð‘Ķ = 𝑚ð‘Ĩ + 𝑏
ð‘Ķā·œ = 𝑎 + 𝑏ð‘Ĩ
ð‘Ķā·œ = −7.69 + 0.57ð‘Ĩ
_______
Linear Regression
ð‘Ķā·œ = 𝑎 + 𝑏ð‘Ĩ
_____________
ð‘Ķā·œ = −7.69 + 0.57ð‘Ĩ
ð‘Ķā·œ: Predicted test score
ð‘Ĩ: Percent attendance
86
ð‘Ķā·œ = −7.69 + 0.57ð‘Ĩ
1) Interpret the slope value:
Stem: For every 1 unit increase in explanatory variable, our model predicts an average
increase/decrease of slope in response variable.
Your answer:
2) Interpret the y-intercept:
Stem: When the explanatory variable is zero units, our model predicts that the response variable would
be y-intercept.
Your answer:
3) Is the y-intercept meaningful in this context? Explain…
Predictions using the LSRL
1) The superintendent of the district asks: “If a student meets our minimum attendance goal (87%),
what would their predicted test score be?” Answer his question and show your work (including drawing
your process on the scatterplot):
ð‘Ķā·œ = −7.69 + 0.57ð‘Ĩ
87
Lesson 3.2 Discussion
Dangers of prediction
In the past several years, superintendents have piloted large-scale (and sometimes quite expensive)
initiatives to improve student attendance. These included:
• Call programs for chronically absent students
• Hiring attendance case managers and coordinators
• Using Uber/Lyft for students with transportation issues
The result:
Attendance
Test Scores
Discussion Question: Why didn’t test scores grow when attendance rose?
Attendance research: Pyne, Grodsky, et al., (2018). What Happens When Children Miss School? Unpacking
Elementary School Absences in MMSD. Madison, WI: Madison Education Partnership.
88
Lesson 3.2 Practice
1) Dr. Youfa Wang at University of North Carolina published a study* on obesity in America. Using linear
regression, the study concluded that by 2048, if trends continue, 100% of Americans would be
overweight. Using the graphs** below, do you believe this conclusion is correct? Why or why not?
Example inspired by Ellenberg, J. How Not to be Wrong, pg. 50-61.
Data and Model
Model Projection
*Wang, Beydoun, et al., “Will all Americans become overweight or obese? estimating the progression and cost of the US obesity epidemic.”
Obesity (Silver Spring). 2008;16(10):2323‐2330. doi:10.1038/oby.2008.351
**Graphs provided are representative approximations of analyses from the paper
89
3.2 PRACTICE!
1. The following data shows the number of miles driven and advertised price for 11 used Honda CR-Vs from the 20052009 model years (prices found at www.carmax.com). The scatterplot below shows a strong, negative linear association
between number of miles and advertised cost. The line on the plot is the regression line for predicting advertised price
based on number of miles.
Thousand
Miles Driven
Cost
(dollars)
22
29
35
39
45
49
55
56
69
70
86
17998
16450
14998
13998
14599
14988
13599
14599
11998
14450
10998
A. Calculate the correlation. What does this value mean in plain English? What is the relative strength of the association?
B. What is the least squares regression equation for this association? Define any variables used.
C. Determine the y-intercept of the regression equation and interpret the value in context. Does the value have any realworld implications?
D. Determine the slope of the regression equation and interpret the value in context. Does the value have any realworld implications?
90
2. For a project, two AP Statistics students decided to investigate the effect of sugar on the life of cut flowers. They
went to the local grocery store and randomly selected 12 carnations. All the carnations seemed equally healthy when
they were selected. When they got home, the students prepared 12 identical vases with exactly the same amount of
water in each vase. They put 1 tablespoon of sugar in three vases, 2 tablespoons of sugar in three vases, and 3
tablespoons of sugar in three vases. In the remaining 3 vases, they put no sugar. After the vases were prepared and
placed in the same location, the students randomly assigned one flower to each vase and observed how many hours
each flower continued to look fresh. A scatterplot, residual plot, and computer output from the regression are shown.
Only 10 points appear on the scatterplot and residual plot since there were two observations at (1, 204) and two
observations at (2, 210).
Predictor
Constant
Sugar
Coef
181.200
15.200
S = 7.52596
SE Coef
3.635
1.943
R-Sq = 86.0%
T
49.84
7.82
P
0.000
0.000
R-Sq(adj) = 84.5%
A. What is the equation of the least-squares line? Be sure to define any variables you use.
B. Is a line an appropriate model for these data? Justify your answer. (You need at least two sentences.)
C. Interpret the value of “s” in the context of this problem.
D. Interpret the value of r2 in the context of this problem.
91
How safe is Barbie?
How can we be sure that the bungee cord we make Barbie will keep her safe?
Should you be worried if you used the wrong units, chose the wrong axes, or
measured wrong?
Below is the data for one group’s Barbie bungee.
Number of rubber bands
0
1
2
3
4
5
6
7
Distance traveled (in)
12
14
17
18
22
23
26
30
1. Go to stapplet.com and
create a scatterplot. Sketch
it to the right. (Leave extra
room on the right.)
2. Find the correlation. r = ______
3. How safe do you feel Barbie’s bungee
jump would be if we use this data? Use
the correlation to justify.
4. One of the group members snuck some extra rubber bands to collect more data.
Add the point (15 rubber bands, 49 in) to your scatterplot. How do you think this
outlier will affect the correlation? Verify in the applet. What is the new r?
5. One group member accidently left off a digit. Add the point (6 rubber bands, 6 in)
to the scatterplot. How do you think this outlier will affect the correlation? Verify
in the applet. What is the new r?
Unfortunately, the group had measured the lowest point of Barbie’s head in inches
instead of centimeters. To fix this they multiplied the inches by 2.54 (1 in. = 2.54 cm).
The new data is below.
Number of rubber bands
0
1
2
3
4
5
6
7
Distance traveled (cm)
30.48
35.56
43.18
45.72
55.88
58.42
66.04
76.20
6. How do you think these changes will affect the correlation? Verify by calculating
the correlation in the applet.
92
Displaying Relationships: Correlation
Important Ideas:
Check Your Understanding:
Fueleconomy.gov gives the city and highway fuel economy for all makes and models of
vehicles back to 1984. The scatterplot displays the city and highway fuel economy (mpg) for a
random sample of ten 2021 vehicles.
City fuel economy
(mpg)
Highway fuel
economy (mpg)
14.4
24.3
27.2
29.9
20.4
28.8
20.9
23.2
28.6
25.4
25.5
37.4
36.5
45.5
28.7
46.1
33.6
38.3
41.3
35.3
a. The correlation between city fuel economy and highway fuel economy for these 10 vehicles
is r = 0.917. Interpret this value.
b. If fuel economy was measured in feet per
gallon, rather than miles per gallon how would
the value of the correlation be affected?
Explain.
c. The Rolls-Royce Ghost EWB gets 14.4 city mpg and 25.5 highway mpg. What affect does
this point have on the correlation? Explain.
93
AP Statistics Handout: Lesson 3.3
Topics: residuals, residual plots, s, r2, outliers
Lesson 3.3 Guided Notes
Residuals
Residuals: The error (vertical distance) between a linear model’s _______________ and the __________
data point.
Residual = observed y – predicted y
Ė‚
Residual = 𝒚 − 𝒚
To explore the relationship between attendance and test scores, a random sample of students’
attendance at school and their math test scores was collected. A linear regression was performed,
resulting in the linear equation: ð‘ĶĖ‚ = −7.69 + 0.57ð‘Ĩ, where ð‘ĶĖ‚ = predicted raw score and x = percent of
school days attended.
a) Prediction: A new student comes to the
school. If his attendance rate is 80%, what
is his predicted test score?
ð‘ĶĖ‚ = −7.69 + 0.57ð‘Ĩ
b) Error: The student gets 44 questions
correct on his exam. Find and interpret our
model’s prediction error (residual) for this
student.
Stems for interpreting residuals:
• ð‘Ķ − ð‘ĶĖ‚ > 0 (positive residual): The actual response variable is greater than predicted by residual
units.
• ð‘Ķ − ð‘ĶĖ‚ < 0 (negative residual): The actual response variable is less than predicted by residual units.
• Example: The actual pulse rate is greater than predicted by 3.65 beats per minute.
94
Residual Plots
Residual Plots – What a “Good Fit” Looks Like…
Residuals ______________ scattered around 0, without
a pattern → linear model is a good fit for this data!
Residual Plots – What a Non-Linear Fit Looks Like…
Residuals show a _____________
pattern → linear model is ______
a good fit for this data!
Residual Plots – Heteroskedastic
More _____________ in residuals
as x values increase → linear model
is ______ a good fit for this data!
95
Standard Deviation of the Residuals (s)
Standard deviation: Typical distance between data points and their mean
Standard deviation of the residuals (s): ______________ error between data points and their LSRL
(typical _______________)
s: The “typical” residual length
s:
Stronger correlation
• _________________
s:
Weaker correlation
• _________________
1) The standard deviation of the residuals for the LSRL between attendance and test scores is s = 1.99.
Interpret this value.
Stem for interpreting s:
• When using the LSRL with explanatory variable to predict response variable, we will typically be off
by about value of s with units of the response variable (y).
96
The Data – Mass Shootings
Which variable most strongly predicted the annual frequency of U.S. mass shootings? Which model had
the smallest residuals?
Year
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
U.S.
Public
Mass
Shootings
1
1
1
2
3
5
1
1
0
1
1
2
3
4
3
4
1
3
7
5
4
7
6
11
12
10
Rifles
manufactured
in U.S.
1,316,607
1,411,120
1,424,315
1,251,341
1,535,690
1,569,685
1,583,042
1,284,554
1,515,286
1,430,324
1,325,138
1,431,372
1,496,505
1,610,923
1,734,536
2,248,851
1,830,556
2,318,088
3,168,206
3,979,570
3,379,549
3,691,799
4,239,335
2,504,092
U.S.
residents
w/ mental
health
disorders
38,705,887
39,220,805
39,949,663
40,703,543
41,470,048
42,210,935
42,979,125
43,446,520
43,873,630
44,268,083
44,676,753
45,100,679
45,380,842
45,643,428
45,887,405
46,131,343
46,360,267
46,664,296
46,955,985
47,208,307
47,445,735
47,775,445
48,106,294
Revenue
from U.S.
Video Game
Sales (in
billions of
2020 dollars)
14.705
13.272
11.41
12
13.973
16.94
16.39
17.255
17.732
16.541
14.96
15.065
17.145
23.436
26.18
23.64
21.948
18.924
19.152
19.14
24.192
25.38
26.215
30.555
36.516
37.9
Notes about the data:
•
Public mass shootings are defined as having 4+ victims,
indiscriminate targeting. Excludes shootings in private
residences or stemming from other crimes such as robbery.
•
Mass shooting data from Washington Post and Mother Jones: https://www.washingtonpost.com/graphics/2018/national/mass-shootings-in-america/
•
Rifle data from Bureau of Alcohol, Tobacco, Firearms and Explosives 2019 Statistical Update: https://www.atf.gov/resource-center/data-statistics
•
Mental health data from Institute for Health Metrics and Evaluation via Our World in Data: https://ourworldindata.org/mental-health
•
Video game data from The NPD Group: https://www.npd.com/wps/portal/npd/us/industry-expertise/video-games/
Rifle production as explanatory variable:
s = 1.31
Mental health patients as explanatory variable:
s = 1.69
Compared to mental health, rifle production is the __________________ predictor of mass shooting
frequency
97
The coefficient of determination (r2)
Graphic inspired by mathisfun.com
𝑟=1
𝑟 = 0.91
𝑟 = 0.48
𝑟2 = 1
𝑟 2 = _____
𝑟 2 = _____
𝑟=0
𝑟 = −0.48
𝑟 2 = _____
Squaring the values:
1) Gets rid of _____________
2) Emphasizes differences in ______________
𝑟 2 = _____
𝑟 = −1
𝑟 = −0.91
𝑟 2 = _____
𝑟 2 = _____
_____ ≤ 𝑟 2 ≤ _____
𝑟 2 close to 0 → __________ correlation
𝑟 2 close to 1 → ______________ correlation
Stem for interpreting r2 :
𝑟 2 % of the variation in response variable can be explained by the linear relationship with explanatory
variable
100%. The linear model _____________
explains the data’s pattern.
𝑟 2 = 1.00 = 100%
72%. The linear model explains _________
of the data’s pattern, but not all of it.
𝑟 2 = 0.72 = 72%
1) The linear relationship between U.S. rifle
production and the annual frequency of mass
shootings has an 𝑟 2 value of 0.62. Interpret
this value.
98
r2 = 0.36
r2 = 0.62
r2 = 0.69
Video game sales were the _______________
_______________ of the annual frequency of
public mass shootings in the United States.
The relationship between s and r2
Strong:
• Low s
• High r2
Weak:
• High s
• Low r2
The effect of outliers
Which of the following models have an outlier? Explain your reasoning.
A
C
N
B
N
99
2017 was an outlier: A data value
_______________ far away from the
_____ (unusually large residual)
Without Outlier
With Outlier
r = 0.79
r2 = 0.62
s = 1.31
r = _____
r2 = _____
s = _____
r, r2, and s are _____ resistant to outliers. Outliers ___________ the strength of a relationship, making…
o r ____________ (magnitude)
o r2 ___________
o s ___________
Lesson 3.3 Discussion
Video game sales were the strongest predictor of the annual frequency of public mass shootings in the
United States. Almost 70% of the variation in yearly public mass shootings can be explained by the linear
relationship with video game sales (r2 = 0.69).
Discussion Question: Would banning violent
video games prevent more mass shootings
than banning assault weapons or improving
mental health services? Why or why not?
100
Remind you of anything?
101
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
II. More Practice with Linear Regression and Residual Plots
Fast food is often considered unhealthy because much fast food is high in fat and calories. The fat and calorie content for
a sample of 5 fast-food burgers is provided below.
Fat(g)
Calories
31
580
35
590
39
640
39
680
43
660
a) Identify the explanatory and the response variables:
b) Use the calculator to make a scatter plot of these ordered pairs. Sketch the scatter plot here.
c) What information does the scatter plot provide? That is, use the scatter plot to describe the
relationship between the fat grams and calories in a fast food burger.
d) Find the following summary statistics for this data:
x, y, sx , s y
e) Now use your calculator to record the following statistics and to find the equation of the least squares
line. Record the equation and use it for the remaining computations.
a, b, r 2 , r,
102
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
f)
Examine a graph of the least squares line superimposed on your scatter plot.
Stat > calc > 8:linreg(a+bx) >L1, L2, Y
To get the Y to show up: Vars > Y Vars> 1:Function > 1: Y1
This will graph the LSRL along with your scatterplot. If you go to the Y= screen, you will now see
the equation for the LSRL
g) Does the line appear to be good model for the data?
h) What is the value of your slope? What information does it provide? Be specific.
i)
How many calories would you predict a burger with 20 fat grams has?
j)
Calculate the residual for 35 fat grams.
k) Calculate the value of r 2 . What information does it provide? Be specific.
l)
What is the value of r ? What does it tell you in this situation?
m) Make a residual plot on your calculator. Be sure to label both axes with words and a “friendly” scale.
n) Based on this residual plot, do you think the least squares line is a good model for this data
103
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
Becky’s parents have kept records of her height since she was born. The data set consists of Becky’s age in
months and her height in centimeters. The summary statistics for the data are provided below:
std. dev. age: 8.5 months
Mean age: 44 months
Mean ht: 82 cm
std. dev. ht: 4.1 cm
The correlation between age and height is .86.
(a) Find the equation of the least squares line that you would use to predict Becky’s height from her age. Show
all work.
(b) What real-world information does the slope provide? Be specific!
(c) Suppose height had been measured in inches rather than in centimeters. What would be the
correlation between age and height in inches? Note: 1 inch = 2.54 cm
104
How many iPhones will be sold?
Here is the data of all iPhone sales during their opening weekends:
iPhone
Year
Original
3G
3Gs
4
4S
5
5C, 5S
6, 6 Plus
6S, 6S Plus
2007
2008
2009
2010
2011
2012
2013
2014
2015
Units Sold
(millions)
0.5
1
1
1.7
4
5
9
10
13
1. Use stapplet.com to create a scatterplot of the data with year as the explanatory
variable and units sold as the response. Sketch the scatterplot in the space above.
2. Describe the form of the distribution.
3. Use the applet to find the least squares regression line. Write the equation below and
graph it on your scatterplot above.
4. Use the least squares regression line to calculate the residual for 2007. Interpret the
residual.
5. Complete the table below.
Year
2007
2008
2009
2010
2011
2012
2013
2014
2015
Actual
Units Sold
(millions)
0.5
1
1
1.7
4
5
9
10
13
Predicted
Units Sold
(millions)
Residual
6. Graph the residuals on the axes
below. This is called a residual plot.
7. For which points was the actual greater than the predicted? Which were less than
predicted? Identify these on the graph.
8. Do you think the regression line is a good fit for the data? Why or why not? Explain using
the residual plot.
105
LSRL and Residual Plots
Important Ideas:
Check Your Understanding:
Fueleconomy.gov gives the city and highway fuel economy for all makes and models of vehicles
back to 1984. The table gives the city and highway fuel economy (mpg) for a random sample of
ten 2021 vehicles.
City fuel economy
(mpg)
Highway fuel
economy (mpg)
14.4
24.3
27.2
29.9
20.4
28.8
20.9
23.2
28.6
25.4
25.5
37.4
36.5
45.5
28.7
46.1
33.6
38.3
41.3
35.3
a. Calculate the equation of the least-squares regression line.
b. Make a residual plot for the linear model in Question 1.
c. What does the residual plot indicate about the appropriateness of the linear model? Explain
your answer.
106
AP Statistics Handout: Lesson 3.4
Topics: leverage and influential points, regression from calculators and computers
Lesson 3.4 Guided Notes
Leverage and Influential Points
The LSRL goes through the point: __________
Ė… = $𝟔𝟗, 𝟏𝟗𝟖
𝒙
Ė… = 𝟓𝟐 items
𝒚
Dataset compiled by student Linda Saucedo, Fall 2019
Removing low leverage points
Removing high leverage points
Low leverage points are close to ________
Low leverage points ___________ affect LSRL much
High leverage points are far from ________
High leverage points have a _________ effect
on an LSRL’s slope.
Influential points: points that, if removed, change the slope, y-intercept, or correlation substantially.
• Three types: Outliers (change ____________), High-Leverage (change _______________), Both
High-Leverage
Outlier
107
Both
Using your Calculator for Regression
1. Turn ON Stat Diagnostics
(MODE)
2. Input data into L1, L2
(STAT → Edit…)
3. Find LinReg
(STAT → Calc → 8:LinReg)
4. Click “Calculate”
Reading Computer Regression Tables
Identify the equation of the LSRL and the important elements from this computer regression output:
Predictor
Coef
SE Coef
T
P
Constant
-14.7
9.30
-1.58
0.122
Income
0.001
0.0001
7.51
0.000
S = 17.46
R-Sq = 61.7%
R-Sq (adj) = 60.6%
108
ð‘ĶĖ‚ = 𝑎 + 𝑏ð‘Ĩ
Prediction as Discrimination?
1) The grocery store chain (H-E-B) wants to open two new locations in San Antonio - one in a lower
income zip code (mean income: $35k) and one in a higher income zip code (mean income: $115,000).
Based on our LSRL, how many more organic items are predicted to be offered at the location in the
higher income neighborhood?
Lesson 3.4 Discussion
Discussion Question: An investigative journalist finds that a different supermarket (not H-E-B) actually
uses a model to estimate how many organic items it should put on shelves purely based on
neighborhood income. The model calls for fewer organic items in low-income areas.
• Name one reason this practice may be unethical.
• If you were the company’s CEO, how would you justify using such a model?
• In your view, is this an unethical use of data? Why or why not?
109
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
I. Minitab / Computer Printouts
Below is a computer output. You will be expected to use and interpret computer output on the
AP Exam. This output is from Minitab, however most computer output looks very similar. We will
discuss which numbers you need to know, what they mean, and how to interpret them.
SE Coef, = 0.3839 represents the standard deviation of the slope
S = 13.40 represents standard deviation of residuals
Constant ----- -87.12
This is the y-intercept. This is the value of the response variable when the explanatory variable is 0.
Check the context of the situation. Often, there can be no such value.
In this case, it is not possible to have a volume that is negative nor is it possible to have a height of zero.
Height/Slope --- 1.5433
This is the coefficient of the explanatory variable, thus it is the slope.
This entire line of numbers deals with regression for slope.
For each increase in height of one unit, the volume is expected to increase by approximately 1.5433
units. (Actual units were not provided)
Prediction equation----- y
87.1 1.54x (ie. Least Squares Regression Line)
This is an equation used to make predictions and is based on only one sample.
SE Coef --------- 0.3839
This is the standard deviation of the slope. Remember, this data came from only one sample. We would
expect the slope to vary a little from sample to sample. Thus,
If we gathered repeated samples, we would expect the slope of the volumes of the trees to vary
by approximately 0.3839 units.
S --------- 13.40
This is the standard deviation of the residuals. The average amount that the observed values differ from
the predicted values is 13.40. The average amount that the observed volumes of trees differ from the
predicted volumes is approximately 13.40 units.
110
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
r 2 ------------- 35.8%
This is the correlation of determination, which is the fraction or proportion of variation in the y values
that is explained by the least squares regression of y on x.
About 35.8% in the variation in volume can be explained by the least squares regression of y( volume)
on x ( height).
r
r2
.358 .598
This is the correlation coefficient. It tells you strength and direction of the relationship.
With an r value of .598, there is a weak, positive relationship between height and volume of trees.
T ------------ 4.02
This is the test statistic which = test statistic
statistic parameter
std .dev of statistic
b
SE
P--------- 0
This is the p-value of a Linear Regression t test.
With a p-value of approx.. 0 less than any alpha level (.05, .01), reject the null. There is evidence that
there is a relationship between the volume of a tree and its height.
Example 2: Minitab / Computer Printouts
Regression Analysis: Height versus Mother Height
The regression equation is
Height = 24.7 + 0.640 Mother Height
(dependent
variable, y)
(intercept,
b0 )
(slope,
b 1)
(estimates)
Predictor
Constant
Mother H
Coef
24.690
0.6405
the estimated regression
equation: yĖ‚ = b0 + b1x
(independent
variable, x)
(sd of ests.)
(test
statistics)
SE Coef
8.978
0.1394
T
2.75
4.59
intercept, b0
slope, b1
S = 2.973
(standard error
of estimate, se)
R-Sq = 35.7%
(coefficient of linear
determination, r2)
(p-values)
P
0.009
0.000
IGNORE these values
tests H0: β1 = 0, vs. two-tailed altern.
(the latter is equivalent to testing for
linear correlation between x and y)
R-Sq(adj) = 34.0%
(adjusted r 2, used for multiple regression)
* (the coefficient of linear correlation is the square root
of r2, with the same sign as the slope, b1)
for a simple linear regression
minitab models only conduct
two-tailed alternatives
S – 2.973 standard error of the estimate, or the standard deviation of the residuals - actual values
versus the predicted values
SE Coef – 0.1394 – standard deviation of the slope
111
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
Practice Minitab / Computer Printouts
1. A sample of men agreed to participate in a study to determine the relationship between several variables
including height, weight, waste size, and percent body fat. A scatterplot with percent body fat on the y-axis and
waist size (in inches) on the horizontal axis revealed a positive linear association between these variables.
Computer output for the regression analysis is given below:
Dependent variable is: %BF
R-squared = 67.8%
S = 4.713 with 250-2 = 248 degrees of freedom
Variable
Constant
Waist
Coefficient
-42.734
1.70
se of coeff
2.717
0.0743
t-ratio
-15.7
22.9
prob
<.0001
<.0001
(a) Write the equation of the regression line (be sure to use correct notation and define your variables):
(b) Explain/interpret the information provided by R-squared in the context of this problem. Be specific.
(c) Calculate and interpret the correlation coefficient (r).
(d) One of the men who participated in the study had waist size 35 inches and 10% body fat. Calculate the
residual associated with the point for this individual.
112
AP Statistics – AP Exam Review
Linear Regression & Computer Output: Interpreting Important Variables
Determine the LSRL, Standard deviation for slope, correlation coefficient and the standard error of
the residuals for each:
#2.
#3.
113
How good are the predictions for Barbie?
Here is the data from one of the groups. The group forgot to record their measurement
for 5 rubber bands.
Number of rubber bands
0
1
2
3
4
5
6
7
Distance traveled (cm)
25
32
41
49
55
?
69
78
1. Go to stapplet.com to make a scatterplot. Then click “Calculate least-squares
regression line”. This is the line that best models the data. Write the equation below.
2. Use the regression line to predict the distance Barbie travels for 5 rubber bands.
Show work.
3. One of the group members later found the measurement for 5 rubber bands was 64
cm. Was the prediction from #2 too high or too low? How far off?
4. Predict the distance that Barbie would travel if the group used 20 rubber bands.
Would you trust this prediction more or less than the prediction you made in #2?
5. What is the y-intercept of the equation of the regression line? What does it mean?
6. What is the slope of the equation of the regression line? What does it mean?
114
Prediction, Residuals, Interpreting a Regression Line
Important Ideas:
Check Your Understanding:
Michael is a runner. He uses his Apple watch to keep track of his distance and the number of
calories he burns for 20 runs. A scatterplot of y = calories burned and x = distance (in miles)
shows a fairly strong, positive linear relationship. The regression equation ð‘Ķ" = 20 + 160ð‘Ĩ
models the data fairly well.
a. Interpret the slope of the regression line.
b. Does the value of the y intercept have meaning in this context? If so, interpret the y
intercept. If not, explain why.
c. Predict the number of calories Michael burns if he runs 5 miles.
d. Calculate and interpret the residual if his Apple watch said that he burned 910 calories
on a 5-mile run.
e. Michael is thinking about signing up for his first marathon. So far, his longest run has
only been 10 miles. Should he use the regression equation to predict how many
calories he would burn if he runs a marathon (26.2 miles)? Explain.
115
How many iPhones will be sold?
Here is the data of all iPhone sales during their opening weekends:
iPhone
Original
3G
3Gs
4
4S
5
5C, 5S
6, 6 Plus
6S, 6S Plus
Year
(after
2000)
7
8
9
10
11
12
13
14
15
Units Sold
(millions)
0.5
1
1
1.7
4
5
9
10
13
1. Use stapplet.com to create a scatterplot of the data with year as the explanatory
variable and units sold as the response. Sketch the scatterplot in the space above.
2. Would you use a linear regression to model the data? Sketch the residual plot below to
support your explanation.
3. Since we expect that the data is nonlinear, we cannot make a linear regression.
However, we can transform the data to make it more linear. First we need to decide
what type of function we think the data would best fit so that we can transform it.
a. What type of model do you think best fits the data?
b. What is the general form of this model?
c. Algebraically, what is the inverse of that function?
d. How can we transform our data using this inverse?
116
4. Complete the table below.
Year
7
8
9
10
11
12
13
14
15
Actual
Units Sold
(millions)
0.5
1
1
1.7
4
5
9
10
13
log(Units
Sold
(millions)
5. Use stapplet.com to create a scatterplot of the data with year as the explanatory
variable and log(units sold) as the response. Sketch the scatterplot in the space above.
6. Calculate the LRSL for the transformed data and write it below.
7. Do you think the regression line is a good fit for the transformed data? Why or why not?
Explain using the residual plot and sketch it below.
8. According to this model, how many iPhones should be sold in 2015?
9. Calculate and interpret the residual for the actual number of iPhones sold in 2015.
117
Transforming Nonlinear Data
Important Ideas:
Check Your Understanding:
A party company specializes in creating balloon boxes, which are
boxes that are completely filled with small balloons. The number of
balloons needed to create a balloon box is a function of the box
height. Here is a scatterplot showing the number of balloons needed
to fill boxes of various heights. Note the clear curved form.
Because height is one-dimensional and number of balloons to fill the
box (volume) is three-dimensional, a power model of the form
number of balloons = a(height)3, should describe the relationship.
Here is a scatterplot of number of balloons versus height3. Because
the transformation made the association roughly linear, we used
computer software to perform a linear regression analysis of y =
number of balloons versus x = height3.
Regression Analysis
Predictor
Coef
SE Coef
Constant
45.008
29.87
Height^3
10.809
0.197
S = 76.938
R-Sq = 99.6%
T
P
1.507
0.15
54.818
0.000
R-Sq(adj) = 99.5%
a. Give the equation of the least-squares regression line. Define any variables you use.
b. Suppose you want to fill a box with a height of 8 feet. Use the model from part (a) to predict
the number of balloons that would be needed.
118
Can You Guess My IQ?
As part of a new transcript at our school, the counselors have decided to include an IQ
score in addition to GPA. Will knowing the GPA help to predict IQ?
Five students requested that the counselors update their transcripts for them: Adam, Bernard,
Christie, Deja, and Eldin. Their IQ scores are 110, 85, 120, 95, 105 but they have been all
mixed up and the counselors don’t know which IQ score goes with which student.
The guidance counselors are forced to predict the IQ for each student. Each counselor takes
a different approach.
Counselor #1: The New Guy
The New Guy is so nervous about being wrong, so he wants to play it safe with his
predictions and minimize his error. He decides to find the average IQ and use it as his
prediction for all five of the students:
Student
Predicted IQ
Adam
Bernard
Christie
Deja
Eldin
Counselor #2: The Veteran
The Veteran noticed an equation written on the board in the AP Statistics room:
IQ = 16•GPA +57.3 . She realized that GPA can help her to make better predictions. She
looks up the GPA of each student:
Adam
Bernard
Christie
Deja
Eldin
GPA = 1.8
GPA = 2.4
GPA = 2.9
GPA = 3.4
GPA = 3.8
Then she used the line of best fit to make her predictions. IQ = 16•GPA +57.3
Student
GPA
Predicted IQ
Adam
1.8
Bernard
2.4
Christie
2.9
Deja
3.4
Eldin
3.8
Counselor #3: The Truth Seeker
Guidance counselor #3 pulled the five students out of class and found the truth.
Student
GPA
Actual IQ
Adam
1.8
85
Bernard
2.4
95
Christie
2.9
110
Deja
3.4
105
119
Eldin
3.8
120
Who made the better predictions?
Now let’s see which counselor made better predictions:
Counselor #1: The New Guy (used the mean IQ for every prediction)
Student
Adam
Bernard
Christie
Deja
85
95
110
105
Actual IQ
Predicted IQ
Error (Actual - Predicted)
Squared error
Eldin
120
Sum of the squared errors:
Counselor #2: The Veteran (used the line of best fit for every prediction)
Student
Adam
Bernard
Christie
Deja
Eldin
GPA
1.8
2.4
2.9
3.4
3.8
Actual IQ
85
95
110
105
120
Predicted IQ
Error (Actual - Predicted)
Squared error
Sum of the squared errors:
Who did better? Why?
Counselor #1 sum of squared errors: ______ Counselor #2 sum of squared errors: _______
Calculate the percentage improvement of the sum of squared errors from Guidance
Counselor #1 to Guidance Counselor #2.
We also could find each counselor’s typical error. Use algebra to turn the sum of the squared
errors into the “average” error for Guidance Counselor #2.
Find the correlation (r), the coefficient of determination (r2) and the standard deviation of the
residuals (s) for the data using the calculator.
GPA
1.8
2.4
2.9
3.4
3.8
Actual IQ
85
95
110
105
120
r=
r2 =
s=
120
s and r2
Important Ideas:
Check Your Understanding:
Fueleconomy.gov gives the city and highway fuel economy for all makes and models of vehicles
back to 1984. The table gives the city and highway fuel economy (mpg) for a random sample of ten
2021 vehicles.
City fuel economy (mpg)
Highway fuel economy
(mpg)
14.4 24.3 27.2 29.9 20.4 28.8 20.9 23.2 28.6 25.4
25.5 37.4 36.5 45.5 28.7 46.1 33.6 38.3 41.3 35.3
Analyze the data using the two-quantitative variable option on stapplet.com.
a. Find the equation of the LSRL. Write it below.
b. Find and interpret s.
c. Find and interpret the value of 𝑟 ! .
121
How do outliers affect the LSRL?
1. Use the Correlation and Regression applet at www.tinyurl.com/regressionapplet
• Click on the graphing area to add 10 points in the lower-left corner so that the
correlation is about r = 0.50.
• Check the boxes to show the LSRL and the mean X and Y lines.
• Sketch it below.
2. For each of the following situations add the point to the scatterplot and decide if the slope,
y-intercept and correlation will increase or decrease.
a. If a point is added on the far right side of the graph on the horizontal line for the
mean of Y.
Slope:
y-intercept:
Correlation:
b. If a point is added on the far left side of the graph on the horizontal line for the mean
of Y.
Slope:
y-intercept:
Correlation:
c. If a point is added below the LSRL on the vertical line for the mean of X.
Slope:
y-intercept:
Correlation:
d. If a point is added above the LSRL on the vertical line for the mean of X.
Slope:
y-intercept:
Correlation:
3. Which outliers had the greatest impact on the LSRL, vertical or horizontal outliers?
122
Outliers and the LSRL
Important Ideas:
Check Your Understanding:
You’ve probably heard the saying “Practice makes perfect!”, but does practice also help you
complete a task faster? A study was conducted to find out. A random sample of 15 high school
students were taught how to solve a Rubik’s cube. Then they were each randomly assigned a
number of times to practice this new skill. After they completed their assigned number of practices
they were timed solving the Rubik’s cube. Here is a scatterplot of the results along with the leastsquares regression line.
a. Describe the influence the student who was assigned
to practice following the steps to solve a Rubik’s cube
14 times has on the equation of the least-squares
regression line.
b. Describe the influence the student who was assigned to practice following the steps to solve a
Rubik’s cube 14 times has on the standard deviation of the residuals and r2.
c. The mean and standard deviation of the number of practices are ð‘ĨĖ… = 8 practices and sx =
4.47 practices. The mean and standard deviation of time are ð‘Ķ$ = 7.71 minutes and sy =
1.20 minutes. The correlation between number of practices and time to solve the Rubik’s
cube is r = –0.793. Find the equation of the least-squares regression line for predicting time
to solve the Rubik’s cube from the number of practices.
123
How close can you get to the finish line?
The Goal: Get your team’s car to reach the finish line without going over.
The Catch: The distance will not be revealed until later.
Test drive your car for pull-backs of 2, 4, 6, and 8 inches. Measure the distance the car
travels. Repeat this process 3 times. Fill in the table below.
x = Pull-back Distance (in.)
2
2
2
4
4
4
6
6
6
8
y = Distance Traveled (in.)
You will create three different regression models for the data and will decide which is
best. For each of the three models below:
a. Create a scatterplot
b. Calculate an LSRL
c. Analyze linearity using any strategies you choose. Show your work.
Linear: Plot (x, y):
Quadratic: Plot (x2, y):
Exponential: Plot (x, log y)
1. Which regression is the best choice for your data? Why?
2. Calculate how many inches you will have to pull back for a finish line of 80 inches.
124
8
8
Choosing the Best Regression
Important Ideas:
Check Your Understanding:
Gapminder.org is an organization dedicated to studying global health. Here is a scatterplot
showing average annual income (in thousands of U.S. dollars) and life expectancy (years) for a
random sample of 15 countries selected from the Gapminder data base.
The output shows three possible models for predicting life expectancy from income. Model (a) is
based on the original data, while Model (b) and (c) involve transformations of the original data.
Each set of output includes a scatterplot with a least-squares regression line added and a residual
plot. The regression equations are given below.
Model (a)
𝑙ðšĪ𝑓𝑒 𝑒ð‘Ĩð‘ð‘’ð‘ð‘Ąð‘Žð‘›ð‘ð‘Ķ
= 69.9352 + 0.1530(𝑖𝑛𝑐𝑜𝑚𝑒)
Model (b)
Model (c)
ln (𝑙ðšĪ𝑓𝑒 𝑒ð‘Ĩð‘ð‘’ð‘ð‘Ąð‘Žð‘›ð‘ð‘Ķ)
= 4.2436 + 0.0021(𝑖𝑛𝑐𝑜𝑚𝑒)
ln (𝑙ðšĪ𝑓𝑒 𝑒ð‘Ĩð‘ð‘’ð‘ð‘Ąð‘Žð‘›ð‘ð‘Ķ)
= 4.1393 + 0.0603𝑙𝑛(𝑖𝑛𝑐𝑜𝑚𝑒)
1. Use each model to predict the life expectancy of residents of a country for which the average
annual income is $80,000.
2. Which model does the best job summarizing the relationship between income and life
expectancy? Explain your answer.
125
Chapter 3 Review Sheet
In my meeting yesterday, I noticed that the number of people paying attention to
the training was declining throughout the day. I collected the following data:
Time (hours)
# people paying attention
0
80
1
68
2
62
3
50
4
48
5
32
6
21
1. Make a scatterplot of the data on your calculator and sketch it below.
2. Describe the data
3. Find the line of best fit using the equations for a and b. Confirm that it
matched with the calculator’s line of best fit.
4. Make a prediction for the # of people paying attention at t = 2.5 hours.
5. What is r? Interpret.
6. What is r2? Interpret.
7. What is the slope of the LSRL? Interpret in the context of the problem.
8. What is the y-intercept of the LSRL? Interpret in the context of the problem.
126
9. Identify the values a and b in the Minitab output:
Predictor
Constant
Time
Coef
79.75
-9.393
SE Coeff
5.34
4.23
S = 3.143
R-Sq = 98.0%
T
3.41
-15.81
P
0
0
R-Sq (adj) = 97.2%
10. Interpret s = 3.143 in the context of the problem.
11. Find the residual for a time of 4 hours.
12. Make a residual plot on your calculator. Sketch it below.
13. Mr. Bakri walked into the meeting at t = 7 hours and all of a sudden 82
people were paying attention. Would this point be influential? Prove it.
127
Barbie Bungee – The Finale
It’s finally time to jump Barbie! At the end of the hour we will be dropping Barbie from
the staircase in the foyer which is 17 ft. (5.2 m). Before we drop her, we will use
everything we’ve learned this chapter to calculate the best possible length of bungee
cord.
Write in your group’s data in the table below.
Number of rubber bands
0
1
2
3
4
5
6
7
Lowest point head reaches (cm)
1. Identify which variable is the explanatory variable and which is the response
variable?
2. Use the Applet to create a scatterplot.
3. Describe your distribution (DUFS).
4. Estimate the r value of your distribution.
5. What would happen to the correlation (r) if you graphed the scatterplot with the
lowest point on the horizontal axis and # rubber bands on the vertical axis?
128
6. Calculate the correlation using SPA applets. Write it below. What is the unit of
the correlation?
7. Use the Applet to find the least squares regression line for your data. Write the
equation below.
8. What is the slope of your LSRL? Interpret the slope.
9. What is the y-intercept of your line? Interpret.
10. Use the LSRL to calculate and interpret the residual for 4 rubber bands.
11. Sketch the residual plot for your LSRL.
12. Find the r2 value and interpret it.
13. Find the standard deviation of the residuals and interpret it.
14. Is the linear regression an appropriate model? Explain.
129
15. Transform the data to y vs. x2 to create a quadratic model. Write the equation of
the LSRL of the transformed variables below.
16. Calculate and interpret the residual using the quadratic regression for 4 rubber
bands.
17. Sketch the residual plot and record the r2 value for the quadratic regression.
r2 = __________
1. Transform the data to log y vs. x to create a exponential model. Write the
equation of the LSRL of the transformed variables below.
2. Sketch the residual plot and record the r2 value for the exponential regression.
r2 = __________
3. Which model is best?
4. Use your model to predict the number of rubber bands Barbie will need in order
to have the most exciting yet safe bungee jump from 17 ft. (518 cm)
130
2017 AP® STATISTICS FREE-RESPONSE QUESTIONS
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 65 minutes on this part of the exam.
Percent of Section II score—75
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. Researchers studying a pack of gray wolves in North America collected data on the length x, in meters, from
nose to tip of tail, and the weight y, in kilograms, of the wolves. A scatterplot of weight versus length revealed
a relationship between the two variables described as positive, linear, and strong.
(a) For the situation described above, explain what is meant by each of the following words.
(i) Positive:
(ii) Linear:
(iii) Strong:
The data collected from the wolves were used to create the least-squares equation yˆ = -16.46 + 35.02 x.
(b) Interpret the meaning of the slope of the least-squares regression line in context.
(c) One wolf in the pack with a length of 1.4 meters had a residual of -9.67 kilograms. What was the weight
of the wolf?
131
STATISTICS
SECTION II
Part B
Question 6
Spend about 25 minutes on this part of the exam.
Percent of Section II score—25
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
6. A newspaper in Germany reported that the more semesters needed to complete an academic program at the
university, the greater the starting salary in the first year of a job. The report was based on a study that used a
random sample of 24 people who had recently completed an academic program. Information was collected on
the number of semesters each person in the sample needed to complete the program and the starting salary, in
thousands of euros, for the first year of a job. The data are shown in the scatterplot below.
(a) Does the scatterplot support the newspaper report about number of semesters and starting salary? Justify
your answer.
The table below shows computer output from a linear regression analysis on the data.
Predictor
Constant
Semesters
S = 7.37702
Coef
34.018
1.1594
SE Coef
4.455
0.3482
R-Sq = 33.5%
T
7.64
3.33
P
0.000
0.003
R-Sq(adj) = 30.5%
(b) Identify the slope of the least-squares regression line, and interpret the slope in context.
132
An independent researcher received the data from the newspaper and conducted a new analysis by separating the
data into three groups based on the major of each person. A revised scatterplot identifying the major of each
person is shown below.
(c) Based on the people in the sample, describe the association between starting salary and number of semesters
for the business majors.
(d) Based on the people in the sample, compare the median starting salaries for the three majors.
(e) Based on the analysis conducted by the independent researcher, how could the newspaper report be modified
to give a better description of the relationship between the number of semesters and the starting salary for
the people in the sample?
133
134
Unit 3
Chapter 4
Collecting Data
135
AP Statistics Handout: Lesson 4.1
Topics: sample vs. population, biased sampling, simple random samples (SRS)
Lesson 4.1 Guided Notes
Sample vs. Population
Census: when you collect data on every individual in the _______________________.
Sample: a _____________ of individuals from a population.
Population
Sample
It’s hard to measure everyone (population), so we take a sample
and make ____________________ about the population.
Graphic courtesy of the National Center for Education Statistics
Biased Sampling
Example: Investigating College Advertisement Statistics
SW Tennessee Community College: The homepage of their website boasts: “Our overall graduation
placement rate is ______, with 91% working in their field of study.” - www.southwest.tn.edu (6/9/2020)
Population
Sample → _________
Among full-time, first-time degree or certificate-seeking students
who entered in 2010/2011, Source: IPEDS (2020)
Bias: a study flaw that leads to _______________________ and/or inaccurate estimates.
Undercoverage: When part of the population has a __________________ of being included in a sample.
-Leads to bias.
-Ex: excluding the students who didn’t graduate.
136
Rogers State University (Oklahoma): In a recent report*, the University found that about 75% of
graduates were pursuing another degree or had found full-time employment by their final semester.
The same report shows that the response rate to the University’s questions/surveys was only 20%.
Nonresponse: When individuals chosen for a sample don’t respond.
-Leads to bias if these individuals ___________ from respondents.
Population
Sample → _______________
Grads without jobs probably
respond _______________.
*Employment and Continuing Education for Graduating Students 2017-2019 AY 3-Year Aggregation
(downloaded 6/9/2020 from https://www.rsu.edu/about/accountability-academics/student-outcomes/)
Question: How could bias in the sampling method have affected the graduate study/employment rate
estimate from Rogers State University?
When writing about sampling bias…
1. Identify the population and the sample
2. Explain how the sampled individuals might __________________ the general population
3. Explain how this leads to an _________________________________.
Model Response:
“Graduates who didn’t find post-grad employment may be ashamed, making them ______________ to
respond to the survey. Therefore, this sampling method may include a lower proportion of unemployed
graduates than in the full population. This produces an _________________ of the true percentage of
_____ graduates who are actually starting full-time work.”
Important: These categories of bias can overlap. On an FRQ, if you’re unsure,
Types of selection bias
________ try to use one of these vocab terms. Instead, just describe the bias,
-Undercoverage bias
how it arises, and whether it leads to an under or overestimate.
-Nonresponse bias
-Voluntary response bias: Occurs when a sample is composed of _______________, who may differ
from individuals who don’t choose to volunteer.
Ex: You want to study heart rate during exercise. You recruit volunteers to run a mile and then
measure their pulse. The few insane people in our society who actually like to run are the ones
who volunteer, so they’re healthier on average than the population → bias
Types of survey bias
-Question wording bias: When survey questions are confusing or leading.
Ex: “Which show do you prefer: Diners, Drive-In, and Dives, hosted by the incredibly talented,
funny, and interminable mayor of Flavortown / chef Guy Fieri or Iron Chef hosted by the boring
Alton Brown?
-Self-reported response bias: When individuals inaccurately report their own traits.
Ex: I report being able to bench-press 350 lbs.
137
Simple Random Samples (SRS)
In order to avoid bias, you must _____________ sample.
Simple Random Sample (SRS): a sampling method in which every possible group of individuals in the
population has an __________________ of being selected.
Example: COVID-19 and Sampling
When COVID-19 spread to NYC, the city only provided tests to people who ____________________.
Some infected people don’t show symptoms. So, the sampling method led to an ________________ of
the number of people infected. It was __________.
Instead: We could have ______________________ the NYC population, tested those who were
sampled, and gotten an __________________ of the number of people infected.
Question: Describe how you would implement a simple random sample (SRS) of 1,000 NYC residents to
test for COVID.
When describing how to perform an SRS:
1. Assign each individual in the population a number 1 – N (population size).
2. Use a random number generator to obtain n (sample size) numbers, skipping repeats.
3. Sample the individuals whose numbers were generated
Model Response:
“Assign every individual in NYC an integer _______ (where N is the population size of NYC). Use a
random number generator to obtain 1,000 integers between 1 – N, __________________________.
Administer the COVID test to the 1,000 individuals whose numbers were selected.”
Lesson 4.1 Discussion
Discussion Question: During World War II, a statistician by the name of Abraham Wald was asked to
help the British air force decide where to put extra armor on their planes. They gave him charts of the
bullet holes in planes that were wounded in fighting but made it back safely to England. An example is
shown below, with each dot representing places hit by bullets.
Using the chart, on what part of a new plane would you
recommend they put extra armor? Choose from the
options below and give a statistical reason for your choice.
Options: A)Nose B)Wings C)Body D)Engine E)Tail
Image courtesy of Professor Joseph Blitzstein (i.e. the best stats prof in the country).
See his “Harvard Thinks Big” talk on this problem: https://youtu.be/dzFf3r1yph8
138
Does Beyoncé write her own lyrics?
1. Quickly circle a random sample of 5 words. Write them below. How many letters in
each word?
2. What is the average word length of your sample? ________.
3. Put your average on the dotplot on the white board at the front of the room. Copy the
class dotplot below.
4. Find a new sample of 5 words using a random number generator. Put your average on
the dotplot on the white board at the front of the room. Copy the class dotplot below.
5. How is the dotplot from #4 different than the dotplot for #3? Which do you think is a
better estimator of the true mean word length?
6. What do you think the true mean word length is for “Crazy in Love”?
7. It is known that Beyonce wrote the lyrics for all of the Destiny’s child songs. The
average word length for these songs is 3.64 letters. Based on your samples, do you
have good evidence that Beyonce did not write the lyrics for “Crazy in Love”. Explain.
139
Sampling Methods
Important ideas:
Check Your Understanding
1. A sign at a local business says, “Rate us with 5-stars on Google and get a $10 gift card!” The
store manager reads the ratings to learn how people in the community feel about the business.
a. What type of sample did the manager obtain?
b. Explain why this sampling method is biased.
c. The manager finds that 92% of customers rate the business with 5 stars. Is 92% likely to be
greater than or less than the percentage of all people in the community who truly believe the
business deserves 5-stars?
2. To help reduce bias, the manager decides to survey the next 10 customers that make a
purchase.
a. What type of sample did the manager obtain?
b. Explain why this sampling method is biased.
3. How could the manager avoid the bias described in Question 2?
140
Crazy in Love
I look and stare so deep in your eyes
I touch on you more and more every time
When you leave I'm begging you not to go
Call your name two or three times in a row
Such a funny thing for me to try to explain
How I'm feeling and my pride is the one to blame
'Cuz I know I don't understand
Just how your love can do what no one else can
Got me looking so crazy right now, your love's
Got me looking so crazy right now (in love)
Got me looking so crazy right now, your touch
Got me looking so crazy right now (your touch)
Got me hoping you'll page me right now, your kiss
Got me hoping you'll save me right now
Looking so crazy in love's
Got me looking, got me looking so crazy in love
When I talk to my friends so quietly
Who he think he is? Look at what you did to me
Tennis shoes, don't even need to buy a new dress
If you ain't there ain't nobody else to impress
The way that you know what I thought I knew
It's the beat my heart skips when I'm with you
But I still don't understand
Just how the love your doing no one else can
I'm Looking so crazy in love's
Got me looking, got me looking so crazy in love
Got me looking, so crazy, my baby
I'm not myself, lately I'm foolish, I don't do this
I've been playing myself, baby I don't care
'Cuz your love's got the best of me
And baby you're making a fool of me
You got me sprung and I don't care who sees
'Cuz baby you got me, you got me, so crazy baby
HEY!
141
1I
2 look
3 and
4 stare
5 so
6 deep
7 in
8 your
9 eyes
10 I
11 touch
12 on
13 you
14 more
15 and
16 more
17 every
18 time
19 When
20 you
21 leave
22 I'm
23 begging
24 you
25 not
26 to
27 go
28 Call
29 your
30 name
31 two
32 or
33 three
34 times
35 in
36 a
37 row
38 Such
39 a
40 funny
41 thing
42 for
43 me
44 to
45 try
46 to
47 explain
48 How
49 I'm
50 feeling
51 and
52 my
53 pride
54 is
55 the
56 one
57 to
58 blame
59 'Cuz
60 I
61 know
62 I
63 don't
64 understand
65 Just
66 how
67 your
68 love
69 can
70 do
71 what
72 no
73 one
74 else
75 can
76 Got
77 me
78 looking
79 so
80 crazy
81 right
82 now,
83 your
84 love's
85 Got
86 me
87 looking
88 so
89 crazy
90 right
91 now
92 (in
93 love)
94 Got
95 me
96 looking
97 so
98 crazy
99 right
100 now,
101 your
102 touch
103 Got
104 me
105 looking
106 so
107 crazy
108 right
109 now
110 (your
111 touch)
112 Got
113 me
114 hoping
115 you'll
116 page
117 me
118 right
119 now,
120 your
121 kiss
122 Got
123 me
124 hoping
125 you'll
126 save
127 me
128 right
129 now
130 Looking
131 so
132 crazy
133 in
134 love's
135 Got
136 me
137 looking,
138 got
139 me
140 looking
141 so
142 crazy
143 in
144 love
145 When
146 I
147 talk
148 to
149 my
150 friends
151 so
152 quietly
153 Who
154 he
155 think
156 he
157 is
158 Look
159 at
160 what
161 you
162 did
163 to
164 me
165 Tennis
166 shoes
167 don't
168 even
169 need
170 to
171 buy
172 a
173 new
174 dress
175 If
176 you
177 ain't
178 there
179 ain't
180 nobody
181 else
182 to
183 impress
184 The
185 way
186 that
187 you
188 know
189 what
190 I
191 thought
192 I
193 knew
194 It's
195 the
196 beat
197 my
198 heart
199 skips
200 when
142
201 I'm
202 with
203 you
204 But
205 I
206 still
207 don't
208 understand
209 Just
210 how
211 the
212 love
213 your
214 doing
215 no
216 one
217 else
218 can
219 I'm
220 Looking
221 so
222 crazy
223 in
224 love's
225 Got
226 me
227 looking,
228 got
229 me
230 looking
231 so
232 crazy
233 in
234 love
235 Got
236 me
237 looking,
238 so
239 crazy,
240 my
241 baby
242 I'm
243 not
244 myself,
245 lately
246 I'm
247 foolish,
248 I
249 don't
250 do
251 this
252 I've
253 been
254 playing
255 myself
256 baby
257 I
258 don't
259 care
260 'Cuz
261 your
262 love's
263 got
264 the
265 best
266 of
267 me
268 And
269 baby
270 you're
271 making
272 a
273 fool
274 of
275 me
276 You
277 got
278 me
279 sprung
280 and
281 I
282 don't
283 care
284 who
285 sees
286 'Cuz
287 baby
288 you
289 got
290 me,
291 you
292 got
293 me,
294 so
295 crazy
296 baby
297 HEY
Credit: Gabe
Yonker
4.1 PRACTICE!
1. In May 2015, the Los Angeles City Council voted to ban most travel and contracts with the state of Arizona to protest
Arizona’s new immigration enforcement law. The Los Angeles Times conducted an online poll that asked if the City
Council was right to pass a boycott of Arizona. The results showed that 96% of the 41,068 people in the sample said
“No.” Does this result represent the opinions of all Los Angeles residents? Explain.
2. The manager of a beach-front hotel wants to survey guests in the hotel to estimate overall customer satisfaction. The
hotel has two towers, an older one to the south and a newer one to the north. Each tower has 10 floors of standard
rooms (40 rooms per floor) and 2 floors of suites (20 suites per floor). Half of the rooms in each tower face the beach,
while the other half of the rooms face the street. This means there are (2 towers)(10 floors)(40 rooms) + (2 towers)(2
floors)(20 suites) = 880 total rooms.
A. Explain how to select a simple random sample of 88 rooms.
143
B. Explain how to select a stratified random sample of 88 rooms.
C. Explain why selecting 2 of the 24 different floors would not be a good way to obtain a cluster sample.
3. Which of the following are sources of sampling error and which are sources of nonsampling error? Explain your
answers.
(a) The subject lies about past drug use.
(b) A typing error is made in recording the data.
(c) Data are gathered by asking people to mail in a coupon printed in a newspaper.
144
4.2 Random Sampling Activity: Sampling Segregation
Most American cities have severe economic segregation. This means that households in the same
neighborhood have similar incomes, but incomes across neighborhoods tend to be quite different.
Multiple reports have found that San Antonio is the most economically segregated city in the United
States. The following activity investigates the challenges of sampling incomes in San Antonio.
Judgement Sample: Select 5 homes that you think are representative of the whole population
of San Antonio. Label the home numbers on the bottom lines and their incomes on the top
lines. Calculate the mean of these incomes and record your answer in the space provided.
Incomes:
________
___
Home Number:
________
___
_________
___
________
___
________
___
Mean: _______
Simple Random Sample (SRS): Use a random number generator to select 5 homes numbered
between 1 and 100. Label the home numbers on the bottom lines and their incomes on the top
lines. Calculate the mean of these incomes and record your answer in the space provided.
Incomes:
________
___
Home Number:
________
___
_________
___
________
___
________
___
Mean: _______
Cluster Random Sample: Use a random number generator to select 1 home numbered
between 1 and 100. Find the home on the map as well as the four homes located closest to it.
Label the home numbers on the bottom lines and their incomes on the top lines. Calculate the
mean of these incomes and record your answer in the space provided.
Incomes:
________
___
Home Number:
________
___
_________
___
________
___
________
___
Mean: _______
Stratified Random Sample: Find the page with the title “Strata by Race-Ethnicity”, in which the
homes are divided by predominate race. Use a random number generator to select 1 home
within each of the 5 strata. For example, I would use RandInt(67,71,1) to select a Native
American home, as the range for that strata is 67-71. Label the home numbers on the bottom
lines and their incomes on the top lines. Calculate the mean of these incomes.
Incomes:
________
___
Home Number:
________
___
_________
___
________
___
________
___
Mean: _______
Systematic Random Sample: Use a random number generator to select a number between 1
and 20. This is the first home. Then add 20 to each previous number to find the other four
homes. Label the home numbers on the bottom lines and their incomes on the top lines.
Calculate the mean of these incomes.
Incomes:
________
___
Home Number:
________
___
_________
___
145
________
___
________
___
Mean: _______
94
93
42
49
91
41
87
50
88
90
89
92
86
40
95
57
82
43
81
45
48
98
56
46
83
60
79
58
55
75
78
76
33
18
74
35
44
59
84
80
77
96
85
54
47
34
11
17
99
8
71
32
97
63
9
13
5
72
6
61
1
7
2
19
20
53
52
25
70
26
23
21
68
27
22
29
51
15
64
66
4
67
100
24
3
65
14
62
10
16
12
73
28
69
31
30
Key (Predominate Race in Home):
37
38
Asian
Black
Hispanic
Native American
White
39
36
Home Income
Home Income
Home Income
Home Income
Home Income
$25,000
Home Income
Home Income
1
16
$23,500
31
$64,000
46
$65,000
61
$66,500
76
$56,000
91
$108,000
2
$26,500
17
$23,500
32
$35,000
47
$62,500
62
$26,000
77
$76,000
92
$157,000
3
$42,000
18
$53,000
33
$55,000
48
$78,000
63
$24,500
78
$44,500
93
$186,000
4
$56,000
19
$36,000
34
$56,500
49
$85,500
64
$22,000
79
$212,500
94
$166,000
5
$23,000
20
$28,500
35
$57,000
50
$61,500
65
$20,000
80
$119,000
95
$174,000
6
$24,500
21
$27,500
36
$42,500
51
$48,000
66
$28,500
81
$67,000
96
$80,000
7
$44,500
22
$28,000
37
$44,500
$23,000
82
$124,000
97
$42,000
8
$25,500
23
$30,500
38
$41,000
$20,000
83
$109,000
98
$48,500
9
$25,500
24
$27,000
39
$43,000
$27,500
84
$111,500
99
$28,000
10
$54,000
25
$31,500
40
$109,000
55
$44,500
70
$36,000
85
$48,000
100
$27,000
11
$26,000
26
$29,500
41
$214,500
56
$102,000
71
$25,000
86
$43,500
12
$25,000
27
$45,500
42
$92,500
57
$139,000
72
$57,000
87
$47,000
13
$35,500
28
$61,000
43
$95,000
58
$68,000
73
$110,000
88
$39,000
14
$25,000
29
$61,000
44
$68,500
59
$107,500
74
$28,000
89
$58,500
15
$27,500
30
$58,500
45
$76,000
60
$85,000
75
$58,000
90
$62,000
52
53
54
$44,500
$47,000
$108,500
146
67
68
69
Strata: By Race-Ethnicity
Asian
57
56
58
Black
61
59
65
62
66
63
60
64
Hispanic
1
6
11
2
7
12
3
16
21
17
22
18
23
13
8
48
28
14
9
19
15
33
54
45
24
20
49
41
37
38
50
34
30
25
10
5
53
44
36
32
29
4
52
35
31
27
47
43
40
42
39
55
46
51
26
Native American
67
68
71
69
70
White
72
82
79
76
90
85
96
93
86
91
97
83
73
77
99
80
87
98
92
74
75
78
84
81
100
94
88
95
89
147
Sampling Distributions for Each Sample Type
Judgement Sample
---------|------------|------------|------------|------------|------------|------------|------------|------------|-------$20k
$30k
$40k
$50k
$60k
$70k
$80k
$90k
$100k
Simple Random Sample (SRS)
---------|------------|------------|------------|------------|------------|------------|------------|------------|-------$20k
$30k
$40k
$50k
$60k
$70k
$80k
$90k
$100k
Cluster Random Sample
---------|------------|------------|------------|------------|------------|------------|------------|------------|-------$20k
$30k
$40k
$50k
$60k
$70k
$80k
$90k
$100k
Stratified Random Sample
---------|------------|------------|------------|------------|------------|------------|------------|------------|-------$20k
$30k
$40k
$50k
$60k
$70k
$80k
$90k
$100k
Systematic Random Sample
---------|------------|------------|------------|------------|------------|------------|------------|------------|------$20k
$30k
$40k
$50k
$60k
148
$70k
$80k
$90k
$100k
AP Statistics: Lesson 4.2 Post-Activity Guided Notes & Discussion
Topics: random sampling methods
High accuracy → unbiased
Low accuracy → biased
High precision → low variability
Low precision → high variability
Judgement Sample
Biased (not accurate) / Unbiased (accurate)
Precise (low variability) / Imprecise (high variability)
Pros:
Cons:
Question: Why was the judgement sample biased?
Simple Random Sample (SRS)
Biased (not accurate) / Unbiased (accurate)
Precise (low variability) / Imprecise (high variability)
Pros:
Cons:
Cluster Random Sample
Biased (not accurate) / Unbiased (accurate)
Precise (low variability) / Imprecise (high variability)
Pros:
Cons:
Question: Why did the cluster random sample have high variability?
Stratified Random Sample
Biased (not accurate) / Unbiased (accurate)
Precise (low variability) / Imprecise (high variability)
Pros:
Cons:
Question: Why did the stratified random sample have low variability?
Systematic Random Sample
Biased (not accurate) / Unbiased (accurate)
Precise (low variability) / Imprecise (high variability)
Pros:
Cons:
149
Lesson 4.2 Discussion – Property Taxes and Financing Schools
Alamo Heights High School (Alamo Heights ISD)
•
•
•
Property tax rate: $1.06 per $100 in home value
Average home value: $560,500 (source: Zillow)
Calculate tax revenue from average home:
Burbank High School (San Antonio ISD)
•
•
•
Property tax rate: $1.17 per $100 in home value
Average home value: $125,000 (source: Zillow)
Calculate tax revenue from average home:
Discussion Question: Funding for schools varies by state, county, and township. If you were mayor of
San Antonio, based on the activity and your analysis above, which of the following school funding
schemes would you choose? Why?
Option A: Fund schools based only on property taxes collected from the homes around the school.
Option B: Fund all schools equally (the same amount of money per student) by taking tax money from
property-rich areas and giving it to schools in property-poor areas.
Option C: Give more funds per student to schools that serve lower-income students. You do this by
taking more tax money from property rich areas and giving it to schools in property-poor areas.
150
4.2 PRACTICE!
1. A study published in the New England Journal of Medicine compared two medicines to treat head lice: an oral
medication called ivermectin and a topical lotion containing malathion. Researchers studied 812 people in 376
households in seven areas around the world. Of the 185 randomly assigned to ivermectin, 171 were free from head lice
after two weeks compared to only 151 of the 191 households randomly assigned to malathion.
Identify the experimental units, explanatory and response variables, and the treatments in this experiment.
2. Does adding fertilizer affect the productivity of tomato plants? How about the amount of water given to the plants?
To answer these questions, a gardener plants 24 similar tomato plants in identical pots in his greenhouse. He will add
fertilizer to the soil in half of the pots. Also, he will water 8 of the plants with 0.5 gallons of water per day, 8 of the
plants with 1 gallon of water per day and the remaining 8 plants with 1.5 gallons of water per day. At the end of three
months he will record the total weight of tomatoes produced on each plant.
Identify the explanatory and response variables, experimental units, and list all the treatments.
151
3. Suppose you have a class of 30 students who volunteer to be subjects in an experiment involving caffeine. Explain
how you would randomly assign 15 students to each of the two treatments.
4. A cell phone company is considering two different keyboard designs (A and B) for its new line of cell phones.
Researchers would like to conduct an experiment using subjects who are frequent texters and subjects who are not
frequent texters. The subjects will be asked to text several different messages in 5 minutes. The response variable will
be the number of correctly typed words.
A. Explain why a randomized block design might be preferable to a completely randomized design for this experiment.
B. Outline a randomized block experiment using 100 frequent texters and 200 novice texters.
152
How Much Do Fans Love Justin Timberlake? Day 1
Justin Timberlake’s concert promoter wants to find out how much fans enjoy the concerts. He
will ask fans, “From 1 to 100, where 100 is the most, how much did you enjoy the concert?” The
section he wants to survey has 50 seats (5 rows x 10 columns). The stage runs along the
northern edge of the venue (where Justin is pictured). He wants to take a sample of 10 seats.
1. Method #1:
Take a simple random sample (SRS) of 10
fans. Explain below the steps you used to
obtain an SRS.
2. Method #2:
Randomly choose 2 fans from each
horizontal row.
3. Method #3:
Randomly choose 1 fan from each
vertical column.
4. Which method do you think is best? Why?
153
5. Now, it’s time for the actual data. For each of your samples on the previous page,
calculate the average enjoyment. Add your average to the dotplots on the board.
Sample #1:
Sample #2:
Sample #3:
Method #1: SRS
average enjoyment
Method #2: Stratify by Row
average enjoyment
Method #3: Stratify by Column
average enjoyment
154
Other Random Sampling Methods Day 1
Important Ideas:
Check Your Understanding:
To score the AP Statistics Exams ETS hires Exam Readers, Table Leaders, and Other
Leadership. Each reading room consists of 16 Exam Readers and 2 Table Leaders. There are
100 reading rooms. The 18 members of Other Leadership work together in a room.
a. Describe how to select a stratified random sample of 36 people hired by ETS to score
the AP Statistics Exams. Explain your choice of strata.
b. Describe how to select a cluster sample of 36 people hired by ETS to score the AP
Statistics Exams. Explain your choice of clusters.
c. Explain a benefit of using a stratified random sample and a benefit of using a cluster
random sample in this context.
155
AP Statistics Handout: Lesson 4.3
Topics: observation vs. experiment, components and principles of experiments, completely
randomized design
Lesson 4.3 Guided Notes
Observational Studies vs. Experiments
Sources:
-Huang, J. et al. “Historical comparison
of gender inequality in scientific
careers across countries and
disciplines.” Proceedings of the
National Academy of Sciences, Mar
2020, 117 (9) 4609-4616; DOI:
10.1073/pnas.1914221117
-Boston Consulting Group, “What’s
Keeping Women out of Data Science?”
bcg.com/publications/2020/whatkeeps-women-out-data-science.aspx
1. One possible cause of gender gaps is hiring bias. Provide an explanation for how hiring biases might
cause these trends:
From this ____________ data alone, we cannot prove that hiring discrimination is the cause. Why?
Confounding variables: Provide ___________________ explanations for trends between explanatory
(gender) and response (hiring rates) variables.
2. Name and discuss one confounding variable that could also explain the gender gap in STEM:
Observational study: a study in which data is collected ___________________________ any treatments.
• Retrospective study: examines ________________________ on individuals
• Prospective study: follows individuals to gather ____________________
• Both ______________ show cause and effect because they do not control for confounding!
156
Experiments: a study in which treatment is _______________ on subjects.
• If well designed, experiments _____________ cause-effect relationships by _________________
for confounding variables.
Components of Experiments
The Jenn/John Study - Example inspired by Advanced High School Statistics
In this study*, experimenters printed up copies of fake application materials for a science lab manager
job. All copies of the application materials were completely identical, except for one thing: the name. On
about half of the copies, the application listed the name “John.” The other half had the name “Jennifer.”
Experimenters found 127 science lab faculty members and randomly sent them either a “John” (n = 63)
or “Jennifer” (n = 64) application. Each faculty member was told that the application was for a real
position at their University. The faculty members independently rated the applicant’s “hireability” (1-7
scale) and estimated their starting salary. Experimenters compared these results across the two groups.
*Moss-Racusin, C., Dovidio, J., et al. “Science faculty’s subtle gender biases favor male students.”
PNAS October 9, 2012 109 (41) 16474-16479; https://doi.org/10.1073/pnas.1211286109
Experimental units: the ________________________ (person, animal, plant, virus, particle, etc.) that are
assigned to different treatments.
In hiring study: _______________________
Explanatory variable: the variable that is purposefully __________________. This is also known as the
factor.
In hiring study: ____________________________
Treatments: the different _____________ of the explanatory variable in the experiment.
In hiring study: __________________
Response variable: the measured experiment ___________________ that is compared between
treatment groups.
In hiring study: ___________________
157
Principles of Experimental Design
•
•
•
•
________________ of at least two treatment groups
_____________________________ of experimental units to treatment
_____________________– many experimental units in each treatment group
________________ of confounding variables
1. Comparison
a) Explain how comparison is implemented in the Jenn/John study:
2. Random Assignment
a) Why would it be a problem if I assigned Tier 1 University labs to get “John” and Tier 2
labs to get “Jennifer” application materials?
b) Random assignment tends to ________________ confounding factors, so inferences can be made
about the explanatory variable.
c)
Sampling
Random ___________
→ Reduces ________
Experiments
Random ____________
→ Reduces _______________
3. Replication
a) Why would it be a problem if I only assigned one faculty member to the “John” group and one
faculty member to the “Jennifer” group?
b) Larger treatment group size reduces the likelihood of differences arising due to chance alone. In
other words, it makes our estimates of treatment effect _______________________.
4. Control
a) One way they control for confounding factors is by making the application materials ____________
_______________, except for the explanatory variable (gender).
158
Describing a Completely Randomized Design
Completely randomized design: An experimental design in which experimental units are assigned to
treatments _______________________________________.
• This is the “SRS” of experiments – the simplest (but still effective) randomized experiment.
Question: Describe how you would implement a completely randomized design of the Jennifer/John
experiment, with 127 science faculty members.
When describing how to perform a completely randomized design experiment:
1. Assign each experimental unit a number 1 – n (sample size).
2. Write all the numbers on identical slips of paper, put into a hat, and mix well.
3. Draw out nt (treatment group size) slips of paper, without replacement. The corresponding units are
assigned treatment 1. Draw out another nt slips of paper, assign to treatment 2, etc.
4. Compare response among treatment groups
Model Response:
“Assign each faculty member an integer, __________. Write integers 1-127 on identical slips of paper,
put them into a hat, and _______________. Draw out 63 slips (without replacement). The
corresponding faculty members will receive ‘John’ application materials. The __________________
faculty members will receive ‘Jennifer’ application materials. At the end of the experiment, record
faculty members’ rating of applicants’ ‘hireability’ and starting salary estimates. Finally,
_______________ these results across the two groups.”
Lesson 4.3 Discussion
Here’s an estimated summary of the Jenn/John study results:
Estimates based on
publicly available data
from the study.
The differences were found to be statistically significant:
so extreme that they were unlikely to happen by
__________________.
Hireability
Salary
John
3.78
$30,238
Jennifer
2.93
$26,508
Since this was a well-designed experiment, we can infer that these average differences are
_________________ by gender bias.
159
Difference
-0.85
-$3,730
How Much Do Fans Love Justin Timberlake? Day 2
In the next city, Justin Timberlake’s concert promoter again wants to find out how much fans
enjoy his concerts. He will ask fans, “From 1 to 100, where 100 is the most, how much did you
enjoy the concert?” Again, he wants to take a sample of 10 fans. He also would like to try out a
couple of new methods for sampling.
1. Method #1:
Take a simple random sample (SRS) of 10
fans.
2. Method #2:
To make it easier to distribute the surveys, the
promoter decides to pick one row and just
sample every fan in that row.
a.
Use this method to select a sample of 10
fans.
b.
Do you think this method will produce
good estimates? Why or why not?
3. Method #3:
Justin’s manager thinks it is important to
sample fans that have different views of the
stage. He wants to sample every 8th fan.
a.
First, we need to figure out the starting
fan. Randomly select a fan and mark with
an X.
b.
Begin marking every 8th seat until you get a
sample of 10 seats (start back at the
beginning if you need to).
4. Which method do you think is best? Why?
160
5. Now, it’s time for the actual data. For each of your samples on the previous page,
calculate the average enjoyment. Add your average to the dotplots on the board.
Sample #1:
Sample #2:
Sample #3:
Method #1: SRS
average enjoyment
Method #2: Cluster Sample
average enjoyment
Method #3: Systematic Random Sample
average enjoyment
161
More Sampling Methods Day 2
Important Ideas:
Check Your Understanding:
The manager of a large ocean front hotel would like to survey their guests to determine their
satisfaction with the view from their room. The hotel has 10 floors. Half of the rooms overlook the
ocean and the other half overlook the street. There are 50 rooms on each floor, for a total of 500
rooms. The hotel manager would like to select a sample of 50 rooms.
a. Describe how to select a stratified random sample of 50 rooms.
b. Describe how to select a cluster sample of 50 rooms.
c. Describe how to select a systematic random sample of 50 rooms.
d. Explain a benefit of using each of the three types of sampling methods in this context.
162
AP Statistics Handout: Lesson 4.4
Topics: blocking, matched pairs, placebo, blinding, generalizing study results
Lesson 4.4 Guided Notes
Blocking
The Jenn/John Study - Example inspired by Advanced High School Statistics
In this study*, experimenters printed up copies of fake application materials for a science lab manager
job. All copies of the application materials were completely identical, except for one thing: the name. On
about half of the copies, the application listed the name “John.” The other half had the name “Jennifer.”
Experimenters found 127 science lab faculty members and randomly sent them either a “John” (n = 63)
or “Jennifer” (n = 64) application. Each faculty member was told that the application was for a real
position at their University. The faculty members independently rated the applicant’s “hireability” (1-7
scale) and estimated their starting salary. Experimenters compared these results across the two groups.
*Moss-Racusin, C., Dovidio, J., et al. “Science faculty’s subtle gender biases favor male students.”
PNAS October 9, 2012 109 (41) 16474-16479; https://doi.org/10.1073/pnas.1211286109
The experiment described above is a…
Completely randomized design: An experimental design in which experimental units are assigned to
treatments completely at random.
• This is the ________ of experiments – the simplest (but still effective) randomized
experiment.
What if the faculty members were from both Tier 1 and Tier 2 Universities?
There is some difference but also
some overlap. The differences may
be due to _____________________.
163
When broken down by Tier, there is
little overlap. Schools ____________
each tier rated John more ‘hireable.’
How to reduce the variability due to school tier –
Randomized Complete Block Design: experimental units are first blocked (_________________) by a
similar trait that may affect response. Then, units from each block are randomly assigned to treatment.
• This is the _______________________________ of experiments.
• It reduces variation between treatment groups at the start of the experiment. This makes it
easier to show that differences in response are really _______________________________,
rather than chance variation in the random assignment.
Question: Describe how you would implement a randomized complete block design for the Jenn/John
study, which takes into account University tier.
Model Response:
“Block the science lab faculty members by University tier. In each block, assign each faculty member
____________________________. For the Tier 1 block, write all the numbers on identical slips of paper,
put into a hat, and mix well. Draw out 3 slips of paper, _____________________________________.
The corresponding faculty are assigned ‘Jennifer’ application materials. The remaining are assigned
‘John’ materials. _______________ the same random assignment process for the Tier 2 block. Compare
‘hireability’ ratings between the Jenn/John groups ____________ each block. ____________________
the results, after accounting for the average difference in each block.”
164
Matched Pairs Design
Matched Pairs Design: a type of randomized blocked experiment in which each block is composed of
_______ similar experimental units (a “matched pair”).
• Often, the “matched pair” is simply the same experimental unit receiving both treatments. The
____________ of the treatments is randomized.
The 2015 Journal of the American Medical Association Depression Study
In this study*, researchers wanted to test if taking a “fake pill” would actually alleviate depression
symptoms, even though the pill had no active ingredients. 35 people enrolled in the study. All had major
depression and none were taking any medications. Because depression varies greatly between
individuals, the researchers implemented a matched pairs design.
*Simplified version of: Association Between Placebo-Activated Neural Systems and Antidepressant Responses: Neurochemistry
of Placebo Effects in Major Depression. Peciña M, Bohnert AS, Sikora M, Avery ET, Langenecker SA, Mickey BJ, Zubieta JK. JAMA
Psychiatry. 2015 Sep 30:1-8. doi: 10.1001/jamapsychiatry.2015.1335. [Epub ahead of print]. PMID: 26421634.
Question: Describe how you would implement a matched pairs design for the depression study.
“For each subject, flip a coin. Heads indicates they get the
fake pill for the first week, no pill for the second week. Tails
indicates the _____________________________________.
Have each subject fill out a depression questionnaire and
undergo a PET brain scan at the end of each week. Compare
measurements _________________________ and compile
the results.”
Depression study results
1. On average, participants reported less severe depressive symptoms after their week with
the fake pill.
2. On average, participants showed “increased µ-opioid receptor brain activity in regions of the
brain associated with emotion and stress regulation.”
o Belief in the pill caused a ____________________________
“Fake pills” have also have also shown significant beneficial effects for….
• Migraines
• Blood Pressure
Called the:
• Asthma
_______________________
• Arthritis
• Many other ‘physical’ illnesses
Source: https://www.ncbi.nlm.nih.gov/books/NBK513296/
165
Blinding and Placebo
Placebo: An ________________ treatment (e.g. sugar pill or salt water
IV drip)
Placebo effect: when subjects’ belief of receiving an active treatment
leads to a measured response, even though the treatment is
___________________________.
Single-blind study: _______________ the subjects or the researchers are unaware of who receives
active treatment or placebo.
Double-blind study: ________ the subjects and the researchers are ___________________ of who
receives active treatment or placebo.
Why blind subjects? _____________________________
Why blind researchers? ___________________________
• Researchers trying to prove that a treatment works may favor treatment group in their
measurements
Lesson 4.4 Discussion
Here’s an estimated summary of the Jenn/John study results:
Hireability
Salary
John Jennifer Difference
3.78
2.93
-0.85
$30,238 $26,508 -$3,730
Estimates based on publicly
available data from the study.
Generalization: using study results to make inferences about a _________________________.
Discussion Question: In the Jenn/John study, all faculty participants were from one of three science
departments: biology, chemistry, or physics. Can we generalize the result of gender hiring discrimination
to all scientific subjects? Explain your reasoning.
166
Were subjects randomly sampled?
Generalization and Scope of Inferences
Were subjects randomly assigned to treatment?
Subjects were
randomly
sampled from
the population
Subjects were
not randomly
sampled from
the population
Subjects were randomly assigned to
groups (experiment)
Subjects were not randomly assigned
to groups (observational study)
Generalize to the population: ___
Determine cause and effect: ___
Generalize to the population: ___
Determine cause and effect: ___
Generalize to the population: ___
Determine cause and effect: ___
Generalize to the population: ___
Determine cause and effect: ___
167
What is wrong with these surveys?
Identify what is wrong in each of these surveys. Be sure to explain.
1. The mayor of Springfield is interested in finding out the average age of people in the city.
He obtains a list of all of the landline telephones in the city, and then contacts a simple
random sample of 300 people. He uses the data from the sample to estimate the average
age of all the people in the city.
a. What is wrong with this survey?
b. Do you think the Mayor will over or underestimate the true mean age of people in
Springfield? Why?
2. The administration at a school wants to know the proportion of students that did all of
their homework last night. They select a simple random sample of 100 students and send
an email to each of them asking if they did all of their homework last night. Of the 40
responses, 36 of the students said that they did all of their homework last night (90%).
a. What is wrong with this survey?
b. Do you think the administration will over or underestimate the true proportion of
students who did all of their homework last night? Why?
3. Boy Scout Peter M. wants to know the proportion of people in his neighborhood who
support the Boy Scouts. He takes a random sample of 30 homes and visits them dressed
in his uniform.
a. What is wrong with this survey?
b. Do you think Peter will over or underestimate the true proportion of his neighbors
who support the Boy Scouts? Why?
168
Sample Surveys: What else can go wrong?
Important Ideas:
Check Your Understanding:
1. The principal of a large high school wants to learn about student opinion regarding remote
learning versus in person learning. Each of the following methods introduces a possible
source of bias. Name the type of bias.
a. The principal selects a random sample of students from those who signed up for remote
learning.
b. The principal emails a random sample of 100 students, but only 28 students respond.
c. The principal surveys the first 50 students that submit their learning preference for the
upcoming school year.
2. The following question was on the Pennsylvania General Election Ballot in the 2016 election.
“Shall the Pennsylvania Constitution be amended to require that justices of the
Supreme Court, judges, and magisterial district judges be retired on the last day of
the calendar year in which they attain the age of 75 years?”
The wording seems to ask if there should be mandatory retirement. However, what was not
revealed is that mandatory retirement already existed at age 70. Due to the wording of the
question is the percentage of people who voted “Yes” likely to be less than, greater than, or
about equal to the percent of all voters who would vote “Yes” if the question were not
misleading? Explain your reasoning.
169
Does SAT prep improve scores?
Last year EKHS offered an after school SAT prep class that students could volunteer to take.
44 students took the course and then took the SAT. The average SAT score for this group was
1220. The average SAT score for all students who did not take the prep class was 1050.
1. Is the situation described an observational study or an experiment?
2. Identify the explanatory variable and the response variable.
3. Can you conclude that taking the prep course will cause a student’s SAT score to
increase? Why or why not?
4. Identify as many other possible variables that you can that may explain why the
SAT scores are higher for those who took the prep course than for those who did
not.
5. Design an experiment that would allow us to determine if the SAT prep causes
an increase in SAT scores. Be as thorough as possible.
170
Observational Studies and Experiments
Important Ideas:
Check Your Understanding:
1. Do homes with metal roofs get worse cell service than homes with shingled roofs? Ben and
Jerry are neighbors and own an ice cream business together. Ben has a metal roof on his
home and Jerry has a shingled roof. Ben and Jerry select a random sample of 50 employees.
They randomly assign them to come to a party at either Ben’s house or Jerry’s house. When
they arrive Ben and Jerry ask them about their signal strength. Was this an observational study
or an experiment? Justify your answer.
2. A study showed that one factor that strongly correlates with student academic success is the
number of books that the family has in their home. Researchers randomly selected 100
students and measured their academic success as well as counted the number of books their
family has in their home.
a. Is this an observational study or an experiment? Justify your answer.
b. What are the explanatory and response variables?
c. Explain clearly why such a study cannot establish a cause-and-effect relationship. Suggest
a variable that may be confounded with number of books that the family have in their home.
171
Would you fall for that?
Would you fall for the placebo effect? Watch this video, then complete the rest of the questions.
1. Why do you think the people in the video got stronger?
Similar to the video, Mrs. Gallas wants to use a beverage to test the affect that caffeine can have on
heart rate. Here is an initial plan:
• measure initial pulse rate
• give each student some caffeine (Coca-Cola)
• wait for a specified time
• measure final pulse rate
• compare final and initial rates
2. What are some problems with this plan? What other variables will be sources of variability in
pulse rates?
3. Go back up to your list in #2 and propose a solution to each problem.
4. Design an experiment to test the effect that caffeine has on heart rate.
172
Designing Experiments
Important Ideas:
Check Your Understanding:
A group of researchers in Africa find a creative way to protect cattle from lion attacks – they paint eyes on
the cows’ rears. To determine if this treatment is effective, they randomly assign the cattle to one of three
treatments: Eyes on their rear, cross-marks on their rear, or nothing on their rear. After 4 years of roaming
the plains, the cows with eyes saw no deaths, the cows with a cross suffered 4 deaths, and the cows with
no marks suffered 15 deaths.
a. Explain why it was important to have a control group that didn’t get a mark on their rear.
b. Suppose the researchers had 1200 cows. Describe how to randomly assign the cows to the treatments.
c. What is the purpose of randomly assigning treatments in this context?
d. Create an outline showing a completely randomized design for the experiment.
173
4.2 PRACTICE!
1. A study published in the New England Journal of Medicine compared two medicines to treat head lice: an oral
medication called ivermectin and a topical lotion containing malathion. Researchers studied 812 people in 376
households in seven areas around the world. Of the 185 randomly assigned to ivermectin, 171 were free from head lice
after two weeks compared to only 151 of the 191 households randomly assigned to malathion.
Identify the experimental units, explanatory and response variables, and the treatments in this experiment.
2. Does adding fertilizer affect the productivity of tomato plants? How about the amount of water given to the plants?
To answer these questions, a gardener plants 24 similar tomato plants in identical pots in his greenhouse. He will add
fertilizer to the soil in half of the pots. Also, he will water 8 of the plants with 0.5 gallons of water per day, 8 of the
plants with 1 gallon of water per day and the remaining 8 plants with 1.5 gallons of water per day. At the end of three
months he will record the total weight of tomatoes produced on each plant.
Identify the explanatory and response variables, experimental units, and list all the treatments.
174
3. Suppose you have a class of 30 students who volunteer to be subjects in an experiment involving caffeine. Explain
how you would randomly assign 15 students to each of the two treatments.
4. A cell phone company is considering two different keyboard designs (A and B) for its new line of cell phones.
Researchers would like to conduct an experiment using subjects who are frequent texters and subjects who are not
frequent texters. The subjects will be asked to text several different messages in 5 minutes. The response variable will
be the number of correctly typed words.
A. Explain why a randomized block design might be preferable to a completely randomized design for this experiment.
B. Outline a randomized block experiment using 100 frequent texters and 200 novice texters.
175
Does type of SAT prep matter?
EKHS has decided to offer an SAT prep class again this year. It will be offered in two different
formats: online or classroom teacher. The counselors want to know which teaching method will
yield higher SAT scores so they have allowed us to set up an experiment. 50 students have
signed up to take some form of the SAT prep class. (20 seniors and 30 juniors)
1. Outline a completely randomized design to compare the two treatments.
2. The counselors at EKHS hypothesize that the online vs. classroom results could
be greatly affected by the grade level of students that were put into each
treatment group. They know that seniors generally score better on the SAT than
juniors. How could we adjust our experiment to ensure that there is even split of
seniors and juniors in each class? Draw an outline of the experiment with your
modifications.
3. The counselors are now worried that a student’s GPA is certainly going to affect
their SAT score. Let’s look only at the Juniors. We want to be sure that the
different GPAs are being evenly distributed into the two treatment groups.
How could we be sure the GPAs are evenly distributed?
176
Randomized Block Designs
Important Ideas:
Check Your Understanding:
A political strategist would like to design an experiment to compare the effectiveness of
three different YouTube advertisements promoting a specific presidential candidate.
She will use 300 randomly selected YouTube users for the experiment.
1. Describe a completely randomized design to compare the effectiveness of the
three advertisements.
2. Describe a randomized block design for this experiment. Justify your choice of
blocks.
3. Why might a randomized block design be preferable in this context?
177
Lesson 4.3: Day 1: Does caffeine increase pulse rate?
Mrs. Gallas and her students decided to perform the caffeine experiment. In their experiment, 10
student volunteers were randomly assigned to drink cola with caffeine and the remaining 10
students were assigned to drink caffeine-free cola. Were their findings statistically significant?
The table shows the change in pulse rate for each student (Final pulse rate – Initial pulse rate),
along with the mean change for each group.
1. Find the difference in mean pulse rate for the groups. Does your initial reaction lead you to
believe that they found evidence that caffeine does or does not increase heart rate? Explain.
2. What are two possible explanations for the difference in mean pulse rate?
To try to decide if the difference in pulse rate is big enough to be convincing we will do a simulation
with the data.
Simulation:
Step 1: Gather 20 index cards to represent the 20 students in this experiment. On each card, write
one of the 20 outcomes listed in the table. For example, write “8” on the first card, “3” on the second
card, and so on.
Step 2: Shuffle the cards and deal two piles of 10 cards each. This represents randomly assigning
the 20 students to the two treatments, assuming that the treatment received doesn’t affect the
change in pulse rate. The first pile of 10 cards represents the caffeine group, and the second pile of
10 cards represents the no-caffeine group.
Step 3: Fill in the table below with your simulated data.
Caffeine
No
Caffeine
178
3. Find the mean change for each group in your simulation and subtract the means (Caffeine –
No caffeine).
4. Add your difference in means to the dotplot on the board. Sketch the dotplot below.
What does each dot represent?
5. What percentage of the dots are greater than or equal to the difference in means of 1.2 found
in Mrs. Gallas’ experiment?
Interpret this percentage:
6. Do you think the difference in means we found from our experiment is due to the caffeine or
has it occurred purely by chance? Explain.
179
Lesson 4.3: Day 1: Inference for Experiments
Big Ideas:
Check Your Understanding:
1. How much do National Football League (NFL) players weigh, on average? In a random
sample of 50 NFL players, the average weight is 244.4 pounds.
(a) Do you think that 244.4 pounds is the true average weight of all NFL players? Explain your
answer.
(b) If another random sample of 50 NFL players was taken, would you expect to an average
weight of exactly 244.4 pounds?
(c) Estimates are usually given with a margin of error. The margin of error for the estimate of
244.4 pounds is 14.2 pounds. Based on this, would you be surprised if the true average
weight of NFL players was 260 pounds? Explain.
(d) Which would be more likely to give an estimate close to the true average weight of all NFL
players: a random sample of 50 players or a random sample of 100 players? Explain your
answer.
180
What’s in a name?
Your class just performed a modified version of the famous “John/Jenn” study* conducted by
researchers at Yale University. You will now perform calculations with the resulting data and
investigate if your findings are statistically significant.
*Moss-Racusin, C., Dovidio, J., et al. “Science faculty’s subtle gender biases favor male students.” PNAS October 9,
2012 109 (41) 16474-16479; https://doi.org/10.1073/pnas.1211286109
Surprise! The resumé you evaluated was fake. You just took part in an experiment. Each resumé the
class evaluated was completely identical except for one component: the first name of the applicant.
Half of the class received a resumé with the name “John Miller,” and the other half received one for
“Jennifer Miller.” Let’s see if the name, alone, was enough to influence your class’s ‘hireability’
estimates.
Ratings for John
Ratings for Jennifer
John mean rating:
Jennifer mean rating:
Difference (John – Jenn):
Is the difference between these mean ratings statistically significant? To decide this, we will do a
simulation with the data.
Simulation:
Step 1: We will use index cards to represent the different ratings in this experiment. On each card,
write one of the ratings listed in the above tables (for both John and Jennifer). For example, if there
were eight ratings of ‘3’ above, then eight index cards should have a ‘3’ on them.
Step 2: Shuffle the cards and deal two even piles. This represents randomly assigning the ratings to
the two name groups (assuming gender doesn’t affect the rating). The first pile of cards represents
the “John” group, and the second pile cards represents the “Jennifer” group.
Step 3: Fill in the table below with your simulated data.
John
Jennifer
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
181
1. Find the mean for each group in your simulation and subtract the means (John - Jenn).
2. Add your difference between means to the dotplot on the board. Sketch the dotplot below.
What does each dot represent?
3. What percentage of the dots are greater than or equal to the difference in means we found in
our experiment?
Interpret this percentage:
4. Do you think the difference between mean ‘hireability’ ratings we found from our experiment
is due to the name or has it occurred purely by chance? Explain.
5. In the original Yale study, applications were sent to STEM faculty at various Universities for a
lab manager position. Among the 127 faculty that were randomly assigned either Jenn/John
materials, the mean ‘hireability’ rating for John was 3.78 and for Jenn was 2.93. Is this
evidence more convincing or less convincing of gender bias than the data from our class
experiment? Explain.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
182
Inference for Experiments
Important Ideas:
Check Your Understanding:
The following is an example of A/B testing, which is a common data science practice among tech and
marketing companies. The marketing team at Yelp came up with two new formats for displaying ads
on its site - format A and format B. Before deciding on which format to use across its entire site, Yelp
wants to test which format would get a higher rate of ad clicks from users. To test this, the company
randomly assigns 50 user accounts to see format A and 50 user accounts to see format B. After an
hour, format A had 21 users click on an ad (𝑝Ė‚ =
"%
#$
!"
#$
= 42%). Format B had 19 users click on an ad (𝑝Ė‚ =
= 38%).
(1) Calculate the difference (A – B) in proportion of users who clicked on an ad.
(2) In your own words, explain what ‘statistically significant’ means. Before doing further calculations,
do you believe the difference you calculated in part (a) is statistically significant? Explain.
The following simulation was performed assuming format A and format B are equally effective: the
individual outcomes from this experiment (‘click’ or ‘not clicked’) were randomly sorted into two
groups (representing formats A and B). The proportion who ‘clicked’ were calculated in each group
and then the difference was taken between the two groups’ proportions. This was repeated 100
times. The resulting differences in proportions who clicked are displayed here:
(3) Using the results from the simulation, do you believe the difference in proportions of users who
clicked between formats A and B is statistically significant? Explain.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
183
Does SAT prep improve scores? Part 2.
Some students at your school claim that taking an SAT prep course will help improve their SAT
score. Design a study to help discover if this claim is true.
Here are six proposed studies for investigating the question of the day. Suppose we found that
the mean SAT score of students who took an SAT prep course is significantly higher than the
mean SAT score of students who didn’t take an SAT prep course. What conclusions could we
make? Can we generalize and can we determine causation?
1. Get all the students in a senior-only class to participate in a study. Ask them whether or not they took
an SAT prep course and divide them into two groups based on their answer to this question.
Conclusion:
2. Select a random sample of Seniors from your school to participate in a study. Ask them whether or not
they took an SAT prep course and divide them into two groups based on their answer to this question.
Conclusion:
3. Select a random sample of students who have taken the SAT from your school to participate in a study.
Ask them whether or not they took an SAT prep course and divide them into two groups based on their
answer to this question.
Conclusion:
4. Get all the students in a senior-only class to participate in a study. Randomly assign half of the
students to take an SAT prep course and have the remaining half not take an SAT prep class. Then have
both groups take the SAT again.
Conclusion:
5. Select a random sample of Seniors from your school to participate in a study. Randomly assign half of
the students to take an SAT prep course and have the remaining half not take an SAT prep course. Then
have both groups take the SAT again.
Conclusion:
6. Select a random sample of students who have taken the SAT from your school to participate in a study.
Randomly assign half of the students to take an SAT prep course and have the remaining half not take an
SAT prep course. Then have both groups take the SAT again.
Conclusion:
184
Scope of Inference
Important Ideas:
Check Your Understanding:
1. Zach works at the Verizon store and wonders if iPhones last longer if the screen brightness is
set to low. He selects a random sample of 20 brand new iPhones from this store and
randomly splits them into two groups of 10. For the first group of 10 iPhones, he sets the
screen brightness to low and then starts a movie. For the second group of 10 iPhones, he
sets the screen brightness to high and then starts a movie. For each iPhone, he measures
the amount of time until the battery is all the way dead. He finds that the low brightness
iPhones lasted longer, on average, than the high brightness iPhones.
(a) Was this an observational study or an experiment? Explain your reasoning.
(b) What is the explanatory variable and the response variable?
(c) Was a random sample used to collect the data?
(d) Was random assignment used to set up an experiment?
(e) What conclusion can we make from this study?
185
Does caffeine increase heart rate? The Finale.
Today we will be testing to see if caffeine increases heart rate.
1. Begin by taking your heart rate (beats per minute).
2. Mrs. Gallas took a random sample of 5 students’ heart rates. The sample mean
was 64.5 beats per minute with a margin of error of 5.6 beats. Can she expect
that the sample mean is the same as the population mean? Explain.
3. Would you be surprised to learn that the true population mean was 68 beats per
minute? Explain.
4. How could Mrs. Gallas find a sample mean that is closer to the true population?
5. Mrs. Gallas will give you a beverage to drink. Drink the entire beverage. Is this
an observational study or an experiment? Explain the difference.
6. Was blinding used for this experiment? Why is that important?
7. Describe what a confounding variable is and list a few that might affect our
results.
8. If we really just want to see if caffeine increase heart rate, why can’t we just have
one group who all drinks caffeinated coke?
186
9. Identify each of the following:
Explanatory variable:
Response variable:
Experimental units:
Treatments:
10. Describe how we could randomly assign the students in the class into two groups
(Caffeine and Caffeine-free).
11. Why do we use random assignment to create the two groups? What is its
purpose?
12. Draw an outline of a completely randomized design.
13. What are the four key principles of experimental design? Explain how your
experiment designed in #12 meets all of them.
14. Mrs. Gallas thinks that gender may play a role in the effect that caffeine has on
heart rate. How could we adjust our experimental design to account for that?
Explain.
15. Outline a randomized block design.
187
16. Could we use a matched pairs design to test the effects of caffeine? If yes,
describe the process of pairing. If no, explain why not.
17. We will be using the completely randomized design. It’s time to take your heart
rate again. What is your heart rate now?
How much did your heart rate
increase? (New – Initial)
Add your change in heart rate to the table on
the board.
Increase in heart rate
Caffeine
Caffeinefree
18. Go to stapplet.com and click One Quantitative Variable. Change the Number of
Groups to 2 and add the data for both groups. Record the mean of each and find
the difference.
Mean with Caffeine:___________
Mean with No Caffeine:__________
Difference (Caffeine – No Caffeine):____________
19. Under Perform Inference, choose “Simulate difference in two means” from the
drop down menu. Add 50 samples. Sketch the dot plot below.
20. What percent of the dots are larger than the difference in means we got from our
experiment? Is the difference statistically significant? Explain.
188
21. Can we generalize the results? Explain.
22. Can we determine cause and effect? Explain.
23. What conclusion can we make?
24. Was this experiment conducted in an ethical manner? Explain.
189
2. Researchers are investigating the effectiveness of using a fungus to control the spread of an insect that destroys
trees. The researchers will create four different concentrations of fungus mixtures: 0 milliliters per liter (ml/L),
1.25 ml/L, 2.5 ml/L, and 3.75 ml/L. An equal number of the insects will be placed into 20 individual containers.
The group of insects in each container will be sprayed with one of the four mixtures, and the researchers will
record the number of insects that are still alive in each container one week after spraying.
(a) Identify the treatments, experimental units, and response variable of the experiment.
Treatments:
Experimental units:
Response variable:
(b) Does the experiment have a control group? Explain your answer.
(c) Describe how the treatments can be randomly assigned to the experimental units so that each treatment
has the same number of units.
190
®
2014 AP STATISTICS FREE-RESPONSE QUESTIONS
4. As part of its twenty-fifth reunion celebration, the class of 1988 (students who graduated in 1988) at a state
university held a reception on campus. In an informal survey, the director of alumni development asked 50 of
the attendees about their incomes. The director computed the mean income of the 50 attendees to be $189,952.
In a news release, the director announced, “The members of our class of 1988 enjoyed resounding success.
Last year’s mean income of its members was $189,952!”
(a) What would be a statistical advantage of using the median of the reported incomes, rather than the mean,
as the estimate of the typical income?
(b) The director felt the members who attended the reception may be different from the class as a whole.
A more detailed survey of the class was planned to find a better estimate of the income as well as other
facts about the alumni. The staff developed two methods based on the available funds to carry out the
survey.
Method 1: Send out an e-mail to all 6,826 members of the class asking them to complete an online form.
The staff estimates that at least 600 members will respond.
Method 2: Select a simple random sample of members of the class and contact the selected members
directly by phone. Follow up to ensure that all responses are obtained. Because method 2 will
require more time than method 1, the staff estimates that only 100 members of the class could be
contacted using method 2.
Which of the two methods would you select for estimating the average yearly income of all 6,826 members
of the class of 1988 ? Explain your reasoning by comparing the two methods and the effect of each method
on the estimate.
191
192
Unit 4
Chapter 5
Probability
193
AP Statistics Handout: Lesson 5.1
Topics: law of large numbers, definition and properties of probability, probability models
Lesson 5.1 Guided Notes
Probability and Statistics
estimate
Probabilities
Statistics
generate
The Law of Large Numbers
Law of large numbers: After many ______________, the relative frequency of outcomes will approach
their ________________________.
Coin flips applet: http://digfir-published.macmillanusa.com/stats_applet/stats_applet_10_prob.html
-Source: Digital First project from Bedford, Freeman, & Worth publishers
In your own words, describe how this
simulation demonstrates the law of large
numbers:
The “due for one” myth
You have a fair coin. You flip it 10 times. In which situation is getting tails (T) on your next flip more
likely?
Situation 1: HTTHHTHTHT
Situation 2: HTTHHHHHHH
194
A tale of two Khris/Chris Davis’s
a) How many at-bats did Chris Davis have in a row without
recording a hit?
b) What does Chris Davis’s story tell us about the “due for one”
myth?
Chris Davis
c) Honestly, the Khris Davis batting average streak isn’t really related
to the “due for one” myth. However, because Khris Davis’s streak is
so incredible and because his name is so similar to the other Davis, I
just HAD to include it. So, there’s no question here that requires a
response. I hope you enjoyed this part of the lesson!
Khris Davis
The Simple Definition of Probability
Probability =
𝑁ð‘Ē𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜ð‘Ēð‘Ąð‘ð‘œð‘šð‘’ð‘  𝑖𝑛 𝑎𝑛 𝑒ð‘Ģð‘’ð‘›ð‘Ą
ð‘‡ð‘œð‘Ąð‘Žð‘™ 𝑛ð‘Ē𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑜𝑠𝑠𝑖𝑏𝑙𝑒 𝑜ð‘Ēð‘Ąð‘ð‘œð‘šð‘’ð‘ 
The above equation is the simple definition of probability
• Only used when all outcomes are _________________________
Example: There are 8 red marbles and 12 blue marbles in a jar. What is the probability of selecting a red
marble from the jar? Let R = the event of selecting a red marble. P(R) = probability of event R occurring.
Properties of Probabilities
1. Probabilities are always between _________ (0% - 100%).
2. The probabilities of all possible events ________________ (100%).
3. Complement rule: the probability of event A ______ happening, P(AC), is equal to 1 – P(A).
195
Probability Models
Probability models: List a set of _________________ and their probabilities.
Properties:
1. All probabilities are between 0 – 1.
2. All probabilities ____________________.
Example: I have a bag of 20 beads. They are a mix of clear/solid and red/green. There are 6 clear red
beads in the bag and 8 red solid beads. There are 2 clear green beads and 4 green solid beads.
a) Fill out the probability model
Bead Type
clear red
red solid
clear green
green solid
Probability
b) Verify that this is a valid probability model (check the probability model properties):
Let: R = Select a red bead, Q = Select a red solid bead
c) Find these probabilities, showing your work:
1. P(Q) =
2. P(R) =
3. P(QC ) =
4. P(RC ) =
Lesson 5.1 Discussion
2011 NYC “Stop and Frisk” police stops:
Race-Ethnicity
Black
Hispanic
White
Proportion of Stops
55.1%
35.2%
9.7%
Proportions calculated just among these three groups (the predominant groups in the dataset)
Data source: NYC.GOV, https://www1.nyc.gov/site/nypd/stats/reports-analysis/stopfrisk.page
Population of NYC in 2011:
Race-Ethnicity
Black
Hispanic
White
Population
2,054,101
2,373,304
2,731,173
Data source: 2011 American Community Survey (data.census.gov)
196
Discussion Question: If stops were done
completely at random, which groups would be
stopped proportionally more often? Less often?
What conclusions can you draw from this?
How good is Mrs. Gallas at free throws?
Mrs. Gallas thinks she is a pretty good free throw shooter. How many free throws would you
like to see Mrs. Gallas shoot before you could be confident guessing her free throw
percentage? We’ll watch Mrs. Gallas shoot free throws, when you are confident make a guess
at her free throw percentage.
1. As each shot is attempted, keep track of the number of made free throws and the total
number of shots attempted in the table below. When you think you know Mrs. Gallas’
true free throw percentage, stop recording the shots.
Shot #
1
2
3
4
5
10
15
20
30
40
50
60
70
Result
(Make or Miss)
Proportion of
Makes
2. What do you think Mrs. Gallas’ true free throw percentage is?
3. Sketch the graph displaying the proportion of made free throws.
4. How could you make your guess more accurate?
5. Mrs. Gallas has a ____% probability of making a free throw. Interpret this probability.
197
80
The Idea of Probability
Important ideas:
Check Your Understanding
1. Use 4 of the following probabilities to complete the middle column of the table, then give an
example of an event may have this probability. Probabilities: 0, 0.001, 0.3, 0.6, 0.99, 1
Explanation
This outcome is impossible. It can never
occur.
This outcome is certain. It will occur on every
trial.
This outcome is very unlikely, but it will occur
once in a while in a long sequence of trials.
This outcome will occur more often than not,
but doesn’t occur almost every time.
Probability
Example
2. In the Wheel of Fortune, there is a 1/9 = 11.1% probability of spinning “Bankrupt” on any given
spin. Interpret the probability.
3. In 9 consecutive spins of the Wheel of Fortune, none of the spins land on “Bankrupt”. The next
contestant worries that he is more likely to land on Bankrupt now because Bankrupt is due.
Explain why this thinking is wrong.
198
Are Soda Contests True?
Pepsi ran a promo contest for their 20 oz. bottles of soda. Some of the caps said, “Please try
again!” while others said, “You’re a winner!” Pepsi advertised the promotion with the slogan “1
in 6 wins a prize.” Mrs. Gallas’ statistics class wonders if the company’s claim is true. To find
out, all 30 students in the class go to the store, and each buys one 20-ounce bottle of the
soda. Two of the 30 students get caps that say “You’re a winner!”
1. How many winners would you expect to get out of a class of 30? Is it guaranteed?
Does this result give convincing evidence that the company’s 1-in-6 claim is inaccurate? We
will perform a simulation to help answer this question. We will assume Pepsi is telling the
truth. If they are telling the truth, what is the probability of getting 2 or fewer winners in a class
of 30 purely by chance? Let’s find out.
2. What could we use to model a 1/6 probability?
to “Losers” and “Winners”. List them below.
Assign certain outcomes
3. Roll your die 30 times to imitate the process of the students in Mrs. Gallas’ statistics
class buying their sodas. How many of them won a prize?
4. Repeat steps 1 and 2. How many won a prize this time?
5. Plot the number of prize winners for each trial of 30 to the dot plot on the board. (2 dots)
6. Sketch the class dot plot below.
7. What percent of the time did Mrs. Gallas’ statistics class get two or fewer prizes, just by
chance?
8. Does it seem plausible that the company is telling the truth but that the class just got
unlucky? Or do we have convincing evidence that Pepsi is lying?
199
Simulation
Important ideas:
Check Your Understanding
Some people have wondered if the Wheel of Fortune wheel is fair. If it is
fair, the “Bankrupt” result should occur with probability 1/9. To test the
wheel, Pat Sajak observed a random sample of 50 spins of the wheel and
found that “Bankrupt” was spun 10 times. Is this convincing evidence that
Bankrupt comes up more often than it should?
a. Describe how you would carry out a simulation to estimate the
probability that in a fair wheel, Bankrupt comes up 10 or more times.
The dotplot displays the results of 100
simulated trials of 50 spins of a fair wheel.
b. Explain what the one dot above 12 indicates.
c. What conclusion would you draw about 10 or more Bankrupt spins occurring in 50 spins?
Explain your answer.
200
5.1 PRACTICE!
The Golden Ticket
At a local high school, 95 students have permission to park on campus. Each month, the student council holds a “golden
ticket parking lottery” at a school assembly. The two lucky winners are given reserved parking spots next to the school’s
main entrance. Last month, the winning tickets were drawn by a student council member from the AP Statistics class.
When both golden tickets went to members of that same AP Stats class, some people thought the lottery had been
rigged. There are 28 students in the AP Statistics class, all of whom are eligible to park on campus. Design and carry out
a simulation using the Table of Random Digits (beginning with line 139) to decide whether it’s plausible that the lottery
was carried out fairly.
Do:
Trial #
Numbers
Selected
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
Both in AP
Stats?
Trial #
Numbers
Selected
Both in AP
Stats?
Conclude:
201
NASCAR
In an attempt to increase sales, Cheerios decides to offer a NASCAR promotion. Each box of cereal will contain a
collectible card featuring one of these NASCAR drivers: Jeff Gordon, Dale Earnhardt, Jr., Tony Stewart, Danica Patrick, or
Jimmie Johnson.
The company says that each of the 5 cards is equally likely to appear in any box of cereal. A NASCAR fan decides to keep
buying boxes of the cereal until she has all 5 drivers’ cards. She is surprised when it takes her 23 boxes to get the full set
of cards. Should she be surprised? Design and carry out a simulation using your graphing calculator to help answer this
question.
Trial #
Numbers
Selected
Total # of
Boxes
Trial #
Numbers
Selected
Total # of
Boxes
1
2
3
4
5
6
7
8
9
10
Trial #
Numbers
Selected
Total # of
Boxes
Trial #
Numbers
Selected
11
12
13
14
15
16
17
18
19
20
Total # of
Boxes
Conclude:
202
AP Statistics Handout: Lesson 5.2
Topics: unions & intersections, the addition rule in multiple formats, mutually exclusive events
Lesson 5.2 Guided Notes
Unions & Intersections
For each of the following probability expressions, take notes on what the notation means and shade the
Venn Diagram:
1. P(A):
3. P(AC):
2. P(B):
A
B
A
4. P(A∩B):
B
A
6. P((A∪B)C):
5. P(A∪B):
A
B
B
A
B
A
B
The Addition Rule in Probability Models
The OKCupid Dataset - https://github.com/wetchler/okcupid
This dataset is composed of public online dating profiles from 2012 on OKCupid. We filtered for people
who identified as age 36, male, and heterosexual, living in the San Francisco area. After filtering, we
collected data for each profile that reported their height and yearly earnings.
We’ll categorize the 192 men in the OKCupid dataset using the following convention:
Shorter than median
Taller than median
Earns less than median ($)
Earns more than median ($)
Short Low-Earner
Short High-Earner
Tall Low-Earner
Tall High-Earner
Socially de-valued
Socially valued
Estimated 2012 median yearly earnings for individual men in San Francisco county: $59,397. Male median height (U.S.): 69.2 in. (5’9”). 1. Earnings
median is an upper-bound estimate based on median earnings for workers in San Francisco county per the 2012 American Community Survey
(data.census.gov). 2. Height median is from the CDC: National Health Statistics Reports, “Mean Body Weight, Height, Waist Circumference, and Body
Mass Index Among Adults: United States, 1999–2000 Through 2015–2016.” Center for Disease Control and Prevention, 2018.
203
In the OkCupid data, we find 21 short low-earners, 22 tall low-earners, 48 short high-earners, and 101
tall high-earners.
Category
short low-earner
tall low-earner
short high-earner
tall high-earner
Probability
21
= 0.109
192
22
= 0.115
192
48
= 0.250
20
101
= 0.526
192
a) Check that this is a valid probability model:
Let: T = event of selecting a tall man
H = event of selecting a high-income earner
1. P(T) =
2. P(T∩H) =
3. P(T∪H) =
Simple Addition Rule: “Or” means ________
• The ___________ P(A∪B) brings together A, B, and A∩B. So
you add the probabilities together.
• Think unions → _______________________
A
B
The Addition Rule in Two-Way Tables
low-earner
high-earner
total
short
21
48
69
tall
22
101
123
total
43
149
192
Formal Addition Rule: P(A∪B) = P(A) + P(B) – P(A∩B)
204
1. P(T∩HC) =
2. P(H) =
3. P(T∪H) =
_______________ Addition Rule:
• “Or” means add
• Avoid _________________________
The Addition Rule in Venn Diagrams
T
H
22
101
48
low-earner
high-earner
total
short
21
48
69
tall
22
101
123
total
43
149
192
21
1. P(H) =
2. P(H∩T) =
4. P(HC) =
5. P(T∪H) =
3. P(H∩TC) =
Mutually Exclusive Events
Mutually exclusive: when events that have no __________________ (i.e. they cannot _________ occur)
A
B
For mutually exclusive events only:
P(A∪B) = P(A) + P(B)
•
Don’t have to worry about _____________________
Lesson 5.2 Discussion
Discussion Question: Do you believe many of the men on
OKCupid are lying on their profiles? Provide a statistical
explanation for your answer.
205
AP Statistics Handout: Lesson 5.3
Topics: conditional probability, independence, the multiplication rule, tree diagrams
Lesson 5.3 Guided Notes
Conditional Probability
Condition: a “____________” in a problem
• P(A|B) = Probability of A occurring given that B _____________________________
The OKCupid Dataset - https://github.com/wetchler/okcupid
This dataset is composed of public online dating profiles from 2012 on OKCupid. We filtered for people
who identified as age 36, male, and heterosexual, living in the San Francisco area. After filtering, we
collected data for each profile that reported their height and yearly earnings.
We’ll categorize the 192 men in the OKCupid dataset using the following convention:
Shorter than median
Taller than median
Earns less than median ($)
Earns more than median ($)
Short Low-Earner
Short High-Earner
Tall Low-Earner
Tall High-Earner
Socially de-valued
Socially valued
Estimated 2012 median yearly earnings for individual men in San Francisco county: $59,397. Male median height (U.S.): 69.2 in. (5’9”). 1. Earnings
median is an upper-bound estimate based on median earnings for workers in San Francisco county per the 2012 American Community Survey
(data.census.gov). 2. Height median is from the CDC: National Health Statistics Reports, “Mean Body Weight, Height, Waist Circumference, and Body
Mass Index Among Adults: United States, 1999–2000 Through 2015–2016.” Center for Disease Control and Prevention, 2018.
In the OkCupid data, we find 21 short low-earners, 22 tall low-earners, 48 short high-earners, and 101
tall high-earners. Let: T = event of selecting a tall man, H = event of selecting a high-income earner.
T
H
22
101
1. P(T) =
48
2. P(T|H) =
21
Conditional Probability Formula:
P(A|B) =
𝑃(ðī∩ðĩ)
𝑃(ðĩ)
Conditional Probability Intuition:
• “Given” means ____________ by the given
206
Conditional Probability in Two-Way Tables
low-earner
high-earner
total
short
21
48
69
tall
22
101
123
total
43
149
192
1. P(T) =
2. P(T|HC) =
Independence
Independence: Two events (A & B) are independent if knowing the outcome of one event __________
_______ affect the probability that the other event will occur.
•
P(A|B) = _______
Example: Are the events “selecting a tall man” and “selecting a man who is a low-earner” independent?
Use the two probabilities you calculated for the two-way table above to support your answer.
Answer:
When we knew the person was a low-earner (when HC was “given”), the probability of selecting a tall
person ___________. So, the events of selecting a low earner and someone who is tall are ____
___________________.
In other words, their earnings ______________________________ about their height. So, earnings and
height are not independent.
Independent vs. Mutually Exclusive
B
A
0.80
0.20
Mutually exclusive: when events that have no intersection
(i.e. they cannot both occur)
P(A) =
P(A|B) =
Mutually exclusive events are _____________________________. Knowing that one event occurs
greatly affects the probability of the other event (_____________________________).
207
Multiplication Rule
Formal Addition Rule:
For all events: P(A∩B) = P(A) × P(B|A)
_______________ Multiplication Rule:
• “And” means _______________
• Account for _________________ events
For independent events: P(A∩B) = P(A) × P(B)
Tree Diagrams
Tree diagrams: Models multiple __________________ or successive events (events that depend on each
other or that happen right after each other).
• Each ______________ represents a different event
Dream School Example:
1. You have a 0.65 probability of getting higher than average GPA and ACT scores.
2. If you have a higher than average GPA/ACT, you have a 0.83 chance of being admitted.
3. If you have a below average GPA/ACT score, you have a 0.39 chance of being admitted.
Draw the tree diagram here:
Let:
H = Event of getting higher than average ACT/GPA , A = Event of being admitted to your dream school
1. P(H∩A) =
2. P(A) =
3. P(H|A) =
208
5 Intuitive Probability Rules for a 5
1. Probabilities are between 0 – 1 (inclusive)
2. Complement rule: P(AC) = 1 – P(A)
3. “Or” means add, beware of double-counts
4. “Given” means divide by the given
5. “And” means multiply, adjust for dependence
If you depend on formulas
Lesson 5.3 Discussion
low-earner
high-earner
total
short
21
48
69
tall
22
101
total
43
149
low-earner
high-earner
total
short
48.8%
48
69
123
tall
51.2%
101
123
192
total
100%
149
192
2. P(T|HC) = 51.2%
1. P(T) = 64.1%
If we find the conditional distribution (“given” low-earner), we’re close to the 50-50 split we’d expect
around the median height!
Discussion Question: Do you believe that filtering results for just the men who reported below-median
earnings would return more honest matches? Explain your reasoning.
209
Who will win the Last Banana?
Suppose that you’re on a desert island playing dice with another castaway. The winner’s prize
will be the last banana. Here are the rules of the game:
• Each player rolls a die
• If the largest value shown is a 1, 2, 3, or 4, then Player 1 wins
• If the largest value shown is a 5 or 6 then Player 2 wins
1. Who do you think has advantage in this game: Player 1, Player 2, or neither? Make your best
guess and explain your choice.
2. Play the game 20 times with your partner and record the winner of each game by tallying in the
table below.
Player
1
2
Tally/Count of Wins
Percentage of Wins
a. How many times did Player 1 win?
Write this as a proportion.
b. How many times did Player 2 win?
Write this as a proportion.
3. Who won more often? Maybe this was only true for your group. Let’s see how the rest of the class
did. Write the number of wins for Player 1 in the table on the board.
a. Find the total proportion of wins for Player 1 for the whole class.
b. Find the total proportion of wins for Player 2 for the whole class.
4. To determine the true probability of Player 1 winning,
we should list out all possible rolls that we could get.
Complete the table below to show all possible rolls.
a. Use your table to find the probability of
Player 1 winning.
1
1
2
3
4
b. Which was closer to the probability you
found in #4a, your group data or the
classroom data? Why do you think that is?
210
5
6
1,1
2
3
4
5
6
Basic Probability Rules
Important ideas:
Check Your Understanding
Pew Research Center reported that among mothers, family size is shrinking. Suppose we are to
randomly select one mother (age 40 to 44) and record the number of children she has. Here is a
probability model.
Number of Children
Probability
1
0.22
2
0.41
3
0.24
4+
0.13
(a) Explain why this is a valid probability model.
(b) Explain why events “have 1 child” and “have 2 children” are mutually exclusive
For each of the following write the event using proper notation and find the probability:
(c) Find the probability that a randomly selected mother has less than 2 children.
(d) Find the probability that a randomly selected mother has 1 or 2 children.
(e) Find the probability that a randomly selected mother does not have 4 or more children.
211
Can You Taco Tongue and Evil Eyebrow?
Some people believe that the ability to taco tongue and evil eyebrow is something that you
are born with. Is this true? Are the two abilities somehow related?
1. Collect class data to fill in the following two-way table and Venn Diagram.
Yes
Evil Eyebrow
No
Evil Eyebrow
Total
Yes Taco Tongue
____
_
No Taco Tongue
Yes
Evil Eyebrow
Yes
Taco Tongue
Total
_____
_____
_____
2. Suppose that we randomly choose a student from class. Find the following probabilities.
P(Yes Taco Tongue) =
P(Yes Evil Eyebrow) =
P(No Taco Tongue) =
P(No Evil Eyebrow) =
P(Yes Taco Tongue AND Yes Evil Eyebrow) =
P(Yes Evil Eyebrow AND No Taco Tongue) =
P(Yes Taco Tongue AND No Evil Eyebrow) =
P(No Taco Tongue AND No Evil Eyebrow) =
3. Suppose that we randomly choose a student from class. Find the following probabilities.
P(Yes Evil Eyebrow) =
P(No Evil Eyebrow) =
P(Yes Evil Eyebrow OR No Evil Eyebrow) =
4. Suppose that we randomly choose a student from class. Find the following probabilities.
P(Yes Taco Tongue) =
P(Yes Evil Eyebrow) =
P(Yes Taco Tongue OR Yes Evil Eyebrow) =
212
Venn Diagrams, General Addition Rule
Important Ideas:
Check Your Understanding:
A random sample of high school students were surveyed regarding their toilet paper habits. Should
the toilet paper pull over the roll? Do they replace the roll when it is empty? The two-way table
displays the data. Suppose we choose a member of the sample at random. Define events O: Over
and R: Replace.
Prefer the toilet paper to pull over the roll
Yes
No
Replaces the roll
Yes
58
24
when it is empty
No
12
22
a. Explain in plain language what P(OC) means and find the probability.
b. Explain why P(O or R) ≠ P(O) + P(R). Then find P(O or R).
c. Make a Venn diagram to display the sample space of this chance process.
d. Write the event “Does not replace the roll and prefers the toilet paper pulls over the roll” in
symbolic form and find the probability.
213
5.2 PRACTICE!
Languages in Canada
Canada has two official languages, English and French. Choose a Canadian at random and ask, “What is your mother
tongue?” Here is the distribution of responses, combining many separate languages from the broad Asia/Pacific region:
(a) What probability should replace “?” in the distribution? Why?
(b) What is the probability that a Canadian’s mother tongue is not English?
(c) What is the probability that a Canadian’s mother tongue is a language other than English or French?
U.S. Senators
The two-way table below describes the members of the U.S Senate in a recent year.
(a) Who are the individuals? What variables are being measured?
(b) If we select a U.S. senator at random, what’s the probability that we choose
i.
a Democrat?
ii. a female?
iii. a female Democrat?
iv. a female or a Democrat?
214
Phone Usage
According to the National Center for Health Statistics, in December 2008, 78% of US households had a traditional
landline telephone, 80% of households had cell phones, and 60% had both. Suppose we randomly selected a household
in December 2008.
We will define events A: has a landline and B: has a cell phone.
Landline
No Landline
Total
Cell Phone No Cell Phone Total
0.60
0.18
0.78
0.20
0.02
0.22
0.80
0.20
1.00
(a) Construct a Venn diagram to represent the outcomes of this chance process.
(b) Find the probability that the household has at least one of the two types of phones.
(c) Find the probability the household has a cell phone only.
215
Can You Taco Tongue and Evil Eyebrow? Day 2
Are the events “Yes Taco Tongue” and “Yes Evil Eyebrow” independent?
1. Find class data from the previous lesson and fill in the following two-way table.
Yes
Evil Eyebrow
No
Evil Eyebrow
Total
Yes Taco Tongue
No Taco Tongue
Total
Suppose we randomly choose a student from class.
2. Find P(Yes Taco Tongue OR Yes Evil Eyebrow)
3. Given that the person selected is a Yes Evil Eyebrow, what is the probability that they are
a Yes Taco Tongue? Write as a fraction, a decimal, and a percent.
4. Given that the person selected is a No Evil Eyebrow, what is the probability that they are
a Yes Taco Tongue? Write as a fraction, a decimal, and a percent.
Definition: Two events are independent if knowing whether or not one event has occurred
does not change the probability that the other event will occur.
5. Are the events “Yes Taco Tongue” and “Yes Evil Eyebrow” independent? Explain.
216
Consider the data for all Seniors at EKHS.
Yes
Evil Eyebrow
No
Evil Eyebrow
Total
Yes Taco Tongue
180
300
480
No Taco Tongue
20
100
120
Total
200
400
600
6. Find each of the following using the data in the table. Write as a fraction, a decimal, and a
percent.
a. P(Yes Taco Tongue) =
b. P(Yes Taco Tongue | Yes Evil Eyebrow) =
c. P(Yes Taco Tongue | No Evil Eyebrow) =
d. Are “Yes Taco Tongue” and “Yes Evil Eyebrow” independent?
7. Fill in the table as if the events were INDEPENDENT.
Yes
Evil Eyebrow
No
Evil Eyebrow
Total
Yes Taco Tongue
480
No Taco Tongue
120
Total
200
400
600
8. Find each of the following using the INDEPENDENT table. Write as a fraction, a decimal,
and a percent.
a. P(Yes Taco Tongue) =
b. P(Yes Taco Tongue | Yes Evil Eyebrow) =
c. P(Yes Taco Tongue | No Evil Eyebrow) =
9. What do you notice about your answers in #6 and #8?
10. Generalize: Complete the following statement using a formula.
Let A à Yes Taco Tongue and B à Yes Evil Eyebrow
If events A and B are INDEPENDENT then…
217
Conditional Probability and Independence
Important Ideas:
Check Your Understanding:
The Pew Research center randomly selected 100 mothers age 40 to 44 in 1976, 1994, and 2014 and
asked each mother how many children they have. The two-way table summarizes the responses.
Number
of
children
1 child
2 children
3 children
4+ children
Total
1976
11
24
25
40
100
Year
1994
21
43
23
13
100
2014
22
41
24
13
100
Total
54
108
72
66
300
Suppose we randomly select one of the survey respondents. Define events C: have 4 or more
children, S: 1976, N: 1994, and F: 2014.
1. Find P(C | S). Interpret this value in context.
2. Given that the chosen mother was not surveyed in 1976, what’s the probability that she has 4 or
more children? Write your answer as a probability statement using correct symbols for the events.
3. Are the events “Surveyed in 1976” and “4 or more children” independent? Explain.
218
AP Statistics Handout: Lesson 5.4
Topics: successive independent events, “at least one,” sampling w/o replacement
Lesson 5.4 Guided Notes
Successive Independent Events
Successive: Events are successive if one happens after the other (i.e. ____________________________)
Independent: Events are independent if knowing the outcome of one event _______________________
the probability of another event.
Example 1: What is the probability of getting “heads” 5x in a row?
Since the probabilities _______________________, you don’t need a tree diagram!
Example 2: A free throw is a penalty shot in basketball. One of the NBA’s best free throw shooters is
Stephen Curry. He makes 90.6% of the free throws he attempts. Assume each free throw attempt is
independent. What is the probability he makes 6 free throws in a row?
Example 3: Stephen Curry is fouled in many 3-point situations, so he often takes 3
free throws at a time. When he takes 3 free throws, what is the probability he makes
the first two but misses the last one?
219
“At Least One” Scenarios
Example: Stephen Curry makes 90.6% of the free throws he attempts. Assume each of his free throw
attempts are independent. What is the probability he makes at least one of his next 4 free throws?
“At least one” scenarios: When asked to find the probability of “at least one” occurrence of a successive
independent event, take the ______________________ of the event that ___________ occur.
P(at least one) = ______________________
Sampling Without Replacement
Example: You have a jar with 12 blue marbles and 8 red marbles. Imagine you sample marbles without
replacement. What is the probability of drawing the following: blue, then red, then blue.
Lesson 5.4 Discussion
Testing Kobe’s “Hot Hand”
The “hot hand” theory: When a player starts to make many shots in a row, the have a “hot hand” – their
probability of making shots is higher than normal.
• Your shots are _________________________. Your
current shot probability ________________ on your
previous shots.
a) Let’s assume the “hot hand” doesn’t exist. Each shot has an
equal and independent probability of going in (44.7%). What is
the probability of Kobe making an insane 8 shots in a row?
Kobe’s 3rd quarter during his 81-point game
b) In his career, Kobe took 30,697 shots. Given this information and the probability of an 8-shot streak
(calculated above), what is the expected number of times Kobe would get an 8-shot streak by chance?
220
Can you get a pair of Aces or a pair of Kings?
Rules of the game. Five cards total: two aces and three Kings. The player chooses their
first card and records the results, and then chooses their second card (without replacement)
and records the result. The player wins if they get a pair of Aces or a pair of Kings.
1. Choose one person who is the dealer and one who is the player. Play the game 10 times.
First card
Second card
Winner?
Based on your 10 games, what is the probability of winning this game? __________
2. Go to the front of room to record the number of wins in 10 games.
Based on the whole class data, what is the probability of winning this game? __________
3. Let’s try to use a Tree Diagram to calculate the theoretical probability. Fill in the blank boxes
with the correct probabilities.
st
1 card
is Ace
2nd card
is Ace
= P(1st is Ace AND 2nd is Ace)
2nd card
is King
= P(1st is Ace AND 2nd is King)
2nd card
is Ace
= P(1st is King AND 2nd is Ace)
2nd card
is King
= P(1st is King AND 2nd is King)
Ace Game
1st card
is King
4. Find the theoretical probability of winning the game. ____________________________
5. What is the probability that the 1st card was a King, given that the person won the game?
221
Conditional Probability and Independence
Important Ideas:
Check Your Understanding:
In the 2016 election, 30 states went to the Republican candidate and 20 states went to the
Democratic candidate. Of the 30 states that went Republican, 29 were in the continental United
States. Of the 20 states that went Democratic, 19 are in the continental United States. One state is
selected at random.
a. Construct a tree diagram to model this chance process.
b. Find the probability that a randomly selected state is in the continental U.S. and went Republican.
c. If we select 4 states at random (with replacement) what is the probability that at least 1 of the
states is in the continental U.S. and went Republican?
d. Given that a randomly selected state is not in the continental U.S., what is the probability that it
went Republican?
222
5.3 PRACTICE!
The Probability of a Flush
1. A poker player holds a flush when all 5 cards in the hand belong to the same suit. Remember that a deck contains 52
cards, 4 suits (hearts, diamonds, spades and clubs) with 13 cards of each suit. When the deck is well shuffled, each card
dealt is equally likely to be any of those that remain in the deck.
(a) We will concentrate on spades. What is the probability that the first card dealt is a spade? What is the conditional
probability that the second card is a spade given that the first is a spade (partial tree diagram may help)?
(b) Continue to count the remaining cards to find the conditional probabilities of a spade on the third, the fourth, and
the fifth card given in each case that all previous cards are spades (consider continuing with your tree diagram from
above).
(c) The probability of being dealt 5 spades is the product of the five probabilities you have found. Why? What is this
probability?
(d) The probability of being dealt 5 hearts or 5 diamonds or 5 clubs is the same as the probability of being dealt 5 spades.
What is the probability of being dealt a flush?
223
2. Internet sites often vanish or move, so that references to them can’t be followed. In fact, 13% of Internet sites
referenced in major scientific journals are lost within two years after publication. If a paper contains seven Internet
references, what is the probability that at least one of them doesn’t work two years later? Show your work!
3. A recent census at Emory University revealed that 40% of its students mainly used Macintosh computers (Macs). The
rest mainly used PCs. At the time of the census, 67% of the school’s students were undergraduates. The rest were
graduate students. In the census, 23% of the respondents were graduate students who said that they used PCs as their
primary computers. Suppose we select a student at random from among those who were part of the census and learn
that the student mainly uses a Mac. Find the probability that this person is a graduate student. Show your work... (Hint:
a tree diagram may be too difficult to construct, so consider a two-way table)
4. Many employers require prospective employees to take a drug test. A positive result on this test indicates that the
prospective employee uses illegal drugs. However, not all people who test positive actually use drugs. Suppose that 4%
of prospective employees use drugs, the false positive rate is 5% and the false negative rate is 10%. What percent of
people who test positive actually use illegal drugs?
(Hint: look at the rule on p. 324 first, then study a similar example on p. 326 ... this is tough, but you can do this!!)
224
5.3 PRACTICE! #2
Picking Two Sneezers
This is a two-way table that classified 40 students according to their gender and whether they had allergies.
Allergies
No Allergies
Total
Female Male Total
10
8
18
13
9
22
23
17
40
Suppose we chose 2 students at random.
a. Draw a tree diagram or chart that shows the sample space for this chance process.
b. Find the probability that both students selected suffer from allergies.
c. Find the probability that neither student selected suffers from allergies.
d. Find the probability that at least one student selected suffers from allergies.
e. Find the probability that only one student selected suffers from allergies.
225
Media Usage and Good Grades
In January 2010, the Kaiser Family Foundation released a study about the influence of media in the lives of young people
ages 8-18 (http://www.kff.org/entmedia/mh012010pkg.cfm). In the study, 17% of the youth were classified as light
media users, 62% were classified as moderate media users and 21% were classified as heavy media users. Of the light
users who responded, 74% described their grades as good (A’s and B’s), while only 68% of the moderate users and 52%
of the heavy users described their grades as good.
a. According to this study, what percent of young people ages 8-18 described their grades as good? Use a tree
diagram or chart to calculate the probability.
b. According to the tree diagram you constructed above, what percent of students with good grades are heavy
users of media?
c. What percent of students with good grades are moderate users?
d. What percent of light users are students with bad grades?
e. What percent of moderate users are students with bad grades?
226
2017 AP® STATISTICS FREE-RESPONSE QUESTIONS
3. A grocery store purchases melons from two distributors, J and K. Distributor J provides melons from organic
farms. The distribution of the diameters of the melons from Distributor J is approximately normal with mean
133 millimeters (mm) and standard deviation 5 mm.
(a) For a melon selected at random from Distributor J, what is the probability that the melon will have a
diameter greater than 137 mm?
Distributor K provides melons from nonorganic farms. The probability is 0.8413 that a melon selected at random
from Distributor K will have a diameter greater than 137 mm. For all the melons at the grocery store, 70 percent
of the melons are provided by Distributor J and 30 percent are provided by Distributor K.
(b) For a melon selected at random from the grocery store, what is the probability that the melon will have a
diameter greater than 137 mm?
(c) Given that a melon selected at random from the grocery store has a diameter greater than 137 mm, what is
the probability that the melon will be from Distributor J?
227
®
2014 AP STATISTICS FREE-RESPONSE QUESTIONS
2. Nine sales representatives, 6 men and 3 women, at a small company wanted to attend a national convention.
There were only enough travel funds to send 3 people. The manager selected 3 people to attend and stated that
the people were selected at random. The 3 people selected were women. There were concerns that no men were
selected to attend the convention.
(a) Calculate the probability that randomly selecting 3 people from a group of 6 men and 3 women will result
in selecting 3 women.
(b) Based on your answer to part (a), is there reason to doubt the manager’s claim that the 3 people were
selected at random? Explain.
(c) An alternative to calculating the exact probability is to conduct a simulation to estimate the probability.
A proposed simulation process is described below.
Each trial in the simulation consists of rolling three fair, six-sided dice, one die for each
of the convention attendees. For each die, rolling a 1, 2, 3, or 4 represents selecting a
man; rolling a 5 or 6 represents selecting a woman. After 1,000 trials, the number of
times the dice indicate selecting 3 women is recorded.
Does the proposed process correctly simulate the random selection of 3 women from a group of 9 people
consisting of 6 men and 3 women? Explain why or why not.
228
Unit 4
Chapter 6
Random Variables
and Probability
Distributions
229
AP Statistics Handout: Lesson 6.1
Topics: discrete random variables, expected frequency, expected value
Lesson 6.1 Guided Notes
Discrete Random Variables
Roulette: Let X = winnings from a $100 bet on black
X=
+$100
-$100
18 Black Spaces
18 Red Spaces
1 Green Space
P(X)
In your own words, describe the difference between a regular variable and a random variable:
Expected Frequency
If I were to play roulette 1000 times, how many times would black be expected to win? Show your work.
Expected Value
If I were to play roulette 1000 times, what would my average winnings be per play? Show your work.
230
Expected Value: the _____________ value of a chance process, after repeating ___________________.
Use the space below to interpret the expected value of playing roulette (-$2.80):
Note: Losing $2.80 is _______ a possible outcome for ____________________ (+/- $100 are the only
outcomes). The expected values tells you the average over many trials, not the result of a single trial.
Calculating expected values: ________________ the outcomes (multiply) by their probabilities, then
add together to find the expected value.
Example 1: You play a coin toss game. Heads means you get $100, tails means you lose $100. Find the
expected value.
Example 2: You play a coin toss game. Heads means you get $300, tails means you lose $100. Find the
expected value.
Higher positive ___________ leads
to higher expected value!
Example 3: You play a coin toss game. The coin is weighted so that there is a 75% chance of tails. Heads
means you get $100, tails means you lose $100. Find the expected value.
Higher ___________________ of
negative value leads to a lower
expected value!
Example 4: You play a coin toss game. The coin is weighted so that there is a 75% chance of tails. Heads
means you get $300, tails means you lose $100. Find the expected value.
Expected values are weighted
averages: in this case, it
________________________.
231
Expected Value - Roulette
X=
+$100
-$100
P(X)
Expected loss for 1,000 plays:
Expected casino profit for 1,000 plays:
Individuals don’t have many trials, but casinos do. They profit through ___________________ expected
values.
Lesson 6.1 Discussion
Imagine the casinos didn’t take AP Stats. They come up with this game for high-rollers: You roll a 6-sided
dice. If you roll 5-6, you get $1,000,000. If you roll 1-4, you have to pay $300,000.
X=
+$1,000,000
-$300,000
Discussion Question: If you could play this game only
once, would you play? Why or why not? Mention the
expected value in your answer.
P(X)
232
How many children are in your family?
Count up the number of children in your family (including yourself). Be sure to include all
your stepbrothers/stepsisters and half-brothers/half-sisters.
Let X = the number of children. Suppose we choose someone from the class at random.
X
1
2
3
4
5
6
7
8
Probability
1. Is this a valid probability model? Explain.
2. Is 5.7167 a possible value for X? Explain.
3. Make a histogram to display information with X on the horizontal axis, and
describe its shape.
4. Describe in words what P ( X ³ 3) and then find P ( X ³ 3).
5. Describe in words what P( X > 3) and then find P( X > 3) .
6. Find the average of the X values.
7. Does this value tell us the average number of children in the families of students
in this class? If yes, explain. If no, why not?
233
Discrete Random Variables
Important ideas:
Check Your Understanding
Home Alone is a series of American Christmas family comedy films created by John Hughes.
There are 5 Home Alone movies. Below is a probability distribution of the number of Home Alone
movies watched by a very large sample of high school students.
Number of Home Alone
Movies Watched
Probability
0
1
2
3
4
5
0.15
0.42
0.32
?
0.02
0.01
a. Write the event “the student has seen 3 Home Alone movies” using probability notation.
Then find this probability.
b. Explain in words what P(X ≥ 3) means. What is this probability?
c. Make a histogram of the probability distribution. Describe its shape.
d. Calculate and interpret the expected value of X.
234
How much do you get paid?
Suppose you got a new job and each day your boss (Mrs. Gallas) draws a slip of paper
from a bag to determine your wage for the day. Let the random variable X = daily wage ($
per hour).
1. What is your wage for the day?
and complete the table below.
X
1
5
Add your data to the table on the board
7
10
15
25
Probability
2. Calculate and interpret the expected value of X.
Value
3. Recall from chapter 1 that standard
deviation tells us the typical distance from
the mean. Complete the table to calculate
the standard deviation for the probability
distribution.
Distance
from mean
(Distance
from mean)2
1
5
7
4. Interpret the standard deviation.
10
15
25
5. Mrs. Gallas decides she would rather
assign wages so that employees could get
any amount from $10 to $20 and all are
equally likely. Draw a graph to represent
this probability distribution.
Total =
SD =
6. What is the probability that an employee makes between $12 and $12.50?
235
Weighted
(Distance
from the
mean)2
Probability and Continuous Random Variables
Important ideas:
Check Your Understanding
Among those who play Minecraft, the amount of time they spend playing per day is
approximately Normally distributed with mean μ = 150 minutes and standard deviation σ = 42.7
minutes. Suppose we choose a Minecraft player at random and let Y = the amount of time they
spend playing Minecraft (in minutes).
a. What type of variable is Y, discrete or continuous? Explain.
b. Interpret the standard deviation.
c. Find "($ ≤ 90). Interpret this value.
d. Find "(45 ≤ $ ≤ 90). Interpret this value.
236
6.1 PRACTICE!
NHL Goals
In 2010, there were 1319 games played in the National Hockey League’s regular season. Imagine selecting one of these
games at random and then randomly selecting one of the two teams that played in the game. Define the random
variable X = number of goals scored by a randomly selected team in a randomly selected game. The table below gives
the probability distribution of X:
Goals:
0
1
2
3
4
5
6
7
8
9
Probability: 0.061 0.154 0.228 0.229 0.173 0.094 0.041 0.015 0.004 0.001
(a) Show that the probability distribution for X is legitimate.
(b) Make a histogram of the probability distribution. Describe what you see.
(c) What is the probability that the number of goals scored by a randomly selected team in a randomly selected game is
at least 6?
237
NHL Goals, continued.
Previously, we defined the random variable X to be the number of goals scored by a randomly selected team in a
randomly selected game. The table below gives the probability distribution of X:
Goals:
0
1
2
3
4
5
6
7
8
9
Probability: 0.061 0.154 0.228 0.229 0.173 0.094 0.041 0.015 0.004 0.001
(d): Compute the mean of the random variable X and interpret this value in context.
(e): Compute and interpret the standard deviation of the random variable X.
Weights of Three-Year-Old Females
2. The weights of three-year-old females closely follow a Normal distribution with a mean of  = 30.7 pounds and a
standard deviation of 3.6 pounds. Randomly choose one three-year-old female and call her weight X.
A. Find the probability that the randomly selected three-year-old female weighs at least 30 pounds.
B. Should the pediatrician be concerned if a 3 year old girl only weighs 25 pounds?
238
AP Statistics Handout: Lesson 6.2
Topics: variance & standard deviation, transforming & combining random variables
Lesson 6.2 Guided Notes
Variance and Standard Deviation of Random Variables
Standard Deviation vs. Variance
Standard Deviation:
______________________
from mean
∑(ð‘Ĩ𝑖 − ð‘ĨĖ… )2
20922.92
𝝈=√
=√
= √1902.08 = 43.61
𝑛−1
11
Variance:
______________________________
measure of spread
∑(ð‘Ĩ𝑖 − ð‘ĨĖ… )2 20922.92
𝝈𝟐 =
=
= 1902.08
𝑛−1
11
Mathematical Relationship
𝝈𝟐 = ð‘―ð’‚ð’“
𝝈 = √ð‘―ð’‚ð’“
Variance for random variables
Roulette: Let X = winnings from a $100 bet on black
X=
+$100
-$100
P(X)
E(X) = 𝜇𝑋 = −2.80
Find the variance and the standard deviation of X:
Var(X) = ∑(𝑋𝑖 − 𝜇𝑋 )2 𝑝𝑖
239
18 Black Spaces
18 Red Spaces
1 Green Space
“A Good Game”
Imagine the casinos didn’t take AP Stats. They come up with this game for high-rollers: You roll a 6-sided
dice. If you roll 5-6, you get $1,000,000. If you roll 1-4, you have to pay $300,000.
X=
+$1,000,000
If you could play this game only once, would you play?
Why or why not?
-$300,000
P(X)
𝜇𝑋 ≈ $132,900
𝜎𝑋 ≈ $612,826
Transforming Random Variables
Transforming Roulette: Let X - 5 = winnings from a $100 bet on black, with $5 fee to play.
X-5=
P(X)
18
37
= .486
19
37
= .514
18 Black Spaces
18 Red Spaces
1 Green Space
a) Find the expected value for playing this game.
E(X ± constant) = _____________________
b) Find the variance and standard deviation for this game.
Adding/subtracting all outcomes by a constant ______________________________ the variation
between outcomes!
When adding or subtracting random variables by a constant…
1. Center: adds/subtracts by that constant amount.
2. Spread: remains the ___________.
3. Shape: remains the ___________.
240
Multiplying a Random Variable by a Constant
E(2X) = 2𝜇𝑋
Standard Deivation(2X) = 2𝜎𝑋
Var(2X) =
When multiplying or dividing random variables by a constant…
1. µ: multiplies/divides by that constant.
2. 𝝈: multiplies/divides by that constant.
𝝈𝟐 : multiplies/divides by that constant ___________________.
3. Shape: remains the ____________.
Combining Random Variables
Adding Random Variables
Usually, people who play in the casinos play more than just roulette. Say that you play the slots and you
put in $20. The slots are programmed so that you have a 20% chance of doubling your money and an
80% chance of losing your money. Here’s is the probability model for this random variable:
Y=
+$20
-$20
P(Y)
0.20
0.80
Y = winnings from slot machine.
Find the expected value and standard deviation of playing both roulette ($100 bet) and the slot machine
($20 bet). Let Z = X+ Y (winnings/losses from both games).
What if we just added the standard deviations instead of the variances?
• You __________________ standard deviations! You must use variances and square root at end.
241
Subtracting Random Variables
Let X be a random variable with 𝜇𝑋 = 2 and 𝜎𝑋 2 = 4. Let Y be a random variable with 𝜇𝑌 = 3 and 𝜎𝑌 2 = 25
Let Z = X-Y.
a) What is 𝛍𝐙 ?
When subtracting random variables, ___________________________!
b) What is 𝝈𝒁 ?
When subtracting random variables, ____________________________________!
Lesson 6.2 Discussion
You are at the casino and decide to ignore your stats teacher and play high-stakes roulette. You have
one of two options:
1. Place a $200,000 bet on red for one turn
2. Place a $100,000 bet on red for each of two separate turns
Discussion Question: Which strategy would you take? Does it matter? Justify your reasoning.
242
Is it time for a raise?
Mrs. Gallas’ employees have been working very hard and it’s time she gives them a raise.
She is trying to decide if she should give everyone a $10 raise (add $10 per hour) or
double everyone’s wage (multiply by 2).
1. Copy the data collected from yesterday’s lesson below.
X
1
5
7
10
15
25
Probability
Mean:
Standard Deviation:
2. To make a decision about what raise should be given, complete the tables below and
calculate the new mean and standard deviation using an applet or your calculator.
a. Option 1: Add $10 per hour to all employees
Xà Old Wage
1
5
7
10
15
25
Y à New Wage
Probability
Mean:
Standard Deviation:
How did adding a constant affect the mean and standard deviation?
b. Option 2: Double the original wage of all employees
X à Old Wage
1
5
7
10
15
Z à New Wage
Probability
Mean:
Standard Deviation:
How did multiplying by a constant affect the mean and standard deviation?
3. Which option would you prefer? Why?
243
25
Transforming Probability Distributions
Important ideas:
Check Your Understanding
Let X = the number billionaires in a randomly selected state. Based on current records, the
probability distribution of X is as follows:
Number of
0
1
2
3
4
5
6
8
9
10 11 12
13 17 56 58 118 165
Billionaires
Probability 0.10 0.24 0.06 0.06 0.04 0.08 0.04 0.04 0.06 0.08 0.02 0.02 0.02 0.06 0.02 0.02 0.02 0.02
The random variable X has mean 𝜇! = 12.68 and standard deviation 𝜎! = 29.02. Suppose a law
is passed requiring each billionaire to pay $1,000,000 to their state. Let Y = the money received
by a randomly selected state.
1. Consider the graph of the probability distribution of X and a separate graph of the
probability distribution of Y. How would their shapes compare?
2. Find the mean of Y.
3. Calculate and interpret the standard deviation of Y.
4. Each state agrees to invest $500,000 to improve roads. Therefore, the net amount (N) of
money a randomly selected state has after the billionaire tax is N = Y – $500,000.
Describe the shape, mean, and standard deviation of the probability distribution of N.
244
How much will you make next year?
After much thought Mrs. Gallas has finally decided on permanent employee wages which
are randomly assigned using the probability distribution X given below. Additionally, at the
end of every year she gives her employees an hourly raise. The bonuses are assigned
randomly according to the probability distribution Y given below. Assume X and Y are
independent.
1. Find the mean, variance and standard deviation of the probability distribution of X, the
hourly wages.
9
12
15
X
Probability
Mean:
0.30
0.45
Variance:
0.25
Standard Deviation:
2. Find the mean, variance and standard deviation of the probability distribution of Y, the
annual hourly raise.
Y
Probability
Mean:
$1
$3
0.70
0.30
Variance:
Standard Deviation:
3. Let N = the new hourly wage for the upcoming year (X + Y).
a. What are all the possible new hourly wages for the new year?
b. What is the probability of an employee being assigned a $9 wage AND a $1 raise?
Show your work.
c. Complete the table below for the probability distribution of N = X + Y and find the mean
and standard deviation.
N
Probability
Mean:
Variance:
Standard Deviation:
d. If N = X + Y, complete the following in terms of X and Y:
𝜇! =
𝜎! =
245
Combining Probability Distributions
Important ideas:
Check Your Understanding
Mrs. Chauvet recently had twins. Let X = the number of diaper changes per day for Alyse and Y =
the number of diaper changes per day for Jocelyn. Based on a few weeks of careful records, the
probability distributions of X and Y are as follows:
Number of diapers
changed xi
Probability pi
Mean: µ X = 4.75
3
4
5
0.05 0.25 0.60
SD: s X = 0.698
6
0.10
Number of diapers
changed yi
Probability pi
Mean: µY = 4.9
3
4
5
6
0.05 0.20 0.55 0.20
SD: s Y = 0.768
Define T = X + Y. Assume that X and Y are independent.
a. Find and interpret 𝜇 " .
b. Calculate and interpret 𝜎" .
c. Alyse wears Diaper size 1, which cost $0.238 per diaper and Jocelyn wears Diaper size 2
which cost $0.2975 per diaper. Find the mean and standard deviation of Mrs. Chauvet’s total
diaper cost per day.
246
6.2 PRACTICE!
Study Habits
The academic motivation and study habits of female students as a group are better than those of males. The Survey of
Study Habits and Attitudes (SSHA) is a psychological test that measures these factors. The distribution of SSHA scores
among the women at a college has mean 120 and standard deviation 28, and the distribution of scores among male
students has mean 105 and standard deviation 35. You select a single male student and a single female student at
random and give them the SSHA test.
(a) Explain why it is reasonable to assume that the scores of the two students are independent.
(b) What are the expected value and standard deviation of the difference (female minus male) between their scores?
(c) From the information given, can you find the probability that the woman chosen scores higher than the man? If so,
find this probability. If not, explain why you cannot.
247
Swim Team
Alonzo & Tracy Mourning Sr. High School has one of the best women’s swimming team in the region. The 400-meter
freestyle relay team is undefeated this year. In the 400-meter freestyle relay, each swimmer swims 100 meters. The
times, in seconds, for the four swimmers this season are approximately Normally distributed with means and standard
deviations as shown. Assuming that the swimmer’s individual times are independent, find the probability that the total
team time in the 400-meter freestyle relay is less than 220 seconds. Show your work!
248
AP Statistics Handout: Lesson 5.7
Topics: geometric and binomial random variables
Lesson 5.7 Guided Notes
Free Throw Accuracy
49.7%
Free Throw Accuracy
61.3%
Wilt Chamberlain: traditional
(all seasons except ’61-’62)
Wilt Chamberlain: granny-style
(1961-1962 season)
Today’s Analysis: Why did players stop shooting “granny-style” free throws?
The Geometric Distribution
Story of Geometric:
• ________________ trials (success/fail → make/miss)
• Looking for ______________________
• Each trial is _____________________
When Wilt Chamberlain shot “granny-style,” he made 61.3% of his free-throws. Say that Wilt shoots
granny-style until he makes one free-throw. Assuming each shot is independent, what is the probability
that his first make comes on his 5th shot? Show calculations here:
249
Geometric PDF formula: What is the probability that first success in on xth attempt? (let p = probability
of success)
𝑋~𝐚𝑒𝑜𝑚(𝑝)
𝑃(𝑋 = ð‘Ĩ) = (1 − 𝑝)
Called the “probability density function” or _______
ð‘Ĩ−1
(𝑝)
PDF vs. CDF
• PDF situation: What is the probability that Wilt’s first made shot is on his 3rd attempt?
“Probability” means
_____________ probability
•
Cumulative Distribution Function (CDF) situation: What is the probability that his first made
shot is on his 3rd attempt or earlier?
“Cumulative” means
_____________________
Using the calculator: Find the probability that his first made shot is on the 100th attempt or earlier.
2nd → DISTR → E: GeometPDF
2nd → DISTR → F: GeometCDF
The Binomial Distribution
When Wilt Chamberlain shot “granny-style,” he made 61.3% of his free-throws. Say that Wilt shoots 8
consecutive granny-style free-throws. Assuming each shot is independent, what is the probability that
he makes exactly 3 of them?
Story of Binomial:
• __________________ trials (success/fail → make/miss)
• Each trial is ____________________________
• ______________ number of trials (8)
• ____________ # of successes (3)
250
Binomial PDF formula: What is the probability of making x out of n shots? (let p = probability of success)
𝑋~ðĩ𝑖𝑛𝑜𝑚(𝑛, 𝑝)
𝑛
𝑃(𝑋 = ð‘Ĩ) = ( ) 𝑝 ð‘Ĩ (1 − 𝑝)𝑛−ð‘Ĩ
ð‘Ĩ
a) What is the probability of making 3 out of 8 shots?
b) What is the probability of making at most 3 out of 8 shots?
2nd → DISTR → A: BinomPDF
2nd → DISTR → B: BinomCDF
c) What is the probability of making at least 4 out of 8 shots?
Lesson 5.7 Discussion
Discussion Question: Why do you believe shorter players tend to be better at free throws? Give a
statistical (not physics-based) reason for your answer.
251
Is it smart to foul at the end of the game?
In the 2005 Conference USA basketball tournament, Memphis trailed Louisville by two
points. At the buzzer, Memphis’s Darius Washington attempted a 3-pointer; he missed but
was fouled, and went to the line for three free throws. Each made free throw is worth 1
point. Was it smart to foul?
1. What are all the possible ways the shots could fall (e.g. make-miss-miss, etc.)?
2. Darius Washington was a 72% free-throw shooter. Find the probability that
Memphis will win, lose or go to overtime. When you have found the probabilities
put them in the table in #3.
3. Prior to watching each shot, calculate the probability that Memphis wins the
game in regulation, loses the game in regulation, or sends the game into
overtime.
4. Washington is a 40% 3-point shooter. Do you think Louisville was smart to foul?
Why or why not?
252
Binomial Random Variables
Important ideas:
Check Your Understanding
1. For each of the following situations, determine whether or not the given random variable has
a binomial distribution. Justify your answer.
a. You play a game of Whack-a-Mole. From playing this game in the past, you know that
you have an 80% probability of whacking the mole before it drops back into its hole. The
moles pop up randomly and your ability to whack any particular mole is not affected by
whether or not you whacked the previous mole. There are 20 moles to be whacked
during one round of the game. Let X = the number of moles you are able to whack.
b. Next you play Skee Ball. You know that you have a 10% probability of getting any
given ball in the 10,000-point hole. Let Y = the number of balls you must roll until you
get one in the 10,000-point hole.
2. Now you play a game called Tsunami Duck Pond. In Tsunami Duck Pond there are 100
ducks that get pummeled by tidal waves. You have to reach your hand into the tsunami
and select a duck. If there is a star on the bottom of the duck, you win. The game
claims to have 20 ducks with stars among the 100 ducks. After each round you must
place the duck back in the tumultuous water. Let W = the number of times you win if you
play this game 10 times.
a. Explain why W is a binomial random variable.
b. Find the probability that you win exactly 3 times.
253
Lesson 6.3: Day 2: Will the EKHS girls’ soccer team win?
When the time runs out in a soccer game and the score is tied, the game will go to a
shootout. Each team gets to choose 5 players to kick penalty kicks. Whichever team
makes the most penalty kicks wins. If the SAHS girls’ soccer team makes 60% of
their penalty kicks, what are the chances they will win the game?
1. Is this a binomial setting? Explain.
2. Fill in the table below showing the probability of making X penalty kicks.
Goals (X)
0
1
2
3
4
5
Probability
3. Find and interpret the mean of the probability distribution. Show your work.
4. Find and interpret the standard deviation of the distribution.
5. What is the probability that the team scores at least one goal?
6. If the other team is expected to make 3 goals, what is the probability that the SAHS
girls’ team wins?
254
Lesson 6.3 Day 2– Describing Binomial Distributions
Important ideas:
Check Your Understanding
Mrs. Mason’s class is very difficult. It’s so hard that when she gave a pop quiz recently, the
students just guessed on every question! Each student in the class guesses an answer from A
through E on each of the 10 multiple-choice questions. Emily is one of the students in this
class. Let Y = the number of questions that Emily answers correctly.
1. Does this setting represent a binomial distribution? Explain.
2. Use technology to make a probability distribution of Y. Describe its shape if it was a histogram.
3. Calculate and interpret the mean of Y.
4. Calculate and interpret the standard deviation of Y.
255
Where are all the Orange Skittles?
Mrs. Mason loves orange Skittles, but the last time she got a fun size bag there were no
orange Skittles! The bag was manufactured at the Skittles factory where 10,000 Skittles
are made each hour. 20% of the Skittles are orange. To make a fun size bag, 10 Skittles
are chosen for each bag. What are the chances of getting a fun size bag with no orange
Skittles?
1. Is this a binomial setting? Explain.
2. Find the probability of getting 0 orange Skittles in a group of 10 from the factory. Write
the binomial formula from the formula sheet.
3. Find the probability of getting at most 5 orange Skittles in a group of 10 from the factory.
To ensure that she gets more orange Skittles, Mrs. Mason buys a jumbo bag of Skittles
which contains 900 Skittles. Let X = number of orange Skittles in a bag of 900 Skittles. Use
a binomial distribution to model the situation.
4. Find the mean and standard deviation of X.
5. What is the probability of getting at most 150 orange Skittles?
6. If we were to make a histogram of X, what do you think the shape would be?
256
Binomial Distributions
Big ideas:
Check Your Understanding
In a survey of 500 U.S. teenagers aged 14 to 18, subjects were asked a variety of questions
about personal finance. One question asked whether teens had a debit card. Suppose that
exactly 12% of teens aged 14 to 18 have debit cards. Let X = the number of teens in a random
sample of size 500 who have a debit card.
1. Is this binomial? Estimate the probability that 50 or fewer teens in the sample have debit cards.
2. Find and interpret in context: P(x>100).
3. Find and interpret in context: p(60<x<100):
3. Find and interpret the mean and standard deviation::
257
6.3 PRACTICE!
1. Patients receiving artificial knees often experience pain after surgery. The pain is measured on a subjective scale with
possible values of 1 to 5. Assume that X is a random variable representing the pain score for a randomly elected patient.
The following table gives part of the probability distribution for X.
(a) Find P(X = 5).
(b) Find the probability that the pain score is less than 3.
(c) Find the probability that the pain score is greater than 3.
(d) Find the mean μ for this distribution.
2. Amarillo Slim, a professional dart player, has an 80% chance of hitting the bull’s-eye on a dartboard with any throw.
Suppose that he throws 10 darts, one at a time, at the dartboard.
(a) Find the probability that Slim hits the bull’s-eye exactly six times.
(b) Find the probability that he hits the bull’s-eye at least four times.
(c) Compute the expected number of bull’s-eyes in 10 throws.
We will add more to this problem set LATER :)
258
(d) Find the probability that Slim’s first bull’s-eye occurs on the fourth throw.
(e) Find the probability that it takes Amarillo more than 2 throws to hit the bullseye.
3. Harlan comes to class one day, totally unprepared for a pop quiz consisting of ten multiple-choice questions. Each
question has five answer choices, and Harlan answers each question randomly.
(a) Find the probability that Harlan’s gets more than 5 questions right out of 10.
(b) Find the probability that Harlan’s first correct answer occurs after the fourth question.
(c) Find the expected number of questions required for Harlan to get his first correct answer.
(d) Find the probability that Harlan guesses more answers correctly than would be expected by chance.
259
6.3 PRACTICE! #2
For each problem, be sure that the situation fits the criteria for binomial distributions. If so, answer the questions (show
the formula) and then find the mean and standard deviation of the distribution.
1) 80% of the graduates of Northeast High who apply to Penn State University are admitted. Last year, there were 6
graduates from Northeast who applied to Penn State. What is the probability that
a) 4 were admitted
b) more than 4 were admitted
Mean of distribution: ____________
Standard deviation of distribution: ____________
2) Tires from the Apex Tire Corp. are traditionally 5% defective. A truck carries 10 tires, 8 in use and 2 spares. If 10 tires
are chosen from Apex, what is the probability that not more than two defective tires are chosen.
Mean of distribution: ____________
Standard deviation of distribution: ____________
3) Studies indicate that in 70% of the families of Blue Bell, both the husband and wife work. If 7 families are randomly
selected from Blue Bell, what is the probability that
a) exactly 4 of them work.
b) more than 4 work
Mean of distribution: ____________
Standard deviation of distribution: ____________
260
4) According to the National Institute of Health, 32% of all women will suffer at least one hip fracture because of
osteoporosis by the age of 90. If 10 women aged 90 are selected at random, find the probability that
a) 2 or more of them suffer/will suffer at least one hip fracture
b) none of them suffer/will suffer a hip fracture
Mean of distribution: ____________
Standard deviation of distribution: ____________
5) According to FBI statistics, only 52% of all rape cases are reported to the police. If 10 rape cases are randomly
selected, what is the probability that at least one is reported to the police?
Mean of distribution: ____________
6) In a school, typically only
1
of
10
Standard deviation of distribution: ____________
the student body returns surveys. 20 students are chosen randomly to receive a
survey. What is the probability that
a) they get no surveys back.
b) they get more 4 or more surveys back.
Mean of distribution: ____________
Standard deviation of distribution: ____________
7) The probability that a driver making a gas purchase will pay by credit card is 0.60. If 50 cars pull up to the station to
buy gas, what is the probability that at least half of the drivers will pay by credit card?
Mean of distribution: ____________
Standard deviation of distribution: ____________
261
262
Strategy:
Example problem where this
strategy would be useful
PROBABILITY STRATEGIES
What this strategy looks like:
Mutually Exclusive versus Independent
Definition: MUTUALLY EXCLUSIVE EVENTS (Disjoint):
Definition: INDEPENDENT EVENTS:
Create an example of two events A and B where:
1. Events A and B are MUTUALLY EXCLUSIVE (Disjoint)
2. Events A and B are INDEPENDENT.
3. Events A and B are MUTUALLY EXCLUSIVE and INDEPENDENT.
4. Events A and B are MUTUALLY EXCLUSIVE and DEPENDENT.
5. Events A and B are NOT MUTUALLY EXCLUSIVE and INDEPENDENT.
6. Events A and B are NOT MUTUALLY EXCLUSIVE and DEPENDENT.
263
How Many Bottle Flips to Go Viral?
Scene: The senior talent show is scheduled to take place this afternoon. Michael Senatore is in
stats class practicing his act - bottle flipping. He tells his friends that the probability of successfully
landing the water bottle right side up is 20%. Assume that each water bottle flip is independent.
1. If he flips the bottle ten times, how many do you expect to land right side up? Justify.
2. What is the probability that he makes exactly two flips?
3. What is the probability that he makes less than two flips?
4. What is the probability that he makes at most two flips?
After the first ten flips, Michael wants to do ten more, but his teacher is getting annoyed with all the noise
and tells him he must stop flipping once the bottle lands right side up.
5. Find the probability that he lands it on the first try.
6. Find the probability that he fails on the first try and lands it on the second try.
7. Find the probability that he fails the first two and lands it on the third try.
8. Find the probability that he fails the first three and lands it on the fourth try.
Let X = the number of flips it takes for Michael to land the bottle right side it up. Fill in the table below.
X
1
2
3
4
5
6
P(X)
9. Find the probability that he lands it on the tenth try. Write a generic rule for finding the probability
that he first lands the bottle on the kth flip.
10. How many flips do you expect it to take for the bottle to land right side up? Why?
264
The Geometric Distribution
Important ideas:
Check Your Understanding
Mason never has a pencil when test day rolls around. Because the classmates are tired of having to
supply pencils for Mason, only 15% of students will give Mason a pencil when asked. Today is test
day and Mason begins asking randomly selected students for a pencil. Let Y = the number of
students Mason asks until he finds someone who will give him a pencil.
a. Describe this probability distribution. Be sure to check the appropriate conditions.
b. What is the probability that the third person asked is the first person who gives him a pencil?
c. What is the probability that Mason gets a pencil by the third person he would ask?
d. How many people should Mason expect to ask before getting a pencil?
e. Should Mason be surprised if Mason didn’t receive a pencil until he asked at least 10 people?
Calculate a probability to justify your answer.
f. Find the standard deviation for the distribution of the number of people he asks for a pencil before
finding someone who will give him one.
Thank you to AP Stats teachers Kelly Pendleton and Alana Braland for this lesson
265
3. A medical researcher surveyed a large group of men and women about whether they take medicine as prescribed.
The responses were categorized as never, sometimes, or always. The relative frequency of each category is
shown in the table.
Men
Women
Total
Never
0.0564
0.0636
0.1200
Sometimes
0.2016
0.1384
0.3400
Always
0.2120
0.3280
0.5400
Total
0.4700
0.5300
1.0000
(a) One person from those surveyed will be selected at random.
(i) What is the probability that the person selected will be someone whose response is never and who is a
woman?
(ii) What is the probability that the person selected will be someone whose response is never or who is a
woman?
(iii) What is the probability that the person selected will be someone whose response is never given that the
person is a woman?
(b) For the people surveyed, are the events of being a person whose response is never and being a woman
independent? Justify your answer.
(c) Assume that, in a large population, the probability that a person will always take medicine as prescribed
is 0.54. If 5 people are selected at random from the population, what is the probability that at least 4 of
the people selected will always take medicine as prescribed? Support your answer.
-9266
GO ON TO THE NEXT PAGE.
5. A company that manufactures smartphones developed a new battery that has a longer life span than that of a
traditional battery. From the date of purchase of a smartphone, the distribution of the life span of the new battery
is approximately normal with mean 30 months and standard deviation 8 months. For the price of $50, the
company offers a two-year warranty on the new battery for customers who purchase a smartphone. The warranty
guarantees that the smartphone will be replaced at no cost to the customer if the battery no longer works within
24 months from the date of purchase.
(a) In how many months from the date of purchase is it expected that 25 percent of the batteries will no longer
work? Justify your answer.
(b) Suppose one customer who purchases the warranty is selected at random. What is the probability that the
customer selected will require a replacement within 24 months from the date of purchase because the battery
no longer works?
(c) The company has a gain of $50 for each customer who purchases a warranty but does not require
a replacement. The company has a loss (negative gain) of $150 for each customer who purchases a warranty
and does require a replacement. What is the expected value of the gain for the company for each warranty
purchased?
-11267
GO ON TO THE NEXT PAGE.
268
Unit 5
Chapter 7
Collecting Data
269
AP Statistics Handout: Lesson 7.1
Topics: parameters vs. statistics, sampling distributions, accuracy vs. precision
Lesson 7.1 Guided Notes
Parameters vs. Statistics
Inference: ________________________ a characteristic of a population using sample statistics.
Population
Sample
Sample: a ____________________ of a larger population.
A ______________________ is a number that describes some characteristic of the population.
A ____________________ is a number that describes some characteristic of a sample.
Example: The Business Owner Problem
A business owner wants to know the average household income in her business’s zip code. She
randomly samples 80 households in the zip code and finds their mean income to be $46,144.
Population:
Parameter:
Sample:
Statistic:
Sampling Distributions
Sampling distribution: the ________________________________ from all possible samples (of the same
size) from a population.
•
Sampling distributions are made of statistics (means, proportions, Q3’s, etc.), ___________
_____________________.
270
____________________
Data
(Data – incomes)
Data
____________________
(Data – incomes)
Means
āīĨ
𝒙
āīĨ𝒙
āīĨ
𝒙
āīĨ𝒙
āīĨ
𝒙
āīĨ𝒙
āīĨ𝒙
āīĨ𝒙
āīĨ
𝒙
āīĨ
𝒙āīĨ
𝒙āīĨ
𝒙āīĨ
𝒙
āīĨ𝒙
āīĨ𝒙
āīĨ𝒙
āīĨ𝒙
āīĨ𝒙
āīĨ
𝒙
____________________
(Statistics – means of incomes)
Accuracy (bias) vs. Precision (variation)
How do you avoid bias?
(how do you make it accurate?)
Sample __________________
How do you reduce variability? (how do
you make it more precise?)
_____________________ sample size (n)
Sample size: n = 5
Sample size: n = 25
_______________
(centered at true mean)
____________
(more precise)
Extension question: Why do higher
sample sizes lead to more precise
estimates?
271
Lesson 7.1 Discussion
The German Tank Problem
Let’s say German tanks with the following July serial numbers were captured
during combat. Assume it’s a random sample of the tanks produced:
038, 072, 121, 158, 206
a) Identify the following:
Population:
Parameter:
Sample:
Statistic:
b) Discussion Question: Calculate a statistic, using this sample, to estimate the total number of tanks
produced. Report your result and justify your process.
272
What was the average for the Chapter 6 test?
How did the Chapter 6 test go? Today, we will be taking a sample from a population. We
will use the average from the sample to estimate the average for the population.
Let’s start with a very simple example. My 5th hour is very small. There were only 4 students
who took the chapter 6 test. Their scores were: 60 70 80 90.
1. Make a dotplot of the population distribution.
2. Take a sample of any 2 of the scores. Find the mean of your sample.
3. Figure out all of the possible samples of size 2. Calculate a sample mean for each
sample of 2.
4. Make a dotplot using each of the means you found in #3.
5. What is the mean of the population? Label this on the dotplot above.
273
What is a Sampling Distribution? Day 1
Important ideas:
Check Your Understanding
The James family has five children: Jocelyn (age 8), Alyse (age 8), Michael (age 14), Erica (age
16), and Sarah (age 18).
a. Complete the table by listing the 10 possible samples of size n = 2 from this population and
calculate the sample mean age for each sample. The first column is completed for you.
Sample
J, A
n=2
8
x
b. Create a sampling distribution of the sample mean age of samples of size 2.
c. What is the mean of the sampling distribution of the sample mean? What is the mean of the
population?
d. Is the sample mean an unbiased estimator of the population mean? Justify your answer.
e. Suppose we had taken samples of size 3 instead of size 2. Would the variability of the sampling
distribution of the sample mean be larger, smaller, or about the same? Justify your answer.
274
What was the real average for the Chapter 6 test?
How did the Chapter 6 test go? Today, we will be taking a sample from a population. We
will use the average from the sample to estimate the average for the population.
Yesterday we looked at a very small class of students as the population. In reality there were
many students who took the test. Take a random sample of 5 students and record their
scores. Then find the mean. Repeat this for a total of 4 times.
Scores:______________ Mean:______
Scores:______________ Mean:______
Scores:______________ Mean:______
Scores:______________ Mean:______
1. Write each mean on a different sticker and put the stickers in the appropriate location
on the poster at the front of the room. Copy down the dotplot that is created on the
poster.
2. What does each dot on the poster represent?
3. What do you think the true Chapter 6 test average is?
4. A sampling distribution shows the means calculated from all of the possible samples
of size 5 from the population. Is the above dotplot a sampling distribution? Explain.
5. We took a random sample of 5 test scores at Rockford high school and got a mean of
68. Is this convincing evidence that Rockford students did worse than students at our
school or is it possible the Rockford has the same average?
275
What is a Sampling Distribution? Day 2
Important ideas:
Check Your Understanding
Pennies made prior to 1982 were made of 95% copper. Because of their copper content, these pennies are
worth about $0.023 each. Pennies made after 1982 are only 2.5% copper. Jenna reads online that 13.2% of
pennies in circulation are pre-1982 copper pennies. Jenna has a large container of pennies at home. She
selects a random sample of 50 pennies from the container and finds that 11 are pre-1982 copper pennies.
Does this provide convincing evidence that the proportion of pennies in her container that are pre-1982
copper pennies is greater than 0.132?
1. Identify the population, parameter, sample and statistic.
Population:
Sample:
Parameter:
Statistic:
2. Does Jenna have some evidence that more than 13.2% of her pennies are pre-1982 copper pennies?
3. Provide two explanations for the evidence described in #2.
We used technology to simulate selecting 100 SRSs of size
n = 50 from a population of pennies in which 13.2% are pre1982 copper pennies. The dotplot shows 𝑝Ė‚ = the sample
proportion of copper pennies for each of the 100 samples.
4. There is one dot on the graph at 0.22 (or 22%). Explain
what this dot represents.
5. Assuming that 13.2% of pennies in circulation are pre-1982 copper pennies, is it surprising to randomly
select 50 pennies for which 𝑝Ė‚ = 11/50 = 22% or greater? Justify your answer.
6. Based on your previous answers, is there convincing evidence that more than 13.2% of pennies in
Jenna’s container are pre-1982 copper pennies? Explain your reasoning.
276
7.1 PRACTICE!
Tall Girls
According to the National Center for Health Statistics, the distribution of heights for 16-year-old females is modeled well
by a Normal density curve with mean μ = 64 inches and standard deviation σ = 2.5 inches. To see if this distribution
applies at their high school, an AP Statistics class takes an SRS of 20 of the 300 16-year-old females at the school and
measures their heights. What values of the sample mean X would be consistent with the population distribution being
N(64, 2.5)? To find out, we used Fathom software to simulate choosing 250 SRSs of size n = 20 students from a
population that is N(64, 2.5). The figure below is a dotplot of the sample mean height X of the students in the sample.
(a) Is this the sampling distribution of X? Justify your answer.
(b) Describe the distribution. Are there any obvious outliers?
(c) Suppose that the average height of the 20 girls in the class’s actual sample is X = 64.7. What would you conclude
about the population mean height μ for the 16-year-old females at the school? Explain.
277
Choosing Cards
We used Fathom (statistical computer software) to simulate choosing 500 SRSs of size 5 from the deck of cards
described in the Alternate Activity on the previous page. The graph below shows the distribution of the sample
median for these 500 samples.
(a) Is this the sampling distribution of the sample median? Justify your answer.
(b) Suppose that another student prepared a different deck of cards and claimed that it was exactly the same as the one
used in the activity. However, when you took an SRS of size 5, the median was 4. Does this provide convincing evidence
that the student’s deck is different?
278
AP Statistics Handout: Lesson 7.2
Topics: p vs. 𝑝Ė‚, sampling distribution for a proportion, spread and sample size
Lesson 7.2 Guided Notes
Why you care about proportions
What is a proportion?
Proportion: the _________ of a binary occurrence
• Written as fraction, decimal, or %
p vs. 𝑝Ė‚
In your own words, describe why it’s difficult to obtain unbiased estimates about immigration in the
United States:
The EU Migration Debate and Gender of Immigrants
Vital International
AP Photo/Michael Svarnias
Sympathy for Refugees
Depictions of migrants in the EU
range from sympathetic portrayals
of families escaping violence to
fear-inducing portrayals of groups
of single adult men illegally
crossing borders.
Fear for Jobs & Terrorism
“As you can see from this picture, most of the people coming are young males and,
yes, they may be coming from countries that are not in a very happy state, they may
be coming from places that are poorer than us, but the EU has made a fundamental
error that risks the security of everybody.”
– Nigel Farage, 2016 (U.K.)
Speaking about EU migration: “I said to myself, ‘Wow. They’re all men.’ You look at
it. There are so few women and there are so few children. And not only are they
men, they’re young men. And they’re strong as can be—they’re tough looking
cookies. I say, what’s going on here?”
– Trump, during 2015 campaign speech
279
The EU Migration Study – Spain
Trump/Farage statistical claim: 60% of migrants are male
To test this claim, we will use data from Spain, which has the best records on EU migrants (both
documented and undocumented migrants). According to a report* in Comparative Migration Studies,
Spain has a “nation-wide rolling local population register, which provides a comprehensive register of
the population, also including unauthorized immigrants.”
• The Immigrant Citizens Survey** conducted a random sample of this database, surveying recent
immigrants to Spain who were 15 and older.
𝑝 = ___________________ proportion
• Parameter
𝑝Ė‚ = _________________ proportion
• Statistic
• Estimator of p
The Immigrant Citizens Survey conducted a random sample of 15+ year-old recent migrants in Spain’s
comprehensive database. In Spain’s immigrant hub, the city of Madrid, 257 of the 583 immigrants
sampled were male.
Population:
Parameter:
Sample:
Statistic:
*Reichel and Morales, “Surveying immigrants without sampling frames – evaluating the success of alternative field methods,” Comparative
Migration Studies (2017) 5:1 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209407/pdf/40878_2016_Article_44.pdf
**Huddleston, Thomas and Dag Tjaden, Jasper. ”Immigrant Citizens Survey: How immigrants experience integration in 15 European cities,” King
Baudouin Foundation and The Migration Policy Group. May 2012.
Sampling Distribution for a Proportion
Welcome to Theory Land – “The not real place.”
Population of all migrants
Realm of Perfect Parameters
Realm of Sucky Statistics (n = 30)
What proportion of all Madrid immigrants are male?
Let’s get a sample!
p = 0.60
Random sample possibility #1:
Random sample possibility #2
Random sample possibility #3:
280
Sampling Distribution for a Proportion - the formula
Under certain conditions:
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
Centered at true proportion, meaning the
estimates are _______________.
𝑝(1−𝑝)
𝑛
)
We _____________________, so spread
affected by sample size. Higher sample
size means _______________________.
Assume the Trump/Farage statistical claim is true: 60% of all migrants to the EU are male.
a) Find the sampling distribution for 𝑝Ė‚ (sample proportion of males) among 30 randomly selected
migrants (n = 30). Assume all conditions are met:
b) If the Trump/Farage claim was true, what is the probability
that a random sample of 30 migrants would find 15 or fewer males?
Spread and Sample Size
Again, assume the Trump/Farage statistical claim is true: 60% of all migrants to the EU are male.
However, now we’ll use the sample size from the EU Migration Study: n = 583. Find the sampling
distribution for 𝑝Ė‚ (sample proportion of males) among 583 randomly selected migrants. Assume all
conditions are met:
Raising the sample size makes estimates
________________________.
281
What’s the proportion of orange Reese’s Pieces?
If we take a sample of Reese’s Pieces, what proportion of the candies will be orange?
Suppose a large bag of Reese’s Pieces has 1000 pieces. The manufacturer says that
exactly 40% of the candies are orange. If we select a sample of 50 pieces, how many will be
orange? Let X = the number of orange candies in the sample.
1. What type of probability distribution does X have? Justify.
2. Draw a sample of 50 Reese’s Pieces using the applet. How many pieces were
orange? Repeat this 5 times. Write the values below.
3. Write the values on sticker dots and add it to the dotplot on the board. Sketch the
dotplot below.
4. What does each dot represent?
5. What is the mean and the standard deviation for the distribution of X? Show work.
6. What is the approximate shape of the sampling distribution for X? Explain and
sketch it below.
282
Instead of finding the number of candies that are orange, we will now find the proportion of
candies that are orange.
7. Use your samples from #2 and turn each number of orange candies into the
proportion of orange candies in the sample ( pĖ‚ ). Write the proportions below and
add them to the second dotplot on the board.
8. Sketch the dotplot below.
9. What does each dot represent?
10. Find the new mean and standard deviation. Show work.
11. What is the approximate shape of the sampling distribution for !Ė‚ ? Explain and
sketch it below.
12. We know that bags of Reese’s Pieces contain exactly 40% that are orange. If we
select a random sample of 50 candies, what is the probability that the sample
proportion will be 50% or greater?
283
The Sampling Distribution of !Ė‚
Important ideas:
Check Your Understanding
According to the American Dental Association, 8% of adults have never had a cavity. A dental
graduate student contacts an SRS of 1000 adults and calculates the proportion !Ė‚ in this sample
who have never had a cavity.
a. Identify the mean of the sampling distribution of !Ė‚ .
b. Calculate and interpret the standard deviation of the sampling distribution of !Ė‚ . Check that the
10% condition is met.
c. Is the sampling distribution of !Ė‚ approximately Normal? Check that the Large Counts condition
is met.
d. Find the probability that the random sample of 1000 adults will give a result within 2
percentage points of the true value.
e. If the sample size were 9000 rather than 1000, how would this change the sampling
distribution of !Ė‚ ?
284
7.2 PRACTICE!
The Candy Machine
1. Suppose a VERY large candy machine has 15% orange candies. Imagine taking an SRS of 25 candies from the machine
and observing the sample proportion 𝑝Ė‚ of orange candies.
(a) What is the mean of the sampling distribution of 𝑝Ė‚ ? Why?
(b) Check to see if the 10% condition is met.
(c) Find the standard deviation of the sampling distribution of 𝑝Ė‚ .
(d) Is the sampling distribution of 𝑝Ė‚ approximately Normal? Check to see if the Normal condition is met.
(e) If the sample size were 75 rather than 25, how would this change the sampling distribution of 𝑝Ė‚ ? How would this
impact the Normal condition?
285
Planning for College
2. The superintendent of Miami-Dade County Public Schools wants to know what proportion of middle school students
in his district are planning to attend a four-year college or university. Suppose that 80% of all middle school students in
his district are planning to attend a four-year college or university. What is the probability that a SRS of size 125 will give
a result within 7 percentage points of the true value?
Who owns a Harley?
3. Harley-Davidson motorcycles make up 14% of all the motorcycles registered in the United States. You plan to
interview an SRS of 500 motorcycle owners. How likely is your sample to contain 20% or more who own Harleys?
286
Do Skittles or M&Ms have more orange candies?
Mr. Wilcox believes that Skittles have a higher proportion of
orange candies than M&Ms, while Mrs. Gallas believes the
opposite. Who is correct?
1. Take an SRS of 50 Skittles and an SRS of 50 M&Ms. Calculate the proportion of orange
candies in each sample and find the difference between proportions (Skittles – M&Ms).
Skittles:_______
M&Ms:_______
Difference (Skittles – M&Ms):_______
2. Write the difference on a sticker dot and place on the dot plot at the board. Copy the class
dot plot below.
Difference between proportions (Skittles – M&Ms)
3. What does each dot represent?
4. For the dotplot above, make a prediction about the following:
Shape:
Center (mean):
Variability (SD):
287
A Google search reveals that 21.6% of Skittles are orange and 20% of M&Ms are orange.
5. Describe the sampling distribution of the sample proportion of orange for Skittles (X) and
the sampling distribution of the sample proportion of orange for M&Ms (Y) for samples of
size 50.
Skittles (X)
M&Ms (Y)
Shape:
Mean:
SD:
6. Describe the sampling distribution of the difference between proportions of orange
Skittles and M&Ms (X – Y).
Shape:
Mean of difference between proportions:
Standard deviation of the difference between proportions:
7. Mr. Wilcox and Mrs. Gallas calculated a difference between proportions of 0.08 from their
samples. Calculate the probability of getting this difference in proportions or higher.
288
The Sampling Distribution of 𝑝Ė‚! − 𝑝Ė‚"
Important ideas:
Check Your Understanding
At Westville High School there are 315 seniors and 389 juniors. 65% of the seniors have parking
passes and 42% of the juniors have parking passes. The statistics teacher selects a SRS of 30
seniors and a separate SRS of 30 juniors. Let 𝑝Ė‚! − 𝑝Ė‚" be the difference in the sample proportions of
seniors and juniors that have parking passes.
a. What is the shape of the sampling distribution of 𝑝Ė‚! − 𝑝Ė‚" ? Why?
b. Find the mean of the sampling distribution.
c. Calculate and interpret the standard deviation of the sampling distribution.
d. What is the probability that the difference in sample proportions (senior – junior) of students
with parking passes is greater than 30%?
289
AP Statistics Handout: Lesson 7.2 part 2
Topics: conditions for sampling a proportion - random, 10%, large counts
Guided Notes
Random Condition
The EU Migration Debate and Gender of Immigrants
Vital International
AP Photo/Michael Svarnias
Sympathy for Refugees
Depictions of migrants in the EU
range from sympathetic portrayals
of families escaping violence to
fear-inducing portrayals of groups
of single adult men illegally
crossing borders.
Fear for Jobs & Terrorism
“As you can see from this picture, most of the people coming are young males and,
yes, they may be coming from countries that are not in a very happy state, they may
be coming from places that are poorer than us, but the EU has made a fundamental
error that risks the security of everybody.”
– Nigel Farage, 2016 (U.K.)
Speaking about EU migration: “I said to myself, ‘Wow. They’re all men.’ You look at it.
There are so few women and there are so few children. And not only are they men,
they’re young men. And they’re strong as can be—they’re tough looking cookies. I say,
what’s going on here?”
– Trump, 2015 (during campaign speech)
Trump/Farage statistical claim: 60% of migrants are male
Let’s assume:
1. Trump / Farage’s claim is true (60%) of all EU migrants are male.
2. We collect a sample of 30 recent immigrants and collect data on their gender identity
Under certain conditions:
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
1) Where does the formula come from?
__________________________
𝑝(1−𝑝)
𝑛
)
2) Why is this formula useful?
It gives us a model for ___________________________.
290
Sampling Bias: Why didn’t we analyze U.S. migration data?
Example U.S. immigration data:
Authored by Jill H. Wilson, “Recent Migration to the
United States from Central America: Frequently
Asked Questions.” Congressional Research Service,
Jan 29, 2018: https://fas.org/sgp/crs/row/R45489.pdf
Even though single adult males used to make up over 90% of people apprehended at the border, the
report found that now “the majority of apprehended migrants are families and unaccompanied
children.” (pg. 2)
Ė‚ < 0.50 (proportion who are single adult men)
Found: 𝒑
Problem: Apprehensions are ____________________________________ of all migrants. Single men
may be more successful at crossing the border and are, therefore, undercovered.
Random Sample
𝝁𝒑Ė‚ = 𝒑
Non-random sampling leads to
____________________________!
In this case, using only
apprehensions tends to
______________________ the
true proportion of adult men in the
population.
Non-Random Sample
𝝁𝒑Ė‚ ≠ 𝒑
Condition 1: Random Condition
a) What is the condition?
You must ensure the sample was ___________________________ from the
_______________________ (in the case of an experiment, you must ensure there was random
assignment to treatment)
b) Why must this condition be satisfied?
If condition is not satisfied, your estimator is _________________. If it is satisfied, your sampling
distribution is ___________________ at the true proportion value → 𝜇𝑝Ė‚ = 𝑝
291
10% Condition
Condition 2: 10% Condition
a) What is the condition?
The _______________________ must be less than 10% of the ____________________________.
Formula: n < 0.10(N), where n = sample size & N = population size
b) Why must this condition be satisfied?
It ensures this is true: ___________________
Otherwise, we can’t model the standard deviation of the sampling distribution (in this course).
Intuition behind 10% condition
Situation 1: You are sampling, with replacement, 3 individuals from a population of 10 people. What is
the probability all three individuals identify as male? 60% of the population is male.
Situation 2: You are sampling, without replacement, 3 individuals from a population of 10 people. What
is the probability all three individuals identify as male? 60% of the population is male.
Situation 3: You are sampling, without replacement, 3 individuals from a population of 640 people.
What is the probability all three individuals identify as male? 60% of the population is male.
If n (sample size) is _______________ compared to N (pop. size), process is almost independent
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
This requires __________________ sampling
→ that’s why we need n < 0.10(N)
𝑝(1−𝑝)
)
𝑛
292
Large Counts Condition
Condition 3: Large Counts
a) What is the condition?
The number of “successes” and “failures” must both be greater than or equal to 10.
Formula: ________________ ______________________
b) Why must this condition be satisfied?
This provides evidence that the sampling distribution is ______________________
________________ in shape.
Intuition behind Large Counts
a) n = 11, p = 0.05 → Check the large counts conditions for this situation using the space below:
Extreme p → _______________________
Low sample size → _________________________
Result: skewed distribution (___________________)
b) n = 800, p = 0.05 → Check the large counts conditions for this situation using the space below:
Extreme p → _______________________
High sample size → _________________________
Result: __________________________________
c) n = 21, p = 0.5 → Check the large counts conditions for this situation using the space below:
Non-extreme p → _______________________
Low sample size → _________________________
Result: __________________________________
293
Putting it all together
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
3) Large counts
→ approx. normal
___________
1) Random condition
→ unbiased ___________
𝑝(1−𝑝)
𝑛
)
2) 10% condition
→ calculable __________
Assume the Trump/Farage claim is true: 60% of all EU migrants are male (p = 0.60). If you randomly
sample 30 migrants and calculate the proportion of males (𝑝Ė‚ ):
Ė‚. Check conditions.
a) Describe the sampling distribution of ðĐ
b) Calculate the probability of sampling 15 or fewer males
294
How tall are we?
How tall are high school seniors in Michigan? Attached are the heights of all 50 high school
seniors at a small high school in the upper peninsula.
1. Make a guess at the mean of all 50 students. Make another guess of the standard
deviation of all 50 students.
2. Select a random sample of 5 students and calculate the mean height for the sample.
Repeat for 4 samples total.
Heights:__________________________
Heights:__________________________
Heights:__________________________
Heights:__________________________
!Ė…
!Ė…
!Ė…
!Ė…
=___________
=___________
=___________
=___________
3. Add your sample means to the dotplot on the board. Sketch it below.
4. Describe the shape, center, and variability of this dotplot.
5. Compare the two dotplots above. How are the dotplots similar? How are they
different?
295
Sample Means
Important ideas:
Check Your Understanding
Every day people watch 1 billion hours of videos on YouTube. That breaks down to every single
person on earth watching YouTube videos for about 8.4 minutes per day. For U.S. teens, in any
given day, the amount of time spent watching YouTube videos is approximately Normal with mean
18.5 minutes and standard deviation 5.3 minutes.
a. Find the probability that a randomly chosen U.S. teen watches YouTube for more than 25
minutes in a given day.
Suppose we choose an SRS of 10 U.S. teens. Let x = the mean amount of time spent watching
YouTube videos for the sample.
b. What is the mean of the sampling distribution of x ?
c. Calculate and interpret the standard deviation of the sampling distribution of x . Verify that
the 10% condition is met.
d. Find the probability that the mean amount of time spent watching YouTube for the teens
in the sample exceeds 25 minutes.
296
AP Statistics Handout: Lesson 7.3
Topics: µ vs. ð‘ĨĖ…, Sampling distribution for a mean, central limit theorem and conditions
Guided Notes
µ vs. ð‘ĨĖ…
In your own words, describe the purpose of SNAP benefits:
SNAP benefits should cover the following essential monthly bundle of groceries for one person:
• 4 boxes of cereal, 1 gallon of milk, 5 banana bunches, 2 gallons of orange juice
• 3 loaves of bread, 4 packages of sliced turkey (9-oz each), 3 packages of sliced provolone (10
count), 4 packages of romaine lettuce (7 oz each), 12 tomatoes, small mayo jar (15 oz), 30
apples, 4 bags of mixed nuts (8 oz each), 5 diet coke 6-packs.
• 15 lbs boneless chicken breasts, 6 packs of frozen mixed veggies (16 oz each), 4lbs white rice, 10
cans of black beans
• Salt (5 oz), pepper (5 oz), olive oil (17 oz)
We’ll call this grocery list the essentials bundle. We will use sampling to determine if SNAP benefits
cover the average (mean) cost of this grocery bundle in U.S. grocery stores.
µ = ___________________ mean
• Parameter
ð‘ĨĖ… = _________________ mean
• Statistic
• Estimator of µ
How much SNAP funding would it take to support an essentials-only diet? We obtain an SRS of 31 U.S.
grocery stores, find the cost for the monthly essentials bundle at each store, and calculate the mean
cost among the sample: $201.24
Population:
Parameter:
Sample:
Statistic:
297
Sampling Distribution for a Mean
Welcome to Theory Land – “The not real place.”
In fiscal year 2020, the maximum monthly SNAP allotment* for a household of one person (who could
not earn income) was $194 in the U.S. Let’s assume this is the true mean cost of the essentials bundle in
the United States.
*This is the max federal allotment for the continental U.S. (not Alaska/Hawaii). Source: U.S. Department of Agriculture, “SNAP Fiscal Year 2020
Cost-of-Living Adjustments,” https://fns-prod.azureedge.net/sites/default/files/media/file/COLA%20Memo%20FY%202020.pdf
Data
µ = $194
Prices for bundle
at all stores
Realm of Perfect Parameters
Population Distribution
Data
Realm of Sucky Statistics
Prices for bundle
at 16 sampled
stores (n = 16)
Means
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Sample Distribution
Sampling Distribution
Means from each
sample
Screenshots taken from sample means applet at Rice Virtual Lab in Statistics (onlinestatbook.com):
http://onlinestatbook.com/stat_sim/sampling_dist/index.html
Sampling Distribution for a Mean - the formula
Under certain conditions:
ð‘ĨĖ… ~ 𝑁𝑜𝑟𝑚 (𝜇ð‘ĨĖ… = 𝜇, 𝜎ð‘ĨĖ… =
We are centered at the true population
mean, meaning our estimates are
___________________.
Notation:
𝜇ð‘ĨĖ… → center of sampling dist. of ð‘ĨĖ…
𝜇 → population mean
𝜎
√𝑛
)
We _____________________, so spread
affected by sample size. Higher sample
size means _______________________.
𝜎ð‘ĨĖ… → standard deviation of the sampling dist. of ð‘ĨĖ…
𝜎 → standard deviation of population values
298
Spread and Sample Size
µ = $194
Population Distribution
of Bundle Costs
Assume the standard deviation of bundle
costs among the population (σ) is $26:
Calculate the standard deviation of the
sampling distribution (𝜎ð‘ĨĖ… )…
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė… vc
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
𝒙
Ė…vc𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė… vc
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
𝒙
Ė…𝒙
Ė… v𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Sampling Distribution
of Sample Means
(n = 25)
a) When n = 25
Sampling Distribution
of Sample Means
(n = 5)
b) When n = 5
v
v
Since we divide by sample size (n), spread ____________________ as sample size _________________.
Full sampling distribution calculation: SNAP benefits example
If the monthly amount SNAP provides ($194) is the exact true mean cost of the essentials bundle in the
U.S., what is the probability that a random sample of 31 grocery stores finds an average essentials
bundle cost of $201.24 or more? Assume the population standard deviation of bundle prices is $26 and
all conditions are met:
299
Central Limit Theorem and Conditions
Condition 1: Random Condition
a) What is the condition?
You must ensure the sample was ___________________________ from the
_______________________ (in the case of an experiment, you must ensure there was random
assignment to treatment)
b) Why must this condition be satisfied?
If condition is not satisfied, your estimator is _________________. If it is satisfied, your sampling
distribution is ___________________ at the true population mean value → 𝜇ð‘ĨĖ… = 𝜇
µ = $194
Population Distribution
of grocery bundle costs
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…𝒙
Ė…
𝒙
What if the SNAP program only sampled
conveniently-located grocery stores near its
offices in downtown San Francisco?
Sampling distribution from random samples
µð‘ĨĖ… = ___
Sampling distribution from convenience
samples (only stores in San Francisco)
µð‘ĨĖ… ≠ ___
When we don’t randomly sample, we get __________________________. In this case, we tended to
overestimate the mean price of the groceries (cost of living in San Francisco is high).
Condition 2: 10% Condition
a) What is the condition?
The _______________________ (n) must be less than 10% of the ____________________________ (N).
Formula: n < 0.10(N)
b) Why must this condition be satisfied?
It ensures this is true: ___________________
Otherwise, we can’t model the standard deviation of the sampling distribution (in this course).
300
Condition 3: Normal/Large Sample
a) What is the condition?
i) The _____________________ distribution must be normal _____
ii) The sample size is ___________________ (n ≥ 30)
b) Why must this condition be satisfied?
This provides evidence that the sampling distribution is __________________________ in shape.
Normal Population
Non-Normal Population
Sampling distribution
is approximately
normal (___________
________________)
Sampling distribution
is approximately
normal only for ____
__________________
Central limit theorem (CLT): when n is large, the sampling distribution for ð‘ĨĖ… is ______________________
________________________
• Even with a highly skewed population, the sampling distribution is close to normal at n = 25. So
we make the rule ___________ just to be safe.
• Useful because we ______________________ the population’s shape! No matter the
population shape, we can model the _______________________________________________,
if there’s a high enough sample size (n).
All the conditions summarized:
ð‘ĨĖ… ~ 𝑁𝑜𝑟𝑚 (𝜇ð‘ĨĖ… = 𝜇, 𝜎ð‘ĨĖ… =
3) Normal/Large Sample
→ approx. normal
___________ (by CLT)
1) Random condition
→ unbiased ___________
301
𝜎
√𝑛
)
2) 10% condition
→ calculable __________
Assume the monthly amount SNAP provides ($194) is the true mean cost of the essentials bundle in U.S.
grocery stores. Also assume the population standard deviation of bundle prices is $26. If you obtain a
random sample of 31 grocery stores and calculate their average price (ð‘ĨĖ… ) for the essentials bundle…
Ė…. Check conditions.
a) Describe the sampling distribution of 𝒙
b) Calculate the probability of obtaining a sample mean bundle price of $201.24 or higher:
Discussion
Discussion Question: A real random sample of 31 U.S. grocery stores found the average cost of the
essentials bundle was $199.45. If we assume the true average costs for this bundle in American grocery
stores is $194 (the max allotment provided by SNAP), what is the probability of finding a sample mean of
$199.45 or higher from 31 stores? Does this probability make you doubt that the assumption that the
SNAP allotment covers the true mean price of the essential groceries? Why or why not?
302
Who has better ACT scores?
The ACT test is scored with whole numbers from 0 to 36. We will use the applet at
http://onlinestatbook.com/stat_sim/sampling_dist/ to take samples of ACT scores from EK and
Rockford.
Click “Begin” and you will see the population distribution of ACT scores from EK.
1. Describe the shape, center, and variability of the distribution of ACT scores for EK.
2. Click “Animated” to take a sample of 5 ACT scores.
List 5 estimated scores here:________________ Estimated mean (blue box):________
Click “Animated” several more times. Then click “10,000” to take 10,000 samples of size 5.
3. The blue boxes make the sampling distribution of x . How do we know that the
sampling distribution of x is approximately normal (hint: see previous lesson)?
4. Now let’s look at the distribution of ACT scores for Rockford. Click “Clear lower 3”
and then change the distribution from “Normal” to “Skewed”. What is the shape of
this distribution? Why does this distribution make sense for our archrival Rockford?
Change both of the bottom two dropdown menus to “Mean”. The first one should be “N=2”
and the second one should be “N=25”. The click “10,000” to take 10,000 samples.
5. Describe the shape of the sampling distribution of x when N = 2.
6. Describe the shape of the sampling distribution of x when N = 25.
303
The Central Limit Theorem
Important ideas:
Check Your Understanding
Among iPhone users who share their data with Apple, the weekly screen time is skewed to the
right with µ = 13.5 hours and σ = 3.75 hours. A random sample of 100 iPhone users are selected
and the mean weekly screen time (ð‘ĨĖ… ) of the sample is calculated.
a. Describe the shape of the sampling distribution of ð‘ĨĖ… for samples of 100 randomly selected
iPhone users. Justify your answer.
b. Find the mean and standard deviation of the sampling distribution of ð‘ĨĖ… . Be sure to check
the 10% condition.
c. Calculate the probability that the weekly screen time for the sample is between 12 and 13
hours.
304
ACT Scores: Which school is better?
ACT scores at Ardrey Kell High School are Normally distributed
with mean 26 and standard deviation 3. ACT scores at Providence
HS are skewed to the right with mean 25 and standard deviation 5.
1. We randomly select 25 students from AKHS and 30 students from PHS. Use the
information given to describe the sampling distributions of the average ACT scores for the
two samples.
AKHS
PHS
Shape:
Mean:
SD:
2. Suppose we took a sample of 25 students from AKHS and a sample of 30 students from
PHS and found the difference in the sample means. Describe the sampling
distribution of the difference in mean ACT scores (AKHS – PHS).
Mean:
Standard Deviation:
Shape:
3. Calculate the probability that random sample of 25 AKHS students has a higher mean
ACT score than the random sample of 30 PHS students.
305
The Sampling Distribution of !Ė…! − !Ė…"
Important ideas:
Check Your Understanding
Is it more expensive to live in California or Florida? Monthly cost of living expenses for California are
approximately Normal with mean µC = $10,800 and standard deviation s C = $3,200. Monthly cost of
living expenses for Florida are approximately Normal with mean µF = $8,500 and standard
deviation s F = $2,700. Suppose we select independent SRSs of 16 residents of California and 9
residents of Florida and calculate the sample mean monthly cost of living, xC and xF .
a. What is the shape of the sampling distribution of xC - xF ? Why?
b. Find the mean and standard deviation of the sampling distribution of xC - xF .
c. Calculate the probability that the average monthly cost of living expense of the 16 randomly
selected California residents is less than the average monthly cost of living expense of the 9
randomly selected Florida residents.
306
Sampling Distributions for Sample Proportions
One Sample
Two Samples
Parameter
Statistic
Mean of Sampling
Distribution
Standard Deviation
of Sampling
Distribution
What condition must
be met to use the
standard deviation
formula?
When is the
sampling distribution
approximately
normal?
z-score formula
307
Sampling Distributions for Sample Means
One Sample
Two Samples
Parameter
Statistic
Mean of Sampling
Distribution
Standard Deviation
of Sampling
Distribution
What condition must
be met to use the
standard deviation
formula?
When is the
sampling distribution
approximately
normal?
z-score formula
308
CENTRAL LIMIT THEOREM PENNIES LAB
In this lab we will be interested in the age of the pennies (in years) and not the year the pennies
were produced. To find the age of a penny, simply subtract the year on the penny from the current
year.
1. Make a dotplot of the ages of the 25 pennies below.
8
7
6
5
4
3
2
1
0 <----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o >
0 2 4
6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38
This is the population distribution. Describe the distribution:
_______________________________________________________________________________
2. Now take a random sample of three pennies. Calculate the average age of the sample. This is
called a sample mean. Record your sample mean: ______
3. Replace the three pennies. Then take another sample of three pennies. Find the sample mean and
record it. ______
4. Do this a total of five times. Record your three other sample means below:
_______
_______
______
309
5. Write each of your sample means on a different post-it note and then take your post-it note to the
histogram on the whiteboard. Record the results of the histogram on the whiteboard below:
10
9
8
7
6
5
4
3
2
1
0 <----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o >
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38
6. We will now change the size of the sample. Take a random sample of 10 pennies. Calculate the
average age of the pennies. Record it here: ________
7. Do this a total of five times. Record your four other sample means below:
_______
_______
_______
_______
8. Write each of your sample means on a different post-it note and then take your post-it note to the
histogram on the whiteboard. Record the results of the histogram on the whiteboard below:
20
18
16
14
12
10
8
6
4
2
0 <----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o----o >
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38
Summary:
•
If the population distribution is normal, the sampling distribution of x is also
________________. This is true for any sample size n.
•
If the population distribution is NOT normal, the Central Limit Theorem (CLT)
tells us that the sampling distribution of x will be approximately normal in most
cases if ____________.
310
7.3 PRACTICE!
Buy Me Some Peanuts and Sample Means
1. At the P. Nutty Peanut Company, dry roasted, shelled peanuts are placed in jars by a machine. The distribution of
weights in the bottles is approximately Normal, with a mean of 16.1 ounces and a standard deviation of 0.15 ounces.
(a) Without doing any calculations, explain which outcome is more likely, randomly selecting a single jar and finding the
contents to weigh less than 16 ounces or randomly selecting 10 jars and finding the average contents to weigh less than
16 ounces.
(b) Find the probability of each event described above. Since the distribution is normal you can use “normalcdf” on your
calculator.
Single jar weighing 16 oz or less:
10 jars weighing 16 oz or less:
311
Mean Texts
2. Suppose that the number of texts sent during a typical day by a randomly selected high school student follows a rightskewed distribution with a mean of 15 and a standard deviation of 35. Assuming that students at your school are typical
texters, how likely is it that a random sample of 50 students will have sent more than a total of 1000 texts in the last 24
hours?
Bad carpet
3. The number of flaws per square yard in a type of carpet material varies with mean 1.6 flaws per square yard and
standard deviation 1.2 flaws per square yard. The population distribution cannot be Normal, because a count takes only
whole-number values. An inspector studies 200 square yards of the material, records the number of flaws found in each
square yard, and calculates X, the mean number of flaws per square yard inspected. Find the probability that the mean
number of flaws exceeds 2 per square yard.
312
Estimating a Population Proportions
Important ideas:
Check Your Understanding
A community activist group in Austin, Texas wanted a particular issue to be placed on the ballot of the
upcoming election. To make it on the ballot, 20,000 valid signatures were needed. The group turned
in their petition with 24,598 signatures. To pass the validity test 20,000/24,598 = 81.3% of the
signatures must be valid. It is too time consuming to check all of the signatures, so a random sample
of signatures are checked. The individual checking the signatures needs to be 95% confident that the
true proportion of valid signatures are estimated with, at most, a 2% margin of error.
1. Using a conservative estimate for 𝑝Ė‚, how large of a sample is needed?
2. In the activist group’s previous petition, 85% of the signatures were valid. Using this value as
a guess for 𝑝Ė‚, find the sample size needed for a margin of error of at most 2 percentage
points with 95% confidence. How does this compare with the required sample size from
Question 1?
3. What if the company president demands 99% confidence instead of 95% confidence? Would
this require a smaller or larger sample size, assuming everything else remains the same?
Explain your answer.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
313
®
2014 AP STATISTICS FREE-RESPONSE QUESTIONS
3. Schools in a certain state receive funding based on the number of students who attend the school. To
determine the number of students who attend a school, one school day is selected at random and the number
of students in attendance that day is counted and used for funding purposes. The daily number of absences at
High School A in the state is approximately normally distributed with mean of 120 students and
standard deviation of 10.5 students.
(a) If more than 140 students are absent on the day the attendance count is taken for funding purposes, the
school will lose some of its state funding in the subsequent year. Approximately what is the probability
that High School A will lose some state funding?
(b) The principals' association in the state suggests that instead of choosing one day at random, the state should
choose 3 days at random. With the suggested plan, High School A would lose some of its state funding in
the subsequent year if the mean number of students absent for the 3 days is greater than 140. Would High
School A be more likely, less likely, or equally likely to lose funding using the suggested plan compared
to the plan described in part (a)? Justify your choice.
(c) A typical school week consists of the days Monday, Tuesday, Wednesday, Thursday, and Friday. The
principal at High School A believes that the number of absences tends to be greater on Mondays and
Fridays, and there is concern that the school will lose state funding if the attendance count occurs on
a Monday or Friday. If one school day is chosen at random from each of 3 typical school weeks, what
is the probability that none of the 3 days chosen is a Tuesday, Wednesday, or Thursday?
314
®
2015 AP STATISTICS FREE-RESPONSE QUESTIONS
STATISTICS
SECTION II
PartB
Question 6
Spend about 25 minutes on this part of the exam.
Percent of Section II score-25
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
6. Com tortillas are made at a large facility that produces 100,000 tortillas per day on each of its two production
lines. The distribution of the diameters of the tortillas produced on production line A is approximately normal
with mean 5.9 inches, and the distribution of the diameters of the tortillas produced on production line B is
approximately normal with mean 6.1 inches. The figure below shows the distributions of diameters for the
two production lines.
Production Line A
Production Line B
I
I
I
I
I
I
I
I
I
I
I
I
,,....\
I
I
I
5.7
5.8
6.0
5.9
6.1
Diam t r (inches)
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
''
6.2
6.3
The tortillas produced at the factory are advertised as having a diameter of 6 inches. For the purpose of quality
control, a sample of 200 tortillas is selected and the diameters are measured. From the sample of 200 tortillas, the
manager of the facility wants to estimate the mean diameter, in inches, of the 200,000 tortillas produced on a
given day. Two sampling methods have been proposed.
Method 1: Take a random sample of 200 tortillas from the 200,000 tortillas produced on a given day.
Measure the diameter of each selected tortilla.
Method 2: Randomly select one of the two production lines on a given day. Take a random sample of
200 tortillas from the 100,000 tortillas produced by the selected production line. Measure the diameter of
each selected tortilla.
(a) Will a sample obtained using Method 2 be representative of the population of all tortillas made that day,
with respect to the diameters of the tortillas? Explain why or why not.
315
®
2015 AP STATISTICS FREE-RESPONSE QUESTIONS
(b) The figure below is a histogram of 200 diameters obtained by using one of the two sampling methods
described. Considering the shape of the histogram, explain which method, Method 1 or Method 2, was most
likely used to obtain a such a sample.
25
(/)
Cd
20
15
t
10
5
0
5.775
5.850
5.925 6.000 6.075
Diameter (inche )
6.150
6.225
(c) Which of the two sampling methods, Method 1 or Method 2, will result in less variability in the diameters of
the 200 tortillas in the sample on a given day? Explain.
Each day, the distribution of the 200,000 tortillas made that day has mean diameter 6 inches with standard
deviation 0.11 inch.
(d) For samples of size 200 taken from one day's production, describe the sampling distribution of the sample
mean diameter for samples that are obtained using Method 1.
(e) Suppose that one of the two sampling methods will be selected and used every day for one year (365 days).
The sample mean of the 200 diameters will be recorded each day. Which of the two methods will result in
less variability in the distribution of the 365 sample means? Explain.
(f) A government inspector will visit the facility on June 22 to observe the sampling and to determine if the
factory is in compliance with the advertised mean diameter of 6 inches. The manager knows that, with both
sampling methods, the sample mean is an unbiased estimator of the population mean. However, the manager
is unsure which method is more likely to produce a sample mean that is close to 6 inches on the day of
sampling. Based on your previous answers, which of the two sampling methods, Method 1 or Method 2, is
more likely to produce a sample mean close to 6 inches? Explain.
316
Unit 6
Chapter 8
Estimating
Proportions with
Confidence
317
AP Statistics Handout: Lesson 8.1
Topics: estimation, margin of error, interpreting confidence intervals and confidence levels
Guided Notes
Estimation and margin of error
AP Photos
Getty Images
Hurricane Katrina – Sheltering at the Superdome
FEMA (Federal Emergency Management Agency) is trying to determine the amount of relief aid (medical
supplies, beds, food, etc.) to provide to the SuperDome. You are FEMA’s head statistician, and you are
tasked with estimating the number of people sheltering at the SuperDome.
Cluster Random Sample
-Unbiased
• If sections equal size
-Saves time
• Avoids counting everyone
• Stadium already in sections
Sampled section 1: 152 people
Sampled section 2: 122 people
Sampled section 3: 192 people
Sampled section 4: 84 people
Sampled section 5: 145 people
Sampled section 6: 202 people
Sampled section 7: 96 people
Sampled section 8: 104 people
Sampled section 9: 156 people
Sampled section 10: 124 people
Average number of people per section:
ð‘ĨĖ… = 137.7
Number of seating sections in stadium:
120 sections
Estimate for total number of people:
137.7 ∗ 120 = _______________
Is this the exact number of people sheltering in the SuperDome?
____ → Even though our estimator should be unbiased, there is
___________________________ in our random sample. We may
overestimate or underestimate the true population size by chance.
318
Confidence Interval: point estimate ± margin of error
___________________
16,524 people
______________________
16,524 people
Narrower intervals are _______________________, but they make you _______________________
you’ve captured the true parameter value.
16,524 people
Wider intervals give _____________________ you’ve captured the true parameter, but they’re often
not as ______________.
Constructing and interpreting a confidence interval
138 is probably not the exact true mean number of people per section. What confidence interval should
you give so that you capture the true mean?
Imagine the sampling distribution for ð‘ĨĖ… is normal with 𝜇ð‘ĨĖ… = 138, 𝜎ð‘ĨĖ… = 3. Recall the 68-95-99.7 rule:
68% of all possible sample means are
within 1𝝈𝒙Ė… of the true mean
95% of all possible sample means are
within 2𝝈𝒙Ė… of the true mean
319
Interval: __________________
Interpretation:
We are _______ confident the interval from 132 to 144 _________________ the true mean number of
people per section.
Wrong Interpretation: There is a 95% probability that the interval from 132 to 144 captures the true
mean number of people per section.
• Why not “probability”?
If it _______ in the interval, the probability
it’s in the interval is _________.
If it’s _______ in the interval, the probability
it’s in the interval is _________.
The parameter has a fixed value. It’s already in the interval or it’s not.
Instead, we say we are 95% _________________ we captured the true mean. Even though our interval
already did/didn’t capture the true value, we (as ignorant humans) _____________________. So, we
can only be certain levels of ‘confident’ that we captured the true value.
320
Interpreting a confidence level
Confidence Level:
95%
Sample Size (n):
20
Confidence Level:
If you raise the __________________
____________, the confidence interval
becomes _____________.
99.7%
Sample Size (n):
20
You have higher confidence of capturing
the true value, but your interval is
_________________________.
Confidence Level:
If you raise the ____________________,
the confidence interval becomes
_________________.
95%
Sample Size (n):
80
You have more precise interval, even at
the __________ confidence level.
321
Applet (confidence levels)
http://digitalfirst.bfwpub.com/stats_applet/stats_applet_4_ci.html
Source: Digital First project from Bedford, Freeman, & Worth publishers
Interpret a 95% confidence level:
“If we take many samples of the
______________ from this population, about
95% of them will result in an interval that
_______________ the actual parameter value.”
Discussion
Discussion Question: Imagine you calculated the following three confidence intervals for the number of
people taking shelter in the Superdome:
a) Which interval would you report to FEMA?
Why?
b) How would you explain to FEMA what the interval means?
322
Guess the Mystery Proportion
Mrs. Mason and Goodenough wants to implement a new reward system. When a student
does somethinggreat, they randomly pick a bead from the bead jar. If the bead is red,
they get a piece ofcandy. If not, they don’t win a prize. The question is, what are the
chances that a studentchooses a red bead? Each group will get a sample of beads and
will create a confidence interval to estimate the true proportion of red beads. The group
with the smallest intervalthat captures the true proportion wins a prize!
1. You will select a random sample of 20 beads from the jar. Calculate the proportion of
beads that are red (write this as a decimal).
Proportion red:_______. This is your point estimate for the true proportion of red beads.
2. Identify the population, parameter, sample, and statistic.
Population:______________________________
Parameter:____________________
Sample:________________________________
Statistic:____________________
3. Now you are going to change your point estimate into an interval of values by adding and
subtracting some value from your point estimate (the number you add and subtract is called
your margin of error). You can choose any amount to add and subtract, but remember, the
smallest interval that captures the truth is the winner. What margin of error do you want to
use? Why?
4. Use your point estimate and chosen margin of error to write an interval that you think
contains the true proportion of red beads.
5. How confident do you feel that your interval captures the true proportion? Answer with a
percentage.
6. One of the groups got (0.27, 0.33) as their interval. What was their point estimate? What
was their margin of error?
7. One group claims that the true proportion of red beads 0.25. Does your interval support or
deny this claim? Why?
323
The Idea of a Confidence Interval
Important ideas:
Check Your Understanding
A random sample of 100 adults are asked if they pay for monthly subscriptions that they do not use,
like a magazine, app, or online program simply. Many do, because they have never taken the time
to cancel the subscription. A 95% confidence interval for the proportion of adults who to pay for
subscriptions they do not use is 0.352 to 0.548.
a. Interpret the confidence interval.
b. Calculate the point estimate and the margin of error.
c. Based upon this survey, a reporter claims that a majority of adults continue to pay for monthly
subscriptions they do not use. Use the confidence interval to evaluate this claim.
324
8.1 PRACTICE!
Losing Weight
A Gallup Poll in November 2014 found that 59% of the people in its sample said “Yes” when asked,
“Would you like to lose weight?” Gallup announced: “For results based on the total sample of national
adults, one can say with 95% confidence that the margin of (sampling) error is ±3 percentage points.”
(a) Explain what the margin of error means in this setting.
(b) State and interpret the 95% confidence interval.
(c) Interpret the confidence level.
The admissions director from Florida International University found that (107.8, 116.2) is a 95%
confidence interval for the mean IQ score of all freshmen. Comment on whether or not each of the
following explanations is correct.
(a) There is a 95% probability that the interval from 107.8 to 116.2 contains μ.
325
(b) There is a 95% chance that the interval (107.8, 116.2) contains X.
(c) This interval was constructed using a method that produces intervals that capture the true mean in
95% of all possible samples.
(d) 95% of all possible samples will contain the interval (107.8, 116.2).
(e) The probability that the interval (107.8, 116.2) captures μ is either 0 or 1, but we don’t know which.
How do Confidence Intervals Change…
(a) How does the shape of the confidence interval change if the confidence level increases from 90% to
95%?
(b) How would the shape of a confidence change if the sample size was decreased? Assume the new,
smaller sample size still meets all of the normality conditions.
326
What does “95% confident” mean?
In this Activity, you will use the applet at www.tinyurl.com/appletCI to learn what it means to
say we are “95% confident” that our confidence interval captures the true proportion.
1. Use the Confidence Intervals for Proportions applet. Set the population proportion to
0.5, the confidence level to 95% and the sample size to 75.
2. Click “Sample” to choose an SRS and display the resulting confidence interval. The
confidence interval is displayed as a horizontal line segment with a dot representing
the sample proportion in the middle of the interval. The true proportion (p) is the green
vertical line.
Did the first confidence interval capture the
true proportion?
Repeat this 10 times and sketch what you
see to the right. How many of the intervals
capture the true proportion?
3. “Reset” and then take a total of 100 confidence intervals (sample 25 four times). How
many out of 100 captured the true proportion? Is this surprising? Why?
4. Watch your confidence intervals as you drag the confidence level from 95% to 99%
(don’t “Reset). What happens to the intervals when the confidence level is increased?
Why does this make sense?
5. “Reset”, then sample 100 times at an 80% confidence interval. What percent of the
intervals capture the true proportion?
Interpret the confidence level:
6. Now we will see what happens when we adjust the sample size. Change the sample
to 20 and sample for 1 interval. Then change it to 250 and sample for 1 interval.
What happens to the interval when the sample size is increased? Why?
327
Interpreting the Confidence Level
Important ideas:
Check Your Understanding
The gym teacher of a large high school wants to estimate the mean number of pushups students at
this school can do in one minute. He selects a random sample of 30 students from those who are
there after school for sports practices. He records how many pushups each of the students in the
sample can do in one minute. He determines that he is 90% confident that the interval from 24.1 to
28.5 captures the mean number of pushups that students at this school can do in one minute.
a. Interpret the confidence level.
b. Explain what would happen to the length of the interval if the confidence level were increased
to 99%.
c. How would a 90% confidence interval based on a sample of size 200 compare to the original
90% interval?
d. Describe one potential source of bias in this study that is not accounted for by the margin of
error.
328
AP Statistics Handout: Lesson 8.2
Topics: review sampling distribution for 𝑝Ė‚ , 95% confidence interval for 𝑝Ė‚ , four step process
Guided Notes
Review: Sampling Distribution for 𝑝Ė‚
Vital International
AP Photo/Michael Svarnias
Sympathy for Refugees
Depictions of migrants in the EU
range from sympathetic portrayals
of families escaping violence to
fear-inducing portrayals of groups
of single adult men illegally
crossing borders.
Fear for Security
“As you can see from this picture, most of the people coming are young males and,
yes, they may be coming from countries that are not in a very happy state, they may
be coming from places that are poorer than us, but the EU has made a fundamental
error that risks the security of everybody.”
– Nigel Farage, 2016 (U.K.)
Speaking on EU migration: “I said to myself, ‘Wow. They’re all men.’ You look at it.
There are so few women and there are so few children. And not only are they men,
they’re young men. And they’re strong as can be—they’re tough looking cookies. I say,
what’s going on here?”
– Trump, during 2015 campaign speech
Trump/Farage statistical claim (conservatively): 60% of migrants are male
Let’s assume:
1. Trump / Farage’s claim is true (60%) of all EU migrants are male.
2. We collect a sample of 30 recent immigrants and collect data on their gender identity
We ________________ the true
proportion by 0.10
In most real-life scenarios:
-You _______________________ the true proportion.
-You survey a random sample of 30 immigrants. 15 of them are male.
So, you calculate the proportion of sample who are male:
15
- 𝑝Ė‚ = = ___________
30
- Your estimate for the true proportion (p) is the sample proportion (𝑝Ė‚ )
329
How do we determine how far off our estimates will usually be? We use the sampling distribution.
Under certain conditions:
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
Centered at true proportion (______________)
𝑝(1−𝑝)
𝑛
)
____________________________ of our
estimates from true proportion value
Calculations:
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
𝑝(1−𝑝)
)
𝑛
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = .6, 𝜎𝑝Ė‚ = √
.6(.4)
)
30
𝑝Ė‚ ~ Norm(𝜇𝑝Ė‚ = .6, 𝜎𝑝Ė‚ = .089)
Typical distance of estimates
from true proportion value.
________________________
by .089
We ________________ the true
proportion by 0.10
a) Was our estimate’s error amount (0.10) pretty typical? Justify using the sampling distribution above.
There should be a way to report both:
a) Our estimate
b) How far off this estimate might be
One way to do this is with a _________________________________!
330
95% Confidence Interval for 𝑝Ė‚
The EU Migration Study – Spain
We will use data from Spain, which has the best records on EU migrants (both documented and
undocumented migrants). According to a report* in Comparative Migration Studies, Spain has a “nationwide rolling local population register, which provides a comprehensive register of the population, also
including unauthorized immigrants.” The Immigrant Citizens Survey** conducted a random sample of
this database, surveying recent immigrants to Spain who were 15 and older.
Recall:
𝑝 = ___________________ proportion
• Parameter
𝑝Ė‚ = _________________ proportion
• Statistic
• Estimator of p
Scenario (Mock Data): In reality, we don’t know the true proportion of recent immigrants to Madrid
(Spain’s immigration hub) who are male because it’s almost impossible to collect records about each
individual. So, instead, imagine we surveyed a random sample of 34 immigrants in Madrid’s address
registries. Of the random sample, 15 identified as male. Estimate the true proportion of Madrid’s recent
immigrants who identify as male, with 95% confidence.
Since we don’t know the true proportion (p),
we use the sample proportion (𝑝Ė‚ ) to generate
our sampling distribution:
Under certain conditions:
𝑝(1−𝑝)
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝, 𝜎𝑝Ė‚ = √
Consider all possible
random samples that could
have happened (sampling
distribution)
𝑛
Ė‚ , 𝑆ðļ𝑝Ė‚ = √
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝒑
)
Ė‚ (1−𝒑
Ė‚)
𝒑
𝑛
)
.441 (1−.441 )
)
34
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = .441 , 𝑆ðļ𝑝Ė‚ = √
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = . 441, 𝑆ðļ𝑝Ė‚ = . 𝟎𝟖𝟓)
Confidence Interval:
𝑝Ė‚ ± 1.96 ∗ 𝑆ðļ𝑝Ė‚
= .441 ± 1.96 ∗ (. 𝟎𝟖𝟓)
= .441 ± 1.67 = (0.274, 0.608)
Side note: Why do we use 1.96 in the formula?
Interpretation:
We are 95% confident the interval from 27.4% to 60.8% ________________ the ___________________
of Madrid immigrants who are male.
*Reichel and Morales, “Surveying immigrants without sampling frames – evaluating the success of alternative field methods,” Comparative
Migration Studies (2017) 5:1 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209407/pdf/40878_2016_Article_44.pdf
**Huddleston, Thomas and Dag Tjaden, Jasper. ”Immigrant Citizens Survey: How immigrants experience integration in 15 European cities,” King
Baudouin Foundation and The Migration Policy Group. May 2012.
331
Formula for 95% confidence interval for one proportion:
𝑝Ė‚(1−𝑝Ė‚)
𝑝Ė‚ ± 1.96 ∗ 𝑆ðļ𝑝Ė‚ →
𝑝Ė‚ ± 1.96 ∗ √
𝑛
Imagine we surveyed a random sample of 34 immigrants in Madrid’s address registries. Of the random
sample, 15 identified as male.
a) Construct and interpret a 95% confidence interval for the true proportion of Madrid’s recent
immigrants who identify as male. (use the four step process)
b) Use your interval to evaluate the validity of Trump/Farage’s claim that 60% of immigrants are male.
332
Which way will the Hershey Kiss land?
When you toss a Hershey Kiss, it sometimes lands flat and sometimes lands on its side.
What proportion of tosses will land flat?
Each group of four selects a random sample of 50 Hershey’s Kisses to bring back to their desks.
Toss the 50 Kisses and then calculate the proportion that land flat. Let pĖ‚ = the proportion of the
Kisses that land flat.
1. What is your point estimate for the true proportion that land flat? ________
2. Identify the population, parameter, sample and statistic.
Population:
Parameter:
Sample:
Statistic:
3. Was the sample a random sample? Why is this important?
4. What is the formula for calculating the standard deviation of the sampling distribution of pĖ‚ ?
5. What condition must be met to use this formula? Has it been met?
6. We don’t know the value of p (that’s the whole point of a confidence interval) so we will use pĖ‚
instead. Calculate the standard deviation.
7. Would it be appropriate to use a normal distribution to model the sampling distribution of pĖ‚ ?
Justify your answer.
333
Name:
Hour:
Date:
8. In a normal distribution, 95% of the data lies within ______ standard deviations of the mean.
This value is called the critical value. Use table A or InverseNorm to find these critical
values:
80% of the data lies within ________ standard deviations of the mean
90% of the data lies within ________ standard deviations of the mean
95% of the data lies within ________ standard deviations of the mean
99% of the data lies within ________ standard deviations of the mean
9. Calculate the margin of error for a 95% interval by multiplying the critical value and standard
deviation you found. Show your work.
10. Find the 95% confidence interval using point estimate +/- margin of error.
11. Add your interval to the graph on the board. Sketch the graph below.
12. What do you think is the true proportion of kisses that land flat is?
334
Name:
Hour:
Date:
Constructing a Confidence Interval for p
Important ideas:
Check Your Understanding
What do you want to be when you grow up? According a nationwide survey of a random sample of
1000 kids under the age of 12, some kids want to be a ninja, a dragon keeper, a dancing unicorn,
and even an octopus. Fifty-five of the 1000 kids want to be a doctor. We would like to use this study
to find a 98% confidence interval for the true proportion of kids who want to be a doctor.
a. Identify the parameter of interest.
b. Check if the conditions for constructing a confidence interval for p are met.
c. Find the critical value for a 98% confidence interval. Then calculate the interval.
d. Interpret the interval in context.
335
AP Statistics Handout: Lesson 8.2 part 2
Topics: critical values, confidence interval calculator steps, sample size calculations
Guided Notes
Vital International
AP Photo/Michael Svarnias
Sympathy for Refugees
Depictions of migrants in the EU
range from sympathetic portrayals
of families escaping violence to
fear-inducing portrayals of groups
of single adult men illegally
crossing borders.
Fear for Security
The EU Migration Study – Spain
We will use data from Spain, which has the best records on EU migrants (both documented and
undocumented migrants). According to a report* in Comparative Migration Studies, Spain has a “nationwide rolling local population register, which provides a comprehensive register of the population, also
including unauthorized immigrants.” The Immigrant Citizens Survey** conducted a random sample of
this database, surveying recent immigrants to Spain who were 15 and older.
95% confidence interval (with mock data): Imagine we surveyed a random sample of 34 immigrants in
Madrid’s address registries. Of the random sample, 15 identified as male. Estimate the true proportion
of Madrid’s recent immigrants who identify as male, with 95% confidence.
Ė‚ , 𝑆ðļ𝑝Ė‚ = √
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝒑
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = .441 , 𝑆ðļ𝑝Ė‚ = √
Consider all possible
random samples that could
have happened (sampling
distribution)
Ė‚ (1−𝒑
Ė‚)
𝒑
𝑛
)
.441 (1−.441 )
)
34
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = . 441, 𝑆ðļ𝑝Ė‚ = . 𝟎𝟖𝟓)
Confidence Interval:
𝑝Ė‚ ± 1.96 ∗ 𝑆ðļ𝑝Ė‚
= .441 ± 1.96 ∗ (. 𝟎𝟖𝟓)
Question: Why do we use 1.96 in the formula?
= (. 𝟐𝟕𝟒, . 𝟔𝟎𝟖)
*Reichel and Morales, “Surveying immigrants without sampling frames – evaluating the success of alternative field methods,” Comparative
Migration Studies (2017) 5:1 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209407/pdf/40878_2016_Article_44.pdf
**Huddleston, Thomas and Dag Tjaden, Jasper. ”Immigrant Citizens Survey: How immigrants experience integration in 15 European cities,” King
Baudouin Foundation and The Migration Policy Group. May 2012.
336
Critical Values
95% interval:
𝑝Ė‚ ± 𝟏. 𝟗𝟔 ∗ 𝑆ðļ𝑝Ė‚
𝑝Ė‚ ± 𝟏. 𝟗𝟔 ∗ √
80% interval:
𝑝Ė‚(1−𝑝Ė‚)
𝑛
𝑝Ė‚ ± (? ) ∗ 𝑆ðļ𝑝Ė‚
𝑝Ė‚ ± (? ) ∗ √
𝑝Ė‚(1−𝑝Ė‚)
𝑛
For a C% confidence interval (𝑝Ė‚ ):
𝑧 ∗ : the __________________________ of the z-interval
• Tells you how many _________________________
you’re including in each side of your interval.
• Determines the _____________________.
𝑝Ė‚ ± (𝑧 ∗ ) ∗ 𝑆ðļ𝑝Ė‚
𝑝Ė‚ ± (𝑧 ∗ ) ∗ √
𝑝Ė‚ (1 − 𝑝Ė‚ )
𝑛
How to get the critical value → use the standard normal curve (μ = 0, σ = 1)
The _________________ on the standard normal curve tells you the ____________________________
away from the center.
Find and interpret z* for 95% confidence level (show calculator input):
Find and interpret z* for 80% confidence level (show calculator input):
337
80% confidence interval (with mock data):
a) A random sample of 34 asylum-seekers had 15 adult men. Estimate the proportion of asylum-seekers
that are adult men, with 80% confidence.
b) Which is wider: the 95% or 80% interval? Why?
Confidence Interval Calculator Steps (one sample conf interval for a proportion)
STAT → TESTS →
A:1-PropZInt…
Input x, n, C-level
Calculate
338
Output
Sample Size Calculations
The mock data above had a low sample size (n = 34).
The sampling distribution and resulting confidence
interval is displayed to the left.
In the actual EU migration survey, researchers
randomly selected 583 Madrid immigrants and 257
were male
257
𝑝Ė‚ =
= 44%
583
Same sample proportion (same point estimate),
higher sample size
Sample size intuition: a higher sample size (n) means…
• More data points per sample
• Unusual sample proportion values are _________________ (easy to get a ‘wonky’ sample when
you only collect a few data points)
• Less variability in our _______________________
• Confidence interval is ________________
Sample size calculations and the margin of error:
𝑝Ė‚ ± margin of error
Standard error: measures variability of your
estimation process (specifically - the _________
______________ of sample proportions from
true proportion)
𝑧 ∗ ∗ 𝑆ðļ𝑝Ė‚
Critical value – the number
of standard errors you
include in your interval
-Determined by the
__________________, not
the variability of your
estimation process
𝑧∗ ∗ √
𝑝Ė‚(1−𝑝Ė‚)
𝑛
Divide by n, so the higher n gets, the _________
_________________________ of your estimates
339
Example: You are designing the Immigrant Citizens Survey (data discussed earlier). You want to provide a
95% confidence interval for the proportion of recent immigrants who are adult men, and you anticipate
that 40% of immigrants will be adult men.
a) If you want your margin of error to be only 5%, how many recent immigrants should you randomly
sample (assume 40% of immigrants sampled are adult men)?
b) By what factor would you need to increase the sample size to cut the margin of error in half? You can
use your answer for part (a) as an example.
Discussion
Discussion Question: You are designing the Immigrant Citizens Survey (data discussed earlier). You want
to provide a 95% confidence interval for the proportion of recent immigrants who are adult men. You
have no prior data about this proportion. If you want to ensure your margin of error is no more than 1%,
how many recent immigrants should the study randomly sample?
340
How much of the Earth is covered by water?
What proportion of the Earth is covered by water? We will investigate this question by
taking a random sample of locations on the globe.
1. How many locations did your class sample? ______ How many locations were water? ______
2. Calculate the proportion of locations from your sample that are water. pĖ‚ = _______
!
3. Construct a 95% confidence interval to estimate the proportion of the Earth that is water.
341
Estimating a Population Proportion:
Important ideas:
Check Your Understanding
A community activist group in Austin, Texas wanted a particular issue to be placed on the ballot of the
upcoming election. To make it on the ballot, 20,000 valid signatures were needed. The group turned in
their petition with 24,598 signatures. To pass the validity test 20,000/24,598 = 81.3% of the signatures
must be valid. It is too time consuming to check all of the signatures, so a random sample of signatures
are checked. The individual checking the signatures needs to be 95% confident that the true proportion
of valid signatures are estimated with, at most, a 2% margin of error.
1. Using a conservative estimate for 𝑝Ė‚ , how large of a sample is needed?
2. In the activist group’s previous petition, 85% of the signatures were valid. Using this value as a
guess for 𝑝Ė‚ , find the sample size needed for a margin of error of at most 2 percentage points
with 95% confidence. How does this compare with the required sample size from Question 1?
3. What if the company president demands 99% confidence instead of 95% confidence? Would
this require a smaller or larger sample size, assuming everything else remains the same?
Explain your answer.
342
Immigration
In October of 2018, a caravan of migrant asylum-seekers from central America traveled to the U.S.
There were thousands in the caravan, many escaping violence in their home countries. Liberalleaning news sources like MSNBC displayed mostly images of women and children in the caravan,
which evoked compassion and sympathy. Conservative-leaning news sources like Fox displayed
mostly images of single adult men acting aggressively, evoking fear and a need for security.
So, what proportion of asylum-seekers are single adult men vs. families? Unfortunately, the U.S.
doesn’t have very good data to analyze on this topic. But in Europe, good migration data has been
collected. We’ll use a sample from the population of EU migrants to create a one-sample zinterval for the proportion of EU migrants who are adult men.
The EU Migration Debate and Gender of Immigrants
Vital International
AP Photo/Michael
Svarnias
Depictions of migrants in the EU
range from sympathetic
portrayals of families escaping
violence to fear-inducing
portrayals of groups of single
adult men illegally crossing
borders.
“As you can see from this picture, most of the people coming are young
males and, yes, they may be coming from countries that are not in a very
happy state, they may be coming from places that are poorer than us, but the
EU has made a fundamental error that risks the security of everybody.”
– Nigel Farage, 2016 (U.K.)
“I said to myself, ‘Wow. They’re all men.’ You look at it. There are so few
women and there are so few children. And not only are they men, they’re
young men. And they’re strong as can be—they’re tough looking cookies. I
say, what’s going on here?”
– Trump, speaking about the EU while campaigning in 2015
Let’s test these claims by collecting our own sample!
The beads represent the immigrant population of Madrid, which has some of the best records* on
EU migration. The orange beads represent males. The blue beads represent non-males.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
343
1. How many people did your class sample? ______ How many people were male? ______
2. Calculate the proportion of people from your sample that were male. pĖ‚ = _______
!
3. Construct a 95% confidence interval to estimate the proportion of immigrants who are male.
4. The Immigrant Citizens Survey** conducted a random sample of recent migrants in Madrid’s
comprehensive database. 257 of the 583 immigrants sampled were male. Without performing the
whole procedure, describe how you believe their confidence interval would differ from the interval
above.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
344
AP Statistics Handout: Lesson 8.3
Topics: one vs. two sample inference, two-sample z-interval for a difference of proportions
Guided Notes
Context for Today’s Lesson: The Race/Resumé Study
In this study*, investigators created mock identical resumés, which were sent to job placement ads in
Chicago and Boston. Each resumé was randomly assigned either a commonly-white or commonly-black
name. In total, 246 out of 2445 commonly-white named resumés received a callback and 164 out of
2445 commonly-black named resumés received a callback.
*Bertrand, Marianne and Sendhil Mullainathan. "Are Emily And Greg More Employable Than Lakisha
And Jamal? A Field Experiment On Labor Market Discrimination," American Economic Review, 2004,
v94(4,Sep), 991-1013. https://www.nber.org/papers/w9873
One vs. Two Sample Inference
Study results
CommonlyWhite Names
CommonlyBlack Names
Total
Called back
246
164
410
Not called back
2199
2281
4480
Total
2445
2445
4890
Find the following quantities (show
calculations): 𝑛1 , 𝑛2 , 𝑝Ė‚1, 𝑝Ė‚ 2 , 𝑝Ė‚1 − 𝑝Ė‚ 2
Ė‚𝟏 > 𝒑
Ė‚𝟐
If there’s hiring discrimination, 𝒑
Group 1: White
𝑝Ė‚1 = proportion of commonly-white
name apps that got callback.
𝑝Ė‚1 = _________
>
Group 2: Black
𝑝Ė‚2 = proportion of commonly-white
name apps that got callback.
𝑝Ė‚2 = _________
Are these proportions _____________________________ to show discrimination, or could this
difference have been a result of _________________________?
345
One vs. Two Samples
One-sample situations: you compare a statistic in __________ population against a ____________about
that population.
-Ex: Is the proportion of recent EU migrants who are male
actually lower than the claimed 75%?
Two-sample situations: you measure the ________________________ in _____________________
_____________________ and see if they are significantly different.
-Ex: Is the proportion of callbacks for commonly-white
name apps higher than for commonly-black name apps?
Two-Sample Z-Interval for a Difference of Proportions
Hypotheses: We don’t need to set up hypotheses to construct a confidence interval, but doing so is going
to help us conceptualize why our interval is useful.
ðŧ0 : 𝑝1 = 𝑝2
ðŧðī : 𝑝1 > 𝑝2
The null (______________) hypothesis: there is ___________________,
so the callback rate is the same in both groups. You’re seeing if there’s
evidence to reject this claim.
The alternative (________________) hypothesis: there is discrimination,
in which case the commonly-white named applications received a
________________________ of callbacks.
Rewrite these hypotheses in a more mathematically convenient way:
Where:
𝑝1 is the proportion of _______ applicants with commonly-white names who’d receive callbacks
when applying to jobs like the ones in this study.
𝑝2 is the proportion of _______ applicants with commonly-black names who’d receive callbacks
when applying to jobs like the ones in this study.
Making the interval
Under certain conditions:
𝑝Ė‚2 = _________
Ė‚1 )
𝑝Ė‚ (1−𝑝Ė‚ )
~ Norm(𝜇 = 𝑝Ė‚1 - 𝑝Ė‚2 , 𝜎 = √𝑝Ė‚1 (1−𝑝
+ 2 𝑛 2)
𝑛
1
2
(1−.101)
.067 (1−.067)
~ Norm(𝜇 = .034, 𝜎 = √.1012445
+
)
2445
~ Norm(𝜇 = .034, 𝜎 = 0.0079)
Confidence Interval: .034 ± 1.96(0.0079)
(1.85%, 4.95%)
346
Confidence Interval: (1.85%, 4.95%)
Interpretation: We are __________________ the interval from _____________________ captures the
________________________ in proportion of callbacks for resumés with commonly white vs. black
names (among jobs similar to the ones in this study).
Significance: Since ________________ our interval, it’s _____________________ to assume 𝑝1 = 𝑝2 .
Therefore, we have convincing evidence that commonly white name resumés receive a _____________
callback rate (among jobs similar to the ones in this study).
Formula for confidence interval:
95% Interval
95% Confidence
Interval Formula
C% Interval
statistic ± margin of error
C% Confidence
Interval Formula
𝑝Ė‚ (1 − 𝑝Ė‚ )
𝑛
95% One-Sample 𝑝Ė‚ ± 𝟏. 𝟗𝟔 ∗ √
Interval Formula
C% One-Sample
Interval Formula
𝑝Ė‚1 (1 − 𝑝Ė‚1 ) 𝑝Ė‚2 (1 − 𝑝Ė‚2 )
95% Two-Sample
(𝑝Ė‚1 - 𝑝Ė‚2 ) ± 𝟏. 𝟗𝟔 ∗ √
+
Interval Formula
𝑛1
𝑛2
C% Two-Sample
Interval Formula
𝟏. 𝟗𝟔: the critical value for a 95% z-interval. It means
we’re include ~2 standard errors in the interval
statistic ± margin of error
𝑝Ė‚ (1 − 𝑝Ė‚ )
𝑝Ė‚ ± 𝒛∗ ∗ √
𝑛
(𝑝Ė‚1 - 𝑝Ė‚2 ) ± 𝒛∗ ∗ √
𝑝Ė‚1 (1 − 𝑝Ė‚1 ) 𝑝Ė‚ 2 (1 − 𝑝Ė‚2 )
+
𝑛1
𝑛2
𝒛∗: the critical value of the z-interval. Tells you how
many standard errors you’re including in your interval.
In the Bertrand-Mullainathan race/resumé study, mock identical resumés were sent to job placement ads
in Chicago and Boston. Each resumé was randomly assigned either a commonly-white or commonlyblack name. In total, 246 out of 2445 commonly-white named resumés received a callback and 164 out
of 2445 commonly-black named resumés received a callback.
a) Construct and interpret a 95% confidence interval for the difference in callback rates for commonly
white vs. black named resumes (among jobs like the ones in this study).
b) Use your interval to draw a conclusion about whether callback rates differ by commonly white vs.
black name.
347
Lesson 8.3 Discussion
Statistically Significant vs. Practically Important
Statistically Significant Difference: Difference is too large to arise by ____________________.
• Confidence intervals and hypothesis tests tell us this
Practically Important Difference: Difference is large enough to have _____________________________
in the world.
• Descriptive statistics and our experiences tell us this.
Let’s look at the study results one more time:
CommonlyWhite Names
CommonlyBlack Names
Total
Called back
246
164
410
Not called back
2199
2281
4480
Total
2445
2445
4890
𝑝Ė‚1 = 10.1%
𝑝Ė‚2 = 6.7%
𝑝Ė‚1 − 𝑝Ė‚2 = 3.4%
Confidence interval for difference in
callback rates: (1.8%, 4.9%)
Discussion Question: We found the difference between callback rates was statistically significant, but is
it practically important? Look at the data, the callback rates, and the final confidence interval. Do you
think the margin between black/white callback rates is practically important? Explain your reasoning.
348
8.3 PRACTICE!
Presidential Approval
Many news organizations conduct polls asking adults in the United States if they approve of the job the
president is doing. How did President Obama’s approval rating change from August 2009 to September 2010?
According to a CNN poll of 1024 randomly selected U.S. adults on September 1-2, 2010, 50% approved of
Obama’s job performance. A CNN poll of 1010 randomly selected U.S. adults on August 28-30, 2009, showed
that 53% approved of Obama’s job performance. Use the results of these polls to construct and interpret a 90%
confidence interval for the change in Obama’s approval rating among all US adults. Based on your interval, is
there convincing evidence that Obama’s job approval rating has changed?
349
Which grade is more likely to go to Prom?
At many high schools, Prom is an annual dance that only Juniors and Seniors can purchase
tickets for. The student council at a large high school is wondering if Juniors or Seniors are
more likely to attend Prom. They take a random sample of 50 Juniors and find that 28 are
planning on attending Prom. They select a random sample of 45 Seniors and 29 are
planning on attending. Construct and interpret a 95% confidence interval for the difference
in proportions of Juniors and Seniors who are planning on attending Prom.
What is the point estimate for…
the proportion of Juniors planning on attending prom? 𝑝
"! = ________
"" = ________
the proportion of Seniors planning on attending prom? 𝑝
the difference in the proportion of Jrs and Srs planning on attending prom? 𝑝
"! − 𝑝
"" = ________
1. Construct and interpret a 95% confidence interval for the difference in proportions of Juniors
and Seniors who are planning on attending prom.
2.
Does the interval provide convincing evidence that Juniors have a lower proportion planning
on going to prom or is it plausible that there is no difference between the two classes?
Explain.
350
Constructing a Confidence Interval for p1 – p2
Important ideas:
Check Your Understanding
In a social study, a random sample of 150 teachers were selected and an independent random sample
of 100 nurses were selected. Each person was asked if they currently have a second job. The results
showed that 48 of the 150 teachers and 21 of the 100 nurses had a second job. Construct and interpret
a 95% confidence interval for the difference in the proportion of all teachers and nurses that have a
second job.
351
AP Stats Chapter 8 Formula Study Sheet
Lesson
One Sample Proportion
Symbol for statistic
Symbol for parameter
Name of the procedure
RANDOM condition
Independent condition
NORMAL condition
Formula for standard error
Formula for margin of error
General formula for confidence
interval
Specific formula for confidence
interval
352
Two Sample Proportion
Sample size practice
1. Edgar Martinez and Ingrid Gustafson are the candidates for mayor in a
large city. We want to estimate the proportion p of all registered voters in
the city who plan to vote for Gustafson with 95% confidence and a margin
of error no greater than 0.03. How large a random sample do we need?
2. High school students who take the SAT Math exam a second time
generally score higher than on their first try. Past data suggest that the
score increase has a standard deviation of about 50 points. How large a
sample of high school students would be needed to estimate the mean
change in SAT score to within 2 points with 95% confidence?
3. A college student organization wants to start a nightclub for students
under the age of 21. To assess support for the idea, the organization will
select an SRS of students and ask each if he or she would patronize this
type of establishment. What sample size is required to obtain a 90%
confidence interval with a margin of error of at most 0.04 if they suspect
pˆ = .75?
353
1. Suppose we are interested in finding out the proportion of the population at EKHS that
has seen The Office. We contact an SRS of 100 students in the school. Of these 100
students, 63 report seeing The Office. Find a 95% confidence interval for the true
proportion of EKHS students who have seen The Office.
2. Mrs. Statistics was an all-star basketball player in high school. To prove that she still has
skills, she took 50 free throws and made 31 of them. Think of these 50 shots as being a
random sample of all the free throws she has ever taken. Find a 99% confidence interval for
the true proportion of free throws Mrs. Statistics would make.
354
355
2010 AP® STATISTICS FREE-RESPONSE QUESTIONS
3. A humane society wanted to estimate with 95 percent confidence the proportion ofhouseholds in its county that
own at least one dog.
(a) Interpret the 95 percent confidence level in this context.
The humane society selected a random sample of households in its county and used the sample to estimate the
proportion of all households that own at least one dog. The conditions for calculating a 95 percent confidence
interval for the proportion of households in this county that own at least one dog were checked and verified, and
the resulting confidence interval was 0.417 ± 0.119.
(b) A national pet products association claimed that 39 percent ofall American households owned at least one
dog. Does the humane society's interval estimate provide evidence that the proportion ofdog owners in its
county is different from the claimed national proportion? Explain.
(c) How many households were selected in the humane society's sample? Show how you obtained your answer.
356
Unit 6
Chapter 9
Testing Claims
About Proportions
357
AP Statistics Handout: Lesson 9.1
Topics: hypotheses, logic of hypothesis tests, p-values and alpha levels
Guided Notes
Flint Water Crisis:
Samuel Wilson
(MLive.com)
Jake May (The
Flint Journal,
AP Images)
WNEM Newsroom
2014
2014/2015
2015
Flint city officials celebrate as they
cut costs by switching the water
supply from Detroit to the Flint River.
Residents voice concerns
about declining water quality.
Mayor drinks tap water on local
television to “prove” the water is safe.
Outside investigators start testing
Flint’s water systems.
Hypotheses
Null hypothesis (ðŧ0 ): The __________________ belief about a parameter’s value.
• Often called the “________” hypothesis – nothing new or interesting happening
• Assumed ___________________________, until evidence convinces us otherwise
Alternative hypothesis (ðŧðī ): Our unproven belief about a _______________________________, for
which we have to gather evidence.
• Often called the “___________________” hypothesis – it’s what our studies aim to demonstrate
• Not assumed to be true: we have to ____________________________________ to support it.
Court System Analogy
Person on trial is assumed to be innocent (the default/dull belief - ðŧ0 ), until proven to be guilty through
evidence (ðŧðī )
Two possible outcomes:
• There’s enough evidence to reject their innocence (Reject ðŧ0 )
• There’s not enough evidence to reject their innocence (Fail to Reject ðŧ0 )
Note: you ________________________. It’s assumed to be true already, so our research isn’t proving it.
• This is why you’re declared “_______________________” rather than “innocent” in court →
your innocence was assumed at the beginning.
358
Flint Hypotheses
Non-numerical version:
ðŧ0 : Flint’s water is _____________ to drink
• Default/dull belief (___________________)
ðŧðī : Flint’s water is ________ safe to drink
• Unproven research hypothesis we aim to
demonstrate (_______________________)
Numerical version:
ðŧ0 :
ðŧðī :
p: true proportion of all homes in Flint
with high lead content in their water
According to EPA regulations, if ___________________________ of homes in a city have high lead
content in their water (>15 parts per billion), the city’s water supply is unsafe.
Logic of Hypothesis Tests
Hypothesis tests are statistical jiu-jitsu:
• You assume the _________________
• You calculate how
___________________ your sample
data is if ðŧ0 is true
• If it’s unlikely enough, you
_______________
You assume ðŧ0 to be true in order to
eventually ___________________________.
Flint hypothesis test logic (numeric example):
Goal: Find enough statistical evidence to _______________ and
support the ________
ðŧ0 : p = 0.10
ðŧðī : p > 0.10
1. Start by assuming the null (the default) ____________.
• We assume that water in Flint is safe by EPA standards:
_________________________ of all homes have lead levels
above 15 ppb.
p: true proportion of all homes in
Flint with high lead content in their
water
2. Ask yourself: if the null is in fact true, how ___________ is the data that you’ve gathered?
• I randomly sampled water from 100 homes and 30 of them had high lead content (30%). This sample
______________________ under my earlier assumption that only 10% of homes in the whole city
had high lead content.
3. Draw a conclusion:
• Under my assumption that only 10% of homes have high lead content, the actually observed data
(30% of 100 homes had high lead content) is highly unlikely. So, ____________________________
___________________________. There’s convincing evidence that __________________________
of homes have high lead content.
359
p-values and Alpha (α) Levels
What is “unlikely?”
ðŧ0 : p = 0.10
ðŧðī : p > 0.10
For each case, comment on the conclusion we’d draw about our hypotheses:
Case A: Sample 1000 homes, 300 have high lead. 𝑝Ė‚ =
Case B: Sample 10 homes, 2 have high lead. 𝑝Ė‚ =
2
10
300
1000
= 0.30
= 0.20
Case C (real study* data): Sample 252 homes, 42 has high lead. 𝑝Ė‚ =
42
252
= 0.17
p-value: The ____________________ of observing your measured statistic value (or one more extreme)
if you ____________________________________.
o Stands for “probability value”
o Helps us determine how ___________________ our sample data is under the null
Is there convincing statistical evidence that the Flint water system is unsafe? A Virginia Tech study*
42
randomly sampled water from 252 homes in Flint. Of those, 42 had high lead content. 𝑝Ė‚ =
= 0.17
252
This phrase means you should do
a _____________________
ðŧ0 : p = 0.10
ðŧðī : p > 0.10
If it’s true that only 10% of all homes have high lead content (ðŧ0 ), the probability that 17% (or more) of
the sampled homes would have high lead content is only ________
Since the p-value is _______________, we have serious doubts about our null
hypothesis assumption. We can reject the null hypothesis!
*Edwards, M., et al. Virginia Tech’s Flint Water Study (published Sept. 2015, accessed July 2020):
http://flintwaterstudy.org/information-for-flint-residents/results-for-citizen-testing-for-lead-300-kits/
360
_______________
p-values, alpha levels, and conclusions
When the p-value is low…
• The data we actually observed is ________________ under the null hypothesis assumption
• We can _______________________ assumption
When the p-value isn’t that low…
• The data we actually observed ______________________ under the null hypothesis assumption
• We ________________________ the null assumption
Significance level (α): the ‘___________________’ for p-values at which we reject the null hypothesis
• Commonly, we use α = 0.05
Rules:
(p-value < α) → we reject ðŧ0 → _________________________________ of ðŧðī
(p-value ≥ α) → we fail to reject ðŧ0 → ______________________ convincing evidence of ðŧðī
Conclusions template: Because our p-value (____) is less/greater than our alpha level (___), we
reject/fail to reject ðŧ0 . We do/don’t have convincing evidence that (ðŧðī in context).
Write the full conclusion: Is there convincing statistical evidence that the Flint water system is unsafe?
361
Lesson 9.1 Discussion
You: The p-value is low, so we have convincing evidence the lead levels are unsafe!
The Mayor: What are you talking about? I tested my own home’s water, and it’s fine.
You: But statistically…
The Mayor: Statistically, I know every person on my block still drinks their water. It’s
perfectly fine!
WNEM Newsroom
Discussion Question: How would you explain what this p-value means to someone who doesn’t know
any statistics? Be clear and specific.
362
Is Mrs. Gallas a good free throw shooter?
Mrs. Gallas claims she is an 80% free throw shooter. To prove her skills she shoots 50 free
throws and makes 32 shots. Is Mrs. Gallas exaggerating about her free throw skills?
1. Identify the population, parameter, sample and statistic.
Population:
Parameter:
Sample:
Statistic:
2. There are two possible explanations for why Mrs. Gallas only made 32/50 shots.
1.)
2.)
To test Mrs. Gallas’ claim, we will assume #1, she is an 80% free throw shooter, and
examine the likelihood that she makes 32/50 shots through simulation.
3. Use the spinner provided to simulate 50 free throws shot by an 80% free throw shooter
by spinning 50 times. What is your sample proportion of shots made?
4. Repeat for another sample of 50 spins. Calculate the sample proportion.
5. Add your sample proportions to the dotplot on the board. Each person in your group
should add two dots to the board. Sketch the dotplot below.
proportion of FT made
363
6. What does each dot represent?
7. One student says, “Each dot represents the proportion of free throws made out of 50 free
throws shot by Mrs. Gallas.” Is this correct? Explain.
8. What percentage of the dots represent a percentage of 64% or less?
Interpret this percentage in context.
9. Based on your answer to Question 8, does the observed pĖ‚ = 0.64 result give convincing
evidence that Mrs. Gallas is exaggerating? Or is it plausible that an 80% shooter can have a
performance this poor by chance alone?
364
AP Statistics Handout: Lesson 9.1 part 2
Topics: bias vs. error, Type I/II errors, problem of multiple tests, power
Guided Notes
Poll after poll after poll predicted Hillary Clinton to trounce Donald Trump in the 2016
election. Yet…
Actual election results from Nov 8th, 2016:
Statistical predictions from Nov 7th, 2016
(chance of winning):
Trump wins electoral college
One of the biggest surprises: Wisconsin
Every 2016 Wisconsin poll shows lead for Clinton
Voted for democrat every election since 1992
Election Day 2016:
Trump Wins Wisconsin
365
Bias vs. Error
Bias: when our estimators _____________________________ under or overestimate a parameter value.
Error: when an _______________ estimator produces an estimate that slightly “misses its mark” due to
____________________________ in the sampling process.
Example: Among Wisconsin voters planning to vote for a major party candidate (Clinton or Trump),
what proportion will vote for Trump? Imagine 60% of all Wisconsin major party voters will vote for
Trump.
Welcome to Theory Land – “The not real place.”
All Wisc. major party voters
Realm of Perfect Parameters
Realm of Sucky Statistics (n = 30)
What proportion of all Wisconsin major party voters
will vote Trump? Let’s get a sample!
Random sample possibility #1:
Random sample possibility #2
Random sample possibility #3:
Resulting sampling distribution (all possible random samples of size n = 30):
Unbiased sampling method: doesn’t tend to
systematically under/overestimate
(_____________________ at true parameter value).
A random sample (from an unbiased estimation process) produces an underestimate (an error):
By chance, we randomly sampled fewer Trump voters
than expected. This produces ________________.
This is why we give confidence intervals ___________
____________________. For unbiased estimators, we
still expect errors because of chance variation in our
random sampling process.
366
Were the Wisconsin polls systematically biased or did they underestimate Trump by chance alone (i.e.
make an error)?
If the polls underestimated Trump by chance alone…
Polls would ___________ at true proportion of Trump voters, with
some errors __________________ due to chance variation in
sampling. Most would capture true proportion
___________________________.
What we actually saw from Wisconsin polls…
Why were Wisconsin polls biased? Several sources of possible bias…
1. Nonresponse bias: Education
• Less educated people are less responsive to polls.
• Usually this isn’t a big problem since less educated people tend to split their votes, but this time
they voted heavily towards Trump. The polls didn’t adjust for this and underestimated Trump’s
vote totals.
• Effect was big in Wisconsin, since state has many white voters without college degrees.
2. Undercoverage bias: Undecided Voters
• Polls cannot figure out how undecided voters will cast their ballots on election day.
• Usually, this is not a problem since undecided voters tend to split votes between candidates.
• Some results suggest that undecided voters broke more towards Trump than expected in
Wisconsin, so these votes were undercovered by the polls.
3. Other potential bias from higher than expected Trump voter turnout and lying to pollsters (although
these theories have less evidence than the first two).
Why do we always have to check conditions? → One reason: making estimates (confidence intervals) or
evaluating claims (hypothesis tests) based on biased sampling is __________________.
• Our inference methods account for _____________. They don’t account for ___________.
• The random condition is one way to check for bias.
367
Type I and Type II Errors
Let’s take an unbiased situation: It’s November 8th, 2016. Voting just closed in Wisconsin and you want
to quickly estimate the proportion of voters who voted for Clinton. It would take a long time to count all
the votes, so you get an SRS of 500 votes from all Wisconsin ballot boxes.
According to many election experts, candidates that get 47% or more of the vote tend to win elections
(about 3% of votes go to third party candidates or “Mickey Mouse”). Of the 500 sampled votes, 255
were votes for Clinton. Use a hypothesis test to determine if there is convincing evidence that Clinton
will win Wisconsin
Hypotheses:
ðŧ0 : ____________
ðŧðī : ____________
Where p is the __________________ of voters who will choose Clinton.
Significance Level: α = 0.05
Calculations:
Let’s assume ðŧ0 is true: exactly 47% of Wisconsin will vote Clinton. We get the following sampling
distribution for random samples of 500 voters each (n = 500):
α = 0.05 → will only reject with a p-value of 0.05
or less
5% chance of getting an unusual sample that
makes us reject 𝑝0 _______________________!
Should we actually reject 𝒑𝟎 and conclude
_____________________ Wisconsin? Or did we
just get an unusually Clinton-supportive sample
by ________________________?
Type I Error
If the Clinton voter rate is truly 0.47, the probability of getting a sample proportion of .51 is less than 5%.
So, we reject ðŧ0 and conclude Clinton will win Wisconsin.
But she ___________________ Wisconsin!
By chance, the 500 people we randomly sampled happened to be more supportive of Clinton than the
__________________________. We falsely rejected ðŧ0 because of __________________________.
Type I Error: When you _____________________ but ðŧ0 is actually true.
• Type I Errors happen α% of the time (so, for most tests, 5% of the time we make a Type I error)
• α (the significance level) is the ______________________________.
• If you are scared of a Type I error, ________________________ to below 0.05. This makes
rejecting the ðŧ0 more difficult, but it also ______________________________.
368
Type II Error
Type II Error: When you fail to reject ðŧ0 but ðŧ0 _________________________. In other words, you
should have rejected it but didn’t.
Problem of Multiple Tests
Scenario:
• Imagine you really want Clinton to win Wisconsin. You take one random sample of 500 ballots
230
and find 230 voted for Trump (𝑝Ė‚ =
= 46%).
500
•
You know Wisconsin usually votes Blue, so you think this must be an unusual sample. You take
215
another sample of size 500. In this one, 215 voted for Clinton (𝑝Ė‚ =
= 43%).
•
You keep taking samples until finally you find one where 257 out of 500 voted for Clinton (𝑝Ė‚ =
257
= 51.4%). You conclude there is statistically significant evidence Clinton will win!
500
500
What is the problem with this scenario?
The Problem of Multiple Tests
• When you run a test and sample many times, you may be “p-hacking” → fishing for a sample
that will prove your point, even though all you’re really doing is
___________________________________.
• If 𝛞 = 0.05, then 5% of the time we’ll get a falsely significant result from an unusual sample
(_________________________).
1
• 0.05 = → this means that roughly __________________ we take, we’ll get a significant result
20
by chance. So if you take samples over-and-over again, you’re bound to get a significant result
eventually, ____________________________.
• This is called the ___________________________________.
Avoiding the Problem of Multiple Tests
Get one sample of the highest sample size you can muster → whatever results from that sample are
your final results.
______________________________ or do many ________________ tests. If you have to repeat a test,
____________________ level.
369
Power
Power: the __________________ of correctly rejecting ðŧ0 when it’s false.
Effect size: the __________________ between a proposed null hypothesis value and the actual
parameter value.
Power is the __________________ of the Type II Error rate. It’s the probability of correctly rejecting ðŧ0
when it’s false (Type II Error is failing to reject ðŧ0 when it’s false).
Type II Error Rate = _________________
Usually, you want a test to have ______________________: so you know that if ðŧ0 is not true, you have
a __________________________ of rejecting it.
Example: New Drug to Lower Blood Pressure
Your research team is testing the effectiveness of a new systolic blood pressure drug—Drug B. The
doctors on the research team say that if you can lower systolic blood pressure by 10 points or more,
such a difference will have meaningful health benefits. If you lower blood pressure by fewer than 10
points, it’s not a medically meaningful difference.
Blood pressure changes are distributed normally and have a standard deviation of about 21.5 points
between people. You give the drug to a group of 20 subjects. What is your power to detect a true
average blood pressure drop by 10 points?
Hypotheses: Where μ = average difference in blood pressure after taking the drug.
ðŧ0 : ___________
ðŧðī : ___________
Significance Level: α = 0.05
Calculations: Start by assuming ð‘Ŋ𝟎 is true and determine the rejection zone
Any sample mean with a blood pressure drop
greater in magnitude than 7.88 points should
result in _________________________ the null.
α = 0.05
Next - Assume ð‘Ŋð‘Ļ is true and see what sample means we’re likely to get…
The ____________ of this test to detect a 10
point drop in blood pressure is 67%--if the true
__________________ is -10, we will reject ðŧ0
only 64% of the time.
370
Original power: 67%
α = 0.05
Increase power method 1: increase α
α = 0.15
Problem: Recall that α is the _________________
________. So even though increasing α increases
the power, it also increases the ___________ of a
Type I error. So, don’t increase α just to increase
power!
Increase power method 2: raise effect size
True average blood pressure drop from Drug B is
now 13 points!
Problem: _________________ to raise drug’s
effect size without changing the drug itself!
Increase power method 3: raise sample size (n)
Let’s raise the sample size (n) so that our estimates
become more _______________ (i.e. less chance
of an unusual sample).
Usually raising n is the _________ way to increase
power.
Why it makes sense that raising the sample size would also raise the power:
-If a drug is effective with 3 people, you may ___________________________ it will work for everyone.
-If a drug is effective with 10,000 people, you will be more sure it’s ____________________ effective.
-If the drug is effective, increasing the sample sizes makes you ____________________ to detect its
effectiveness because your estimates are more precise.
Summary: How to increase power
1. Increase α level  _______________ (most of the time)
2. Increase effect size  __________________ (most of the time)
3. Increase sample size (n)  ___________ (most of the time)
371
Lesson 7.6 Discussion
Comic from XKCD: xkcd.com/882
Discussion Question: What statistical problem is this cartoon pointing out? How might this problem be
an issue for scientists testing new ideas, medicines, or theories?
372
Is this gender discrimination?
A local engineering firm had to conduct a series of layoffs recently. The company has 180
employees that could be laid off and they will choose 10. All are equally qualified so the
company decides to use a lottery system to be carried out by the manager to decide who
will be laid off. The manager posts a list of the employees to be laid off. Five employees are
women and five are men. One of the women claims this is gender discrimination and starts
a lawsuit against the company.
1. The manager responds, “How could there be gender discrimination when half of the
employees laid off were female and half were male?” What additional information do
you need to evaluate this statement?
2. How can you use a simulation to investigate the gender discrimination claim? Detail a
process that could be used.
3. Perform one repetition of the simulation. Proportion of females in sample = _______
4. Add your proportion to the class dotplot. Copy below.
5. What percentage of the dots represent half or more females being laid off?
6. Interpret this percentage in context.
7. Do you have convincing evidence of gender discrimination? Explain.
373
Significance Tests: The Basics
Important ideas:
Check Your Understanding
Factinate.com claims that 84% of teenagers think highly of their mother. To investigate this claim, a
school psychologist selects a random sample of 150 teenagers and finds that 135 think highly of
their mother. Do these data provide convincing evidence that the true proportion of teens who think
highly of their mother is greater than 0.84?
1. State appropriate hypotheses for performing a significance test. Be sure to define the parameter
of interest.
The school psychologist performed the significance test and obtained a P-value of 0.0225.
2. Explain what it would mean for the null hypothesis to be true in this setting.
3. Interpret the P-value.
4. What conclusion would you make at the α = 0.05 level?
374
Are you sure Mrs. Magic isn’t a good free throw shooter?
In our introduction to significance tests, we used simulation to estimate a P-value to
decide whether or not Mrs. Magic as exaggerating about her free throw percentage.
Today, we will use a formula to find a P-value.
1. We’re going to carry out the significance test from lesson 9.1 again. Begin by writing
the hypotheses.
2. a. Each class found a different P-value because each dotplot was different. Would it be
appropriate to use a Normal distribution to model the sampling distribution of pĖ‚ ?
Justify your answer.
b. Are there any other conditions we should check?
3. Now that conditions have been met, find the mean and standard deviation of the
sampling distribution of pĖ‚ .
4. Use the mean and standard deviation you found to label the Normal curve.
5. How many standard deviations below the
mean (z-score) is pˆ = 0.64 ? Label it on
the normal curve.
6. Find the probability of an 80% shooter
making 32/50 ( pˆ = 0.64 ) or less.
7. What conclusion can we make?
375
Significance Test for p
Important ideas:
Check Your Understanding
Sharon claims that 90% of students can identify the smell of a skunk. She carries out a study to test
this theory. She selects a random sample of 100 students and asks them each to take a whiff from a
bag that is filled with skunk smell. She finds that 84 are able to correctly identify the smell as that of
a skunk. She would like to know if these data provide convincing evidence that less than 90% of
students can identify the smell of a skunk. Use α = 0.05.
a. State appropriate hypotheses for performing a significance test. Be sure to define the
parameter of interest.
b. Explain why the sample result gives some evidence for the alternative hypothesis.
c. Check if the conditions for performing the significance test are met.
d. Calculate the standardized test statistic and P-value.
e. What conclusion should Sharon make?
376
AP Statistics Handout: Lesson 9.2
Topics: one-sample z-test for a proportion, four-step procedure, two-sided tests
Guided Notes
Flint Water Crisis:
Samuel Wilson
(MLive.com)
Jake May (The
Flint Journal,
AP Images)
WNEM Newsroom
2014
2014/2015
2015
Flint city officials celebrate as they
cut costs by switching the water
supply from Detroit to the Flint River.
Residents voice concerns
about declining water quality.
Mayor drinks tap water on local
television to “prove” the water is safe.
Outside investigators start testing
Flint’s water systems.
One-sample z-test for a proportion
Researchers’ claim: the new water from the Flint river corrodes pipes, so lead gets into the water.
• Among many other side effects, ingesting lead can hurt children’s brain development
According to EPA regulations, if ___________________________ of homes in a city have high lead
content in their water (>15 parts per billion), the city’s water supply is unsafe.
Is there convincing statistical evidence that the Flint water system is unsafe? A Virginia Tech study*
42
randomly sampled water from 252 homes in Flint. Of those, 42 had high lead content. 𝑝Ė‚ =
= 0.17
252
Hypotheses
Non-numerical version:
ðŧ0 : Flint’s water is _____________ to drink
• Default/dull belief (___________________)
ðŧðī : Flint’s water is ________ safe to drink
• Unproven research hypothesis we aim to
demonstrate (_______________________)
Numerical version:
ðŧ0 :
This phrase means you should
do a _____________________
ðŧðī :
p: true proportion of all homes in Flint
with high lead content in their water
*Edwards, M., et al. Virginia Tech’s Flint Water Study (published Sept. 2015, accessed July 2020):
http://flintwaterstudy.org/information-for-flint-residents/results-for-citizen-testing-for-lead-300-kits/
377
Under the hood: calculating z-test statistics and p-values for a one proportion test
Goal: Find enough statistical evidence to reject the ð‘Ŋ𝟎 and
support the ð‘Ŋð‘Ļ
ðŧ0 : p = 0.10
ðŧðī : p > 0.10
1. Start by assuming the null (the default) is true.
• We assume that water in Flint is safe by EPA standards:
no more than 10% of all homes have lead levels above
15 ppb.
p: true proportion of all homes in
Flint with high lead content in their
water
1. We assume _______________________
in the population of flint have high lead
water.
p = 0.10
2. Ask yourself: if the null is in fact true, how likely is the
data that you’ve gathered?
2. The flint study collected water from 252
homes (n = 252). At this sample size, what
estimates for p are we _______________?
Let’s find the ________________________ of
𝑝Ė‚ ’s from all possible random samples.
Under certain conditions:
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = 𝑝0 , 𝜎𝑝Ė‚ = √
𝑝0 (1−𝑝0 )
𝑛
)
𝑝Ė‚ ~ Normal(𝜇𝑝Ė‚ = _____, 𝜎𝑝Ė‚ = _______)
Now we look at the data we gathered (sample).
Ė‚ = _________
42 out of 252 homes had high-lead water: 𝒑
Under the null assumption, how
Ė‚)?
unlikely was the data we observed (𝒑
Two ways to measure that:
1. z-test statistic
2. p-value
378
Using the z-test statistic:
Z-Scores
Shows how many σ a data value
is above/below the mean.
Z-Test Statistics
Shows how many σ a ___________ (sample) is above/below _____.
𝑧=
data value − mean
𝑧=
standard deviation
z=
ð‘Ĩ𝑖 − µ
z=
𝜎
𝑠𝑎𝑚𝑝𝑙𝑒 ð‘ ð‘Ąð‘Žð‘Ą − 𝑛ð‘Ē𝑙𝑙 ð‘Ģ𝑎𝑙ð‘Ē𝑒
ð‘ ð‘Ąð‘Žð‘›ð‘‘ð‘Žð‘Ÿð‘‘ 𝑑𝑒ð‘Ģð‘–ð‘Žð‘Ąð‘–ð‘œð‘›
𝑝Ė‚−𝑝0
𝜎𝑝
Ė‚
→ z=
𝑝Ė‚−𝑝0
𝑝 (1−𝑝0 )
√ 0
𝑛
The ______________ the test statistic (in magnitude), the more _____________ your measurements are
under the null assumption. You start to ________________________________.
𝑧=
𝑝Ė‚ − 𝑝0
=
𝜎𝑝Ė‚
𝑧=
If 2 standard deviations away from 𝑝0
would be unlikely, then 3.5 standard
deviations away is ___________________.
Using the p-value: exactly how unlikely was the proportion we sampled?
a) This probability is the p-value. Explain why:
b) Show the calculator commands and final
result for the p-value:
3. Draw a conclusion:
Under my assumption that only 10% of homes have high lead content, the actually observed data
(16.7% of 252 homes had high lead content) is highly unlikely (__________________). So, ____________
_________________________. There’s convincing evidence that ___________________of homes have
high lead content.
379
According to the EPA, no more than 10% of homes in a water system should have high lead levels (>15
parts per billion). A Virginia Tech study* randomly sampled water from 252 homes in Flint. Of those, 42
42
had high lead content. 𝑝Ė‚ =
= 0.167
252
Is there convincing statistical evidence that the Flint water system is unsafe?
Calculator steps for one-sample z-test for a proportion:
𝑝0 : null value
x: # of ‘successes’
n: sample size
Prop: ðŧðī direction
STAT → TESTS →
5: 1-PropZTest…
Calculate
380
Output
Two-Sided Tests
One-Sided Test
Two-Sided Test
According to the EPA, no more than 10% of
homes in a water system should have high lead
levels (>15 parts per billion). A Virginia Tech
study* randomly sampled water from 252 homes
in Flint. Of those, 42 had high lead content.
42
𝑝Ė‚ =
= 0.167
252
Is there convincing statistical evidence that the
Flint water system is unsafe?
A city official claims that 10% of all homes in Flint
have high lead content in their water. A Virginia
Tech study* randomly sampled water from 252
homes in Flint. Of those, 42 had high lead content.
42
𝑝Ė‚ =
= 0.167
252
Is there convincing statistical evidence that the
percent of homes with high lead content differs
from the official’s claim?
“Differs” means that the percent of
homes with high lead content is
_______________________ than 10%.
“Unsafe” means that _________ of
homes have high lead content in water.
Conceptual Hypotheses
ðŧ0 : Flint’s water is ___________to drink
ðŧðī : Flint’s water is _______ safe to drink
Formal Hypotheses
ðŧ0 : p = 0.10
ðŧðī : p ___ 0.10
Conceptual Hypotheses
ðŧ0 : The proportion of homes in Flint with high lead
content ____ 10%
ðŧðī : The proportion of homes in Flint with high lead
content _________ 10%
Testing whether __________
_____ of all homes have high
lead content in their water.
Formal Hypotheses
ðŧ0 : p = 0.10
ðŧðī : p ___ 0.10
One-sided test: when you’re testing an alternative
in ____________________ (ðŧðī either uses < or >)
Testing whether the actual
proportion of homes with
high lead content in their
water ______________ 10%.
Two-sided test: when you’re testing an alternative
in ___________________________ (ðŧðī uses ≠)
p-value for two-sided test: the probability of
getting results as different or more different from
ðŧ0 in __________________, assuming ðŧ0 is true.
p-value for one-sided test: the probability of
getting results as different or more different from
ðŧ0 in __________________, assuming ðŧ0 is true.
Calculate the two-sided p-value (show your work):
One-sided p-value = 0.0002
381
Two-sided test calculations:
To find the p-value for a two-sided test, just multiply the one-sided p-value _____!
One-sided conclusion:
Two-sided conclusion:
(p-value)
(α)
0.0002 < 0.05
(p-value)
(α)
_______ < 0.05
In both cases,
______________
In fields like medicine, two-sided tests are preferred because their ______________ p-values mean they
are more _________________—you are less likely to get a statistically significant result by chance alone.
Discussion
The confidence interval for the proportion of Flint homes with a high lead level in their water is
(0.121, 0.213).
Discussion Question: Is this confidence interval consistent with the results of our hypothesis test?
Explain.
382
Can you taste the rainbow?
Many students claim that they can taste the different colors of Skittles. Today we will conduct
an experiment and perform a significance test to see if students really can “taste the rainbow”.
Collect data:
How many correct? ______
How many total? ______
383
Significance Test for a Proportion
Important ideas:
Check Your Understanding
1. Sometimes parents and grandparents like to recount how difficult life was when they were kids, such as having to
walk 10+ miles to school (in the snow, uphill both ways). A random sample of 180 teenagers were selected and
40% had heard stories from their parents or grandparents about how difficult life was when they were kids. Do these
data provide convincing evidence at the α = 0.05 significance level that the proportion of all teenagers who have
heard stories from their parents or grandparents about how difficult life was when they were kids differs from 0.50?
2. A 95% confidence interval for the proportion of all teenagers who have heard stories from their parents or
grandparents about how difficult life was when they were kids is (0.328, 0.472). Explain how the confidence interval
is consistent with, but gives more information than, the test.
384
Is Yawning Contagious?
Mythbusters investigated this question. Here’s a brief recap. Each subject was placed in a
booth for an extended period of time and monitored by hidden camera. 34 subjects were
given a “yawn seed” by one of the experimenters: that is, the experimenter yawned in the
subject’s presence before leaving the room. The remaining 16 subjects were given no yawn
seed.
1. Draw an outline of Mythbuster’s experiment.
50 subjects
2. Here are the Mythbusters results.
Subject Yawned?
Yawn seed?
Yes
No
Yes
10
24
No
4
12
Total
14
36
Total
34
16
50
Call p1 the true proportion of yes yawn seed people who yawn. pĖ‚1 = ________
Call p2 the true proportion of no yawn seed people who yawn. pĖ‚ 2 = ________
What
is the
pĖ‚1 - pĖ‚ 2? ___________
he front
of difference
the room. in
Is proportions
it a fair deck?
3. Do the data provide some evidence that yawning is contagious? Why?
4. Adam Savage and Jamie Hyneman, the cohosts of Mythbusters used these data to
conclude that yawning is contagious. Do you agree?
385
In this Activity, your class will investigate whether the results of the experiment are
statistically significant OR if they could have occurred purely by chance due to random
assignment.
4. What is the null hypothesis?
The 50 people in the experiment are represented by the cards. A person is either a yawner
or a non-yawner, no matter which treatment they are randomly assigned.
5. Shuffle the 50 cards and put them into two piles, one group of 34 that gets the yawn
seed and one group of 16 that does not get the yawn seed. Record the proportion of
people who yawned in each group. You will do this three times.
Trial
Proportion who yawned
in yawn seed group, pĖ‚1
Proportion who yawned
no yawn seed group, pĖ‚ 2
Difference in
proportions, pĖ‚1 - pĖ‚ 2
1
2
3
6. Make a class dotplot of the difference in proportions. Sketch below:
7. In what percent of the class’s trials did the difference in proportions equal or exceed
29% - 25% = 4% (what Mythbusters got in their experiment)?
8. What conclusion can you make about whether yawning is contagious?
386
Tests About a Difference in Proportions - Intro
Important ideas:
Check Your Understanding
Mythbusters investigated the question “Is Yawning Contagious?” Here’s a brief recap.
Each subject was placed in a booth for an extended period of time and monitored by
hidden camera. 34 subjects were given a “yawn seed” by one of the experimenters: that is,
the experimenter yawned in the subject’s presence before leaving the room. The
remaining 16 subjects were given no yawn seed.
Yawn seed?
Yes
No
Total
Subject Yawned?
Yes
No
10
24
4
12
14
36
Total
34
16
50
387
Race and Hiring
Your class just performed a modified version of the famous race/resumé study* conducted by
researchers at the University of Chicago. You will now conduct a two-sample z-test for
proportions with the resulting data.
*Bertrand, Marianne and Sendhil Mullainathan. "Are Emily And Greg More Employable Than Lakisha And Jamal? A Field
Experiment On Labor Market Discrimination," American Economic Review, 2004, v94(4,Sep), 991-1013.
https://www.nber.org/papers/w9873
Surprise! The resumé you evaluated was fake. You just took part in an experiment. Each resumé the
class evaluated was completely identical except for one component: the first name of the applicant.
Half of the class got “Emily Jones,” and the rest got “Lakisha Jones.” According to birth certificate
records, the name “Emily” is almost exclusively given to white children. The name “Lakisha” is
almost exclusively given to black children.
Fill in the following based on data from your whole class:
Number of callbacks for Emily: _______
Number of callbacks for Lakisha: _______
Sample size for Emily group: _______
Sample size for Lakisha group: ______
Callback proportion for Emily: ______
Callback proportion for Lakisha: _______
1. Without performing any further calculations, do you think the difference in callback proportions
between the Emily and Lakisha resumés is significant? Why or why not?
2. Is there convincing evidence of a difference in callback rates, based on the name alone?
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
388
3. In the original study, the researchers sent resumés with commonly-white or commonly-black
names (randomly assigned) to firms in Boston and Chicago. In total, 246 out of 2445 commonlywhite named resumés received a callback and 164 out of 2445 commonly-black named resumés
received a callback. Is this evidence more convincing or less convincing of racial bias than the data
from our class experiment? Explain. Don’t perform another full test!
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
389
Tests About a Difference in Proportions
Important ideas:
Check Your Understanding
To study the long-term effects of preschool programs for poor children, researchers designed an
experiment. They recruited 123 children who had never attended preschool from low-income
families in Michigan. Researchers randomly assigned 62 of the children to attend preschool (paid for
by the study budget) and the other 61 to serve as a control group who would not go to preschool.
One response variable of interest was the need for social services as adults. Over a 10-year period,
38 children in the preschool group and 49 in the control group have needed social services.
Do these data provide convincing evidence that preschool reduces the later need for social services
for children like the ones in this study? Justify your answer.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
390
AP Statistics Handout: Lesson 9.3
Topics: two-sample z-test for a difference of proportions and its connection to intervals
Guided Notes
Context for Today’s Lesson: The Race/Resumé Study
In this study*, investigators created mock identical resumés, which were sent to job placement ads in
Chicago and Boston. Each resumé was randomly assigned either a commonly-white or commonly-black
name. In total, 246 out of 2445 commonly-white named resumés received a callback and 164 out of
2445 commonly-black named resumés received a callback.
*Bertrand, Marianne and Sendhil Mullainathan. "Are Emily And Greg More Employable Than Lakisha
And Jamal? A Field Experiment On Labor Market Discrimination," American Economic Review, 2004,
v94(4,Sep), 991-1013. https://www.nber.org/papers/w9873
Two Sample Z-Test for a Difference of Proportions
Study results
CommonlyWhite Names
CommonlyBlack Names
Total
Called back
246
164
410
Not called back
2199
2281
4480
Total
2445
2445
4890
Find the following quantities (show
calculations): 𝑛1 , 𝑛2 , 𝑝Ė‚1, 𝑝Ė‚ 2 , 𝑝Ė‚1 − 𝑝Ė‚ 2
Ė‚𝟏 > 𝒑
Ė‚𝟐
If there’s hiring discrimination, 𝒑
Group 1: White
𝑝Ė‚1 = proportion of commonly-white
name apps that got callback.
𝑝Ė‚1 = _________
>
Group 2: Black
𝑝Ė‚2 = proportion of commonly-white
name apps that got callback.
𝑝Ė‚2 = _________
Are these proportions _____________________________ to show discrimination, or could this
difference have been a result of _________________________?
391
Hypotheses: Recall court analogy: “Innocent until proven guilty”
There is no discrimination,
so the callback rate is the
_______________________
. You’re seeing if there’s
evidence to reject this
default claim.
Non-numerical version:
Numerical version:
ðŧ0 : Employers __________________
ðŧ0 :
• Default/dull belief (___________________)
ðŧðī : Employers ____________ white applicants
ðŧðī :
• Unproven research hypothesis we aim to
demonstrate (_______________________)
Where:
𝑝1 is the proportion of ______ applicants with commonly-white names
who’d receive callbacks when applying to jobs like the ones in this study.
𝑝2 is the proportion of ______ applicants with commonly-black names
who’d receive callbacks when applying to jobs like the ones in this study.
There is discrimination, in
which case the commonlywhite named applications
received a ______________
of callbacks.
Rewrite these hypotheses in a more mathematically convenient way:
Under the hood: calculating z-test statistics and p-values for a two-sample test
Investigating if we find enough statistical evidence to reject the
ð‘Ŋ𝟎 and support the ð‘Ŋð‘Ļ
ðŧ0 : 𝑝1 − 𝑝2 = 0
ðŧðī : 𝑝1 − 𝑝2 > 0
1. Start by assuming the null (the default) is true.
• We assume there is no discrimination: callback rates are the same for both groups.
2. Ask yourself: if the null is in fact true, how likely is the
data that you’ve gathered?
2. What estimates for 𝑝1 - 𝑝2 are we _________
_______ if there’s actually no discrimination?
Let’s find the ________________________ of
𝑝Ė‚1 - 𝑝Ė‚2 from all possible random assignments.
Under certain conditions:
𝑝Ė‚𝑐 (1−𝑝Ė‚𝑐 )
~ Norm(𝜇 = 0, 𝜎 = √
𝑛1
+
𝑝Ė‚𝑐 (1−𝑝Ė‚𝑐 )
~ Normal(𝜇𝑝Ė‚ = 0, 𝜎𝑝Ė‚ = _______)
392
𝑛2
Now we look at the data we gathered (difference in callback rates from experiment): 𝑝Ė‚1 − 𝑝Ė‚ 2 = 𝟎. 𝟎𝟑𝟒
Under the null assumption, how unlikely
was the data we observed (𝑝Ė‚1 − 𝑝Ė‚2 = 𝟎. 𝟎𝟑𝟒)?
Two ways to measure that:
1. z-test statistic
2. p-value
Write down the z-test statistic and the p-value. Interpret their meaning and describe if they suggest
there is reason to doubt the null hypothesis in this case.
3. Draw a conclusion:
Under my assumption that there is no difference in callback rates, the actually observed data (a 3.4%
difference in callback rates among 4890 employers) is highly unlikely (__________________). So, _____
___________________________. There’s convincing evidence that commonly-white named resumés
receive a _________________________.
Connection to Confidence Intervals
Confidence Interval:
(1.85%, 4.95%)
Since 𝑝1 − 𝑝2 = 0 was ________________ our interval, it’s not plausible to assume 𝑝1 = 𝑝2 . Therefore,
we ______________. There’s evidence of a preference for “white name” resumés.
Usually (especially for two-sided tests): If ðŧ0 is outside the interval, you will reject it. If ðŧ0 is __________
the interval, you _________________________.
393
In the Bertrand-Mullainathan race/resumé study, mock identical resumés were sent to job placement ads
in Chicago and Boston. Each resumé was randomly assigned either a commonly-white or commonlyblack name. In total, 246 out of 2445 commonly-white named resumés received a callback and 164 out
of 2445 commonly-black named resumés received a callback.
Do the results give convincing statistical evidence that employers favored commonly-white name
applicants (in terms of callbacks)?
Calculator steps for two-sample z-interval for proportions:
For groups 1&2:
x: # of ‘successes’
n: sample size
p1: ðŧðī direction
Calculate
STAT → TESTS →
6: 2-PropZTest…
394
Output
Lesson 9.3 Discussion
CommonlyWhite Names
CommonlyBlack Names
Total
Called back
246
164
410
Not called back
2199
2281
4480
Total
2445
2445
4890
Discussion Question: We found that commonly-white name resumés had a statistically significant
advantage in attracting callbacks. Does this result suggest discrimination at other levels of the hiring
process (e.g. job offers rates after an interview, starting salary estimates, etc.)? Explain.
395
Should Rockford switch to bottled water?
The Wolverine Worldwide (a shoe company in Rockford) improperly disposed of chemicals
(PFAS), which have leaked into the ground water. The state of Michigan says that if more
than 7% of households in a city exceed the safe limit, the city needs to switch to bottled water.
A concerned citizen takes a random sample of 100 households and finds that 12 have unsafe
water. Do the data provide convincing evidence that Rockford should switch to bottled water?
1. State appropriate hypotheses for performing a significance test. Use α = 0.05.
2. (a) After conducting a significance test, a P-value of 0.025 is found. Interpret this value.
(b) Reject H0 or fail to reject H0? Keep the current water or switch to bottled water? Explain.
(c) Let’s suppose this decision is wrong. What would be a consequence of this error?
(d) If the water is safe, what is the probability that this error will occur?
3. (a) Now suppose the P-value was 0.217. Reject H0 or fail to reject H0? Keep the current water
or switch to bottled water? Explain.
(b) Let’s suppose this decision is wrong. What would be a consequence of this error?
4. Are the consequences in question #2 or question #3 more serious? Explain.
396
Type I and Type II Errors
Important ideas:
Check Your Understanding
Mr. Wilcox purchased a trick coin that is supposed to land heads up 75% of the time. One of his
students volunteer to test this claim. The student flips the coin 50 times and finds that the coin lands
heads up 35 times. The student then performs a test of the following hypotheses at the α = 0.10
significance level:
ðŧ! : 𝑝 = 0.75
ðŧ" : 𝑝 < 0.75
where p = the true proportion of tosses of this coin that would land heads-up.
1. Describe a Type I error and a Type II error in this setting.
2. Which type of error may result in Mr. Wilcox returning the coin and writing a negative review
of the product?
3. If the student were to use α = 0.05 instead of α = 0.10, would this make it more or less likely to
reject the null hypothesis when the null hypothesis is true? Explain.
397
Will Mrs. Magic Prove Herself??
Recall that Mrs. Magic claims she is an 80% free throw
shooter. Mr. Wilcox is skeptical and believes that she is a
less than 80% free throw shooter. Mrs. Magic has decided to
take another SRS of 50 free throws to prove herself.
1. Write the appropriate hypotheses for the significance test. Be sure to define the
parameter of interest.
2. Describe a Type II error in this setting.
3. Suppose that Mrs. Magic s true free throw percentage is actually 66% (Ha is true!).
Enter all information into the applet at https://istats.shinyapps.io/power/. Click to
Display Type II error. It will show as a blue shaded region.
(a) If Ha is true, what is the probability of a Type II error (bad decision)? _______
(b) If Ha is true, what would be the probability of a good decision here? _______
(c) Interpret the probability of this good decision in the context of the problem.
4. Click Display Power. We want to increase the power of our test. How could we adjust
each of the following factors to increase our power? Use the applet to explore each.
a. Sample size:
b. Alpha level:
c. Alternative p:
398
Power of a Test
Important ideas:
Check Your Understanding
A statistics major and a finance major decide to get married. In order to investigate their per person
expenses they select a random sample guests they have invited to the wedding and records
whether or not each person plans to attend the wedding. They decide to test H0: p = 0.75 versus Ha: p
≠ 0.75 where p = the true proportion of all guests that will attend the wedding.
a. The power of the test to reject H0: p = 0.75 when p = 0.70 using α = 0.05 and n = 25 subjects is
0.10. Interpret this value.
b. Find the probability of a Type I error and the probability of a Type II error for the test in part (a).
c. Determine whether each of the following changes would increase or decrease the power of the
test. Explain your answers.
•
Use α = 0.01 instead of α = 0.05.
•
Use n = 100 instead of n = 25.
399
400
Formulas for Significance Tests
Formulas for Confidence
Intervals
Formula for standard deviation
of the sampling distribution
Formula for mean of the
sampling distribution
Name the procedure
Conditions
Symbol for statistic (sample)
Symbol for parameter
(population)
Lesson
One Proportion
AP Stats Chapter 8,9 Study Sheet
Difference of Proportions
®
2015 AP STATISTICS FREE-RESPONSE QUESTIONS
4. A researcher conducted a medical study to investigate whether taking a low-dose aspirin reduces the chance
of developing colon cancer. As part of the study, 1,000 adult volunteers were randomly assigned to one of two
groups. Half of the volunteers were assigned to the experimental group that took a low-dose aspirin each day,
and the other half were assigned to the control group that took a placebo each day. At the end of six years,
15 of the people who took the low-dose aspirin had developed colon cancer and 26 of the people who took
the placebo had developed colon cancer. At the significance level a = 0.05, do the data provide convincing
statistical evidence that taking a low-dose aspirin each day would reduce the chance of developing colon cancer
among all people similar to the volunteers?
401
2012 AP ® STATISTICS FREE-RESPONSE QUESTIONS
5. A recent report stated that less than 35 percent of the adult residents in a certain city will be able to pass a
physical fitness test. Consequently, the city's Recreation Department is trying to convince the City Council to
fund more physical fitness programs. The council is facing budget constraints and is skeptical of the report. The
council will fund more physical fitness programs only if the Recreation Department can provide convincing
evidence that the report is true.
The Recreation Department plans to collect data from a sample of 185 adult residents in the city. A test of
significance will be conducted at a significance level of a = 0.05 for the following hypotheses.
H 0 : p = 0.35
H a : p < 0.35,
where p is the proportion of adult residents in the city who are able to pass the physical fitness test.
(a) Describe what a Type II error would be in the context of the study, and also describe a consequence of
making this type of error.
(b) The Recreation Department recruits 185 adult residents who volunteer to take the physical fitness test. The
test is passed by 77 of the 185 volunteers, resulting in a p-value of 0.97 for the hypotheses stated above. If it
was reasonable to conduct a test of significance for the hypotheses stated above using the data collected
from the 185 volunteers, what would the p-value of 0.97 lead you to conclude?
(c) Describe the primary flaw in the study described in part (b), and explain why it is a concern.
402
Unit 7
Chapter 10
Estimating Means
with Confidence
403
AP Statistics Handout: Lesson 10.1
Topics: review sampling distribution for ð‘ĨĖ…, 95% confidence interval for ð‘ĨĖ…, four step process
Guided Notes
Students: “I’m gonna drop out and become a…”
YouTube Creator
Insta Influencer
“TikToker”
Mr. Beast
2020: $24 Million
Kendall Jenner
2019: $16 Million
Josh Richards
2020: $1.5 Million
Source: Forbes
Can you make this kind of money by making social media your full-time career? Probably not, but
maybe you could make a living…
Today’s Key Analysis: Do social media creators, on average, make a livable wage?
The Data:
Searched “How much I make on YouTube.” ___________________________ from
hundreds of results
Reliable data: They show their
private channel revenue pages in the
videos
Source:
Forbes
404
Review: Sampling Distribution for ð‘ĨĖ…
The Population vs. The Sample Mean
𝜇 = _________________ mean
• Parameter
• Ex: mean salary _____________________
ð‘ĨĖ… = _____________ mean
• Statistic used to estimate µ
• Ex: mean salary ________________
Estimation
In a world where…
1. The true mean yearly salary among YouTubers is $55,000
2. The true standard deviation of salaries is $29,500
We ________________ the
true mean by $5,000
In most real-life scenarios:
-You _______________________ the true mean.
-You survey a random sample of 35 YouTubers. Among them, the average yearly earnings was
ð‘ĨĖ… = $60,000.
-Your estimate for the true mean (µ) is the sample mean (ð‘ĨĖ… )
The Sampling Distribution
Under certain conditions:
ð‘ĨĖ… ~ Normal(𝜇ð‘ĨĖ… = 𝜇, 𝜎ð‘ĨĖ… =
Centered at true mean (______________)
𝜎
√𝑛
)
____________________________ of our
estimates from true mean value
Calculations:
ð‘ĨĖ… ~ Normal(𝜇ð‘ĨĖ… = 𝜇, 𝜎ð‘ĨĖ… =
𝜎
)
√𝑛
ð‘ĨĖ… ~ Normal(𝜇ð‘ĨĖ… = 55000, 𝜎ð‘ĨĖ… =
Means from repeated
random samples of 35
YouTubers (in a world
where µ = $55,000).
29500
)
√35
ð‘ĨĖ… ~ Normal(𝜇ð‘ĨĖ… = 55000, 𝜎ð‘ĨĖ… = 4986)
Typical distance of estimates from true mean value.
________________by $4,986.
405
We overestimated the
true mean by $5,000
1) Was our estimate’s error amount ($5,000) pretty typical? Justify using the sampling distribution
above.
There should be a way to report both:
a) Our estimate
b) How far off this estimate might be
One way to do this is with a _________________________________!
The t-distribution and interval for ð‘ĨĖ…
Sampled YouTuber Yearly Revenues ($)
1) Label the U.S. individual poverty line, the U.S. mean wage, the sample mean, and the
sample standard deviation on the dotplot above.
2) What concerns might you have about assuming the true mean matches the sample
mean above? Why?
406
First Attempt: Creating the Confidence Interval for our Data
Under certain conditions:
ð‘ĨĖ… ~ Normal(𝜇ð‘ĨĖ… = 𝜇, 𝜎ð‘ĨĖ… =
𝜎
√𝑛
)
1) What prevents us from directly calculating 𝜎ð‘ĨĖ… ? What can we do to provide a reasonable estimate?
2) Using the YouTube data, what is 𝑆ðļð‘ĨĖ… ? Show your work.
3) Why can’t we directly calculate the confidence interval
using the method show on the diagram to the right?
Note: we can’t do this to calculate the interval
The t-distribution to the Rescue!
t distribution
• ___________ than normal curve
• Width captures ___________________________
when using 𝑆ð‘Ĩ to estimate σ
degrees of freedom (df) = _______
• df measures how _______________ 𝑆ð‘Ĩ estimates σ
• The higher n, the more precisely we estimate σ, and
the more our t-curve ________________________!
407
t critical values
ð‘Ą ∗ : the _____________________ of the t-interval
• Tells you how many _____________________ you’re including in your interval.
• Determines the _____________________.
Normal (z)
t (df = 34)
For normal curve, z* was
1.96 for 95% confidence
For t-distribution, we need
to go wider to capture 95%
t (df = 34)
Calculator steps to find t*
t* = 2.03
2nd → VARS → 4: InvT
-Area: percent below
interval (2.5% = 0.025)
-df: n – 1 (35 – 1 = 34)
As expected, the t-critical value is __________________ than 1.96 needed for a normal curve.
Confidence Interval for a Mean
Using the formula and the info above, calculate and interpret
the confidence interval for the mean YouTube salary:
408
Formula:
• point estimate ± margin of error
• ð‘ĨĖ… ± ð‘Ą ∗ (𝑆ðļð‘ĨĖ… )
a) Construct and interpret a 95% confidence interval for the true mean salary of all YouTubers.
Calculator steps
→Stats
ð‘ĨĖ… : sample mean
𝑆ð‘Ĩ : sample stdev
n: sample size
Calculate
STAT → TESTS →
8: TInterval
409
Output
Lesson 10.1 Discussion
We can directly estimate the true mean income of YouTubers from Ad Revenue:
/
Creator Share of 2019 YouTube Ad Revenue
$8,331,950,000
# of Creators
18,000,000
=
Average Yearly Pay Per Creator
$463
Confidence interval we estimated: $41,877 to $98,339
Discussion Question: Why was our confidence interval so far off?
Sources:
- Alphabet 4th quarter earnings release: https://abc.xyz/investor/static/pdf/2019Q4_alphabet_earnings_release.pdf?cache=05bd9fe
- “How YouTube Ad Revenue Works,” Investopedia, June 4, 2020: https://www.investopedia.com/articles/personal-finance/032615/howyoutube-ad-revenue-works.asp#:~:text=Enabling%20ads%20on%20your%20YouTube,get%20the%20remaining%2055%20percent.
- “How Many YouTube Channels Are There?”, Tubics, https://www.tubics.com/blog/number-of-youtube-channels
- SocialBlade, data accessed 2019
410
10.1 PRACTICE!
How much homework?
Ms. Garcia wants to estimate how much time students spend on homework, on average, during a typical week.
She wants to estimate at the 90% confidence level with a margin of error of at most 30 minutes. A pilot study
indicated that the standard deviation of time spent on homework per week is about 154 minutes.
SAT Math Scores
High school students who take the SAT Math exam a second time generally score higher than on their first try.
Past data suggest that the score increase has a standard deviation of about 50 points. How large a sample of
high school students would be needed to estimate the mean change in SAT score to within 2 points with 95%
confidence? Show your work.
Travel Time to Work
A study of commuting times reports the travel times to work of a random sample of 20 employed adults in New
York State. The mean (ð‘Ĩð‘ĨĖ… ) is 31.25 minutes, and the standard deviation is (𝑠𝑠) 21.88 minutes.
What is the standard error of the mean? Interpret this value in context.
411
Vitamin C Content
Several years ago, the U.S. Agency for International Development provided 238,300 metric tons of corn- soy
blend (CSB) for emergency relief in countries throughout the world. CSB is a highly nutritious, low- cost fortified
food. As part of a study to evaluate appropriate vitamin C levels in this food, measurements were taken on
samples of CSB produced in a factory.
The following data are the amounts of vitamin C, measured in milligrams per 100 grams (mg/100 g) of blend, for
a random sample of size 8 from one production run:
26
31
23
22
11
22
14
31
Construct and interpret a 95% confidence interval for the mean amount of vitamin C (𝜇𝜇) in the CSB from this
production run.
412
How much does an Oreo weigh?
Mrs. Gallas wanted to estimate the average weight of an Oreo cookie to determine if the
average weight was less than advertised. She selected a random sample of 30 cookies and
found the weight of each cookie (in grams). The mean weight was x = 11.1921 grams with a
standard deviation of sx = 0.0817 grams. Make a 95% confidence interval to estimate the true
mean weight of an Oreo.
1. What is the point estimate for the true mean? ________
2. Identify the population, parameter, sample and statistic.
Population:
Parameter:
Sample:
Statistic:
3. Was the sample a random sample? Why is this important?
4. What is the formula for calculating the standard deviation of the sampling distribution of ð‘ĨĖ… ?
5. What condition must be met to use this formula? Has it been met?
6. In the formula for standard deviation of the sampling distribution of ð‘ĨĖ… , we don’t know the value
of σ (if we did, we would have known µ) so we will use 𝑠! instead. Find the standard error.
7. Would it be appropriate to use a normal distribution to model the sampling distribution of ð‘ĨĖ… ?
Justify your answer.
8. When finding the margin of error for a confidence interval for a proportion we use z*. For a
mean we will use _____ as the critical value. Why???
413
Name:
Hour:
Date:
9. What t* is needed for this confidence interval? Use Table B and the degrees of freedom
(df) = n - 1 to find it.
10. Calculate the margin of error using t* and the standard error.
11. Calculate the 95% confidence interval using point estimate +/- margin of error.
12. Interpret the interval.
13. Write a specific formula for a confidence interval for a population mean.
14. According to Nabisco, an Oreo weighs 11.3 grams. Does our confidence interval provide
convincing evidence that the true average weight is less than 11.3 grams? Explain.
414
Estimating a Population Mean Day 1
Important ideas:
Check Your Understanding
1. Use Table B to find the critical value t* that you would use for a confidence interval for a
population mean μ in each of the following settings. If possible, check your answer with
technology.
(a) A 98% confidence interval based on a random sample of 26 observations
(b) A 99% confidence interval from an SRS of 85 observations
2. A national poll of a random sample of 1,640 adults was carried out by Morning Consult. Each
person reported how much they (or their significant other) spent on an engagement ring. The
histogram displays the results. Determine if the conditions for constructing a confidence
interval for a mean have been met in this context.
415
How many states can you name?
How many states can you name in one minute? We will use this class as a random sample
of all AP Stats students to estimate a 95% confidence interval for the mean number of
states an AP Stats student can name in one minute.
1. When the timer starts, list as many states as you can on a piece of paper. Write the
number of states you listed on the board.
2. What type of data is this? Categorical or quantitative?
2. Enter the class data at stapplet.com. Find the sample mean and standard deviation. Sketch
the dotplot of the sample data.
n=
x=
sx =
3. Construct a 95% confidence interval to estimate the mean # of states a senior can name.
416
Important ideas:
Check Your Understanding
City council members want to estimate how many pounds of trash households in their community
produce per week. To determine an estimate for the standard deviation of the weight of trash produced
a small random sample of households was selected and their trash was weighed on garbage day. This
produced an estimated standard deviation of 36 pounds.
a. How many households need to be surveyed to estimate µ at the 95% confidence level with a margin
of error of at most 3 pounds?
b. After solving part (a), the city council realizes that it would be too much work to weigh the garbage
of that many households. They give up on their hopes of estimating the true mean weight of trash
produced within 3 pounds and select a random sample of 15 households and weigh the trash for
each of these households on garbage day. Here are the results:
114.8
74.3 80.1 41.5 99.1 31.0 93.1 118.9 26.5 33.1 88.3
46.1 119.7 46.4 19.8
Calculate and interpret a 95% confidence interval for the mean weight of trash for all households in
this community.
417
AP Statistics Handout: Lesson 10.2
Topics: two-sample t-interval for a difference of means
Guided Notes
Two Facts about College:
1. On average, college graduates make ___________________________ than high school
graduates. ($30,000 more per year, in 2018)
2. ________________________________ of college, the average economic return to a college
degree is still _____________________________________________.
Are college and income causally related?
Causation
Correlation
For “margin” students (those on the
fence of attending college based on
motivation and qualifications)…
• If college effect is causal: Educators
should encourage margin students
to attend college
• If college effect is correlation:
Educators shouldn’t always
encourage margin students to
attend college
Florida College Study*
A Yale economist used a regression discontinuity (a “quasi-experimental” method) to explore whether
college has a causal impact on wages. AP Stats doesn’t cover quasi-experimental methods, but they’re
really cool. We’ll analyze a modified version of this study using a concept we do learn in AP Stats: the
two-sample t-interval. In the space below, describe how the Florida College Study obtained its two
comparison samples. Then, describe why their methods allow for a causal comparison:
*Study: Zimmerman, Seth. “The Returns to College Admission for
Academically Marginal Students.” Journal of Labor Economics, 2014,
vol. 32 (4). https://www.jstor.org/stable/10.1086/676661?seq=1
418
Technical notes for technical folks:
• Regression Discontinuity uses multiple regression, which isn’t covered in AP Stats. We’re doing a
modified version with a two-sample t-interval (which is covered in AP Stats)
• Technically, we’re not really doing regression discontinuity. But what we’re doing uses similar
logic and reflects the paper’s main outcomes.
• We modified the data to make sure our results (including standard errors) would be similar to
Zimmerman’s (specifically, his IV effect estimates for students with GPA’s within 0.15 points of
the cutoff - see Table 5 in his paper – using annual rather than quarterly earnings as the
outcome).
Study results, for students near the GPA cutoff:
Mean Income
Stdev. Income
n
College
$35,764
$27,147
202
High School
$28,964
$21,899
190
“Income” is students’ annual income 8-14
years after high school (in 2005 dollars)
If college made a difference for margin
students, just-above cutoff students should
have a ________________________________
than just-below cutoff students.
Show calculations for ð‘ĨĖ…ðķ − ð‘ĨĖ…ðŧ𝑆 :
Could the difference in outcome have happened by chance alone? Or did college admission actually
raise wages? Let’s make a ____________________________ to see if difference between these means is
significantly large.
Two-sample t-interval for a difference of means
Hypotheses: We don’t need to set up hypotheses to construct a confidence interval, but doing so is going
to help us conceptualize why our interval is useful.
ðŧ0 : 𝜇ðķ = 𝜇ðŧ𝑆
ðŧðī : 𝜇ðķ > 𝜇ðŧ𝑆
The null (______________) hypothesis: there is ___________________
in average wage between the college and high school groups. You’re
seeing if there’s evidence to reject this claim.
The alternative (________________) hypothesis: the college group
_______________________ (on average) than the high school group
Rewrite these hypotheses in a more mathematically convenient way:
Where:
μC is the mean earnings among _____ college students (just above GPA cutoff)
μHS is the mean earnings among _____ high school students (just below GPA cutoff)
419
Making the interval
Mean Income
Stdev. Income
n
College
$35,764
$27,147
202
High School
$28,964
$21,899
190
Under certain conditions:
𝜎 2
~ Norm(𝜇 = 𝜇ðķ − 𝜇ðŧ𝑆 , 𝜎 = √ 𝑛𝑐 +
𝑐
𝜎ðŧ𝑆 2
𝑛ðŧ𝑆
1. Show the steps we take to get our t-distribution (and its final parameters):
2. Show the steps for calculating the final confidence interval (and write down the interval):
3. Interpret your final interval and comment on whether 0 is inside or outside your interval (and why
that’s important):
420
)
In the Florida College Study, two “as good as randomly” assigned groups of students (with similar GPA’s)
were given admission to college or were denied. Summary statistics are provided.
a) Construct and interpret a 95% confidence interval for the difference in mean incomes between
marginal students who attend and do not attend college.
b) Use your interval to draw a conclusion about whether college creates an “income boost” for marginal
students.
Calculator steps for two-sample t-interval for means:
STAT → TESTS →
0: 2-SampTInt…
421
For groups 1&2:
ð‘ĨĖ… : sample mean
𝑠ð‘Ĩ : sample stdev.
n: sample size
C-lvl: Confidence Lvl
Pooled → NO
Output
Lesson 10.2 Discussion
Confidence interval: $1,906 to $11,693
Interval shows us that the “college boost” is statistically significant, but is it practically
• Study author (Zimmerman) shows, even with rising college costs, this “college boost” represents
important?
a substantial net gain over time.
• So, yes, it’s practically important! College “pays off” for students at the margin of admission.
Policy Implications
Many colleges have tried to recruit more students from underrepresented backgrounds (e.g. students of
color and low-income students).
• One problem: Because students from these backgrounds disproportionately attend lowerperforming schools, many have lower GPA’s and academic qualifications.
• Colleges worry underqualified students will have bad outcomes (not graduate, student debt,
etc.)
Discussion: To help recruit students from a wider array of backgrounds, Universities in Florida are
considering lowering their admissions standard for high school GPA’s. Would this be a wise move?
Explain your answer using the Florida College Study results.
422
10.2 PRACTICE!
Plastic Grocery Bags
Do plastic bags from Target or plastic bags from Publix hold more weight? A group of AP Statistic students
decided to investigate by filling a random sample of 5 bags from each store with common grocery items until the
bags ripped. Then they weighed the contents of items in each bag to determine its capacity. Here are their
results, in grams:
Target:
12,572
13,999
11,215
15,447
10,896
Publix:
9552
10,896
6983
8767
9972
Construct and interpret a 99% confidence interval for the difference in mean capacity of plastic grocery bags
from Target and Publix. Does your interval provide convincing evidence that there is a difference in the mean
capacity among the two stores?
423
Which cookie has the most chips?
Is there a difference in the number of chocolate chips in Chips Ahoy cookies versus the
number of chocolate chips in Meijer brand cookies? Each pair of students will count the
number of chocolate chips in 1 Chips Ahoy cookie and 1 Meijer Chipsters cookie. Due to
the factories processes, we can assume the population distributions of # of chips are
approximately normal and that the samples are random.
1. Record the number of chocolate chips in each cookie. Write them on the board.
# in Chips Ahoy = _______
# in Meijer Chipsters = _______
2. Find the mean number of chocolate chips for each type of cookie, the standard
deviation and the difference.
Chips Ahoy: x1 =
Meijer Chipsters: x2 =
s1 =
s2 =
Difference: x1 - x2 =
3. If we repeated this process many times and created a dotplot, we would have the
sampling distribution of x1 - x2. Describe the shape, center and spread of the sampling
distribution.
Shape:
Center:
Spread:
4. Have the conditions for constructing a confidence interval been met? Explain.
5. Construct a 95% confidence interval for the true difference in the mean number of
chocolate chips in Chips Ahoy and Meijer Chipsters.
6. Do we have evidence that there is a difference in the average number of chocolate
chips in a Chips Ahoy and a Meijer Chipsters cookie?
424
Confidence Intervals for a Difference in Means
Important ideas:
Check Your Understanding
The most recent American Time Use Survey, conducted by the Bureau of Labor Statistics, found
that many Americans barely spend any time reading for fun. People ages 15 to 19 average only 7.8
minutes of leisurely reading per day with a standard deviation of 5.4 minutes. However, people ages
75 and over read for an average of 43.8 minutes per day with a standard deviation of 35.5 minutes.
These results were based on random samples of 975 people ages 15 to 19 and 1050 people ages
75 and over.
Construct and interpret a 95% confidence interval for the difference in mean amount of time
(minutes) that people age 15 to 19 and people ages 75 and over read per day.
425
Does Memory Training Help?
Does memory training help? You will be given a list of words. You will be given 3 minutes to
memorize as many as possible by just rereading the words. You will record as many words as
you can remember. You will then be given another list of words to memorize for 3 minutes using a
memorization strategy. You will record as many words as you can remember.
1. How many words did you get correct using strategy 1 (rereading)?
2. How many words did you get correct using the strategy 2 (story)?
3. Add your data to the table on the board. Copy the data below and calculate the difference
in each pair.
Strategy 1
Strategy 2
Difference
(S2 – S1)
Strategy 1
Strategy 2
Difference
(S2 – S1)
Strategy 1
Strategy 2
Difference
(S2 – S1)
4. Use the 1 Quantitative Variable (single group) applet at stapplet.com to enter the
Differences only. Make a dotplot of the differences below:
What does the dotplot suggest about the memory training?
Mean:
Interpret:
SD:
426
5. Construct a 95% confidence interval for the true mean difference in words remembered by
students using story rather than rereading.
6. Do we have convincing evidence that there is more words remembered using a story
instead of rereading?
427
Confidence Interval for Paired Data
Important ideas:
Check Your Understanding
Teenagers spend, on average, approximately 5 hours online every day. Do parents realize how many
hours their children are spending online? A family psychologist conducted a study to find out. A random
sample of 10 teenagers were selected. Each teenager was given a Chromebook and free internet for 6
months. During this time their internet usage was measured (in hours per day). At the end of the 6
months, the parents of each teenager were asked how many hours per day they think their child spent
online during this time frame. Here are the results.
Teenager
Actual time spent online
(hours/day)
Parent perception
(hours/day)
Difference (A – P)
1
2
3
4
5
6
7
8
9
10
5.9
6.2
4.7
8.2
6.4
3.8
2.9
7.1
5.2
5.8
2.5
3
3
3.5
1.5
2
2
3
2.5
3
3.4
3.2
1.7
4.7
4.9
1.8
0.9
4.1
2.7
2.8
a. Make a dotplot of the difference (A – P) in time spent online (hours/day) for each teenager. What
does the dotplot reveal?
b. Find the mean and standard deviation of the difference (A – P) in time spent online. Interpret the
mean difference in context.
c. Construct and interpret a 90% confidence interval for the true mean difference (A – P) in time
spent online.
428
AP Statistics Handout: Lesson 10.2 part 2
Topics: matched-pairs t-interval and t-test
Guided Notes
Climate Change Skepticism:
97% of climate scientists:
current climate change is
real and caused by humans
3% of climate scientists and
author Michael Crichton
State of Fear: A popular book by science fiction author Michael Crichton –
who has an MD but did not pursue medicine. The book was a best-seller
and remains one of the most cited works by climate change skeptics.
The book is framed as fiction, but it has tons of graphs of real data,
footnotes to scientific articles, and a 20-page bibliography.
Senator James Inhofe, who used to chair the Senate committee on
Environment and Public Works, when asked about Crichton: "I think I've
read most of his books; I think I've read them all. I enjoyed most 'State of
Fear' and made it required reading for this committee."
Quote from: Jankofsky, M. “Michael Crichton, Novelist, Becomes Senate Witness.” The New York Times. Sept. 29, 2005.
https://www.nytimes.com/2005/09/29/books/michael-crichton-novelist-becomes-senate-witness.html
Identifying a matched-pairs situation
“It’s the record from the weather station
at Punta Arenas, near here. It’s the
closest city to Antarctica in the world.” He
tapped the chart and laughed. “There’s
your global warming.”
Punta Arenas
- from State of Fear
429
1. What is the general trend shown in the chart shown from the weather station at Punta Arenas? How
does this support Crichton’s skepticism of global warming?
2. Fill in the chart and complete the blank below:
Station
Punta Arenas
1901-1950 Temp
1951-2000 Temp
6.828 (oC)
Difference
6.752 (oC)
Temperatures tended to __________________ during the 1900s in Punta Arenas.
Selected Station
Sable Island
Manila Int Airport
Perm
Hobart Ellerslie
Bulawayo Goetz
Veraval
Yokohama
Punta Arenas
Aldergrove
Harare Kutsaga
Bahia Blanca Aero
Cape Leeuwin
Maliye Karmakuly
Hobarttasmanwas
Svyatoy
Apia
Aparri
Syktyvkar
Upernavik
Gabo Island
Antananarivoville
Kumasi
Khartoum
Mahe Seychellesbri
Onslow
Rarotonga Intl
Ponta Delgada
Viljujsk
Andenes
Kyzylorda
Port Blair
Chatham Islands
1901-1950 1951-2000 Difference
Temp
Temp
(oC)
6.803
7.420
0.617
26.779
27.416
0.637
1.670
2.208
0.538
12.549
13.062
0.513
18.891
19.183
0.292
26.404
26.779
0.375
14.534
15.428
0.894
6.828
6.752
-0.077
8.888
9.012
0.124
18.816
19.055
0.239
14.963
15.204
0.241
16.608
16.983
0.375
-5.036
-5.044
-0.008
12.439
12.638
0.199
-0.263
0.263
0.527
26.380
26.479
0.099
25.288
26.091
0.803
0.435
0.894
0.460
-7.012
-7.286
-0.274
14.925
14.924
-0.001
17.387
17.741
0.354
25.652
25.854
0.202
28.535
28.874
0.339
26.414
26.872
0.459
24.154
24.540
0.387
24.105
24.157
0.052
14.942
15.676
0.734
-8.854
-9.057
-0.203
2.600
3.160
0.560
9.631
10.411
0.780
26.506
26.778
0.271
10.392
11.132
0.739
Here’s the thing: Punta Arenas is one location. Climate
patterns vary a lot by location. Instead of one location,
let’s ask: What is the average trend across many of these
stations? We collected temperature readings from 32
NASA-GISS stations based on a random sample of
latitude-longitude coordinates.
This data has a _________________________ structure.
•
•
Each individual (station) has _________
____________________
We find differences _____________________
_______________ (within station)
The matched-pairs data structure is useful because we
are __________________________________ in weather
patterns between stations.
Data Note: Temperatures are averages of the meteorological annual
mean temperatures from those years (in Celsius). Data is collected
from the "Adjusted cleaned" dataset from NASA, which they describe
as, "adjusted data after removal of some outliers and duplicate
records.“ Crichton appears to use the unadjusted and uncleaned data
in the chart displayed in State of Fear, but the patterns at Punta
Arenas and other areas are largely the same.
Data Source:
https://data.giss.nasa.gov/gistemp/station_data_v4_globe/#form
430
The station data
Punta Arenas: -0.076 oC
Temperature Differences from Stations (Late 1900’s – Early 1900’s) in Celsius
1. Did more stations see warming during the 1900’s or cooling during the 1900’s? How do you know?
2. Does Punta Arenas appear to be representative of the general trend we see in the station data? How
does this influence your evaluation of Michael Crichton’s evidence against global warming?
Punta Arenas
3. What is the sample mean of the differences? Draw this on the dot plot.
4. Is the sample mean more representative of the
general trend than Punta Arenas? Why or why not?
Matched-pair t-interval
From: https://climate.nasa.gov/interactives/climate-time-machine
Thank you Amy Hogan (@alittlestats) for sharing!
The cool thing about inference for matched-pairs: It’s the same as one-sample inference! We’re finding
a ___________________ of the differences.
Summary statistics you’ll be working with: ð‘ĨĖ…𝐷 = 0.35 , 𝑠ð‘Ĩ = 0.296, n = 32
431
a) Construct and interpret a 95% confidence interval for the true global average difference in
temperature (Late 1900’s – Early 1900’s). Display your full proceess below:
For “DO” phase: Calculator steps
→Stats
ð‘ĨĖ… : sample mean
𝑆ð‘Ĩ : sample stdev
n: sample size
Calculate
STAT → TESTS →
8: TInterval
432
Output
Matched-pair t-test
b) Determine if we have statistically convincing evidence of true global warming (on average). One more
time, show the full process. You don’t have to check conditions again.
Calculator steps
STAT → TESTS →
2: T-Test
→Stats
𝜇0 : null mean
ð‘ĨĖ… : sample mean
𝑆ð‘Ĩ : sample stdev
> 𝜇0 : alternative
c) Is your confidence interval (part a) consistent with your answer to part b? Explain.
433
Output
Discussion
We’re often taught: “There are two sides to every story.” In popular media, climate change is often
framed as a debate between two sides:
News stories on climate often interview two experts:
Expert 1
Expert 2
“Climate change
is real and
human-caused.”
“Climate change is
not real or humancaused.”
97% of climate
scientists agree
3% of climate
scientists agree &
Discussion Question: Should popular media portray both sides of the climate change “debate?” Explain
your reasoning.
434
®
2013 AP STATISTICS FREE-RESPONSE QUESTIONS
STATISTICS
SECTION II
Part A
Questions 1-5
Spend about 65 minutes on this part of the exam.
Percent of Section II score-75
Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the
correctness of your methods as well as on the accuracy and completeness of your results and explanations.
1. An environmental group conducted a study to determine whether crows in a certain region were ingesting food
containing unhealthy levels of lead. A biologist classified lead levels greater than 6. 0 parts per million (ppm) as
unhealthy. The lead levels of a random sample of23 crows in the region were measured and recorded. The data
are shown in the stemplot below.
Lead Levels
2
3
3
4
4
5
5
6
6
8
0
588
112
688
0122 34
99
34
68
Key: 218 = 2.8 ppm
(a) What proportion of crows in the sample had lead levels that are classified by the biologist as unhealthy?
(b) The mean lead level of the23 crows in the sample was 4.90 ppm and the standard deviation was1.12 ppm.
Construct and interpret a95 percent confidence interval for the mean lead level of crows in the region.
435
2008 AP ® STATISTICS FREE-RESPONSE QUESTIONS
3. A local arcade is hosting a tournament in which contestants play an arcade game with possible scores ranging
from 0 to 20. The arcade has set up multiple game tables so that all contestants can play the game at the same
time; thus contestant scores are independent. Each contestant's score will be recorded as he or she finishes, and
the contestant with the highest score is the winner.
After practicing the game many times, Josephine, one of the contestants, has established the probability
distribution of her scores, shown in the table below.
Josephine's Distribution
16
17
18
19
0.10
0.30
0.40
0.20
Score
Probability
Crystal, another contestant, has also practiced many times. The probability distribution for her scores is shown in
the table below.
Crystal's Distribution
Score
Probability
17
18
19
0.45
0.40
0.15
(a) Calculate the expected score for each player.
(b) Suppose that Josephine scores 16 and Crystal scores 17. The difference (Josephine minus Crystal) of
their scores is -1. List all combinations of possible scores for Josephine and Crystal that will produce
a difference (Josephine minus Crystal) of -1, and calculate the probability for each combination.
(c) Find the probability that the difference (Josephine minus Crystal) in their scores is -1.
(d) The table below lists all the possible differences in the scores between Josephine and Crystal and some
associated probabilities.
Distribution (Josephine minus Crystal)
Difference
-3
Probability
0.015
-2
-1
0
1
2
0.325
0.260
0.090
Complete the table and calculate the probability that Crystal's score will be higher than Josephine's score.
436
Unit 7 Chapter
11 Testing
Claims About
Means
437
AP Statistics Handout: Lesson 11.1
Topics: one sample t-test for a mean
Guided Notes
SNAP benefits (food stamps) should cover the following essential monthly bundle of groceries for one
person:
• 4 boxes of cereal, 1 gallon of milk, 5 banana bunches, 2 gallons of
orange juice
• 3 loaves of bread, 4 packages of sliced turkey (9-oz each), 3
packages of sliced provolone (10 count), 4 packages of romaine
lettuce (7 oz each), 12 tomatoes, small mayo jar (15 oz), 30
apples, 4 bags of mixed nuts (8 oz each), 5 diet coke 6-packs.
• 15 lbs boneless chicken breasts, 6 packs of frozen mixed veggies
(16 oz each), 4lbs white rice, 10 cans of black beans
• Salt (5 oz), pepper (5 oz), olive oil (17 oz)
We’ll call this grocery list the essentials bundle. We will use sampling to determine if SNAP benefits
cover the average (mean) cost of this grocery bundle in U.S. grocery stores.
One Sample t-test for a Mean
Central question: In fiscal year 2020, the maximum monthly SNAP allotment for a household of one
person (who could not earn income) was $194 in the U.S.* An SRS of 31 U.S. grocery stores found the
average cost of the essentials bundle was $199.45, with a sample standard deviation of $25.98.
Is there convincing statistical evidence that the maximum SNAP individual monthly allotment ($194)
doesn’t cover the true mean price of the essentials-only grocery bundle in the U.S.?
How can we tell this question is asking us to do a hypothesis test?
*This is the max federal allotment for the continental U.S. (not Alaska/Hawaii). Source: U.S. Department of Agriculture, “SNAP
Fiscal Year 2020 Cost-of-Living Adjustments,” https://fnsprod.azureedge.net/sites/default/files/media/file/COLA%20Memo%20FY%202020.pdf
Building Hypotheses
Null hypothesis (ðŧ0 ): The ________________ belief about a parameter’s value.
Alternative hypothesis (ðŧðī ): Our unproven belief about a ____________________________, for which
we have to gather evidence.
438
Convert the following hypotheses into numerical statements:
Hypotheses in words
ðŧ0 : Food stamps cover the true
average cost of essentials in U.S.
ðŧðī : Food stamps don’t cover it
Hypotheses in numbers
ðŧ0 :
ðŧðī :
Let µ = true mean price of the essentials-only grocery bundle in the U.S.
Performing the Test, Step 1
Step 1: Start by assuming the null (the default) ________________.
We assume SNAP provides enough:
the true mean price of the essentialsonly bundle in U.S. stores is
_______________________________
______________________.
Performing the Test, Step 2
Step 2: Ask yourself: if the null is in fact true, how likely is the data that you’ve gathered?
We collected bundle prices from 31
random grocery stores (n = 31). At this
sample size, what estimates for µ are
we __________________? Let’s find
the ___________________________
________ of ð‘ĨĖ… ’s from all possible
samples.
Under certain conditions:
ð‘ĨĖ… ~ 𝑁𝑜𝑟𝑚 (𝜇ð‘ĨĖ… = 𝜇, 𝜎ð‘ĨĖ… =
Now, we look at the data we sampled: among 31 stores,
Ė… = $199.45
the mean price of bundle was: 𝒙
439
ð‘ĨĖ… ~ ð‘Ą (df = 30)
𝜇ð‘ĨĖ… = ______
𝑆ðļð‘ĨĖ… = ______
𝜎
√𝑛
)
Using the t-test statistic:
Z-Scores
Shows how many σ a data value
is above/below the mean.
𝑧=
t-test statistics
Shows how many SE a _________________ is above / below _____,
using a t-dist.
data value − mean
standard deviation
z=
t=
ð‘ĨĖ… −𝜇0
𝑆ðļð‘ĨĖ…
→ t=
ð‘ĨĖ… −𝜇0
𝑠ð‘Ĩ
√𝑛
ð‘Ĩ𝑖 − µ
𝜎
The _________________ the test statistic (in magnitude), the less ________________ your
measurements are under the null assumption. You can’t _________________________________.
ð‘Ą=
ð‘ĨĖ… − 𝜇0
=
𝑆ðļð‘ĨĖ…
ð‘Ą=
Getting a sample mean about 1 standard
error away from the hypothesized mean is
_________________________.
Using the p-value: exactly how unlikely was the proportion we sampled?
a) This probability is the p-value. Explain why:
b) Show the calculator commands and final
result for the p-value:
Performing the Test, Step 3: Write the Conclusion
Under my assumption that the true mean price of the essentials bundle is $194 in U.S. stores, the actual
observed data ($199.45 sample mean at 31 stores) is somewhat likely (________________). So, ______
___________________________________. We don’t have convincing evidence that the average price of
the bundle is higher than SNAP benefits.
440
Is there convincing statistical evidence that SNAP ($194) doesn’t cover the average cost of essential
food? An SRS of 31 U.S. grocery stores found the average cost of the essentials bundle was $199.45,
with a sample standard deviation of $25.98.
Calculator steps for one-sample z-test for a proportion:
→Stats
μ0 : null value
xĖ…: sample mean
sx : sample stdev.
n: sample size
µ: HA direction
STAT → TESTS →
2: T-Test
441
Output
11.1 PRACTICE!
Anemia in Children
Hemoglobin is a protein in red blood cells that carries oxygen from the lungs to body tissues. People with less
than 12 grams of hemoglobin per deciliter of blood (g/dl) are anemic. A public health official in Jordan suspects
that Jordanian children are at risk of anemia. He measures a random sample of 50 children.
(a) State the appropriate null hypothesis Ho and alternative hypothesis Ha. Be sure to define your parameter.
For the study of Jordanian children, the sample mean hemoglobin level was 11.3 g/dl and the sample standard
deviation was 1.6 g/dl. A significance test yields a P-value of 0.0016.
(b) Interpret the p-value in context.
(c) What conclusion would you make if α = 0.05? α = 0.01? Justify your answer.
442
Healthy Streams
The level of dissolved oxygen (DO) in a stream or river is an important indicator of the water’s ability to support
aquatic life. A dissolved oxygen level below 5 mg/l puts aquatic life at risk. A researcher measures the DO level
at 15 randomly chosen locations along a stream. Here are the results in milligrams per liter:
4.53
5.04
3.29
5.23
4.13
5.50
4.83
5.42
6.38
4.01
4.66
2.87
5.73
5.55
Is there evidence that aquatic life is at risk in this stream?
443
4.40
Pineapples
At the Hawaii Pineapple Company, managers are interested in the sizes of the pineapples grown in the
company’s fields. Last year, the mean weight of the pineapples harvested from one large field was 31 ounces. A
new irrigation system was installed in this field after the growing season. Managers wonder whether this
change will affect the mean weight of future pineapples grown in the field. To find out, they select and weigh a
random sample of 50 pineapples from this year’s crop. The Minitab output below summarizes the data.
Is there evidence that the mean weight of pineapples is different with the new irrigation system?
444
Are you getting enough sleep?
It’s recommended that teenagers get 8 hours of sleep a night. Mrs. Gallas believes her AP
Stats students are getting less than the recommended 8 hours of sleep per night. To test
her belief, take a random sample of 10 students in class and record the number of hours
of sleep for each. Do these data provide convincing evidence that the AP stats students
get less than 8 hours of sleep per night using α = 0.05?
1. Calculate the sample mean and standard deviation.
2. State the appropriate hypotheses for a significance test. Be sure to define the
parameter of interest.
3. What conditions must be met? Check them.
4. Give the formulas for the mean and standard deviation of the sampling distribution of x
and calculate the values.
5. Draw a picture and then calculate the test statistic.
6. Remember, since we are working with means, the test statistic is a t value. Use table
B to find the P-value.
7. What conclusion can we make?
445
Significance Test for µ
Important ideas:
Check Your Understanding
According to the AAA Foundation for Traffic Safety’s American Driving Survey, U.S. drivers spend,
on average, 51 minutes behind the wheel each day. A researcher believes this is an overstatement.
To investigate, a random sample of 75 drivers were selected. The study revealed that the mean
time behind the wheel for the sample of 75 drivers was 46.4 minutes with a standard deviation of
18.8 minutes. Is there convincing evidence that the mean time behind the wheel for all U.S. drivers
is less than 51 minutes? Use α = 0.01.
446
What is normal body temperature?
For many years, doctors have told people that “normal” body
temperature is 98.6 degrees Fahrenheit. Today, we will try to
find out if this is true.
Take your body temperature and record it on whiteboard. Record the following for the data for
the whole class (think of our class as an SRS of all high school students)
sx =
x=
n=
Do the data provide convincing evidence that the mean normal body temperature is different
than the doctor’s claim? Assume the conditions have been met.
Another class did the same activity with these results: x = 97.9
sx = 1.6
n = 30
1. Use T-test on the calculator to find the P-value =
Reject H0 at α = 0.10 ?
Reject H0 at α = 0.05?
Reject H0 at α = 0.01?
2. Use TInterval on the calculator to find the following confidence intervals.
90%:________________
95%:________________
99%:________________
Reject H0?___________
Reject H0?___________
Reject H0?___________
3. What connection do you notice between your answers to #1 and #2?
447
Significance Test for µ
Important ideas:
Check Your Understanding
According to a flyer created by BroadwayPartyRental.com, their 18-inch helium balloons fly,
on average, for 32 hours. You purchase a SRS of 50 18-inch helium balloons from this
company and record how long they fly. You would like to know if the actual mean flight time
of all balloons differs from the advertised 32 hours.
1. State an appropriate pair of hypotheses for a significance test in this setting. Be sure to
define the parameter of interest.
2. A 95% confidence interval for the mean flight time (in hours) for all helium balloons is (28.5,
31.4). Based on this interval, what conclusion would you make for a test of the hypotheses
in #1 at the α = 0.05 significance level?
448
Name: _________________
AP Statistics Handout: Lesson 11.2
Topics: two-sample t-test for a difference of means
Guided Notes
The Stroke Stent Study
In this study*, researchers randomly assigned stroke
patients to two groups: one received the current standard
care (control) and the other received a stent surgery in
addition to the standard care (stent treatment). Mr. YoungSaver’s father, Dr. Jeffrey Saver, was a principal researcher
for the study.
Dr. Saver
Mr. Young-Saver
The paternity is not in question!
Stent in action, working
on a stroke clot
(UT Health San Antonio)
*Saver JL, Goyal M, Bonafe A, Diener HC, Levy EI, Pereira VM, et al. “Stent-retriever thrombectomy after
intravenous t-pa vs. T-pa alone in stroke.” N Engl J Med. 2015; 372:2285-2295
https://www.nejm.org/doi/full/10.1056/nejmoa1415061
Two sample t-test for a difference of means
Study results
Stent
Control
Mean Disability
2.26
3.23
Stdev. Disability
1.78
1.78
98
93
n
If the stents work, the treatment group should
have a ____________________ disability score.
Could the difference in outcome have
happened by chance alone? Or did the
stents actually improve patient outcomes?
449
Hypotheses: Take default belief until evidence convinces us otherwise
Non-numerical version:
Numerical version:
ðŧ0 : ___________ standard of care is best
ðŧ0 :
• Default/dull belief (___________________)
ðŧðī : Stents ____________ patient outcomes
ðŧðī :
• Unproven research hypothesis we aim to
demonstrate (_______________________)
Where:
𝜇𝑠 is the mean disability score of ____ patients who’d receive stents.
𝜇𝑐 is the mean disability score of ____ patients who’d receive current
standard of care.
The stents don’t improve
outcomes over the current
standard care, so the mean
mRS is the _____________
___________.
The stents did improve care,
so the stent group had a
______________________.
Rewrite these hypotheses in a more mathematically convenient way:
Under the hood: calculating t-test statistics and p-values for a two-sample test
Investigating if we find enough statistical evidence to reject the ð‘Ŋ𝟎 and support the ð‘Ŋð‘Ļ
1. Start by assuming the null (the default) is true.
• We assume stents do not help: disability level is same for both groups.
2. Ask yourself: if the null is in fact true, how likely is the
data that you’ve gathered?
2. The study enrolled 191 patients (93 in
control, 98 in treatment). At this sample size,
what estimates for 𝜇𝑠 - 𝜇𝑐 are we ___________
if stents don’t actually help? Let’s find the
______________________________________
of ð‘ĨĖ…𝑠 - ð‘ĨĖ…𝑐 from all possible random assignments.
Under certain conditions:
~ Norm(𝜇ð‘ĨĖ…𝑠−ð‘ĨĖ…𝑐 = 0, 𝜎ð‘ĨĖ… 𝑠−ð‘ĨĖ…𝑐 = √
Under the null assumption, how unlikely was the
data we observed (ð‘ĨĖ…𝑠 − ð‘ĨĖ…𝑐 = −𝟎. 𝟗𝟕)?
450
ð‘ĨĖ… ~ ð‘Ą (df = 92)
𝜇ð‘ĨĖ… = ______
𝑆ðļð‘ĨĖ… = ______
𝜎𝑠 2
𝑛𝑠
+
𝜎𝑐 2
𝑛𝑐
Write down the t-test statistic and the p-value. Interpret their meaning and describe if they suggest
there is reason to doubt the null hypothesis in this case.
3. Draw a conclusion:
Under my assumption that stents provide no added benefit, the actually observed data (0.97 point
decline in disability in stent group) is highly unlikely (________________). So, I ___________ my earlier
assumption. There’s convincing evidence that the ____________________________________________.
In the stroke stent study, patients were randomly assigned to
receive either the current standard of care or current care with
a stent surgery. The results are summarized below.
Do the results give convincing statistical evidence that the
stent treatment reduces the average disability from stroke?
Mean Disability
Stdev. Disability
n
Stent
2.26
1.78
98
Control
3.23
1.78
93
: Calculator steps for two-sample z-interval for proportions:
For groups 1&2:
ð‘ĨĖ… : sample mean
𝑠ð‘Ĩ : sample stdev.
n: sample size
µ1: ðŧðī direction
Pooled → NO
STAT → TESTS →
4: 2-SampTTest…
451
Output
Lesson11.2 Discussion
Researchers enroll patients until reaching a sample size goal. Independent reviewers monitor the trial
data and stop the trial early if either:
1. The treatment seems to be hurting patients
2. The treatment seems so good that it’d be unethical to not give it to everyone.
Discussion Question: You are on the monitoring committee for a new drug that aims to lower
cholesterol among men. Researchers aim to enroll 300 patients in the trial. You look at the results for
the first 150 patients. Would you stop the trial early? Why or why not?
Mean cholesterol
Stdev. cholesterol
n
Control
209 mg/dL
53 mg/dL
75
Treatment
195 mg/dL
55 mg/dL
75
Healthy cholesterol level is < 200 mg/dL
452
11.2 PRACTICE!
The Stronger Picker-upper?
In commercials for Bounty paper towels, the manufacturer claims that they are the “quicker picker-upper.” But
are they also the stronger picker-upper? Two AP Statistics students, decided to find out. They selected a
random sample of 30 Bounty paper towels and a random sample of 30 generic paper towels and measured their
strength when wet. To do this, they uniformly soaked each paper towel with 4 ounces of water, held two
opposite edges of the paper towel, and counted how many quarters each paper towel could hold until ripping,
alternating brands. Here are their results:
Bounty: 106, 111, 106, 120, 103, 112, 115, 125, 116, 120, 126, 125, 116, 117, 114
118, 126, 120, 115, 116, 121, 113, 111, 128, 124, 125, 127, 123, 115, 114
Generic: 77, 103, 89, 79, 88, 86, 100, 90, 81, 84, 84, 96, 87, 79, 90
86, 88, 81, 91, 94, 90, 89, 85, 83, 89, 84, 90, 100, 94, 87
(a) Display these distributions using parallel boxplots and briefly compare these distributions. Based only on the
boxplots, discuss whether or not you think the mean for Bounty is significantly higher than the mean for
generic.
(b) Use a significance test to determine if there is convincing evidence that wet Bounty paper towels can hold
more weight, on average, than wet generic paper towels.
453
Is one form of the AP exam harder?
Last year,South Aiken High School had 30 students take the AP Statistics exam.
They were informed later that the College Board gave two forms of the exam, which
were randomly assigned to the students. Here are the results:
Form A
3
3
3
3
4
4
4
4
5
5
5
5
5
5
5
Form B
2
2
3
3
4
4
4
4
4
5
5
5
5
5
5
Mean score Form A ( x A ) ? ______
Mean score Form B ( x B ) ? ______
What is the difference in means x A − x B ? ___________
Assume the two forms are the same difficulty, so if Doug scored a 5 on Form A, he
would also score a 5 on Form B. In other words, Doug is a 5 no matter which form he is
randomly assigned.
1. The 30 AP scores from the class are written on 30 cards. Randomly assign half
of the students to get Form A and the other half to get Form B. What is the
difference in mean scores for this random assignment?
x A = _________
x B = _________
x A − x B = _________
2. Write the difference of mean scores on a sticker dot and take it to the poster at
the front of the room. Sketch the dotplot below.
difference of mean scores ( x A − x B )
3. East Kentwood had a difference of mean scores of 4.20 – 4.0 = 0.2. Is this
outcome surprising if we assume both forms are the same difficulty? Explain.
4. Based on the simulation, do we have convincing evidence that one form of the
exam is harder? Explain.
454
Significance Test for a Difference in Means
Important ideas:
Check Your Understanding
Last year, South Aiken High School had 30 students take the AP Statistics exam. They
were informed later that the College Board gave two forms of the exam, which were
randomly assigned to the students. The two forms had a difference of mean scores of
4.20 – 4.0 = 0.2. Do the data provide convincing evidence that one form of the AP
exam is harder than the other?
Complete the first two steps of a significance test.
455
Does labeling menus reduce calories?
According to a Stanford Business article, Americans may eat fewer calories at restaurants if
the calories of the food items are labeled on the menu. To investigate this, researchers
compared Starbucks receipts from locations where the menus were labeled to receipts from
stores where the menus were not labeled. A random sample of 30 receipts from stores with
the menus labeled had an average number of calories of 225 calories with a standard
deviation of 100 calories. A random sample of 40 receipts from stores without menus labeled
showed an average of 265 calories per receipt with a standard deviation of 75 calories. Does
this provide convincing evidence that the average calories per receipt at Starbucks with a
labeled menu is less than at a Starbucks without labeled menus?
456
Significance Test for a Difference in Means
Important ideas:
Check Your Understanding
Can balloons hold more air or more water before bursting? A student purchased a large bag of 12-inch
balloons. He randomly selected 10 balloons from the bag and then randomly assigned half of them to be
filled with air until bursting and the other half to be filled with water until bursting. He used devices to
measure the amount of air and water was dispensed until the balloons burst. Here are the data.
Air (ft3)
Water (ft3)
0.52
0.44
0.58
0.41
0.50
0.45
0.55
0.46
0.61
0.38
Do the data give convincing evidence air filled balloons can attain a greater volume than water filled
balloons?
457
Does Memory Training Help? Part 2
Suppose that 32 seniors were randomly selected to try both memory strategies. Each student
calculated the difference in the # words (Strategy 2 – Strategy 1). The mean of the differences was
x diff = 1.5 and the standard deviation of the differences is s diff = 2.7 . Do these data give convincing
evidence that strategy 2 improves memory at the α = 0.05 significance level?
458
Significance Test for A Mean of Differences
Important ideas:
Check Your Understanding
In each of the following settings, decide whether you should use two-sample t procedures to
perform inference about a difference in means or paired t procedures to perform inference about a
mean difference. Explain your choice.
1. A random sample of 30 adults were selected. Each adult reported the number of pieces of
junk mail they received in their mailbox at home that day as well as the number of junk
emails they received in their inbox that day. A researcher would like to know if adults
receive significantly more junk email than junk mail in their mailbox.
2. A random sample of 100 people who live on the west coast and 100 people who live on the
east coast are selected. Each person reports how many spam calls they receive in a oneweek time period. We would like to know if people who live on the west coast receive
significantly more spam calls than those who live on the east coast.
3. A researcher randomly selects a variety of 15 cell phones from different manufacturers and
different service providers. The researcher measures the signal strength from two different
remote areas: one in the state of Pennsylvania and one in the state of Virginia. The
researcher would like to know if the signal strength is significantly weaker in the remote
area in the state of Virginia.
459
Climate Change Part 2
Suppose that 32 weather stations were randomly selected across the globe. For each station, the
peak annual temperature (oC) between 1900-1940 was found. Then, the peak annual temperature
between 1960-2000 was found. Differences were calculated for each station (later temperature –
earlier temperature). The mean of the differences was ð‘ĨĖ…!"## = 0.35 oC and the standard
deviation of the differences was 𝑠!"## = 0.296 oC. Do these data give convincing evidence that
the peak global temperatures increased (on average) during the 1900’s?
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
460
Significance Test for A Mean of Differences
Important ideas:
Check Your Understanding
In each of the following settings, decide whether you should use two-sample t procedures to perform
inference about a difference in means or paired t procedures to perform inference about a mean
difference. Explain your choice.
1. A random sample of 30 adults were selected. Each adult reported the number of pieces of
junk mail they received in their mailbox at home that day as well as the number of junk
emails they received in their inbox that day. A researcher would like to know if adults receive
significantly more junk email than junk mail in their mailbox.
2. A random sample of 100 people who live on the west coast and 100 people who live on the
east coast are selected. Each person reports how many spam calls they receive in a oneweek time period. We would like to know if people who live on the west coast receive
significantly more spam calls than those who live on the east coast.
3. A researcher randomly selects a variety of 15 cell phones from different manufacturers and
different service providers. The researcher measures the signal strength from two different
remote areas: one in the state of Pennsylvania and one in the state of Virginia. The
researcher would like to know if the signal strength is significantly weaker in the remote area
in the state of Virginia.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
461
462
How to find P-value
Formulas for Significance
Tests
Formulas for Confidence
Intervals
Formula for standard
deviation of the sampling
distribution
Formula for mean of the
sampling distribution
Conditions
Name the procedure
Symbol for parameter
(population)
Symbol for statistic
(sample)
Lesson
10.1 – One Mean
AP Stats Chapter 10, 11 Study Sheet
10.2 – Difference of Means
10.2 – Mean Difference
463
2009 AP ® STATISTICS FREE-RESPONSE QUESTIONS (Form B)
5. A bottle-filling machine is set to dispense 12.1 fluid ounces into juice bottles. To ensure that the machine is
filling accurately, every hour a worker randomly selects four bottles filled by the machine during the past hour
and measures the contents. If there is convincing evidence that the mean amount of juice dispensed is different
from 12.1 ounces or if there is convincing evidence that the standard deviation is greater than 0.05 ounce, the
machine is shut down for recalibration. It can be assumed that the amount of juice that is dispensed into bottles
is normally distributed.
During one hour, the mean number of fluid ounces of four randomly selected bottles was 12.05 and the standard
deviation was 0.085 ounce.
(a) Perform a test of significance to determine whether the mean amount of juice dispensed is different from
12.1 fluid ounces. Assume the conditions for inference are met.
(b) To determine whether this sample of four bottles provides convincing evidence that the standard deviation
of the amount of juice dispensed is greater than 0.05 ounce, a simulation study was performed. In the
simulation study, 300 samples, each of size 4, were randomly generated from a normal population with
a mean of 12.1 and a standard deviation of 0.05. The sample standard deviation was computed for each
of the 300 samples. The dotplot below displays the values of the sample standard deviations.
0.01
0.02
0.03
0.04
0.05
0.06
0.07
I. •• • ••• •••••
0.08
0.09
0.10
Use the results of this simulation study to explain why you think the sample provides or does not provide
evidence that the standard deviation of the juice dispensed exceeds 0.05 fluid ounce.
464
Unit 8 & 9
Chapter 12
Inference for
Distributions and
Relationships
465
AP Statistics Handout: Lesson 12.1
Topics: problem of multiple tests, chi-square test for goodness of fit
Guided Notes
Fall, 2014: Harvard is sued by a group of Asian-American
applicants who were rejected in admissions. They claimed racial
discrimination.
Summer, 2018: Courts release reports on Harvard admissions
data, giving an unprecedented window into private University
admissions data. The data we’ll look at today is reconstructed
from parts of the plaintiff report* in which the defense and
plaintiffs generally agreed on the findings.
Today’s Key Analysis: Is there convincing evidence that Harvard discriminates
against Asian applicants?
*Plaintiff report accessible here: https://samv91khoyt2i553a2t1s05i-wpengine.netdnassl.com/wp-content/uploads/2018/06/Doc-415-1-Arcidiacono-Expert-Report.pdf
The problem of multiple tests
Class of 2019 Applicants,
Top 10% Academic Index
Group
%
Class of 2019
Admitted Students
Group
%
Asian-American
57.5%
Asian-American
21.4%
Hispanic
3.1%
Hispanic
12.2%
African-American
0.8%
African-American
11.2%
White
38.7%
White
55.3%
Source: Plaintiff’s report, table B.5.7
Notes:
- Academic Index ratings are internal measures of
academic qualification produced by the Harvard
admissions office. They’re calculated based on
standardized test scores and high school
grades/performance.
- Other groups (Native American, Mixed Race, etc.)
and international students were not included in
the court’s main analysis. Percentages are
calculated just out of these 4 groups.
Source: Harvard Gazette
Imagine Harvard claims: “We only accept the top academic applicants and we treat those applicants
equally. Our admitted class is as good as a random sample from the pool of top applicants.” You’d like to
test if there’s convincing evidence against this claim. Would it be appropriate to do four z-test for
proportions (one for each racial group above)? Why or why not?
Source:
Forbes
466
The null (__________________) hypothesis: there is no
racial discrimination, so the distribution of admitted
students ______________________________________.
Chi-Square test for goodness of fit
Hypotheses
ðŧ0 : The racial distribution of admitted students is the ____________ as the claimed distribution
ðŧðī : The racial distribution of admitted students is _____________________ as the claimed distribution
The alternative (______________________) hypothesis:
the distribution of admitted students _______________
________________________________.
Logic of Hypothesis Tests
1. Start by assuming the null (the default) is true. In your own words, describe what that means in
context of the example above:
2. Ask yourself: if the null is in fact true, how likely is the data that you’ve gathered? In your own words,
describe what that means in context of the example above:
3. Draw a conclusion. In your own words, describe what that means in context of the example above:
Calculating the Test Statistic:
Population of Top
Academic Applicants
In total, Harvard admitted n = 2023 applicants from these racial groups to the Class of 2019. How many
would we expect to admit if it was a random sample?
Group
%
Asian-American
57.5%
Hispanic
3.1%
African-American
0.8%
White
38.7%
Expected Counts
Step 1: Assuming
the null is true, this
is what we’d expect
467
Step 2: Compare our null expectation to the actual data
Group
Expected Observed (ð‘ķ𝒃𝒔𝒆𝒓𝒗𝒆𝒅 − 𝑎𝒙𝒑𝒆𝒄𝒕𝒆𝒅)𝟐
Count
Count
𝑎𝒙𝒑𝒆𝒄𝒕𝒆𝒅
AsianAmerican
432
Hispanic
247
AfricanAmerican
226
a) Why do we subtract the observed and
expected values?
b) Why do we square the differences?
c) Why do we divide by the expected values?
White
1118
Calculated with our data:
Formula:
=∑
(𝑂𝑏𝑠𝑒𝑟ð‘Ģ𝑒𝑑 − ðļð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘)2
ðļð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘
=
When chi-square is _______, that means your observed data is very ________________ from what was
expected under ðŧ0 . Therefore, if ðŧ0 is true, your observations are unlikely (___________________). You
reject ðŧ0 .
When chi-square is __________, that means your observed data is very _____________ to what was
expected under ðŧ0 . Therefore, if ðŧ0 is true, your observations are likely (___________________). You
fail to reject ðŧ0.
Step 3: Conclude → Next page
468
Imagine that Harvard claims: “We only accept the top academic applicants and we treat those applicants
equally. Our admitted class is as good as a random sample from the pool of top applicants.” Do the data
below provide convincing evidence to refute Harvard’s admissions claim?
%
Asian-American
57.5%
Hispanic
3.1%
African-American
0.8%
White
38.7%
Admitted
Top Academic
Applicants
Group
469
Observed
432
247
226
1118
12.1 PRACTICE!
Cheap Dice?
Mrs. Daniel purchased a bunch of inexpensive dice from Amazon. She is now wondering if the dice are fair. So, Mrs.
Daniel randomly selects 6 dice to roll 10 times each, for a total of 60 observations to evaluate. Do we have statistically
convincing evidence that the dice are fair?
Outcome
1
2
3
4
5
6
Total
470
Observed
13
11
6
12
10
8
60
Landline surveys
According to the 2010 Census, of all US residents age 20 and older, 19.1% are in their 20’s, 21.5% are in their 30’s, 21.1
% are in their 40’s, 15.5% are in their 50’s, and 22.8% are 60 and older. The table below shows the age distribution for a
sample of US residents age 20 and older. Members of the sample were chosen by randomly dialing landline telephone
numbers.
Category Count
20-29
141
30-39
186
40-49
224
50-59
211
60+
286
Total
1048
Do these data provide convincing evidence that the age distribution of people who answer landline telephone surveys is
not the same as the age distribution of all US residents?
471
Which color M&M is the most common?
The company that makes milk chocolate M&Ms claims the following distribution:
13% Brown, 14% Yellow, 20% Orange, 16% Green, 24% Blue, and 13% Red. Is this true?
1. Observed values: Brown:_____ Yellow:_____ Orange:_____ Green:_____ Blue:_____ Red:_____
Total number of M&Ms:_______
2. As a class, write down hypotheses for a significance test.
H0:
Ha:
3. Let’s suppose that M&Ms claimed distribution is correct. If they are correct, how many of each
color would we expect to get in our sample.
Expected values: Brown:_____ Yellow:_____ Orange:_____ Green:_____ Blue:_____ Red:_____
Use the table to calculate the test statistic.
Observed
Expected
(Observed - Expected)
(Observed - Expected)2
(Observed - Expected ) 2
Expected
Brown
Yellow
Orange
Green
Blue
Red
Add up all the numbers in the last column. This is our test statistic:_________
4. What value would we get for the test statistic if our sample was very close to what is expected?
Explain.
5. What value would we get for the test statistic if our sample was very far from what is expected?
Explain.
472
Chi-Square Test: Goodness of Fit
Important ideas:
Check Your Understanding
Are births equally likely across the days of the week? A random sample of 150 births give the following
sample distribution:
Day of
the week
Count
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
11
27
23
26
21
29
13
a. State the appropriate hypotheses.
b. Calculate the expected count for each of the possible outcomes.
c. Calculate the value of the chi-square test statistic.
d. Which degrees of freedom should you use?
e. Use Table C to find the P-value. What conclusion would you make?
473
Which color M&M is the most common part two?
The company that makes milk chocolate M&Ms claims the following distribution:
13% Brown, 14% Yellow, 20% Orange, 16% Green, 24% Blue, and 13% Red. Is this true?
1. Record the information from yesterday.
Observed values: Brown:_____ Yellow:_____ Orange:_____ Green:_____ Blue:_____ Red:_____
Expected values: Brown:_____ Yellow:_____ Orange:_____ Green:_____ Blue:_____ Red:_____
Test statistic: χ 2 = _________
2. Check conditions:
Random:
10%:
Large counts: Which expected count is the lowest? Are all of the expected counts greater than 5?
3. Calculate the P-value.
For this test df = n – 1, but n represents the number of categories (colors).
What is the df for this test?_______
What is the test statistic for this test?________
Use Table C to find the P-value:________
4. Make a conclusion. Use α = 0.05 .
5. Which color M&M had an observed value the farthest from the expected?
474
Do the data provide significant evidence that the company was lying about the distribution of
colors of M&Ms? Use α = 0.05
475
Chi-Square Test for Goodness of Fit:
Important ideas:
Check Your Understanding
A traffic light is installed to allow traffic from a seldom used side street to cross a 4-lane highway.
Because the side street doesn’t get a lot of traffic the light is set to provide a red light for the side street
80% of the time, yellow 5% of the time, and green 15% of the time. A resident who must pass through
the light several times per day is suspicious that the light is not functioning according to the claimed
distribution. He sets up a trail camera and programs it to snap a picture of the light at 200 randomly
selected times throughout the day. Here are the results: Red: 173, Yellow: 13, and Green: 14.
a. Do these data provide convincing evidence that the light is not functioning according to the claimed
distribution?
b. If there is convincing evidence of a difference in the distribution of car color, perform a follow-up
analysis.
476
Does Harvard Discriminate Against Asian Applicants? (Pt 2)
Below is the true distribution of top academic applicants to Harvard’s Class of 2019, along with the
actually admitted class. Recall: We’re imagining that Harvard claims their admitted students are
essentially a random sample from this pool of top academic applicants. Do the data provide convincing
evidence against Harvard’s claim? Use α = 0.05.
Class of 2019 Applicants with
Top Academic Ratings
Group
%
Class of 2019 Actually
Admitted Students
Group
Count
Asian-American
57.5%
Asian-American
432
Hispanic
3.1%
Hispanic
247
African-American
0.8%
African-American
226
White
38.7%
White
1118
Source: Plaintiff’s report, table B.5.7
Source: Harvard Gazette
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
477
1. Were white applicants admitted at higher or lower rates than expected? Was this surprising? Why or
why not?
3. Discussion Question: Despite the evidence above, how do you believe Harvard’s lawyer would
defend the school’s admissions policy against claims of discrimination? Explain.
4. Discussion Question: What further evidence would you like to know when assessing whether
Harvard’s admissions process is truly discriminatory?
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
478
Chi-Square Test for Goodness of Fit
Important ideas:
Check Your Understanding
A traffic light is installed to allow traffic from a seldom used side street to cross a 4-lane highway. Because
the side street doesn’t get a lot of traffic the light is set to provide a red light for the side street 80% of the
time, yellow 5% of the time, and green 15% of the time. A resident who must pass through the light several
times per day is suspicious that the light is not functioning according to the claimed distribution. He sets up
a trail camera and programs it to snap a picture of the light at 200 randomly selected times throughout the
day. Here are the results: Red: 173, Yellow: 13, and Green: 14.
a. Do these data provide convincing evidence that the light is not functioning according to the claimed
distribution?
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
479
AP Statistics Handout: Lesson 12.2
Topics: chi-square tests for independence and homogeneity
Guided Notes
Force Level
No Force Used
Hands Used
Higher Force Level*
Race of Suspect
Black
Hispanic White
1371
853
260
263
188
30
109
72
18
*Includes push to wall/ground, handcuffs, draw/point weapon, pepper
spray, baton. Data source: NYC.GOV,
https://www1.nyc.gov/site/nypd/stats/reports-analysis/stopfrisk.page
This 2011 data is a random sample of police stops from New York’s “Stop and Frisk” program. The
program allowed police officers to stop people on the street and search them for weapons or
contraband. The program was controversial. Critics alleged that it led to heightened police
discrimination of people of color.
Conditional distributions
i) Fill in the table to the right with the
conditional distribution of force levels among
each racial group:
ii) A television commentator says, “Police had
‘no force’ interactions with only 260 white
suspects. Meanwhile, a much higher number
of black suspects—1,371—didn’t experience
force. Clearly, black people experience ‘forcefree’ interactions with police more frequently
than white people.” Was that a good analysis
of this data? Why or why not?
Black
No Force Used
Hands Used
Higher Force Level
480
White
Imagine a statistician is investigating if there is convincing evidence of an association between race of
suspect and level of force used. The statistician performs a set of z-tests for proportions on the various
rates of force levels used between groups. Why is this not a good idea?
Chi-Square Test for Independence
Hypotheses
ðŧ0 : There ____________________________between race of suspects and levels of force used.
• This is the null (dull / default / innocent) hypothesis: force is used at the __________________
between all races.
ðŧðī : There ____________________________ between race of suspects and levels of force used.
• This is the alternative (guilty / proof-needed) hypothesis: force is used at ___________________
between racial groups.
Logic of Hypothesis Tests
1. Start by assuming the null (the default) is true. In your own words, describe what that means in
context of the example above:
2. Ask yourself: if the null is in fact true, how likely is the data that you’ve gathered? In your own words,
describe what that means in context of the example above:
3. Draw a conclusion. In your own words, describe what that means in context of the example above:
481
Calculating the Test Statistic:
Step 1: Assume the null is true. In a world where force is used equally across racial groups, determine
the expected counts of suspects receiving the various levels of force. Show your work in the “Expected
Counts” table below:
Expected Counts
Observed Data
Force
None
Hands
Greater
Total
Black
1,371
263
109
1,743
Expected Count =
Hispanic
853
188
72
1,113
White
260
30
18
308
Total
2,484
481
199
3,164
Force
Black
Hispanic
White
None
2,484
Hands
481
Greater
199
row total × column total
grand total
i) In your own words, explain how the
expected counts were calculated and why
they represent a world under the null
hypothesis assumption:
Total
1,743
1,113
308
Step 2: Compare your expected counts (under null assumption) to the actual observed data
ii) Compare the expected counts to the observed data. What trends do you notice between racial
groups?
Chi-Square Statistic Formula:
=∑
Total
Show the first few terms and final result using our data:
(𝑂𝑏𝑠𝑒𝑟ð‘Ģ𝑒𝑑 − ðļð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘)2
ðļð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘
=
When chi-square is _______, that means your observed data is very ________________ from what was
expected under ðŧ0. Therefore, if ðŧ0 is true, your observations are unlikely (___________________). You
reject ðŧ0.
When chi-square is _______, that means your observed data is very __________ to what was expected under
ðŧ0. Therefore, if ðŧ0 is true, your observations are likely (___________________). You fail to reject ðŧ0.
482
3,164
Step 3: Conclude → We’ll go through that in the process below!
Below are 3,164 randomly sampled records from the 685 thousand stops made by NYC police (as a part
of stop-and-frisk) in 2011. Is there convincing evidence of an association between race of suspect and
level of force used?
Race of Suspect
Force Level
Black
Hispanic White
No Force Used
1371
853
260
Hands Used
263
188
30
Higher Force Level*
109
72
18
*Includes push to wall/ground, handcuffs, draw/point weapon, pepper
spray, baton. Data source: NYC.GOV,
https://www1.nyc.gov/site/nypd/stats/reports-analysis/stopfrisk.page
Calculator steps
Data
2nd → MATRIX→
Edit → [A]
Input Observed
STAT → TESTS →
C: ð‘ŋ𝟐 − 𝐓𝐞𝐎𝐭 …
Observed: [A]
Expected: [B]
Expected Values
483
Output
2nd → MATRIX→
Edit → [B]
Chi-Square Test for Homogeneity
It’s the _______________________ as the chi-square test for independence. With two key differences:
• The data ________________
• The ________________________and (therefore) conclusions
Independence Situation
Homogeneity Situation
Data “setup” - We are testing the association
of…
• ____________________: race of suspect
and force level
• One sample from __________________:
stops in NYC
Data “setup” - We are testing the association
of…
• ___________________: race of suspects
• 3 samples from ___________________:
NYC, Boston, and LA stops
City
Force Level
No Force Used
Hands Used
Higher Force Level
Race of Suspect
Black
Hispanic White
1371
853
260
263
188
30
109
72
18
Race of Suspect
Black
Hispanic
N.Y.C Boston
1,743 457
1,113 1,034
L.A.
1,367
1,467
White
Asian
Native American
308
126
56
780
245
145
893
76
22
Note: this is fake data
i) Write the hypotheses for this situation:
i) Write the hypotheses for this situation:
ii) In your own words, what makes this an
independence situation, rather than a homogeneity
situation?
ii) In your own words, what makes this a
homogeneity situation, rather than an independence
situation?
484
Lesson 12.2 Discussion
Force level
Black
Hispanic
White
No Force
78.7%
76.6%
84.4%
Hands Used
15.1%
16.9%
9.7%
Higher Force
6.3%
6.5%
5.8%
Recap: We found that race of suspect and level of
force used were significantly associated. In other
words, the differences in use of force rates between
racial groups were unlikely due to chance alone.
Discussion Questions
i) Are the differences in force rates across racial groups practically important?
ii) Does our data provide enough evidence to prove that New York police are racially biased (in terms of
use of force)? Why or why not?
485
12.2 PRACTICE!
Ibuprofen or Acetaminophen?
In a study reported by the Annals of Emergency Medicine, researchers conducted a randomized, double-blind clinical
trial to compare the effects of ibuprofen and acetaminophen plus codeine as a pain reliever for children recovering from
arm fractures. There were many response variables recorded, including the presence of any adverse effect, such as
nausea, dizziness, and drowsiness. Is there evidence to suggest a difference between the drug and the side effects?
Here are the results:
Ibuprofen
Adverse effects
No adverse effects
Total
36
86
122
486
Acetaminophen
Total
plus Codeine
57
93
55
141
112
234
Tide vs New Tide
Before bringing a new product to market, firms carry out extensive studies to learn how consumers react to the product
and how best to advertise its advantages. Here are data from a study of a new laundry detergent. The participants are a
random sample of people who don’t currently use the established brand that the new product will compete with. Give
subjects free samples of both detergents. After they have tried both for a while, ask which they prefer. The answers may
depend on other facts about how people do laundry.
Determine whether or not the sample provides convincing evidence that laundry practices and product preference are
independent in the population of interest.
487
Does gummy bear brand matter?
Is the distribution of gummy bear color the same for Haribo gummy bears and Meijer
gummy bears? We’ll collect data as a class and determine if we have convincing evidence
of a difference.
1. Add your data to the board and fill in the table below with the class totals.
Observed:
Brand
Haribo
Meijer
Total
Red
Green
Color
Yellow
Orange
White
Total
2. How many samples do we have? What population are they from? Explain.
3. How many variables are we examining? Explain.
4. As a class, write down hypotheses for a significance test.
H0:
Ha:
5. Now we will use a chi-square test to test if there is a difference between the two
populations. We first need to find the expected values. Complete the table below.
Expected:
Brand
Haribo
Meijer
Red
Green
Color
Yellow
Orange
White
Total
488
Total
6. Use your work on the front page to complete a significance test.
df = (rows – 1)(columns – 1)
7. Explain how this test is different from a chi-square test for goodness of fit?
489
Chi-Square Test for Homogeneity
Important ideas:
Check Your Understanding
A social scientist selects a random sample of 25 freshmen, 25 sophomores, 25 juniors, and 25 seniors
from various high schools across the state Kentucky. Each student was asked if they preferred inperson or remote learning. Here are the results:
Remote
In Person
Freshman
3
22
Sophomore
12
13
Junior
14
11
Senior
15
10
a. State the appropriate null and alternative hypotheses.
b. Show the calculation for the expected count in the Remote/Senior cell. Then provide a
complete table of expected counts.
c. Calculate the value of the chi-square test statistic.
490
Are Taco Tongue and Evil Eyebrow independent?
Is there an association between the Taco Tongue and the Evil Eyebrow? Below is the data
for a random sample of 600 Senior students. Do we have convincing evidence that the
ability to do the Taco Tongue and Evil Eyebrow are associated for all Seniors?
1. Describe what it means for two events to be independent.
2. Calculate the expected counts.
Observed:
Expected:
3. Do the data provide significant evidence that there is an association between the ability to Taco
Tongue and Evil Eyebrow for all Seniors? Use α = 0.05
491
Chi-Square Test for Independence
Important ideas:
Check Your Understanding
For each of the following situations decide what type of chi square test is appropriate. Explain.
1. A random sample of 200 students was asked to sample a new type of pizza that the school was
considering using as a replacement for the current pizza. Each student stated if they were a
freshman, sophomore, junior, or senior and also if they liked the new pizza more than the current
pizza (or not). The school would like to know if there is a relationship between grade level and
pizza opinion.
2. Another school is also considering changing their pizza vendor. This school selects separate
random samples of 50 freshmen, 50 sophomores, 50 juniors, and 50 seniors. Each student tries
the new pizza and states whether they like it more than the current pizza (or not). The school
would like to know if the distribution of opinion differs across the grade levels.
3. A pizza shop claims that 30% of orders are placed on Fridays, 20% are placed on Saturdays,
and 10% of orders are placed on the other days of the week. A global pandemic may have
changed this distribution. The manager investigates so he knows how to staff the pizza shop
appropriately. He selects a random sample of 300 orders and classifies each one according to
the day of the week the order was placed. He wants to know if the distribution of orders is the
same as it was before the global pandemic.
492
Was “Stop and Frisk” Biased?
Today, we’ll analyze 2011 data from New York’s “Stop and Frisk” program. The program allowed
police officers to stop people on the street and search them for weapons or contraband. The
program was controversial. Critics alleged that it led to heightened police discrimination of people of
color. We will explore that claim using a chi-square test for independence.
Open the following links:
• The program data (may take some time to load): tinyurl.com/stop-and-frisk-data
• Random number generator: random.org/integers
• NYPD precinct map: tinyurl.com/nypd-precincts
The data we’re using contains every stop made by NYPD police officers in 2011 (more than half a
million stops in total). Each row represents a single stop. The second tab in the spreadsheet
contains a data key and further information. Note: These data were reported by the police officers
who made the stops.
1. Use the random number generator to obtain 10 random integers between the values 2 and
632722 (the number of dataset rows). Find the dataset row numbers that correspond with your 10
random integers and record the following information for each selected stop:
Race
Force
Level
2. For one of your selected stops, look at all the variables listed: race, gender, and age of suspect
along with the suspected crime and whether an arrest was actually made. If you’d like, use the
precinct number and precinct map (linked above) to see the general area in which this stop took
place. Write a two-sentence description of this stop, as if you’re writing a news brief for an article:
3. Often, police interactions are portrayed at an individual level in the news media, with vivid details
given about the people involved and the interactions themselves. We are about to conduct a
statistical analysis of many interactions, focusing solely on the relationship between race and force
level used. What are the strengths and weaknesses of each type of analysis (individual &
statistical)?
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
493
4. As a class, combine your samples to fill in the observed table. Then, calculate the expected
counts. Note: all force levels other than “hands” (including push to wall/ground, handcuffs,
draw/point weapon, pepper spray, baton) are all considered “higher level” force. The NYPD did not
include police shootings in this dataset.
Observed
Force
Black
Hispanic
Expected
White
Total
Force
None
None
Hands
Only
Hands
Only
Higher
Level
Higher
Level
Total
Total
Black
Hispanic
White
Total
5. Do the data provide convincing evidence of an association between race of suspects and the
levels of force used by police officers? Use α = 0.05.
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
494
The population data (all 2011 stops):
Raw Counts
Force Level
Black
None
Hispanic
274,260
Conditional Distribution (By Race)
White
Total
170,633 52,040 496,933
Hands Only
52,676
37,620
5,904
96,200
Higher Level
21,739
14,321
3,528
39,588
Total
348,675
222,574 61,472 632,721
Force level
Black
Hispanic
White
None
78.7%
76.7%
84.7%
Hands Only
15.1%
16.9%
9.6%
Higher Level
6.2%
6.4%
5.7%
There is a substantial association between race of suspects and the force levels used in the population.
Specifically, White suspects receive “no force” at a higher rate, and they receive higher level force at a
lower rate.
Discussion Questions
1. Were the results of your chi-square test consistent with the above conclusion about the population
data? Explain.
2. Do these data provide enough evidence to prove that New York police are racially biased (in terms of
use of force)? Why or why not?
3. (Optional Extension Question) Information about the residential population of NYC in 2011 is
displayed below. Compare these counts to the data above. If police stops were done completely at
random, which groups would be stopped proportionally more often? Less often? What conclusions can
you draw from this?
Race
Black
Hispanic
White
Population
2,054,101
2,373,304
2,731,173
Data source: 2011 American Community Survey (data.census.gov)
Lesson provided by Stats Medic (statsmedic.com) & Skew The Script (skewthescript.org)
Made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License
(https://creativecommons.org/licenses/by-nc-sa/4.0)
495
Coincidence or Murder?
Kristen Gilbert started working at the VAMC as a nurse in 1989. A proficiency report
obtained by the Boston Globe described her as “highly skillful,” calm and
compassionate. She organized charity drives and collections for the needy, one of her
defense attorneys said, and she once organized a memorial service for a colleague who
died of cancer.
However, Gilbert did have a tendency to be nearby when patients died. Co-workers
sometimes referred to her as the “angel of death,” the Boston Globe reported.
Federal prosecutor Bill Welch said that death rates seemed to increase significantly
during the shifts Gilbert was assigned to, according to the Globe. Eventually, her fellow
nurses grew suspicious.
In particular, Gilbert’s co-workers started to raise concerns about an increase in medical
emergencies (“codes”) and deaths in Ward C between August 1995 and February 1996,
court records show. A criminal investigation was launched in February, and Gilbert left
her job soon after learning she was the target of it.
According to a search warrant affidavit, a statistical analysis of codes and deaths in
Ward C between January 1, 1995 and February 19, 1996 showed that Gilbert was
present or on duty for 37 of them. Authorities determined the likelihood that her frequent
presence during these emergencies was merely coincidental was _________.
1. What other explanations might there be for why Gilbert had more deaths on her
shifts?
2. What statistical information would we need to know in order to make a decision
about whether the deaths on Gilbert’s shifts were coincidental, or in other words,
purely by chance?
496
An analysis of 1641 eight-hour shifts is presented below. Use the data to decide if there
is an association between Gilbert’s presence and death during a shift.
Death during shift?
Gilbert present?
Yes
No
Yes
40
217
No
34
1350
3. What type of test will you use to analyze the data? Why?
4. Conduct the test below using the 4 step process.
5. Do you think Kristen Gilbert murdered her patients? Explain using the results of
your study.
497
AP Statistics Handout: Lesson 12.3
Topics: chi-square vs. regression, t-interval for regression slope, t-test for regression slope
Note: Lesson inspired by “Silence the
Violence” materials from 21st Century
Math Projects
Guided Notes
Which country is this?
Data notes: Random sample of developed
nations from University of Sydney School of
Public Health database. Full database available
at gunpolicy.org. Data from most recent year
available for each nation (2014-2019).
"Developed" nations = top 65 countries in
2019 UN Human Development Index.
Houston Gun Show, photo by glasgows
Cartoon by David Simonds
Two positions for preventing gun homicide:
Gun Control
More Guns
Describe the arguments both of these sides make, as well as the nations they reference when making
these arguments:
Source:
Forbes
498
Chi-Square vs. Regression
Nation
Australia
Belgium
Bulgaria
Czech Republic
Finland
France
Germany
Greece
Hungary
Italy
Japan
Lithuania
Malta
Netherlands
Portugal
Qatar
Romania
Switzerland
Sweden
Civilian
Firearms per
100 People
Gun Homicides
per 1,000,000
People
13.7
12.7
8.4
12.5
32.4
19.6
32.0
17.6
10.55
12.89
0.3
13.6
28.3
2.6
24.22
16.7
2.6
34.8
23.1
1.5
2.5
2.0
1.0
2.0
1.2
0.6
1.9
0.5
2.9
0.1
1.1
4.6
1.6
2.4
0.5
0.3
1.6
4.0
Random sample of 19 developed nations
i) How many variable(s) are there in this dataset?
ii) Are the variable(s) quantitative or categorical?
iii) When we have two categorical variables, how do we
organize and test the data? What about two quantitative
variables?
iv) In this case, which variable is the explanatory and which
is the response? How do you know?
Data notes: Random sample of developed nations from University
of Sydney School of Public Health database. Full database available
at gunpolicy.org. Data from most recent year available for each
nation (2014-2019). "Developed" nations = top 65 in 2019 UNHDI.
Scatterplot and Least Squares Regression Line (LSRL)
i) Circle and label Japan and Germany on these graphs. Comment on how their locations in these graphs
relate to the “gun control” vs. “more guns” arguments:
ii) Comment on the overall direction and strength of the relationship:
499
t-Interval for the Slope of a Regression Line
Predictor
Constant
Firearms
S = 1.141
Coef
SE Coef
0.91618 0.51205
0.04675 0.02625
R-Sq = 15.73%
T
1.789
1.781
P
0.0914
0.0928
R-Sq (adj) = 10.77%
1) Write down the LSRL equation:
2) Interpret the estimate for the slope:
Making the Interval
Construct and interpret a 95% confidence interval for the population regression line. First, identify the
following:
Population:
Parameter:
Sample:
Statistic:
β
95% confident we’ll capture this
Statistic ± margin of error
Statistic ± (critical value)(standard error)
_________________ standard errors in
either direction.
Using the formula, the computer output, and technology,
calculate and interpret the t-interval for the slope of the
regression line:
Confidence Interval for Regression Slope:
𝑏 ± (ð‘Ą ∗ )(𝑆ðļ𝑏 )
To find 𝒕∗ with TI-84:
2nd → VARS → 4: InvT
500
-Area: percent below
interval (2.5% = 0.025)
-df: n – 2 (19 – 2 = 17)
t-Test for the Slope of a Regression Line
Predictor
Constant
Coef
0.91618
SE Coef
0.51205
T
1.789
P
0.0914
Firearms
0.04675 0.02625
1.781
0.0928
S = 1.141
R-Sq = 15.73%
R-Sq (adj) = 10.77%
Is there convincing evidence of a linear relationship between rates of gun ownership and rates of gun
homicide in developed nations?
Hypotheses:
ðŧ0 : ð›― = 0
ðŧðī : ð›― ≠ 0
β: The true slope of the population regression
line relating rates of gun ownership (per 100
people) and rates of gun homicide (per 1
million people) in developed nations.
i) In your own words, interpret what the null and alternative hypotheses mean in this context:
ii) Why do our hypotheses use β instead of b?
iii) Why do you think the above test is the same thing as testing if there is no correlation (𝜌 = 0)
between rates of gun ownership and homicide?
Conducting the test:
iv) You can use the interval we calculated earlier to figure out the result of this significance test. Please
do so and explain your results:
v) Report the p-value and test statistic. Circle where they came from in the computer output above.
Then, write the conclusion of your test:
501
Rates of gun ownership and rates of gun homicide from 19 randomly sampled developed nations is
shown in the scatterplot below. LSRL computer output, a residual plot, and a histogram of the residuals
are provided:
Predictor
Constant
Firearms
S = 1.141
Coef
SE Coef
0.91618 0.51205
0.04675 0.02625
R-Sq = 15.73%
T
1.789
1.781
P
0.0914
0.0928
R-Sq (adj) = 10.77%
Conditions for Regression Inference
L - _______________, to check this condition:
I - ________________, to check this condition:
N - _______________, to check this condition:
E - _______________, to check this condition:
R - _______________, to check this condition:
Plots A & B are made from mock data. If you were performing inference on their datasets, these plots
would be a bad sign for your conditions. Which conditions would they violate and why?
A
B
502
Lesson 12.3 Discussion
We ______________________ convincing evidence of a relationship between rates of gun ownership
and rates of gun homicide. This conclusion makes sense: the relationship was ___________________.
Original Sample (USA not included)
With USA Added
Discussion Question: Would reducing civilian gun ownership also reduce gun homicides in the U.S.?
Explain.
•
•
•
First: Answer this question using only the data we’ve analyzed, without making any assumptions.
Second: Answer by choosing a clear side and use a BS (Bad Statistics) argument to support it.
Third: Provide your actual personal opinion. Make sure, as always, to address the data.
503
Does seat location matter? Part 1
Do students who sit in the front rows do better than students who sit farther away?
Row
1
1
Score 76 77
1
94
1
1
1
2
2
2
2
2
2
3
3
3
3
3
3
99 88 90 83 85 74 79 77 79 90 88 68 78 83 79
Row
4
4
4
4
4
4
5
5
5
5
5
5
Score 94 72 101 70 63 76 76 65 67 96 79 96
1. Is this an observational study or an experiment? Why?
2. Why is it important to randomly assign the students to seats rather than letting each student choose
his or her own seat?
3. How many variables are we measuring?_____ Are they categorical or quantitative?_____________
What is the explanatory variable (x)?__________________ Response variable(y)?_____________
4. Use stapplet.com to make a scatterplot. Sketch it below.
5. Find the least squares regression line (LSRL):__________________________________________
6. What is the slope of the LSRL:_________
Interpret the slope in the context of the problem.
504
Does the negative slope provide convincing evidence that sitting closer causes higher achievement, or is
it plausible that the association is purely by chance because of random assignment?
In order to answer this question, we need to know more about “purely by chance because of random
assignment”. If we assume that seat location has NO effect on Exam Score, then we could just randomly
assign all 30 Exam Scores to each of the seat locations. We will do this by writing down each of the 30
Exam Scores onto an index card, shuffle the index cards, and then randomly assign them to seat
locations.
In pairs, shuffle up the note cards and randomly assign 6 students into each of the 5 rows. Record the
results:
Row 1: _____, _____, _____, _____, _____, _____
Row 2: _____, _____, _____, _____, _____, _____
Row 3: _____, _____, _____, _____, _____, _____
Row 4: _____, _____, _____, _____, _____, _____
Row 5: _____, _____, _____, _____, _____, _____ Now find the slope of the LSRL:____________
Repeat this process 2 more times for a total of 3 different samples. Record the results.
Row 1: _____, _____, _____, _____, _____, _____
Row 2: _____, _____, _____, _____, _____, _____
Row 3: _____, _____, _____, _____, _____, _____
Row 4: _____, _____, _____, _____, _____, _____
Row 5: _____, _____, _____, _____, _____, _____ Now find the slope of the LSRL:____________
Row 1: _____, _____, _____, _____, _____, _____
Row 2: _____, _____, _____, _____, _____, _____
Row 3: _____, _____, _____, _____, _____, _____
Row 4: _____, _____, _____, _____, _____, _____
Row 5: _____, _____, _____, _____, _____, _____ Now find the slope of the LSRL:____________
You have now calculated three different possible values for the slope based on random assignment.
Take these 3 values to the dotplot on the whiteboard in the front of the room. When everyone in class
has recorded their data, copy the dotplot below:
505
Sampling Distribution of b
Important ideas:
Check Your Understanding
You may have heard that your nose and ears grow through your whole life. While it is true that your
nose and ears get bigger throughout life, its not because they grow, but because of gravity. The
cartilage in your nose and ears break down as we age and the “growth” people observe is the result
of drooping. To quantify the expansion of ears over time, a random sample of 30 adults were
selected. For each adult, their age (in years) was recorded and their ear height (cm) was measured.
Below is the regression output. Is there evidence of a positive linear relationship between age and
ear height? Assume the conditions for inference are met.
a. What is the estimate for α? Interpret this value.
b. What is the estimate for β? Interpret this value.
c. What is the estimate for σ? Interpret this value.
d. Give the standard error of the slope SEb. Interpret this value.
506
Does seat location matter – Part 2?
Do students who sit in the front rows do better than students who sit farther away? Mrs.
Gallas took a random sample of 30 students from her classes and found these results.
Row
1
1
Score 76 77
1
94
1
1
1
2
2
2
2
2
2
3
3
3
3
3
3
99 88 90 83 85 74 79 77 79 90 88 68 78 83 79
Row
4
4
4
4
4
4
5
5
5
5
5
5
Score 94 72 101 70 63 76 76 65 67 96 79 96
Line of best fit:________________
Slope: b =
SEb = 1.33
1. If Mrs. Gallas were to take another random assignment of 30 students, do you think the slope of the
LSRL would be the same? Why?
2. We are going to construct a 95% confidence interval for the slope of the population regression line.
Identify the parameter and statistic.
Parameter:________________________________
3. There are five conditions to check.
(1) Linear: The scatterplot needs to show a linear
relationship. Also, the residual plot doesn’t have a
leftover curved pattern. Sketch each at right.
(2) Independent:
(3) Normal: A dotplot of the residuals cannot show
strong skew or outliers. Make one using the applet
and sketch it at right.
(4) Equal SD: The residual plot does not show a clear
sideways Christmas tree pattern.
(5) Random:
4. Construct the interval:
507
Statistic:____________
Confidence Intervals for Slope
Important ideas:
Check Your Understanding
A thrill-seeker wanted to try to travel across a large field while being suspended in the air by holding
onto balloons. In order to determine the number of balloons needed per pound of weight, he did a
preliminary study. He selects a random sample of 20 rocks of various sizes. He weighed each one
and also determined how many balloons are needed to lift the rock. Here is output from a leastsquares regression analysis of the data.
Construct and interpret a 90% confidence interval for the slope of the population regression line.
508
How does GPA relate to ACT score?
Mrs. Gallas is wondering if there is a relationship between GPA and ACT score. She took a
random sample of 9 out of her 101 students and recorded their GPA and ACT score. The
data are below.
Student #
GPA
ACT
83
3.7
23
69
2.3
20
96
4.0
35
89
3.8
33
57
3.0
22
13
1.8
13
24
2.0
17
37
2.3
20
1. What relationship would you expect GPA and ACT score to have? Explain.
Here is the Minitab ouput as well as graphs of the data.
Predictor
Coef
SE Coef
T
P
Constant
1.201
0.0874
13.72
0
GPA
7.507
1.29
5.82
0.0006511
S = 3.252686
R-Sq = 82.8%
R-Sq(adj) = 76.5%
2. Find the LSRL for the data.
509
91
3.9
29
3. Do the data provide significant evidence that there is a positive linear relationship
between GPA and ACT?
510
Significance Test for Slope
Important ideas:
Check Your Understanding
You may have heard that your nose and ears grow through your whole life. While it is true that your
nose and ears get bigger throughout life, it’s not because they grow, but because of gravity. The
cartilage in your nose and ears break down as we age and the “growth” people observe is the result
of drooping. To quantify the expansion of ears over time, a random sample of 30 adults were
selected. For each adult, their age (in years) was recorded and their ear height (cm) was measured.
Below is the regression output. Is there convincing evidence of a positive linear relationship
between age and ear height? Assume the conditions for inference are met.
511
512
513
2011 AP ® STATISTICS FREE-RESPONSE QUESTIONS
5. Windmills generate electricity by transferring energy from wind to a turbine. A study was conducted to examine
the relationship between wind velocity in miles per hour (mph) and electricity production in amperes for one
particular windmill. For the windmill, measurements were taken on twenty-five randomly selected days, and the
computer output for the regression analysis for predicting electricity production based on wind velocity is given
below. The regression model assumptions were checked and determined to be reasonable over the interval of
wind speeds represented in the data, which were from 10 miles per hour to 40 miles per hour.
Predictor
Constant
Wind velocity
Coef
0.137
0.240
S = 0.237
R-Sq = 0.873
SE Coef
0.126
0.019
T
1.09
12.63
p
0.289
0.000
R-Sq (adj) = 0.868
(a) Use the computer output above to determine the equation of the least squares regression line. Identify all
variables used in the equation.
(b) How much more electricity would the windmill be expected to produce on a day when the wind velocity is
25 mph than on a day when the wind velocity is 15 mph? Show how you arrived at your answer.
(c) What proportion of the variation in electricity production is explained by its linear relationship with wind
velocity?
(d) Is there statistically convincing evidence that electricity production by the windmill is related to wind
velocity? Explain.
514
EXAM REVIEW
515
AP Statistics Chapter 1 Notes - Exploring Data
1.1/2: Categorical Variables and Displaying Distributions with Graphs
Individuals and Variables
 Individuals are objects described by a set of data. Individuals may be people, but they
may also be animals or things.
 A variable is any characteristic of an individual. A variable can take different values for
different individuals.
Categorical and Quantitative Variables
 A categorical variable places an individual into one of several groups or categories.
 A quantitative variable takes numerical values for which arithmetic operations such as
adding and averaging make sense.
Distribution
The distribution of a variable tells us what values the variable takes and how often it takes
these variables.
Describing the Overall Pattern of a Distribution – Remember your SOCS
To describe the overall pattern of a distribution, address all of the following:
 Spread – give the lowest and highest value in the data set
 Outliers – are there any values that stand out as unusual?
 Center – what is the approximate average value of the data (only an estimation)
 Shape – does the graph show symmetry, or is it skewed in one direction (see below)
Outliers
An outlier in any graph of data is an individual observation that falls outside the overall
pattern of the graph.
Describing the SHAPE of a distribution – Symmetric and Skewed Distributions
Symmetric
Skewed Left
Skewed Right
Mean = Median
Mean < Median
Mean > Median
Time Plot
 A time plot of a variable plots each observation against the time at which it was
measured.
 Always mark the time scale on the horizontal axis and the variable of interest on the
vertical axis. If there are not too many points, connecting the points by lines helps show
the pattern of changes over time.
516
1.3: Describing Distributions with Numbers
The Mean ( x )
To find the mean of a set of observations, add their values and divide by the number of
observations. If the n observations are x1, x2, …, xn, their mean is:
x ï€Ŧ x ï€Ŧ x ï€Ŧ  ï€Ŧ xn
xï€― 1 2 3
n
n
or simply,
x ï€― ïƒĨ xi
i ï€―1
The Median (M)
 The median M is the midpoint of distribution, the number such that half the observations
are smaller and the other half are larger. To find the median of distribution:
 Arrange all observation in order of size, from smallest to largest.
 If the number of observations n is odd, the median M is the center observation in the
ordered list. The position of the center observation can be found at (n + 1) / 2
 If the number of observations n is even, the median M is the mean of the two center
observations in the ordered list. The position of the two middle values are n/2 and n/2 + 1
The Five-Number Summary
The five-number summary of a data set consists of the smallest observation, the first quartile,
the median, the third quartile, and the largest observation, written in order from smallest to
largest. In symbols, the five-number summary is:
Minimum – Q1 – M – Q3 – Maximum
The Quartiles (Q1 and Q3 )
 To calculate the quartiles, arrange the observations in increasing order and locate the
median M in the ordered list of observations.
 The 1st quartile (Q1) is middle number of the values that are less than the median.
 The 3rd quartile (Q3) is the middle number of the values that are greater than the median.
Example
2
14
28
29
30
32
33
34
40
42
52
Min
Q1
Med
Q3
Max
The Interquartile Range (IQR)
The IQR is the distance between the first and third quartiles, IQR = Q3 - Q1
Outliers: The 1.5 x IQR Criterion
Call an observation an outlier if it falls more than 1.5 x IQR below the first quartile or above
the third quartile. Using the 5-number summary from above as an example (IQR = 40-28=12)
 Low outlier cutoff: Q1  1.5 ï‚ī IQR (example: 28 – 1.5(12) = 28 – 18 = 10) Therefore,
the 2 is an outlier.
 High outlier cutoff: Q3 ï€Ŧ 1.5 ï‚ī IQR (example: 40 + 1.5(12) = 40 + 18 = 58) no outlier
517
1.3: Describing Distributions with Numbers
Boxplot
A boxplot is a graph of the five-number summary, with outliers plotted individually.
 A central box spans the quartiles.
 A line in the box marks the median.
 Observations more than 1.5 x IQR outside the central box are plotted individually.
 Lines extend from the box out to the smallest and largest observations, not the outliers.
Example:
The Standard Deviation (S or Sx)
The standard deviation of a set of observations is the average of the squares of the deviations
of the observations from their mean. The formula for the standard deviation of n observations
x1, x2, …, xn is:
sï€―
ïƒĨ ( x  x)
2
i
n 1
Calculation of the Standard Deviation
Consider the data below which has a mean of 4.8:
xi
6
3
8
5
2
Sum
xi – mean
6 – 4.8 = 1.2
3 – 4.8 = -1.8
8 – 4.8 = 3.2
5 – 4.8 = 0.2
2 – 4.8 = -2.8
0
So the standard deviation is
(xi-mean)2
(1.2)2 = 1.44
(-1.8)2 = 3.24
(3.2)2 = 10.24
(0.2)2 = 0.04
(-2.8)2 = 7.84
22.8
22.8 / (5  1) ï€― 22.8 / 4 ï€― 5.7 ï€― 2.387
518
AP Statistics Chapter 2 – Describing Location in a Distribution
2.1: Measures of Relative Standing and Density Curves
Density Curve
A density curve is a curve that
• is always on or above the horizontal axis, and
• has area exactly 1 underneath it.
A density curve describes the overall pattern of a distribution. The area under the curve and
above any range of values is the proportion of all observations that fall in the range.
Example
The density curve below left is a rectangle. The area underneath the curve is 4 i 0.25 = 1.
The figure on the right represents the proportion of data between 2 and 3 ( 1 i 0.25 = 0.25 ).
Median and Mean of a Density Curve
• The median of a density curve is the equal-areas point, the point that divides the area
under the curve in half.
• The mean of a density curve is the balance point, at which the curve would balance if
made of solid material.
• The median and mean are the same for a symmetric density curve. They both lie at the
center of the curve. The mean of a skewed curve is pulled away from the median in the
direction of the long tail.
Normal Distributions
A normal distribution is a curve that is
• mound-shaped and symmetric
• based on a continuous variable
• adheres to the 68-95-99.7 Rule
The 68-95-99.7 Rule
In the normal distribution with mean μ and standard deviation σ:
• 68% of the observations fall within 1σ of the mean μ.
• 95% of the observations fall within 2σ of the mean μ.
• 99.7% of the observations fall within 3σ of the mean μ.
519
2.2: Normal Distributions
Standardizing and z-Scores
If x is an observation from a distribution that has mean μ and standard deviation σ, the
standardized value of x is
z=
x−μ
σ
A standardized value is often called a z-score.
Standard Normal Distribution
• The standard normal distribution is the normal distribution N(0, 1) with mean 0 and
standard deviation 1.
• If a variable x has any normal distribution N(μ, σ) with mean μ and standard deviation σ,
then the standardized variable
x−μ
z=
σ
has the standard normal distribution (see diagram below).
The Standard Normal Table
Table A is a table of areas under the standard normal curve. The table entry for each value z is
the area under the curve to the left of z.
Standard Normal Calculations
Area to the left of z ( Z < z ) Area to the right of z ( Z > z )
Area = Table Entry
Area = 1 – Table Entry
Area between z1 and z2
Area = difference between
Table Entries for z1 and z2
Inverse Normal Calculations
Working backwards from the area, we find z, then x. The value of z is found using Table A in
reverse. The value of x is found, from z, using the formula below
x = μ + z iσ
520
AP Statistics Chapter 3 – Examining Relationships
3.1: Scatterplots and Correlation
Explanatory and Response Variables
A response variable measures an outcome of a study. An explanatory variable attempts to
explain the observed outcomes. The explanatory variable is sometimes referred to as the
independent variable and is typically symbolized by the variable x. The response variable is
sometimes referred to as the dependent variable and is typically symbolized by the variable y.
Scatterplot
A scatterplot shows the relationship between two quantitative variables measured on the same
individuals. The values of the explanatory variable appear on the horizontal axis, and the values
of the response variable appear on the vertical axis. If there is no clear explanatory/response
relationship between the two variables, then either variable can be placed on either axis. Each
individual in the data set appears as a single point in the plot fixed by the values of both variables
for that individual.
Examining a Scatterplot
In any graph of data, look for patterns and deviations from the pattern. Describe the overall
pattern of a scatterplot by the form, direction and strength of the relationship.
 Form can be described as linear or curved.
 Direction can be described as positive or negative or neither.
 Strength can be described as weak, moderate or strong.
A deviation from the overall pattern of a scatterplot is called an outlier.
Association
 Two variables are positively associated if as one increases the other increases.
 Two variables are negatively associated if as one increases the other decreases.
Correlation
Correlation measures the strength and direction of the relationship between two quantitative
variables. Correlation is usually represented by the letter r.
Facts about Correlation
1. When calculating correlation, it makes no difference which variable is x and which is y.
2. Correlation is only calculated for quantitative variables, not categorical.
3. The value of r does not change if the units of x and/or y are changed.
4. Positive r indicates a positive association between x and y. Negative r indicates a
negative association.
5. Correlation is always a number between -1 and +1. Values close to +1 or -1 indicate that
the points lie close to a line. The extreme values of +1 and -1 are only achieved when the
points are perfectly linear.
6. Correlation measures the strength of a linear relationship between two variables, not
curved relationships.
7. Correlation, like the mean and standard deviation, is nonresistant. Recall that this means
that it is greatly affected by outliers.
521
3.2: Least-Squares Regression
Regression Line
A regression line is a straight line that describes how a response variable y changes as an
explanatory variable x changes. The line is often to predict values of y for given values of x.
Regression, unlike correlation, requires an explanatory/response relationship. In other words,
when x and y are reversed, the regression line changes. Recall that correlation is the same no
matter which variable is x and which is y.
Least-Squares Regression Line
The least-squares regression line is the line that makes the sum of the squares of the vertical
distances from the data points to the line as small as possible.
Equation of the Least-Squares Regression Line
To find the equation of the regression line in the form y ï€― a ï€Ŧ bx , where a is the y-intercept and b
is the slope, use the following equations:
bï€―r
sy
sx
and a ï€― y  bx
The Role of r-squared (Coefficient of Determination)
The square of the correlation coefficient, or r-squared, represents the percentage of the change
in the y-variable that can be attributed to its relationship with the x-variable. So if r-squared for
the regression between x and y is .73, we can say that x accounts for 73% of the variation in y.
Residuals
A residual is the difference between an observed value of y and the value predicted by the
regression line. That is, residual = actual y - predicted y.
Residual Plot
A residual plot is a scatterplot of each x-value and its residual value. The residual plot is used to
determine whether a linear equation is a good model for a set of data, as follows:
 If the residual plot exhibits randomness, then a line is a good model for the data (see left)
 If the residual plot exhibits a pattern, then a line is NOT a good model for the data (right)
Outliers and Influential Points
A point that lies outside the overall pattern of the other observations is considered an outlier. If
the removal of such a point has a large effect on the correlation and/or regression, that point is
considered an influential point.
522
AP Statistics Chapter 4 – Designing Studies
4.1: Surveys and Samples
Population, Census and Sample
 The population in a statistical study is the entire group of individuals we want
information about. For example, all registered voters in a given county.
 A census collects data from every individual in the population.
 A sample is a subset of individuals in the population from which we actually collect data.
Bias
The design of a statistical study shows bias if it would consistently underestimate or consistently
overestimate the value you want to know.
Convenience Sampling
A convenience sample chooses the individuals easiest to reach. This will typically result in a
biased sample of like-minded individuals.
Voluntary Response Sample
A voluntary response sample consists of people who choose themselves by responding to a
general invitation. Voluntary response samples show bias because people with strong opinions
(often in the same direction) are most likely to respond.
Simple Random Sample
A simple random sample (SRS) of size n consists of n individuals from the population chosen
in such a way that every set of n individuals has an equal chance to be the sample actually
selected.
Random Digits
A table of random digits is a long string of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these two
properties:
1. Each entry in the table is equally likely to be any of the 10 digits 0 through 9.
2. The entries are independent of each other. That is, knowledge of one part of the table
gives no information about any other part.
Choosing an SRS
Choose an SRS in two steps:
Step 1: Label. Assign a numerical label to every individual in the population.
Step 2: Table. Use Table B to select labels at random.
523
Stratified Random Sample
To get a stratified random sample, start by classifying the population into groups of similar
individuals, called strata. Then choose a separate SRS in each stratum and combine these SRSs
to form the sample.
Cluster Sample
To get a cluster sample, start by classifying the population into groups of individuals that are
located near each other, called clusters. Then choose an SRS of the clusters. All individuals in
the chosen clusters are included in the sample.
Forms of Bias in Surveys and Samples
 Undercoverage occurs when some members of the population cannot be chosen in a
sample.
 Nonresponse occurs when an individual chosen for the sample can’t be contacted or
refuses to participate.
 A systematic pattern of incorrect responses in a sample survey leads to response bias.
 The wording of questions is the most important influence on the answers given to a
sample survey.
4.2: Experiments
Observational Study vs Experiment
 An observational study observes individuals and measures variables of interest but does
not attempt to influence the responses.
 An experiment deliberately imposes some treatment on individuals to measure their
responses.
 When our goal is to understand cause and effect, experiments are the only source of fully
convincing data. The distinction between observational study and experiment is one of
the most important in statistics.
Confounding occurs when two variables are associated in such a way that their effects on a
response variable cannot be distinguished from each other. Observational studies often fail to
provide valid causal links between variables due to confounding
The Language of Experiments
A specific condition applied to the individuals in an experiment is called a treatment. If an
experiment has several explanatory variables, a treatment is a combination of specific values of
these variables.
The experimental units are the smallest collection of individuals to which treatments are
applied. When the units are human beings, they often are called subjects.
524
Principles of Experimental Design
The basic principles for designing experiments are as follows:
1. Comparison. Use a design that compares two or more treatments.
2. Random assignment. Use chance to assign experimental units to treatments. Doing so
helps create roughly equivalent groups of experimental units by balancing the effects of
other variables among the treatment groups.
3. Control. Keep other variables that might affect the response the same for all groups.
4. Replication. Use enough experimental units in each group so that any differences in the
effects of the treatments can be distinguished from chance differences between the
groups.
Statistical Significance
 An observed effect so large that it would rarely occur by chance is called statistically
significant.
 A statistically significant association in data from a well-designed experiment does imply
causation.
Completely Randomized Design
 In a completely randomized design, the treatments are assigned to all the experimental
units completely by chance.
 Some experiments may include a control group that receives an inactive treatment or an
existing baseline treatment.
 The response to a dummy treatment is called the placebo effect.
 In a double-blind experiment, neither the subjects nor those who interact with them and
measure the response variable know which treatment a subject received.
Block Design
A block is a group of experimental units that are known before the experiment to be similar in
some way that is expected to affect the response to the treatments.
In a randomized block design, the random assignment of experimental units to treatments is
carried out separately within each block.
Matched Pairs Design
 A matched pairs design is a randomized blocked experiment in which each block
consists of a matching pair of similar experimental units.
 Chance is used to determine which unit in each pair gets each treatment.
 Sometimes, a “pair” in a matched-pairs design consists of a single unit that receives both
treatments. Since the order of the treatments can influence the response, chance is used to
determine with treatment is applied first for each unit.
525
AP Statistics Chapter 5 – Probability: What are the Chances?
5.1: Randomness, Probability and Simulation
Probability
The probability of any outcome of a chance process is a number between 0 and 1 that describes
the proportion of times the outcome would occur in a very long series of repetitions.
Simulation
The imitation of chance behavior, based on a model that accurately reflects the situation, is
called a simulation.
Performing of a Simulation – The 4-Step Process
1. State: Ask a question of interest about some chance process.
2. Plan: Describe how to use a chance device to imitate one repetition of the process. Tell
what you will record at the end of each repetition.
3. Do: Perform many repetitions of the simulation.
4. Conclude: Use the results of your simulation to answer the question of interest.
5.2: Probability Rules
Sample Space
The sample space S of a chance process is the set of all possible outcomes.
Probability Models
Descriptions of chance behavior contain two parts:
A probability model is a description of some chance process that consists of two parts:
• a sample space S and
• a probability for each outcome.
For example: When a fair 6-sided die is rolled, the Sample Space is S = {1, 2, 3, ,4,5, 6}.
The probability for a fair die would include the probabilities of these outcomes, which are all the
same.
Outcome
Probability
1
1/6
2
1/6
3
1/6
4
1/6
5
1/6
6
1/6
Event
An event is any collection of outcomes from some chance process. That is, an event is a subset
of the sample space. Events are usually designated by capital letters, like A, B, C, and so on.
For example: For the probability model above we might define event A = die roll is odd. The
elements of the sample space S that fits this event are {1, 3, 5}. The probability of the event A,
written as P(A) is the 3/6 or ½. So we would write P(A) = 0.5, in decimal form.
526
The Basic Rules of Probability
• For any event A, 0 ≤ P(A) ≤ 1.
• If S is the sample space in a probability model, P(S) = 1.
• In the case of equally likely outcomes,
number of outcomes corresponding to event A
P( A) ï€―
total number of outcomes in sample space
C
• Complement rule: P(A ) = 1 – P(A)
• Addition rule for mutually exclusive events: If A and B are mutually exclusive,
P(A or B) = P(A) + P(B). Also be familiar with the notation: 𝑷(ð‘Ļ ∪ ð‘Đ).
Mutually Exclusive Events
Two events A and B are mutually exclusive (or disjoint) if they have no outcomes in common
and so can never occur together—that is, if P(A and B ) = 0. Alternate notation: 𝑷(ð‘Ļ ∩ ð‘Đ).
For example: Using a deck of playing cards and drawing a card at random, the events A = card
is a King, and B = card is a Queen are mutually exclusive because a single card cannot be both a
King and a Queen. Thus we can calculate the probability of A or B as the sum of their individual
probabilities - P(A or B) = P(A) + P(B).
General Addition Rule
If A and B are any two events resulting from some chance process, then
P(A or B) = P(A) + P(B) – P(A and B)
Venn Diagrams and Probability
The complement Ac contains exactly the
outcomes that are not in A.
The events A and B are mutually exclusive
(disjoint) because they do not overlap. That
is, they have no outcomes in common.
The intersection of events A and B (A ∩ B)
is the set of all outcomes in both events A
and B.
The union of events A and B (A ∪ B) is the
set of all outcomes in either event A or B.
527
5.3: Conditional Probability and Independence
Conditional Probability
The probability that one event happens given that another event is already known to have
happened is called a conditional probability.
Suppose we know that event A has happened. Then the probability that event B happens given
that event A has happened is denoted by P(B | A). The symbol “” is read as “given that,” so we
read P(B | A) as the probability that B occurs given that A has already occurred.
Calculating Conditional Probability
To find the conditional probability P(A | B), use the formula
P( A | B) ï€―
P( A  B)
P( B)
The conditional probability P(B | A) is given by
P( B | A) ï€―
P( B  A)
P( A)
The General Multiplication Rule
The probability that events A and B both occur can be found using the general multiplication rule
P(A ∩ B) = P(A) • P(B | A),
where P(B | A) is the conditional probability that event B occurs given that event A has already
occurred.
Conditional Probability and Independence
Two events A and B are independent if the occurrence of one event does not change the
probability that the other event will happen. In other words, events A and B are independent if
P(A | B) = P(A) and P(B | A) = P(B).
The Multiplication Rule for Independent Events
If A and B are independent events, then the probability that A and B both occur is
P(A ∩ B) = P(A) • P(B)
528
AP Statistics Chapter 6 – Discrete, Binomial & Geometric Random Variables
6.1: Discrete Random Variables
Random Variable
A random variable is a variable whose value is a numerical outcome of a random phenomenon.
Discrete Random Variable
A discrete random variable X has a countable number of possible values. Generally, these values
are limited to integers (whole numbers). The probability distribution of X lists the values and
their probabilities.
Value of X
x1
x2
x3
…
xk
Probability
p1
p2
p3
…
pk
The probabilities pi must satisfy two requirements:
1. Every probability pi is a number between 0 and 1.
2. p1 + p2 + … + pk = 1
Find the probability of any event by adding the probabilities pi of the particular values xi that
make up the event.
Continuous Random Variable
A continuous random variable X takes all values in an interval of numbers and is measurable.
Mean (Expected Value) of A Discrete Random Variable
Suppose that X is a discrete random variable whose distribution is
Value of X
x1
x2
x3
…
xk
Probability
p1
p2
p3
…
pk
To find the mean of X, multiply each possible value by its probability, then add all the products:
𝝁𝒙 = 𝑎(𝒙) = ∑ 𝒙𝒊 ∙ 𝒑𝒊 = 𝒙𝟏 ∙ 𝒑𝒊 + 𝒙𝟐 ∙ 𝒑𝟐 + â‹Ŋ + 𝒙𝒌 ∙ 𝒑𝒌
529
6.3: The Binomial Distributions
A binomial probability distribution occurs when the following requirements are met.
1.
2.
3.
4.
Each observation falls into one of just two categories – call them “success” or “failure.”
The procedure has a fixed number of trials – we call this value n.
The observations must be independent – result of one does not affect another.
The probability of success – call it p - remains the same for each observation.
Notation for binomial probability distribution
n denotes the number of fixed trials
k denotes the number of successes in the n trials
p denotes the probability of success
1 – p denotes the probability of failure
Binomial Probability Formula
P( X ï€― k ) ï€―
n!
( p) k (1  p) n  k
k!(n  k )!
How to use the TI-83/4 to compute binomial probabilities *
There are two binomial probability functions on the TI-83/84, binompdf and binomcdf
binompdf is a probability distribution function and determines P( X ï€― k )
binomcdf is a cumulative distribution function and determines P( X ï‚Ģ k )
*Both functions are found in the DISTR menu (2nd-VARS)
Probability
Calculator Command
Example (assume n = 4, p = .8)
P( X ï€― k )
binompdf(n, p, k)
P( X ï€― 3) = binompdf(4, .8, 3)
P( X ï‚Ģ k )
binomcdf(n, p, k)
P( X ï‚Ģ 3) = binomcdf(4, .8, 3)
P( X  k )
binomcdf(n, p, k - 1)
P( X  3) = binomcdf(4, .8, 2)
P( X ï€ū k )
1 – binomcdf(n, p, k)
P( X ï€ū 3) = 1 – binomcdf(4, .8, 3)
P( X ï‚ģ k )
1 – binomcdf(n, p, k - 1)
P( X ï‚ģ 3) = 1 – binomcdf(4, .8, 2)
Mean (expected value) of a Binomial Random Variable
Formula:  ï€― np
Meaning: Expected number of successes in n trials (think average)
Example: Suppose you are a 80% free throw shooter. You are going to shoot 4 free throws.
For n = 4, p = .8,  ï€― (4)(.8) ï€― 3.2 , which means we expect 3.2 makes out of 4 shots, on average
530
6.3: The Geometric Distributions
A geometric probability distribution occurs when the following requirements are met.
1. Each observation falls into one of just two categories – call them “success” or “failure.”
2. The observations must be independent – result of one does not affect another.
3. The probability of success – call it p - remains the same for each observation.
4. The variable of interest is the number of trials required to obtain the first success.*
* As such, the geometric is also called a “waiting-time” distribution
Notation for geometric probability distribution
n denotes the number of trials required to obtain the first success
p denotes the probability of success
1 – p denotes the probability of failure
Geometric Probability Formula
P( X ï€― n) ï€― (1  p)n  1( p)
How to use the TI-83/4 to compute geometric probabilities *
There are two geometric probability functions on the TI-83/84, geometpdf and geometcdf
geometpdf is a probability distribution function and determines P( X ï€― n)
geometcdf is a cumulative distribution function and determines P( X ï‚Ģ n)
*Both functions are found in the DISTR menu (2nd-VARS)
Probability
Calculator Command
Example (assume p = .8, n = 3)
P( X ï€― n)
geometpdf (p, n)
P( X ï€― 3) = geometpdf(.8, 3)
P( X ï‚Ģ n)
geometcdf(p, n)
P( X ï‚Ģ 3) = geometcdf(.8, 3)
P( X  n)
geometcdf(p, n-1)
P( X  3) = geometcdf(.8, 2)
P( X ï€ū n)
1 – geometcdf(p, n)
P( X ï€ū 3) = 1 – geometcdf(.8, 3)
P( X ï‚ģ n)
1 – geometcdf(p, n-1)
P( X ï‚ģ 3) = 1 – geometcdf( .8, 2)
Mean (expected value) of a Geometric Random Variable
Formula:  ï€―
1
p
Meaning: Expected number of n trials to achieve first success (average)
Example: Suppose you are a 80% free throw shooter. You are going to shoot until you make.
For p = .8,  ï€―
1
ï€― 1.25 , which means we expect to take 1.25 shots, on average, to make first
.8
531
AP Statistics – Chapter 7 Notes: Sampling Distributions
7.1 – What is a Sampling Distribution?
Parameter – A parameter is a number that describes some characteristic of the population
Statistic – A statistic is a number that describes some characteristic of a sample
Symbols
used
Proportions
Means
Sample
Statistic
pĖ‚
x
Population
Parameter
p

Sampling Distribution – the distribution of all values taken by a statistic in all possible samples of the same
size from the same population
A statistic is called an unbiased estimator of a parameter if the mean of its sampling distribution is equal to
the parameter being estimated
Important Concepts for unbiased estimators
 The mean of a sampling distribution will always equal the mean of the population for any sample size
 The spread of a sampling distribution is affected by the sample size, not the population size.
Specifically, larger sample sizes result in smaller spread or variability.
7.2 – Sample Proportions
7.3 – Sample Means
Choose an SRS of size n from a large population with Suppose that x is the mean of a sample from a large
population proportion p having some characteristic of population with mean  and standard deviation ïģ .
interest.
Then the mean and standard deviation of the
sampling distribution of x are
Let 𝑝Ė‚ be the proportion of the sample having that
𝜎
characteristic. Then the mean and standard deviation
Mean: 𝜇ð‘ĨĖ… = 𝜇
Std. Dev.: 𝜎ð‘ĨĖ… =
of the sampling distribution of 𝑝Ė‚ are
√𝑛
Mean: 𝜇𝑝Ė‚ = 𝑝 Std. Dev.:
With the Z-Statistic: 𝑧
=
𝜎𝑝Ė‚ = √
𝑝(1−𝑝)
With the Z-Statistic: 𝑧
𝑛
𝑝Ė‚−𝑝
√
𝑝(1−𝑝)
𝑛
CONDITIONS FOR NORMALITY
The 10% Condition
Use the formula for the standard deviation of pĖ‚ only
when the size of the sample is no more than 10% of
1
the population size (𝑛 ≤ 𝑁).
10
The Large Counts Condition
We will use the normal approximation to the
sampling distribution of pĖ‚ for values of n and p that
satisfy np ï‚ģ 10 and n(1  p) ï‚ģ 10 .
ð‘ĨĖ… −𝜇
=𝜎
⁄ 𝑛
√
CONDITIONS FOR NORMALITY
If an SRS is drawn from a population that has the
normal distribution with mean  and standard
deviation ïģ , then the sample mean x will have the
normal distribution N ( , ïģ n ) for any sample size.
Central Limit Theorem
If an SRS is drawn from any population with mean
 and standard deviation ïģ , when n is large
(n ï‚ģ 30) , the sampling distribution of the sample
mean x will have the normal distribution
N ( , ïģ n ) .
532
AP Statistics – Chapter 8 Notes: Estimating with Confidence
8.1 – Confidence Interval Basics
Point Estimate
A point estimator is a statistic that provides an estimate of a population parameter. The value of that statistic
from a sample is called a point estimate.
The Idea of a Confidence Interval
A C% confidence interval gives an interval of plausible values for a parameter. The interval is calculated from
the data and has the form: point estimate ± margin of error
The difference between the point estimate and the true parameter value will be less than the margin of error in
C% of all samples.
The confidence level C gives the overall success rate of the method for calculating the confidence interval.
That is, in C% of all possible samples, the method would yield an interval that captures the true parameter
value.
Interpreting Confidence Intervals
To interpret a C% confidence interval for an unknown parameter, say, “We are C% confident that the interval
from _____ to _____ captures the actual value of the [population parameter in context].”
Interpreting Confidence Levels
To say that we are 95% confident is shorthand for “If we take many samples of the same size from this
population, about 95% of them will result in an interval that captures the actual parameter value.”
8.2 – Estimating a Population Proportion
Conditions for Inference about a Population Proportion
 Random Sample - The data are a random sample from the population of interest.
1
 10% Rule - The sample size is no more than 10% of the population size: 𝑛 ≤ 𝑁

10
Large Counts - Counts of successes and failures must be 10 or more: 𝑛𝑝Ė‚ ≥ 10 and 𝑛(1 − 𝑝Ė‚ ) ≥ 10
Ė‚ is
Standard Error of a Sample Proportion 𝒑
√
Ė‚(𝟏 − 𝒑
Ė‚)
𝒑
𝒏
One-Proportion z-interval
The form of the confidence interval for a population proportion is
Ė‚ ± 𝒛∗ √
𝒑
Ė‚(𝟏 − 𝒑
Ė‚)
𝒑
𝒏
Sample size for a desired margin of error
To determine the sample size (n) for a given margin of error m in a 1-proportion z interval, use formula
2
ïƒĶ z* ïƒķ
n = p (1  p )  
ïƒĻmïƒļ
where p* = 0.5, unless another value is given. Remember, that we will always round up to ensure a slightly
smaller margin of error than is required.
*
*
533
8.3 – Estimating a Population Mean
Conditions for Inference about a Population Mean
 Random Sample - The data are a random sample from the population of interest.
1
 10% Rule - The sample size is no more than 10% of the population size: 𝑛 ≤ 𝑁

10
Large Counts/Normality – If the sample size is large (𝑛 ≥ 30), then we can assume normality for any
shape of distribution. When sample is smaller than 30, the t procedures can be used except in the
presence of outliers or strong skewness. Construct a quick graph of the data to make an assessment.
Standard Error
When the standard deviation of a statistic is estimated from the data, the result is called the standard error of
the statistic. The standard error of the sample mean is
𝑠
√𝑛
One-Sample t-Interval for Estimating a Population Mean
The form of the confidence interval for a population mean with n  1 degrees of freedom is
ð‘ĨĖ… ± ð‘Ą ∗
𝑠
√𝑛
Paired Differences t-interval
To compare the responses to the two treatments in a paired data design, apply the one-sample t procedures to
the observed differences.
For example, suppose that pre and post test scores for 10 individuals in a summer reading program are:
Subject
Pre-test
Post-test
Difference
1
25
28
3
2
31
30
-1
3
28
34
6
4
27
35
8
5
30
32
2
6
31
31
0
7
22
26
4
8
18
16
-2
9
24
28
4
We would then use the data in the “difference” row and perform one-sample t analysis on it.
534
10
30
36
6
AP Statistics – Chapter 9 Notes: Testing a Claim
9.1: Significance Test Basics
Null and Alternate Hypotheses
The statement that is being tested is called the null hypothesis (H0). The significance test is
designed to assess the strength of the evidence against the null hypothesis. Usually the null
hypothesis is a statement of "no effect," "no difference," or no change from historical values.
The claim about the population that we are trying to find evidence for is called the alternative
hypothesis (Ha). Usually the alternate hypothesis is a statement of "an effect," "a difference,"
or a change from historical values.
Test Statistics
To assess how far the estimate is from the parameter, standardize the estimate. In many
common situations, the test statistics has the form
estimate  parameter
test statistic =
standard deviation of the estimate
P-value
The p-value of a test is the probability that we would get this sample result or one more
extreme if the null hypothesis is true. The smaller the p-value is, the stronger the evidence
against the null hypothesis provided by the data.
Statistical Significance
If the P-value is as small as or smaller than alpha, we say that the data are statistically
significant at level alpha. In general, use alpha = 0.05 unless otherwise noted.
A Plan for Carrying out a Significance Test:
1.
2.
3.
4.
Hypotheses: State the null and alternate hypotheses
Conditions: Check conditions for the appropriate test
Calculations: Compute the test statistic and use it to find the p-value
Interpretation: Use the p-value to state a conclusion, in context, in a sentence or two
Type I and Type II Errors
There are two types of errors that can be made using inferential techniques. In both cases, we
get a sample that suggests we arrive at a given conclusion (either for or against H0).
Sometimes we get a bad sample that doesn’t reveal the truth.
Here are the two types of errors:
Type I – Rejecting the Ho when it is actually true (a false positive)
Type II – Accepting the Ho when it is actually false (a false negative)
Be prepared to write, in sentence form, the meaning of a Type I and Type II error in the
context of the given situation. The probability of a Type I error is the same as alpha, the
significance level. You will not be asked to find the probability of a Type II error.
535
9.2: Tests about a Population Proportion
Z-test for a Population Proportion (one-proportion z-test)
1. Hypotheses: H0: 𝑝 = 𝑝0 ; Ha: 𝑝 < 𝑝0 or 𝑝 > 𝑝0 or 𝑝 ≠ 𝑝0
2. Conditions:
o Random – does the data come from a random sample?
o Independent – is the sample size less than 10% of the population size?
o Normal – Are np0 and n(1  p0 ) both at least 10?
3. Test-Statistic: 𝑧 =
𝑝Ė‚−𝑝0
𝑝 (1−𝑝0 )
√ 0
𝑛
where 𝑝Ė‚ is the sample proportion
P-value: The P-value is based on a normal z-distribution. This value can be estimated
using Table A or found accurately using the 1-Prop Z-test function on your calculator
4. Conclusion: If P < ïĄ, then Reject the H0, otherwise Fail to Reject H0.
9.3: Tests about a Population Mean
T-test for a Population Mean
1. Hypotheses: H0: = 0; Ha:  < 0 or  > 0 or  ï‚đ 0
2. Conditions:
o Random – does the data come from a random sample?
o Independent – is the sample size less than 10% of the population size?
o Normal – Is it given or is there a large sample size ( n ï‚ģ 30 )?
ð‘ĨĖ… −𝜇
3. Test-Statistic: ð‘Ą = 𝑠 0where s is the sample standard deviation
⁄ 𝑛
√
P-value: The P-value is based on a t-distribution with n  1 degrees of freedom. This
value can be estimated using Table C or found accurately using the T-test function on
your calculator
4. Conclusion: If P < ïĄ, then Reject the H0, otherwise Fail to Reject H0.
Paired Differences T-test
To compare the responses to the two treatments in a paired data design, apply the one-sample t
procedures to the observed differences.
For example, suppose that pre and post test scores for 10 individuals in a summer reading
program are:
Subject
Pre-test
Post-test
Difference
1
25
28
3
2
31
30
-1
3
28
34
6
4
27
35
8
5
30
32
2
6
31
31
0
7
22
26
4
8
18
16
-2
9
24
28
4
We would use the data in the differences row and perform one-sample t analysis on it.
536
10
30
36
6
AP Statistics – Chapter 10 Notes: Comparing Two Population Parameters
10.1: Comparing Two Proportions
Conditions for Comparing Two Proportions
 Random– We have two random samples, from two distinct populations
 Independence – Each sample must be selected independently of the other (no pairing or
matching) and each distinct population size must be 10 times greater than their samples.
 Normality – Counts of all “successes” and “failures” are at least 10.
Two-Proportion z Confidence Interval
To estimate the difference between two population proportions ( p1  p2 ) use the formula
𝑝Ė‚1 (1 − 𝑝Ė‚1 ) 𝑝Ė‚2 (1 − 𝑝Ė‚ 2 )
(𝑝Ė‚1 − 𝑝Ė‚ 2 ) ± 𝑧 ∗ √
+
𝑛1
𝑛2
Two-Proportion z-Test
To test the hypothesis H0: p1 ï€― p2 , compute the two-proportion z statistic
𝑝Ė‚1 − 𝑝Ė‚ 2
𝑧=
𝑝Ė‚ (1 − 𝑝Ė‚𝑐 ) 𝑝Ė‚𝑐 (1 − 𝑝Ė‚𝑐 )
√ 𝑐
+
𝑛1
𝑛2
Where 𝑝Ė‚𝑐 =
ð‘Ĩ1 +ð‘Ĩ2
𝑛1 +𝑛2
given that 𝑝Ė‚1 =
ð‘Ĩ1
𝑛2
and 𝑝Ė‚2 =
ð‘Ĩ2
𝑛2
10.2: Comparing Two Means
Two-Sample Problems
 The goal of inference is to compare the responses to two treatments or to compare the
characteristics of two populations.
 We have a separate sample from each treatment or each population.
Conditions for Comparing Two Means
 Random – We have two random samples, from two distinct populations
 Independence – Each sample must be selected independently of the other (no pairing or
matching) and each distinct population size must be 10 times greater than their samples.
 Normality – Both populations are normally distributed or 𝑛1 ≥ 30 𝑎𝑛𝑑 𝑛2 ≥ 30 .
Two-Sample t Confidence Interval
To estimate the difference between two population means ( 1  2 ) use the formula
( x1  x 2 ) ï‚ą t
*
s12 s2 2
ï€Ŧ
n1 n2
Two-Sample t-Test
To test the hypothesis H0: 1 ï€― 2 , compute the two-sample t statistic
tï€―
x1  x 2
s12 s2 2
ï€Ŧ
n1 n2
537
AP Statistics Chapter 12 Inference for Distributions of Categorical Data
12.1 – Chi-Square (𝜒2) Goodness of Fit Test
Goodness of Fit
A goodness of fit test is used to help determine whether a population has a certain hypothesized
distribution, expressed as proportions of individuals in the population falling into various outcome
categories. There are two types of goodness of fit tests
1. Equal Proportions (all proportions are expected to be the same)
2. Fixed or Given Proportions (proportions are expected to follow given values)
Hypotheses for the Goodness of Fit Test
Ho: The stated distribution of the categorical variables in the population of interest is correct.
Ha: The stated distribution in the population of interest is not correct.
The Chi-Square Statistic
The formula is
(𝑜𝑏𝑠𝑒𝑟ð‘Ģ𝑒𝑑 𝑐𝑜ð‘Ēð‘›ð‘Ą − 𝑒ð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘ 𝑐𝑜ð‘Ēð‘›ð‘Ą)2
(𝑂 − ðļ)2
𝜒 = ∑
=∑
𝑒ð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘ 𝑐𝑜ð‘Ēð‘›ð‘Ą
ðļ
2
Expected Counts


The expected counts for the equal proportions GOF test are all the same. They are found by
dividing the total of the counts by the number of outcome categories.
The expected counts for the given proportions GOF test are NOT all the same. They are found by
multiplying the total of the counts by each given percentage.
Conditions for the Chi-Square Goodness of Fit Test



Random – The data come from a well-designed random sample or randomized experiment.
Independent – is the sample size less than 10% of the population size?
Large Counts – All expected counts are at least 5.
Degrees of Freedom for the Goodness of Fit Test
The degrees of freedom for the GOF test are n  1 where n is the number of outcome categories.
Using the Calculator for the 𝜒 2 Test on the TI-83 Calculator*




The observed counts are to be stored in L1
The expected counts are to be stored in L2
Let L3 = (L1-L2)2/L2
The 𝜒 2 is the sum of L3 (which can be found by using 1-Var Stats)
*The TI-84 has a built-in test called 𝜒 2 GOF-Test. Find it in the STAT-TEST menu.
538
12.1 – The 𝜒2 Test of Association/Independence for Two-Way Tables
Two-Way Tables
When there are two categorical variables, data can be arranged in a row and column format, called a
Two-Way Table. Here is an example:
Color Choice
Grade
1st
blue
13
green
7
red
8
yellow
2
Totals
30
2nd
11
10
6
5
33
Totals
24
17
14
7
63
Hypotheses for the Test for Association between Two Categorical Variables
Ho: There is no association between the categorical variables
Ha: There is an association
OR
Ho: The categorical variables are independent (there is no association)
Ha: The variables are NOT independent (there is an association)
Degrees of Freedom for the Chi-Square Test of Association/Independence
The degrees of freedom for the Test of Association/Independence test are (r  1) x(c  1) where r is the
number of rows and c is the number of columns in the table.
Conditions for the Chi-Square Test of Association/Independence
The conditions are the same as for the goodness of fit test
The Chi-Square Statistic
The formula is the same as in the goodness of fit test.
Expected Counts
The expected counts for this test can be found as follows:
𝑒ð‘Ĩð‘ð‘’ð‘ð‘Ąð‘’ð‘‘ 𝑐𝑜ð‘Ēð‘›ð‘Ą =
(𝑟𝑜ð‘Ī ð‘Ąð‘œð‘Ąð‘Žð‘™)(𝑐𝑜𝑙ð‘Ē𝑚𝑛 ð‘Ąð‘œð‘Ąð‘Žð‘™)
ð‘Ąð‘Žð‘ð‘™ð‘’ ð‘Ąð‘œð‘Ąð‘Žð‘™
For example, for the expected count for 2nd grade/green in the table above, we would use the
calculation
(33)(17)
= 8.9
63
539
AP Statistics Chapter 12: More about Regression
122 – Inference for Linear Regression
Sample Computer Output for a Linear Data Analysis



For the above, the linear equation is y = 7.0647 + 0.36583x
The Standard Error of the slope (SEb) = 0.01048
S = the Standard Deviation of the Residuals. Since S = 0.544, predictions of y from x
based on this regression model will be off by an average of about 0.544.
Confidence Interval for the Slope of a Regression Line
The confidence interval for β has the familiar form
statistic ± (critical value) · (standard deviation of statistic)
The t Interval for the slope ð›―:
𝑏 ± ð‘Ą ∗ 𝑆ðļ𝑏
Where b is the slope, SEb is the standard error of the slope, and t is the critical value with df = n – 2.
Performing a Significance Test for the Slope
Ho: ð›― = ð›―0 (some hypothesized value – often 0)
Ha: either ð›― < ð›―0 or ð›― > ð›―0 or ð›― ≠ ð›―0
Test Statistic: ð‘Ą
=
𝑏−ð›―0
𝑆ðļ𝑏
P-Value: Use the t distribution with df = n - 2
12.2 – Transformations to Achieve Linearity
Finding an Exponential Model for Data
Form: 𝒚 = ð‘Ļ(ð‘Đ)𝒙
Transformation: (x, log y)
Process:
1. LinReg(x, log y)
2. Resulting line is y = a + bx
3. Let A = 10a and B = 10b
Finding a Power Model for Data
Form: 𝒚 = ð‘Ļ(𝒙)ð‘Đ
Transformation: (log x, log y)
Process:
1. LinReg(log x, log y)
2. Resulting line is y = a + bx
3. Let A = 10a and B = b
540
Formulas for AP Statistics
I. Descriptive Statistics
ï€―
x
1
ï€―
ïƒĨ xi
n
ïƒĨx
i
n
ïƒĨï€Ļx
i
xï€Đ
2
n 1
yï€― a ï€Ŧ bx
yĖ‚ï€― a ï€Ŧ bx
rï€―
1
2
ï€Ļ xï€―
ïƒĨ
i xï€Đ
n 1
sx
ï€―
ïƒĶ xi  x ïƒķ ïƒĶ yi  y ïƒķ
1



ïƒĨ
n  1 ïƒĻ sx ïƒļ ïƒĻ s y ïƒļ
bï€―r
sy
sx
II. Probability and Distributions
P ï€Ļ A  Bï€―
ï€Đ P ï€Ļ Aï€Đ ï€Ŧ P ï€Ļ B ï€Đ  P ï€Ļ A  B ï€Đ
Probability Distribution
P ( A | B) ï€―
P ( A  B)
P ( B)
Mean
Standard Deviation
μ X = E ( X ) = ïƒĨ xi ⋅ P ( x i )
Discrete random variable, X
If X has a binomial distribution with
parameters n and p, then:
σX =
 X ï€― np
ïƒĨ(x
ï€―
ïģX
− μ X ) ⋅ P ( xi )
2
i
np ï€Ļ1  p ï€Đ
ïƒĶn ïƒķ
n x
P ï€Ļ Xï€― xï€―
ï€Đ  x  p x ï€Ļ1  p ï€Đ
ïƒĻ ïƒļ
where x ï€― 0, 1, 2, 3,  , n
If X has a geometric distribution with
parameter p , then:
P ï€Ļ Xï€― x ï€―
ï€Đ
where x
ï€Ļ1  p ï€Đ
x 1
X ï€―
p
1
p
ïģX ï€―
1 p
p
ï€― 1, 2, 3, 
III. Sampling Distributions and Inferential Statistics
Standardized test statistic:
statistic − parameter
standard error of the statistic
Confidence interval: statistic ± ( critical value )( standard error of statistic )
2
Chi-square statistic: χ = ïƒĨ
( observed − expected )
2
expected
Appendix V.1 | 254
Return to Table of Contents
AP Statistics Course and Exam Description
© 2019 College Board
541
III. Sampling Distributions and Inferential Statistics (continued)
Sampling distributions for proportions:
Random Variable
For one population:
pĖ‚
For two populations:
pˆ1 − pˆ2
Parameters of Sampling Distribution
 pĖ‚ ï€― p
p ï€Ļ1  p ï€Đ
ïģ pˆ ï€―
1
2
1
p1 ï€Ļ1  p1 ï€Đ
ï€Ŧ
n1
2
pˆ ï€Ļ1  pˆ ï€Đ
s pˆ ï€―
n
 pˆ  pˆï€― p1  pï€―
ïģ pˆ  pˆ
2
Standard Error* of
Sample Statistic
p2 ï€Ļ1  p2 ï€Đ
n2
n
pˆ1 ï€Ļ1  pˆ1 ï€Đ
ï€―
s pˆ1  pˆ2
When
n1
ï€Ŧ
pˆ 2 ï€Ļ1  pˆ 2 ï€Đ
n2
p1 ï€― p2 is assumed:
ïƒĶ1 1ïƒķ
s pˆ1 − pˆ2 = pˆc (1 − pˆc )  + 
ïƒĻ n1 n2 ïƒļ
X ï€Ŧ X2
ˆc ï€― 1
where p
n1 ï€Ŧ n2
Sampling distributions for means:
Random Variable
Parameters of Sampling Distribution
For one population:
X ï€― 
ïģX ï€―
 X  Xï€― 1  2
ïģ Xï€―
X
X
For two populations:
X1  X2
1
2
1
Standard Error* of
Sample Statistic
ïģ
s
sX ï€―
n
ïģ 12
n1
2
ï€Ŧ
ïģ 22
s X1 ï€―
 X2
n2
n
s12 s22
ï€Ŧ
n1 n2
Sampling distributions for simple linear regression:
Random Variable
Parameters of Sampling Distribution
For slope:
b ï€― ïĒ
b
ïģb ï€―
ïģ
ïģx n
where ïģ x
ï€―
,
Standard Error* of
Sample Statistic
sb ï€―
ïƒĨï€Ļx
i
xï€Đ
n
s
sx n  1
2
where
and
sï€―
sx ï€―
,
ïƒĨï€Ļ y
i
ï€Đ 2
 yi ï€Đ
n2
ïƒĨï€Ļx
i
xï€Đ
2
n 1
*Standard deviation is a measurement of variability from the theoretical population. Standard error is the estimate of the standard deviation. If the standard
deviation of the statistic is assumed to be known, then the standard deviation should be used instead of the standard error.
Appendix V.1 | 255
Return to Table of Contents
AP Statistics Course and Exam Description
© 2019 College Board
542
2004 AP® STATISTICS FREE-RESPONSE QUESTIONS
Probability
Table entry for z is the
probability lying below z.
Table A
z
Standard normal probabilities
z
.00
.01
.02
.03
.04
.05
.06
.07
.08
.09
– 3.4
– 3.3
– 3.2
– 3.1
– 3.0
– 2.9
– 2.8
– 2.7
– 2.6
– 2.5
– 2.4
– 2.3
– 2.2
– 2.1
– 2.0
– 1.9
– 1.8
– 1.7
– 1.6
– 1.5
– 1.4
– 1.3
– 1.2
– 1.1
– 1.0
– 0.9
– 0.8
– 0.7
– 0.6
– 0.5
– 0.4
– 0.3
– 0.2
– 0.1
– 0.0
.0003
.0005
.0007
.0010
.0013
.0019
.0026
.0035
.0047
.0062
.0082
.0107
.0139
.0179
.0228
.0287
.0359
.0446
.0548
.0668
.0808
.0968
.1151
.1357
.1587
.1841
.2119
.2420
.2743
.3085
.3446
.3821
.4207
.4602
.5000
.0003
.0005
.0007
.0009
.0013
.0018
.0025
.0034
.0045
.0060
.0080
.0104
.0136
.0174
.0222
.0281
.0351
.0436
.0537
.0655
.0793
.0951
.1131
.1335
.1562
.1814
.2090
.2389
.2709
.3050
.3409
.3783
.4168
.4562
.4960
.0003
.0005
.0006
.0009
.0013
.0018
.0024
.0033
.0044
.0059
.0078
.0102
.0132
.0170
.0217
.0274
.0344
.0427
.0526
.0643
.0778
.0934
.1112
.1314
.1539
.1788
.2061
.2358
.2676
.3015
.3372
.3745
.4129
.4522
.4920
.0003
.0004
.0006
.0009
.0012
.0017
.0023
.0032
.0043
.0057
.0075
.0099
.0129
.0166
.0212
.0268
.0336
.0418
.0516
.0630
.0764
.0918
.1093
.1292
.1515
.1762
.2033
.2327
.2643
.2981
.3336
.3707
.4090
.4483
.4880
.0003
.0004
.0006
.0008
.0012
.0016
.0023
.0031
.0041
.0055
.0073
.0096
.0125
.0162
.0207
.0262
.0329
.0409
.0505
.0618
.0749
.0901
.1075
.1271
.1492
.1736
.2005
.2296
.2611
.2946
.3300
.3669
.4052
.4443
.4840
.0003
.0004
.0006
.0008
.0011
.0016
.0022
.0030
.0040
.0054
.0071
.0094
.0122
.0158
.0202
.0256
.0322
.0401
.0495
.0606
.0735
.0885
.1056
.1251
.1469
.1711
.1977
.2266
.2578
.2912
.3264
.3632
.4013
.4404
.4801
.0003
.0004
.0006
.0008
.0011
.0015
.0021
.0029
.0039
.0052
.0069
.0091
.0119
.0154
.0197
.0250
.0314
.0392
.0485
.0594
.0721
.0869
.1038
.1230
.1446
.1685
.1949
.2236
.2546
.2877
.3228
.3594
.3974
.4364
.4761
.0003
.0004
.0005
.0008
.0011
.0015
.0021
.0028
.0038
.0051
.0068
.0089
.0116
.0150
.0192
.0244
.0307
.0384
.0475
.0582
.0708
.0853
.1020
.1210
.1423
.1660
.1922
.2206
.2514
.2843
.3192
.3557
.3936
.4325
.4721
.0003
.0004
.0005
.0007
.0010
.0014
.0020
.0027
.0037
.0049
.0066
.0087
.0113
.0146
.0188
.0239
.0301
.0375
.0465
.0571
.0694
.0838
.1003
.1190
.1401
.1635
.1894
.2177
.2483
.2810
.3156
.3520
.3897
.4286
.4681
.0002
.0003
.0005
.0007
.0010
.0014
.0019
.0026
.0036
.0048
.0064
.0084
.0110
.0143
.0183
.0233
.0294
.0367
.0455
.0559
.0681
.0823
.0985
.1170
.1379
.1611
.1867
.2148
.2451
.2776
.3121
.3483
.3859
.4247
.4641
$
543
11
2004 AP® STATISTICS FREE-RESPONSE QUESTIONS
Probability
Table entry for z is the
probability lying below z.
z
Table A
(Continued)
z
.00
.01
.02
.03
.04
.05
.06
.07
.08
.09
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
1.1
1.2
1.3
1.4
1.5
1.6
1.7
1.8
1.9
2.0
2.1
2.2
2.3
2.4
2.5
2.6
2.7
2.8
2.9
3.0
3.1
3.2
3.3
3.4
.5000
.5398
.5793
.6179
.6554
.6915
.7257
.7580
.7881
.8159
.8413
.8643
.8849
.9032
.9192
.9332
.9452
.9554
.9641
.9713
.9772
.9821
.9861
.9893
.9918
.9938
.9953
.9965
.9974
.9981
.9987
.9990
.9993
.9995
.9997
.5040
.5438
.5832
.6217
.6591
.6950
.7291
.7611
.7910
.8186
.8438
.8665
.8869
.9049
.9207
.9345
.9463
.9564
.9649
.9719
.9778
.9826
.9864
.9896
.9920
.9940
.9955
.9966
.9975
.9982
.9987
.9991
.9993
.9995
.9997
.5080
.5478
.5871
.6255
.6628
.6985
.7324
.7642
.7939
.8212
.8461
.8686
.8888
.9066
.9222
.9357
.9474
.9573
.9656
.9726
.9783
.9830
.9868
.9898
.9922
.9941
.9956
.9967
.9976
.9982
.9987
.9991
.9994
.9995
.9997
.5120
.5517
.5910
.6293
.6664
.7019
.7357
.7673
.7967
.8238
.8485
.8708
.8907
.9082
.9236
.9370
.9484
.9582
.9664
.9732
.9788
.9834
.9871
.9901
.9925
.9943
.9957
.9968
.9977
.9983
.9988
.9991
.9994
.9996
.9997
.5160
.5557
.5948
.6331
.6700
.7054
.7389
.7704
.7995
.8264
.8508
.8729
.8925
.9099
.9251
.9382
.9495
.9591
.9671
.9738
.9793
.9838
.9875
.9904
.9927
.9945
.9959
.9969
.9977
.9984
.9988
.9992
.9994
.9996
.9997
.5199
.5596
.5987
.6368
.6736
.7088
.7422
.7734
.8023
.8289
.8531
.8749
.8944
.9115
.9265
.9394
.9505
.9599
.9678
.9744
.9798
.9842
.9878
.9906
.9929
.9946
.9960
.9970
.9978
.9984
.9989
.9992
.9994
.9996
.9997
.5239
.5636
.6026
.6406
.6772
.7123
.7454
.7764
.8051
.8315
.8554
.8770
.8962
.9131
.9279
.9406
.9515
.9608
.9686
.9750
.9803
.9846
.9881
.9909
.9931
.9948
.9961
.9971
.9979
.9985
.9989
.9992
.9994
.9996
.9997
.5279
.5675
.6064
.6443
.6808
.7157
.7486
.7794
.8078
.8340
.8577
.8790
.8980
.9147
.9292
.9418
.9525
.9616
.9693
.9756
.9808
.9850
.9884
.9911
.9932
.9949
.9962
.9972
.9979
.9985
.9989
.9992
.9995
.9996
.9997
.5319
.5714
.6103
.6480
.6844
.7190
.7517
.7823
.8106
.8365
.8599
.8810
.8997
.9162
.9306
.9429
.9535
.9625
.9699
.9761
.9812
.9854
.9887
.9913
.9934
.9951
.9963
.9973
.9980
.9986
.9990
.9993
.9995
.9996
.9997
.5359
.5753
.6141
.6517
.6879
.7224
.7549
.7852
.8133
.8389
.8621
.8830
.9015
.9177
.9319
.9441
.9545
.9633
.9706
.9767
.9817
.9857
.9890
.9916
.9936
.9952
.9964
.9974
.9981
.9986
.9990
.9993
.9995
.9997
.9998
$
12
544
2004 AP® STATISTICS FREE-RESPONSE QUESTIONS
Table entry for p and
C is the point t* with
probability p lying
above it and
probability C lying
between −t * and t*.
Probability p
t*
Table B
t distribution critical values
Tail probability p
df
.25
.20
.15
.10
.05
.025
.02
.01
.005
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
40
50
60
80
100
1000
!
1.000
.816
.765
.741
.727
.718
.711
.706
.703
.700
.697
.695
.694
.692
.691
.690
.689
.688
.688
.687
.686
.686
.685
.685
.684
.684
.684
.683
.683
.683
.681
.679
.679
.678
.677
.675
.674
1.376
1.061
.978
.941
.920
.906
.896
.889
.883
.879
.876
.873
.870
.868
.866
.865
.863
.862
.861
.860
.859
.858
.858
.857
.856
.856
.855
.855
.854
.854
.851
.849
.848
.846
.845
.842
.841
1.963
1.386
1.250
1.190
1.156
1.134
1.119
1.108
1.100
1.093
1.088
1.083
1.079
1.076
1.074
1.071
1.069
1.067
1.066
1.064
1.063
1.061
1.060
1.059
1.058
1.058
1.057
1.056
1.055
1.055
1.050
1.047
1.045
1.043
1.042
1.037
1.036
3.078
1.886
1.638
1.533
1.476
1.440
1.415
1.397
1.383
1.372
1.363
1.356
1.350
1.345
1.341
1.337
1.333
1.330
1.328
1.325
1.323
1.321
1.319
1.318
1.316
1.315
1.314
1.313
1.311
1.310
1.303
1.299
1.296
1.292
1.290
1.282
1.282
6.314
2.920
2.353
2.132
2.015
1.943
1.895
1.860
1.833
1.812
1.796
1.782
1.771
1.761
1.753
1.746
1.740
1.734
1.729
1.725
1.721
1.717
1.714
1.711
1.708
1.706
1.703
1.701
1.699
1.697
1.684
1.676
1.671
1.664
1.660
1.646
1.645
12.71
4.303
3.182
2.776
2.571
2.447
2.365
2.306
2.262
2.228
2.201
2.179
2.160
2.145
2.131
2.120
2.110
2.101
2.093
2.086
2.080
2.074
2.069
2.064
2.060
2.056
2.052
2.048
2.045
2.042
2.021
2.009
2.000
1.990
1.984
1.962
1.960
15.89
4.849
3.482
2.999
2.757
2.612
2.517
2.449
2.398
2.359
2.328
2.303
2.282
2.264
2.249
2.235
2.224
2.214
2.205
2.197
2.189
2.183
2.177
2.172
2.167
2.162
2.158
2.154
2.150
2.147
2.123
2.109
2.099
2.088
2.081
2.056
2.054
31.82
6.965
4.541
3.747
3.365
3.143
2.998
2.896
2.821
2.764
2.718
2.681
2.650
2.624
2.602
2.583
2.567
2.552
2.539
2.528
2.518
2.508
2.500
2.492
2.485
2.479
2.473
2.467
2.462
2.457
2.423
2.403
2.390
2.374
2.364
2.330
2.326
63.66
9.925
5.841
4.604
4.032
3.707
3.499
3.355
3.250
3.169
3.106
3.055
3.012
2.977
2.947
2.921
2.898
2.878
2.861
2.845
2.831
2.819
2.807
2.797
2.787
2.779
2.771
2.763
2.756
2.750
2.704
2.678
2.660
2.639
2.626
2.581
2.576
127.3
14.09
7.453
5.598
4.773
4.317
4.029
3.833
3.690
3.581
3.497
3.428
3.372
3.326
3.286
3.252
3.222
3.197
3.174
3.153
3.135
3.119
3.104
3.091
3.078
3.067
3.057
3.047
3.038
3.030
2.971
2.937
2.915
2.887
2.871
2.813
2.807
318.3
22.33
10.21
7.173
5.893
5.208
4.785
4.501
4.297
4.144
4.025
3.930
3.852
3.787
3.733
3.686
3.646
3.611
3.579
3.552
3.527
3.505
3.485
3.467
3.450
3.435
3.421
3.408
3.396
3.385
3.307
3.261
3.232
3.195
3.174
3.098
3.091
636.6
31.60
12.92
8.610
6.869
5.959
5.408
5.041
4.781
4.587
4.437
4.318
4.221
4.140
4.073
4.015
3.965
3.922
3.883
3.850
3.819
3.792
3.768
3.745
3.725
3.707
3.690
3.674
3.659
3.646
3.551
3.496
3.460
3.416
3.390
3.300
3.291
50%
60%
70%
80%
90%
95%
96%
98%
99%
99.5%
99.8%
99.9%
Confidence level C
13
$
545
.0025
.001
.0005
2004 AP® STATISTICS FREE-RESPONSE QUESTIONS
Probability p
Table entry for p is the point
( χ 2 ) with probability p lying
above it.
(χ2 )
Table C
χ 2 critical values
Tail probability p
df
.25
.20
.15
.10
.05
.025
.02
.01
.005
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
40
50
60
80
100
1.32
2.77
4.11
5.39
6.63
7.84
9.04
10.22
11.39
12.55
13.70
14.85
15.98
17.12
18.25
19.37
20.49
21.60
22.72
23.83
24.93
26.04
27.14
28.24
29.34
30.43
31.53
32.62
33.71
34.80
45.62
56.33
66.98
88.13
109.1
1.64
3.22
4.64
5.99
7.29
8.56
9.80
11.03
12.24
13.44
14.63
15.81
16.98
18.15
19.31
20.47
21.61
22.76
23.90
25.04
26.17
27.30
28.43
29.55
30.68
31.79
32.91
34.03
35.14
36.25
47.27
58.16
68.97
90.41
111.7
2.07
3.79
5.32
6.74
8.12
9.45
10.75
12.03
13.29
14.53
15.77
16.99
18.20
19.41
20.60
21.79
22.98
24.16
25.33
26.50
27.66
28.82
29.98
31.13
32.28
33.43
34.57
35.71
36.85
37.99
49.24
60.35
71.34
93.11
114.7
2.71
4.61
6.25
7.78
9.24
10.64
12.02
13.36
14.68
15.99
17.28
18.55
19.81
21.06
22.31
23.54
24.77
25.99
27.20
28.41
29.62
30.81
32.01
33.20
34.38
35.56
36.74
37.92
39.09
40.26
51.81
63.17
74.40
96.58
118.5
3.84
5.99
7.81
9.49
11.07
12.59
14.07
15.51
16.92
18.31
19.68
21.03
22.36
23.68
25.00
26.30
27.59
28.87
30.14
31.41
32.67
33.92
35.17
36.42
37.65
38.89
40.11
41.34
42.56
43.77
55.76
67.50
79.08
101.9
124.3
5.02
7.38
9.35
11.14
12.83
14.45
16.01
17.53
19.02
20.48
21.92
23.34
24.74
26.12
27.49
28.85
30.19
31.53
32.85
34.17
35.48
36.78
38.08
39.36
40.65
41.92
43.19
44.46
45.72
46.98
59.34
71.42
83.30
106.6
129.6
5.41
7.82
9.84
11.67
13.39
15.03
16.62
18.17
19.68
21.16
22.62
24.05
25.47
26.87
28.26
29.63
31.00
32.35
33.69
35.02
36.34
37.66
38.97
40.27
41.57
42.86
44.14
45.42
46.69
47.96
60.44
72.61
84.58
108.1
131.1
6.63
9.21
11.34
13.28
15.09
16.81
18.48
20.09
21.67
23.21
24.72
26.22
27.69
29.14
30.58
32.00
33.41
34.81
36.19
37.57
38.93
40.29
41.64
42.98
44.31
45.64
46.96
48.28
49.59
50.89
63.69
76.15
88.38
112.3
135.8
7.88
10.60
12.84
14.86
16.75
18.55
20.28
21.95
23.59
25.19
26.76
28.30
29.82
31.32
32.80
34.27
35.72
37.16
38.58
40.00
41.40
42.80
44.18
45.56
46.93
48.29
49.64
50.99
52.34
53.67
66.77
79.49
91.95
116.3
140.2
14
$
546
.0025
9.14
11.98
14.32
16.42
18.39
20.25
22.04
23.77
25.46
27.11
28.73
30.32
31.88
33.43
34.95
36.46
37.95
39.42
40.88
42.34
43.78
45.20
46.62
48.03
49.44
50.83
52.22
53.59
54.97
56.33
69.70
82.66
95.34
120.1
144.3
.001
.0005
10.83
13.82
16.27
18.47
20.51
22.46
24.32
26.12
27.88
29.59
31.26
32.91
34.53
36.12
37.70
39.25
40.79
42.31
43.82
45.31
46.80
48.27
49.73
51.18
52.62
54.05
55.48
56.89
58.30
59.70
73.40
86.66
99.61
124.8
149.4
12.12
15.20
17.73
20.00
22.11
24.10
26.02
27.87
29.67
31.42
33.14
34.82
36.48
38.11
39.72
41.31
42.88
44.43
45.97
47.50
49.01
50.51
52.00
53.48
54.95
56.41
57.86
59.30
60.73
62.16
76.09
89.56
102.7
128.3
153.2
P1: OSO
FREE013-TABLE
FREE013-Moore
September 4, 2008
Random digits
LINE
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
19223
73676
45467
52711
95592
68417
82739
60940
36009
38448
81486
59636
62568
45149
61041
14459
38167
73190
95857
35476
71487
13873
54580
71035
96746
96927
43909
15689
36759
69051
05007
68732
45740
27816
66925
08421
53645
66831
55588
12975
96767
72829
88565
62964
19687
37609
54973
00694
71546
07511
95034
47150
71709
38889
94007
35013
57890
72024
19365
48789
69487
88804
70206
32992
77684
26056
98532
32533
07118
55972
09984
81598
81507
09001
12149
19931
99477
14227
58984
64817
16632
55259
41807
78416
55658
44753
66812
68908
99404
13258
35964
50232
42628
88145
12633
59057
86278
05977
05233
88915
05756
99400
77558
93074
69971
15529
20807
17868
15412
18338
60513
04634
40325
75730
94322
31424
62183
04470
87664
39421
29077
95052
27102
43367
37823
36809
25330
06565
68288
87174
81194
84292
65561
18329
39100
77377
61421
40772
70708
13048
23822
97892
17797
83083
57857
66967
88737
19664
53946
41267
28713
01927
00095
60227
91481
72765
47511
24943
39638
24697
09297
71197
03699
66280
24709
80371
70632
29669
92099
65850
14863
90908
56027
49497
71868
74192
64359
14374
22913
09517
14873
08796
33302
21337
78458
28744
47836
21558
41098
45144
96012
63408
49376
69453
95806
83401
74351
65441
68743
16853
96409
27754
32863
40011
60779
85089
81676
61790
85453
39364
00412
19352
71080
03819
73698
65103
23417
84407
58806
04266
61683
73592
55892
72719
18442
77567
40085
13352
18638
84534
04197
43165
07051
35213
11206
75592
12609
47781
43563
72321
94591
77919
61762
46109
09931
60705
47500
20903
72460
84569
547
12531
42648
29485
85848
53791
57067
55300
90656
46816
42006
71238
73089
22553
56202
14526
62253
26185
90785
66979
35435
47052
75186
33063
96758
35119
88741
16925
49367
54303
06489
85576
93739
93623
37741
19876
08563
15373
33586
56934
81940
65194
44575
16953
59505
02150
02384
84552
62371
27601
79367
42544
82425
82226
48767
17297
50211
94383
87964
83485
76688
27649
84898
11486
02938
31893
50490
41448
65956
98624
43742
62224
87136
41842
27611
62103
48409
85117
81982
00795
87201
45195
31685
18132
04312
87151
79140
98481
79177
48394
00360
50842
24870
88604
69680
43163
90597
19909
22725
45403
32337
82853
36290
90056
52573
59335
47487
14893
18883
41979
08708
39950
45785
11776
70915
32592
61181
75532
86382
84826
11937
51025
95761
81868
91596
39244
41903
36071
87209
08727
97245
96565
97150
09547
68508
31260
92454
14592
06928
51719
02428
53372
04178
12724
00900
58636
93600
67181
53340
88692
03316
Revised Pages
7:30
548
2
A z-score is the number of standard
deviations a data value is away from the
mean. The sign indicates the direction –
above or below.
Standard deviation is the average distance
between the data values and the mean.
Interpret standard 1
deviation
Interpret z-score
SOCS: Describe shape, outliers, center, and
spread (SOCS). Choose appropriate
measures of center and spread based on
shape.
What to do
Describe distribution of 1
quantitative variable
Chapter
The z-score for Bob’s test equals -1.5. Bob's score is 1.5 standard
deviations below the mean of 80.
On average, test scores are 10 points away from the mean of 80.
The distribution of test scores is unimodal and nearly symmetric.
The mean equals 80 and the standard deviation equals 10.
Example
Graph only approximates the distribution. Your graph should have axes labeled.
Scores on a stats test had a mean equal to 80 and a standard deviation equal to 10. A student named Bob scored a 65.
Scenario for examples
Quantitative Distributions
AP Statistics
Interpreting and Describing
549
Prepare side-by-side graphs. Compare
distributions, using comparison words
(higher/equal/lower).
The distribution of males scores is bimodal and skewed negative
while the distribution of female scores is unimodal and nearly
symmetric. Males had a lower mean score than females (75 vs.
84) and the standard deviation is higher for males, which
indicates their scores were less consistent.
Example
relationship
Describe 3, 12
Chapters
Describe strength, direction and form. Identify
outliers, including influential observations.
What to do
There is a strong, positive, linear association between hours
studied and test score. The point representing a student who
studied five hours and scored a 100 is an influential observation.
Example
Graph only approximates the association. Your graph should have labels.
Scores on a stats test had a mean equal to 80 and a standard deviation equal to 10. Students studied a mean of 2 hours for the test with a
standard deviation of 1. The relationship between is study time and test score is linear and the correlation equals 0.7. Bob studied 30 minutes
and scored a 65 on the test.
Scenario for examples
When both variables are quantitative
Describe relationship
1
Chapter What to do
The mean score of males on a stats test equals 75 with a standard deviation of 13. The mean score of females on the same test equals 84 and
the standard deviation equals 9.
Scenario for examples
When at least one variable is categorical
Two-variable Relationships
550
The estimated value of y when x equals 0.
Strength and direction of linear associations.
Do not use when association isn’t linear.
The percentage of <y> variance explained by
<x>
Estimated value of <y> for a given value of <x>
Difference between the true value of <y> and
the estimated value of <y>
The points in the residual plot are distributed
randomly about the x-axis. The association of
the original data is linear.
Interpret 3, 12
correlation
Interpret 3, 12
coefficient of
determination
3, 12
3, 12
Interpret y-hat
Interpret residual
Describe and 3, 12
interpret residual
plot
Rise over run: Slope is predicted amount <y>
changes by <rise amount> for per <1 unit
change> in <x>.
3, 12
Interpret y- 3. 12
intercept
Interpret slope
What to do
Chapters
Graph only approximates the association. Your graph should have
labels
The residuals are distributed randomly about the x-axis. The least
squares model is a good fit for the data.
Bob's actual score is 4.5 points lower than predicted.
The estimated grade of students who study 0.5 hours equals 69.5
49% of test score variance is explained by time spent studying
r= 0.7 means that there is a moderately strong positive
relationship between time spent studying and test score.
The estimated grade of students who don't study equals 66.
On average, test scores increase 7 points for every extra hour of
study.
Example
551
P-value
9, 10, 11,
12
Explain Type II error 12
9, 10, 11,
9, 10, 11,
Explain Type I error 12
Probability of a sample result that is at least as
extreme.
False negative
False positive
Probability that a test detects a difference
9, 10, 11,
12
Explain power
Linkage + context: Compare p-value to
significance level; reject fail/fail to reject;
answer with context
<CL%> of intervals capture the true
<parameter>
Explain confidence
8, 10, 12
level
Hypothesis test 9, 10, 11,
conclusion 12
Identify the confidence level and the
boundaries of the interval. Use context.
Confidence
8, 10, 12
interval conclusion
Chapters What to do
If male stats teachers have the same mean weight as other male
math teachers, the probability is 0.16066 that a sample of 61
male stats teachers would be 175 pounds or more.
Though male stats teachers have a higher mean weight than
other male math teachers, the test fails to detect the difference.
Though male stats teachers have the same mean weight as other
male math teachers, the test indicates male stats teachers are
heavier.
This test has a 35% probability of detecting that male stats
teachers have a mean weight that is 5 pounds heavier than the
mean weight of other math teachers. (Note: I made up the
power. It may actually be different).
Because the 𝑝 = 0.16066 is greater than 𝛞 = 0.05, do not
reject the null hypothesis. The mean weight of male stats
teachers may equal 173 pounds.
95% of intervals calculated using samples of the same size will
include the true mean weight of male stats teachers.
With 95% confidence, the mean weight of male stats teachers is
between 171 and 179 pounds.
Example
Male math teachers, other than stats teachers, have a mean weight of 173 pounds. The mean weight of a random sample of 61 male stats
teachers equals 175 pounds with a standard deviation equal to 15.6 pounds. Is this evidence that the mean weight of male stats teachers is
greater than the mean weight of other male math teachers?
Scenario for examples
Inference
552
553
554
555
556
557
558
559
560
561
2
C
I
S
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
STATS MEDIC
REVIEW
MC and FRQ
587
Unit 1 Practice Free Response Question
Below are boxplots that summarize the weights (in pounds) of large samples from two breeds of
dog: the Anatolian Shepherd and the Black Russian Terrier.
(a) Compare the distributions of weights for the two dog breeds.
(b) This sample of Black Russian Terriers does not contain any outliers. What weights would a
Black Russian Terrier have to be to be considered an outlier?
588
Unit 1 Multiple Choice
FREQUENCY
1. The graph below displays the amount of time to the nearest hour spent on homework per week
for a sample of students. Which measures of center and variability would be most appropriate to
describe the given distribution?
16
14
12
10
8
6
4
2
0
14
12
15 15
11
8
5
4 4
3
2
1 1
1 2 3 4 5 6 7 8 9 10 11 12 13
TIME (HOURS)
(A) Mean and standard deviation
(B) Mean and IQR
(C) Median and standard deviation
(D) Median and IQR
(E) Median and Range
589
2. The boxplot below summarizes the heights (in centimeters) of a sample of 200 3-year-old males in
the United States. Which of the following statements is false?
(A) 50% of the children in the sample had heights between 93 and 98 cm.
(B) 25 children had heights greater than 98 cm.
(C) The 25th percentile in this sample is a height of 93 cm.
(D) The tallest child in this sample was 111 cm tall.
(E) Every child in the sample was at least 82 cm tall.
3. A college professor calculates the standard deviation of all the grades from the midterm exams
she most recently administered. Which of the following is the best description of the standard
deviation?
(A) The difference between the highest score on the midterm and the lowest score on the midterm.
(B) The difference between the score representing the 75th percentile of all midterm exams and the
score representing the 25th percentile of all midterm exams.
(C) Approximately the mean distance between each individual grade of the midterm exams.
(D) Approximately the mean distance between the individual grades of the midterm exams and the
mean grade of all midterm exams.
(E) Approximately the median distance between the individual grades of the midterm exams and the
median grade of all midterm exams.
590
4. Consider the stemplot below, which gives the number of people attending a matinee show at the
local movie theatre for the past 26 days.
1
2
3
4
5
6
7
8
9
10
11
12
8
0
0
1
3
3
1
1
2
7
1
3
2
3
1
5
5
5 7
2 8
5 8
9
7
2
Key: 1|8 = 18 people
Which of the following statements are true?
I. The five number summary is 18, 35, 54, 87, 123
II. Half of the data values are larger than 54.
III. The shape of the distribution is skewed left.
(A) I and II
(B) II
(C) II and III
(D) III
(E) I, II, and III
5. Which of the following statements is true?
(A) Histograms have gaps between each bar.
(B) Dotplots do not provide enough information to determine if there are outliers in the data.
(C) Bar graphs can display both quantitative and categorical data.
(D) Stemplots are the best graphs for displaying data sets with two variables.
(E) Boxplots clearly show the five-number summary of a data set.
591
6. A distribution has a shape that is strongly skewed left. Which of the following statements is most
likely true about this distribution?
(A) Mean > Median.
(B) Mean = Median.
(C) Mean < Median.
(D) Only the mode can be determined.
(E) There are no outliers in the distribution.
7. Several students in an AP Statistics class administer a survey to a sample of students in their
school, asking the students how far they travel to school in miles. Which of the following statistics
would not be measured in miles?
(A) Mean
(B) Median
(C) Interquartile range
(D) Standard deviation
(E) Variance
8. The following table shows how high school students in four grade levels ate their lunch on a
particular day – purchased in the cafeteria, brought a bag lunch, or purchased lunch off campus.
What is the approximate proportion of Juniors that purchased lunch off-campus on this particular
day?
Cafeteria
Bagged Lunch
Off-Campus
Freshman
65
120
0
Sophomore
70
100
20
Junior
50
70
45
(A) 0.06
(B) 0.37
(C) 0.24
(D) 0.27
(E) 0.35
592
Senior
30
60
65
9. A club bowling team has 12 members. The first ten members bowl a game, and their average
score is 156. If you know the next two members bowl scores of 173 and 151, do you have enough
information to find the total of all 12 scores?
(A) Yes, the total is 157.
(B) Yes, the total is 1872.
(C) Yes, the total is 1884.
(D) No, we would need to know the ten individual scores.
(E) No, we would need to know the standard deviation.
10. The mosaic plot shows the distribution of favorite math class for students and teachers at East
Kentwood High School. Based on the graph, which of the following statements is true?
(A) There are more teachers than students who chose their favorite math class as AP Calc.
(B) 100% of students chose Intro Stats as their favorite math class.
(C) When combining students and teachers, AP Stats has the highest overall proportion who
chose it as their favorite math class.
(D) 80% of students chose AP Calc.
(E) There are more teachers than students at East Kentwood High School.
593
Unit 2 Practice Free Response Question
The computer output below gives results from the linear regression analysis for predicting the
pounds of fuel consumed based on the distance traveled in miles for passenger aircraft. Data used
for this analysis were obtained from ten randomly selected flights.
Predictor
Constant
Distance (miles)
Coef
-4702.64
21.282
SE Coef
1657
0.833
S = 2766.57
R-Sq = 98.8%
R-Sq(adj)=98.3%
T
-2.84
25.54
P
0.022
0.000
(a) What is the equation of the least-squares regression line that describes the relationship between
the distance traveled in miles and the pounds of fuel consumed? Define any variables used in this
equation.
(b) Below is a residual plot for the ten flights. Is it appropriate to use the linear regression equation
to make predictions? Explain.
(c) Interpret the y-intercept in the context of the problem. Is this value statistically meaningful?
594
Unit 2 Multiple Choice
1. Which of the following relationships between two variables could be described using correlation,
r?
(A) Number of books read and gender of a student.
(B) Number of football games played and the position of a football player.
(C) High temperature of the day and number of zoo visitors that day.
(D) Type of beverage ordered and time of day it was ordered.
(E) Brand of cell phone and number of cell phones sold.
2. A scatterplot shows a strong, positive, linear relationship between the number of rebounds a
basketball team averages and the number of wins that team records in a season. Which conclusion is
most appropriate?
(A) A team that increases its number of rebounds causes its chances of winning more games to
increase.
(B) If the residual plot shows no pattern, then it is safe to conclude that getting more rebounds
causes more wins, on average.
(C) If the residual plot shows no pattern, then it is safe to conclude that getting more wins causes
more rebounds, on average.
(D) If the 𝑟 ! value is close enough to 100%, then it is safe to conclude that getting more rebounds
causes more wins, on average.
(E) Rebounds and wins are positively correlated, but we cannot conclude that getting more
rebounds causes more wins, on average.
3. Two variables, x and y, have a correlation of 0.75. If x has a mean of 25 and a standard deviation
of 3, and y has a mean of 12 and a standard deviation of 6, which of the following is the leastsquares regression line for the two variables?
(A) ð‘Ķ = −25.5 + 1.5ð‘Ĩ
(B) ð‘Ķ = 12 + 1.5ð‘Ĩ
(C) ð‘Ķ = 12 + 0.75ð‘Ĩ
(D) ð‘Ķ = 16 + 0.75ð‘Ĩ
(E) Not enough information
595
4. Data are collected on the amount of fat (in grams) and calories in the french fry orders at nine fast
food restaurants. The least-squares regression line for the data is ð‘Ķ = 274.34 + 9.55ð‘Ĩ, where ð‘Ķ is the
predicted number of calories and x is grams of fat. Which of the following is the correct
interpretation of the slope of the least-squares regression line?
(A) The calories increase by 9.55, on average.
(B) For every increase in fat, the calories increase as well.
(C) Every increase of 1 gram of fat causes an increase of 9.55 calories.
(D) For every increase of 1 gram of fat, the predicted calories increase by 9.55.
(E) For every increase of 1 calorie, the predicted grams of fat increase by 9.55.
5. The scatterplot below shows data for the nine french fry orders from the previous problem, with
the least-squares regression line displayed. Which of the following is the best estimate of the value
of the residual for the point indicated by the arrow?
(A) -570
(B) 570
(C) -60
(D) 60
(E) 630
596
6. The scatterplot below shows data for the nine french fry orders from the previous problem. A
tenth fast food chain has been added, as indicated by the arrow. How would this tenth data point
affect the slope and correlation in this scenario?
(A) Slope decreases, correlation increases
(B) Slope increases, correlation increases
(C) Slope increases, correlation decreases
(D) Slope decreases, correlation decreases
(E) Cannot be determined without the full set of data
7. Battery life has a strong, negative, linear relationship with temperature. If the least-squares
regression line using x = temperature explains 90% of the variation in battery life, which of the
following must be the correlation, r, between battery life and temperature?
(A) -0.90
(B) 0.90
(C) -0.95
(D) 0.95
(E) Cannot be determined without the original data.
597
8. A random sample of households is taken. For each household, the number of hours spent
watching television and the power consumption (in kWh) during a day are recorded. The table below
shows computer output from a linear regression analysis on the data.
Predictor
Constant
Hours television
Coef
19.31
0.891
SE Coef
2.8621
0.2715
S = 4.185
R-Sq = 30.0%
R-Sq(adj)=28.9%
T
6.75
3.28
P
0.000
0.002
Which of the following is the equation of the least-squares regression line?
(A) ð‘Ķ = 19.31 + 0.891ð‘Ĩ
(B) ð‘Ķ = 2.8621 + 0.2715ð‘Ĩ
(C) ð‘Ķ = 0.891 + 19.31ð‘Ĩ
(D) ð‘Ķ = 0.2715 + 2.8621ð‘Ĩ
(E) ð‘Ķ = 0.2715 + 0.891ð‘Ĩ
9. A random sample of households is taken. For each household, the number of hours spent
watching television and the power consumption (in kWh) during a day are recorded. The table below
shows computer output from a linear regression analysis on the data.
Predictor
Constant
Hours television
Coef
19.31
0.891
SE Coef
2.8621
0.2715
S = 4.185
R-Sq = 30.0%
R-Sq(adj)=28.9%
T
6.75
3.28
P
0.000
0.002
Which of the following is a correct interpretation of r2?
(A) Number of hours of television explains 30% of the variability in power consumption.
(B) 30% of the increase in number of hours of television is explained by power consumption.
(C) 30% of the data will lie on the least-squares regression line.
(D) 30% of the residuals will be less than 4.185.
(E) All of the above are correct interpretations.
598
10. Using the least-squares regression line, yĖ‚ = −25.5+1.5x , what is the residual for the data point at
(28,19)?
(A) -2.5
(B) 2.5
(C) 4.33
(D) 16.5
(E) 19
599
Unit 3 Practice Free Response Question
The two tires on a bicycle experience wear much in the same way car tires do; repeated contact with the road
or sidewalk will wear down the raised rubber, giving the tire less traction. A major tire manufacturer has
developed a new bicycle tire design they hope will reduce wear. After running several rounds of testing in
laboratory conditions, they want to test the tires in real-life biking conditions. They find 20 volunteers who
bike on a daily basis. The volunteers are each given a new bike of the same model for the duration of the
experiment. The volunteers are to bike as they normally would for a month, then return the bike to have the
wear of the tires measured, determined by millimeters of tire depth remaining. The manufacturer knows
these results must be compared to their standard model tire, so they will use the standard model in the
experiment as a control group.
a) The manufacturer determines a matched pair design would be most appropriate to compare the wear of
the new tire design to the standard design. Describe how to implement a matched pair design for this
experiment.
b) Describe the primary advantage of using a matched pair design in the context of this experiment.
c) At the conclusion of the experiment, the manufacturer finds that the new tire design has experienced
statistically significantly less wear than the standard tire design. Can the manufacturer infer that the new tire
will cause significantly less tire wear for all bikers? Explain.
600
Unit 3 Multiple Choice
1. An AP Statistics class starts a project to estimate the average number of hours a student at their
high school sleeps per night. Their high school has 1200 students, and they take a sample of the first
120 students that arrive at school on a particular day. They ask each of the 120 students how many
hours of sleep they got the night before and then calculate an average. Which of the following
statements is an accurate description of the elements of this survey?
(A) Population: the 120 students surveyed. Parameter of interest: the average number of hours a
student at this high school sleeps per night.
(B) Sample: the 1200 students of the school. Population: all high school students in the United
States. Statistic: the average number of hours a student at this high school sleeps per night.
(C) Sample: the 120 students surveyed. Population: the 1200 students at the high school. Parameter
of interest: the average number of hours a student at this high school sleeps per night.
(D) Sample: the 120 students surveyed. Population: the 1200 students at the high school. Statistic:
The average number of hours a student at this high school sleeps per night.
(E) This is not a survey because a convenience sample was used.
2. At a high school, the cafeteria staff provides optional comment cards to students eating in the
cafeteria. Students who wish to make a comment can obtain and fill out a comment card and return
it to a drop box in the front office. At the end of the semester, the staff analyzes the responses and
finds that 72% of the students who filled out a comment card recommend major changes to the food
selection. What conclusions can the cafeteria staff draw from these responses?
(A) Since all of the students had an opportunity to respond, this is evidence that a majority of all the
students supports major changes.
(B) The cafeteria staff should not use the results to draw any conclusions since some students may
not have filled out a comment card, which is nonresponse.
(C) If the students recorded their grade level with their response, the staff could make the original
sample a stratified sample and use the responses to learn what proportion of each grade
recommends major changes to food selection.
(D) The responses were from a voluntary response sample, so the cafeteria staff should not use the
results to draw any conclusions.
(E) The opportunity to fill out comment cards should be offered to students at all high schools in the
school district to get a larger sample size, which will make the results more reliable.
601
3. The Career Placement department at a university wants to administer a survey to determine
student awareness of the resources offered. The department expects that student awareness of the
Career Placement resources will vary significantly depending on how many years a student has
attended the university, so they plan to randomly select 30 freshmen, 30 sophomore, 30 juniors, and
30 seniors. This type of sample can be best described as a
(A) simple random sample.
(B) systematic sample.
(C) stratified sample.
(D) cluster sample.
(E) block sample.
4. Major League Baseball (MLB) has recently been evaluating the timing of various events during
games in an effort to improve the pace of a game. MLB wants to know how long a mound visit,
defined as when a coach pauses the game to visit the pitcher on the mound, takes on average. MLB
randomly selects 100 games over the course of a season, and records the length, in seconds, of
every mound visit that occurs in that game. This sample of mound visits can be best described as a
(A) simple random sample.
(B) systematic sample.
(C) stratified sample.
(D) cluster sample.
(E) block sample.
5. Twenty volunteers with high cholesterol were selected for a trial to determine whether a new diet
reduces cholesterol. The volunteers were given a low-carb, low-calorie diet. After 8 weeks of the
diet, the average cholesterol of the volunteers dropped a significant amount. This study is an
example of
(A) A controlled experiment.
(B) An uncontrolled experiment.
(C) A double blind experiment.
(D) A blocked design experiment.
(E) An observational study.
602
6. The Department of Natural Resources (DNR) is planning a study to determine whether
breakwaters help decrease erosion at a large lake. They divide the shoreline into 100-foot plots,
installing some with breakwaters and some without. However, the east and west shorelines of this
lake receive very different wave patterns due to the wind. The DNR suspects this will affect the
responses to the breakwaters. Because of this, the DNR plans to treat each shoreline as a different
group. This study is an example of
(A) An uncontrolled experiment.
(B) A blocked design experiment.
(C) A matched pairs experiment.
(D) A stratified random sample.
(E) A randomized observational study.
7. Forty adult males volunteered to participate in an experiment. Half of them are randomly assigned
to take a caffeine supplement before working out, and the other half are assigned to take a placebo
before completing the same workout. During the workout, heart rate monitors will be used to
measure each participant’s heart rate. The study found that those who took a caffeine supplement
had significantly higher average pulse rates during the workout. What conclusion can be drawn from
the study?
(A) Two experimental groups of 20 each is too small to conclude that the results are significant.
(B) Those that took the caffeine supplement had a higher average heart rate during the workout, but
it would not be correct to conclude that caffeine caused this to occur.
(C) The caffeine supplements caused the heart rates to increase due to the placebo effect.
(D) Caffeine supplements will, on average, raise the heart rates of all adult males during a workout
similar to the one performed in the experiment.
(E) Caffeine supplements will, on average, raise the heart rates of adult males similar to those in this
study during a workout similar to the one performed in the experiment.
603
8. The following 8 truck models will be used in an experiment with two treatments. The trucks are
listed according to their make and size. If the experiment uses a block design based on the size of
the trucks, which of the following is not a possible list of trucks that receive the first treatment?
Mid-size
Full-size
Chevy
Colorado
Silverado
GMC
Canyon
Sierra
Nissan
Frontier
Titan
Toyota
Tacoma
Tundra
(A) Colorado, Sierra, Frontier, Tundra
(B) Colorado, Canyon, Titan, Tacoma
(C) Colorado, Silverado, Canyon, Sierra
(D) Canyon, Frontier, Titan, Tundra
(E) Silverado, Frontier, Titan, Tacoma
9. Diverticulitis is a common intestinal ailment. To test a new medication, a hospital takes 50
volunteers who have a moderate case of the disease. The volunteers are randomly assigned to two
treatment groups, one receiving the old medication and one receiving the new medication. The pills
are distributed in unmarked pill form so that the patients and medical professionals do not know
which medication the patient is taking. Which of the following statements is/are true?
I. If the results show that the new medication is significantly more effective than the old
medication, we can conclude that the new medication will be more effective for all people
that have a moderate case of the disease.
II. The group receiving the old medication serves as a control group.
III. This is an example of a double-blind experiment.
(A) III
(B) I and II
(C) I and III
(D) II and III
(E) I, II, and III
604
10. A study is conducted to determine if the blue light from a tablet device will affect the fall asleep
time of people in various age groups differently. Volunteers for the study are grouped by age: 1830, 31-50, and 50+. Half of each group is assigned a standard tablet, the other half is assigned a
tablet with reduced blue light. People in each group are asked to use the tablet for 10 minutes
before bed and their fall asleep time is recorded. Which of the following is correct?
(A) Blocks: the three age groups. Treatments: using the standard tablet or the reduced blue light
tablet before bed. Response variable: How the eyes react to blue light.
(B) Blocks: the three age groups. Treatments: measuring how long it takes to fall asleep. Response
variable: The hours of sleep recorded.
(C) Strata: the three age groups. Treatments: using the standard tablet or the reduced blue light
tablet before bed. Response variable: How the eyes react to blue light.
(D) Blocks: the three age groups. Treatments: using the standard tablet or the reduced blue light
tablet before bed. Response variable: Time it takes to fall asleep.
(E) Blocks: the group receiving the standard tablet and the group receiving the reduced blue light
tablet. Treatments: using the standard tablet or the reduced blue light tablet before bed. Response
variable: Time it takes to fall asleep.
605
Unit 4 Practice Free Response Question
The Big River Casino is advertising a new digital lottery-style game called Instant Lotto. The player
can win the following monetary prizes with the associated probabilities:
•
5% probability to win $10
•
4% probability to win $15
•
•
3% probability to win $30
1% probability to win $50
•
0.1% probability to win the Grand Prize, $1000.
(a) Calculate the expected value of the prize for one play of Instant Lotto.
(b) As a promotion, a visitor to the casino is given 20 free plays of Instant Lotto. What is the
probability that the visitor wins some prize at least twice in the 20 free plays?
(c) The number of people who play Instant Lotto each day is approximately normally distributed with
a mean of 800 people and a standard deviation of 310 people. What is the probability that a
randomly selected day has at least 1000 people play Instant Lotto?
606
Unit 4 Multiple Choice
1. In the casino game Roulette, a bet on “red” will win if the ball lands on one of the 18 red numbers
of the 38 numbers on the wheel, with each number being equally likely. You want to run a simulation
that will estimate the probability of a player winning both bets when betting on red twice. Which of
the following would be an appropriate setup for the simulation:
I.
II.
III.
Use a table of random digits to select one number from 01 to 38 and then a second
number from 01 to 38. If the first number is between 01 and 18 and the second
number is between 01 and 18, then the player has won both rounds.
Use a table of random digits to select two numbers 01-38, without repeats. If both
numbers are between 01 and 18, then the player has won both rounds.
Use a table of random digits to select two numbers 01-19, allowing repeats. If both
numbers are between 01 and 09, then the player has won both rounds.
(A) I
(B) II
(C) III
(D) I and II
(E) I and III
2. At a summer camp, 72% of the campers participate in rope climbing and 26% participate in
canoeing. 83% of the campers participate in rope climbing, canoeing, or both. What is the
probability that a randomly selected camper participates in both rope climbing and canoeing?
(A) 0.11
(B) 0.15
(C) 0.69
(D) 0.98
(E) Cannot be determined
607
3. The table below gives information on the number of houses built in three different neighborhoods
and in three different decades. Which one of the following statements is false?
Shady Lane
Oakcrest
Pinewood Estates
1960s
40
60
0
1970s
30
15
45
1980s
10
5
15
(A) Houses in Pinewood Estates and houses built in the 1960s are mutually exclusive.
(B) The events “home is in Oakcrest” and “home was built in 1970s” are dependent.
(C) If a house is selected at random, the probability that the house was built in the 1980s is 30/220.
(D) The probability that a randomly selected home was built in the 1980s or is in the Oakcrest
neighborhood is 110/220.
(E) The probability that a randomly selected house in Shady Lane was built in the 1960s is ½.
4. According to school records, your school’s softball team wins 62% of the time when they play a
game on their home field and 26% of the time when they play at the other team’s field. This season,
they play 45% of their games at their home field. Assuming this team wins at the same pace as
previous teams, what is the probability that they win a randomly selected game this season?
(A) 0.040
(B) 0.143
(C) 0.279
(D) 0.422
(E) 0.440
5. Redundant monitoring means that a signal is sent in multiple forms or paths to reach its intended
target. For example, an alarm may be set up to send multiple signals to a security company, each
independent of each other in case of failure along one path. If an alarm sends off four signals down
different paths, and each has a 0.65 probability of reaching the security company, what is the
probability that at least one signal successfully reaches the security company?
(A) 0.015
(B) 0.111
(C) 0.178
(D) 0.821
(E) 0.985
608
6. A company advertises two car tire models. The number of thousands of miles that the standard
model tires last has a mean 𝜇( = 60 and standard deviation 𝜎( = 5. The number of miles that the
extended life tires last has a mean 𝜇. = 70 and standard deviation 𝜎. = 7. If mileages for both tires
follow a normal distribution, what is the probability that a randomly selected standard model tire will
get more mileage than a randomly selected extended life tire?
(A) 0.108
(B) 0.123
(C) 0.434
(D) 0.566
(E) 0.592
7. When playing the card game Blackjack, multiple decks are used and reshuffled often so that the
outcomes of the cards dealt are approximately independent. When a player receives two cards that
are a combination of an ace and a face card, this is called a “natural blackjack” and automatically
wins. A natural blackjack should occur in 4.5% of the rounds played. What is the probability that a
player plays 20 rounds of Blackjack and gets two or more natural blackjacks?
(A) 0.059
(B) 0.168
(C) 0.227
(D) 0.773
(E) 0.900
8. By one estimate, 3% of all Siberian Husky puppies are born with two different colored eyes (called
heterochromia iridum). For random samples of 50 Siberian Husky puppies, what are the mean and
standard deviation of the number of puppies that will have two different colored eyes?
(A) 𝜇 = 0.03, 𝜎 = 0.024
(B) 𝜇 = 0.03, 𝜎 = 0.171
(C) 𝜇 = 1.5, 𝜎 = 1.206
(D) 𝜇 = 1.5, 𝜎 = 1.455
(E) 𝜇 = 3, 𝜎 = 50
609
9. Using data collected from 1981 to 2010 for Ann Arbor, MI (GO BLUE!), the average “high”
temperature for days in July has a mean of 28.9° Celsius with a standard deviation of 3.3° Celsius.
What are the mean and standard deviation if the temperatures are converted to degrees Fahrenheit?
(A) Mean is 84.02°F, Standard deviation is 1.83°F.
(B) Mean is 52.02°F, Standard deviation is 5.94°F.
(C) Mean is 84.02°F, Standard deviation is 5.94°F.
(D) Mean is 84.02°F, Standard deviation is 35.28°F.
(E) Mean is 84.02°F, Standard deviation is 37.94°F.
10. The scores on the verbal section of the Graduate Records Examination (GRE) are approximately
normally distributed with a mean of 150 and a standard deviation of 8.5. What is the probability that
a randomly selected score on the verbal section is higher than 165?
(A) 0.018
(B) 0.039
(C) 0.961
(D) 0.982
(E) Cannot be determined
11. The finishing times of a 5K race run by competitive male runners age 15-18 are approximately
normal distribution with a mean time of 18 minutes and a standard deviation of 2 minutes. A
particularly tough 5K race tends to take these same runners 4.5 additional minutes. To earn a
completion medal in this particular 5K race, the runner must complete the race within 25 minutes.
Approximately, what percent of competitive male runners in the 15-18 age bracket typically do not
earn a completion medal at this race?
(A) Approximately 0%
(B) 10.56%
(C) 14.08%
(D) 28.93%
(E) 35.03%
610
12. A pollster asked 100 people “If money was not a factor, how many children would you like to
have?” The results and their frequencies are shown in the (estimated) probability distribution
function table below.
X
P(X)
0
?
1
2
3
4
0.26
0.29
0.15
0.05
What is the probability that the number of children a randomly selected person from this sample
would like to have is less than the mean of X?
(A) 0.25
(B) 0.49
(C) 0.51
(D) 0.80
(E) 1.49
13. Employees that work at a fish store must measure the level of nitrites in the water each day.
Nitrite levels should remain lower than 5 ppm as to not harm the fish. The nitrite level varies
according to a distribution that is approximately normal with a mean of 3 ppm. The probability that
the nitrite level is less than 2 ppm is 0.0918. Which of the following is closest to the probability that
on a randomly selected day the nitrite level will be at least 5 ppm?
(A) 0.0039
(B) 0.0266
(C) 0.0918
(D) 0.7519
(E) 0.9961
14. According to a study, 91% of all adults have a cell phone. An employee of a cell phone company
attends a community event and has a special offer to give to first time cell phone owners. If she
randomly selects adults in attendance at the event and cell phone ownership is independent from
adult to adult, what is the probability that she asks 20 adults before finding one that does not own a
cell phone?
(A) (0.91)20(0.09)
(B) (0.09)20(0.91)
(C) 20(0.91)20(0.09)
(D) 20(0.09)20(0.91)
(E) 20(0.09)(0.91)
611
15. In a large school district, it is known that 25% of all students entering kindergarten are already
reading. A simple random sample of 10 new kindergarteners is drawn. What is the probability that
fewer than three of them are able to read?
(A) 0.2503
(B) 0.2816
(C) 0.5000
(D) 0.5256
(E) 0.7759
16. A dice game at a local carnival uses a 10-sided die which has been rigged. The even numbered
outcomes (2, 4, 6, 8, and 10) have been made lighter so that they are three times more likely to
occur as the odd numbered outcomes (1, 3, 5, 7, and 9). A player that rolls an odd number wins the
game. What is the probability of winning this game?
(A) 0.05
(B) 0.15
(C) 0.20
(D) 0.25
(E) 0.50
17. Airlines must be aware of the weight of the aircraft plus everything on board. Based on
thousands of flights, the mean and standard deviation for the weights of several items are shown in
the table. The weights of the airplane, people/luggage, and fuel are assumed to be independent.
Item
Mean (in
tons)
Plane
45
People/luggage
10
Fuel
20
Standard Deviation (in
tons)
2
1
1
Let the random variable T represent the total weight of the plane, people/luggage, and fuel for a
randomly selected flight. Which of the following is closest to the standard deviation of T?
(A) 2.45 tons
(B) 4.00 tons
(C) 6.00 tons
(D) 75.0 tons
(E) 120 tons
612
18. According to the Guinness Book of World Records, a woman from Russia, Mrs. Vassilyeva, had
69 children between the years 1725 to 1765. She had 16 pairs of twins, 7 sets of triplets, and 4 sets
of quadruplets. Suppose one of the births is randomly selected. Given that Mrs. Vassilyeva gave
birth to at least 3 children (triplets), what is the probability that she gave birth to quadruplets?
(A) 4/27
(B) 7/11
(C) 3/11
(D) 3/27
(E) 4/11
19. A particular manufacturer of refrigerators has come under fire as their $1,700 LGA model has
failed for a significant portion of owners. The LGA model often failed approximately 1-2 years after
purchase, which is outside the warranty period. A class action lawsuit was filed by owners of the LGA
model claiming that the manufacturer should repair or replace the units. The lawyer representing the
class action lawsuit obtained a list of LGA owners who completed registration cards and asked a
simple random sample of 1000 LGA owners, “Is your LGA refrigerator still working?” He also
recorded the age of the refrigerator from information provided on the registration card. Here are the
data:
Working?
Yes
No
Total
Age of the LGA Refrigerator
Less than 1
1 to 2 years
More than 2
year
years
204
55
210
51
165
315
255
220
525
Total
469
531
1000
What is the probability a randomly selected LGA refrigerator is at least one year old and no longer
works?
(A) 0.051
(B) 0.135
(C) 0.480
(D) 0.531
(E) 0.904
613
20. Based on student records, 25% of the students at a large high school have a GPA of 3.5 or
better, 16% of the students are currently enrolled in at least one AP class, and 12% of the students
have a GPA of 3.5 or better and are enrolled in at least one AP class. If we select one student at
random, what is the probability that the student has a GPA lower than 3.5 and is not taking any AP
classes?
(A) 0.29
(B) 0.47
(C) 0.53
(D) 0.63
(E) 0.71
614
Unit 5 Practice Free Response Question
A regulation baseball can weigh no more than 149 grams. A factory produces baseballs with weights
that are normally distributed with a mean of 146 grams and a standard deviation of 2.3 grams.
(a) If a baseball produced by the factory is randomly selected, what is the probability that it is within
regulation weight?
(b) The baseballs are shipped in boxes of 16. What is the probability that at least 15 of the 16
baseballs in a pack are within regulation weight?
(c) The factory will not ship a box of 16 if the average weight of the baseballs in the box exceeds 147
grams. What is the probability that a pack of 16 baseballs would have an average weight of more
than 147 grams?
615
Unit 5 Multiple Choice
1. In a large city, 46% of adults support the local football team building a new stadium. If a poll is
taken from a random sample of 80 adults in the large city, which of the following properly describes
the sampling distribution of the sample proportion of adults who support the stadium?
(A) 𝜇! = 36.8, 𝜎! = 4.46, the distribution is approximately normal.
(B) 𝜇! = 36.8, 𝜎! = 4.46, shape of the distribution is unknown.
(C) 𝜇! = 0.46, 𝜎! = 0.056, the distribution is approximately normal.
(D) 𝜇! = 0.46, 𝜎! = 0.056, shape of the distribution is unknown.
(E) 𝜇! = 43.2, 𝜎! = 4.46, the distribution is binomial.
2. Which of the following statements is true?
(A) A parameter is a number that describes some characteristic of a sample.
(B) An unbiased estimator is any statistic that is taken from a sample chosen by random methods.
(C) A sampling distribution is the distribution of a statistic calculated from all possible samples of the
same size from the same population.
(D) The variability of a population distribution will decrease as the sample size increases.
(E) A normal approximation can always be used for the sampling distribution of 𝑝 as long as the
sample size is greater than 30.
3. The weight of a single bag checked by an airplane passenger follows a distribution that is right
skewed with a mean of 38 pounds and a standard deviation of 6.2 pounds. If a random sample of 96
bags is selected, what is the probability that the average weight of the bags exceeds 40 pounds?
(A) 0.0008
(B) 0.0011
(C) 0.3735
(D) 0.9992
(E) It is not appropriate to use a normal distribution to calculate probability in this situation.
616
4. Below are histograms displaying the values taken by three sample statistics in several hundred
samples from the same population. Each histogram has the same scale and the true value of the
population parameter is marked by an arrow on each histogram. Which statistic is the best estimator
of the parameter?
A
B
C
(A) Statistic A because A and B are unbiased but A has higher variability.
(B) Statistic B because A and B are unbiased but B has lower variability.
(C) Statistic C because it is always better to underestimate rather than overestimate.
(D) Statistics A or B because the population parameter is in the center of the sampling distributions
for both.
(E) Statistics A, B or C because all contain the population parameter in the sampling distributions.
5. A company intends to collect a random sample of size n from a population with population
proportion of interest p. Which of the following situations would result in the smallest standard
deviation of the sampling distribution of 𝑝?
(A) The population proportion 𝑝 = 0.5, and the sample size is 𝑛.
(B) The population proportion 𝑝 ≠ 0.5, and the sample size is 𝑛.
(C) The population proportion 𝑝 = 0.5, and the sample size is 2𝑛.
(D) The population proportion 𝑝 ≠ 0.5, and the sample size is 2𝑛.
(E) There is not enough information provided to determine the smallest standard deviation of the
sampling distribution.
617
6. Kent county and Oakland county are located on the west and east sides of Michigan, respectively.
Information about the yearly household income is given in the table below. Both distributions are
strongly skewed right. A random sample of 100 households is taken from each of the very large
populations. Which of the following describes the sampling distribution of the difference in sample
means?
Oakland County
Kent County
n
100
100
Mean (dollars)
66,400
54,340
Std. Dev. (dollars)
48,400
39,700
(A) 𝜇!! !!! = 12060, 𝜎!! !!! = 870, the distribution is approximately normal
(B) 𝜇!! !!! = 12060, 𝜎!! !!! = 6259.91, the distribution is strongly skewed right
(C) 𝜇!! !!! = 12060, 𝜎!! !!! = 6259.91, the distribution is approximately normal
(D) 𝜇!! !!! = 12060, 𝜎!!!! = 8810, the distribution is strongly skewed right
(E) 𝜇!! !!! = 12060, 𝜎!! !!! = 8810,the distribution is approximately normal
7. A recent report states that 89% of Americans consider themselves above average drivers. A local
newspaper is planning on conducting a survey to investigate whether this is true locally. If the
newspaper assumes that the 89% claim is true and plans to use the normal approximation to
calculate probabilities associated with their sample proportion, which sample size would be most
appropriate?
(A) 10
(B) 11
(C) 30
(D) 91
(E) The number depends on the size of the population.
8. Daren and Josh are pretty good free throw shooters. Daren makes 75% of the free throws he
attempts. Josh makes 80% of his free throws. Suppose we take separate random samples of 50 free
throws each from Daren and Josh, and record the proportion of free throws that are made by each.
Which of the following best describes the sampling distribution of pĖ‚D − pĖ‚ J ?
(A) Strong skew, with mean -0.05 and standard deviation 0.083
(B) Approximately normal, with mean -0.05 and standard deviation 0.083
(C) Shape cannot be determined, with mean of -0.05 and standard deviation 0.083
(D) Strong skew, with mean -0.05 and standard deviation 0.118
(E) Approximately normal, with mean -0.05 and standard deviation 0.118
618
9. Donner Summit, California, is a popular ski resort area. Over the past 60 years, the annual snowfall
totals of Donner Summit have followed a distribution that is strongly skewed right with a mean of
404 inches and a standard deviation of 129 inches. If many samples of size 9 were taken, which of
the following would best describe the shape of the sampling distribution of ð‘Ĩ?
(A) The shape is approximately normal since the sample size is reasonably large.
(B) The shape is skewed right since 𝑛𝑝 ≥ 10 and 𝑛(1 − 𝑝) ≥ 10 have not been met.
(C) The shape is equally as skewed right as the population distribution.
(D) The shape is skewed right but less so than the population distribution.
(E) Cannot be determined from the given information.
10. The heights of all adult males in Croatia are approximately normally distributed with a mean of
180 cm and a standard deviation of 7 cm. The heights of all adult females in Croatia are
approximately normally distributed with a mean of 158 cm and a standard deviation of 9 cm. If
independent random samples of 10 adult males and 10 adult females are taken, what is the
probability that the difference in sample means (males – females) is greater than 20 cm?
(A) 0.3463
(B) 0.6537
(C) 0.6827
(D) 0.7104
(E) 0.8687
619
Unit 6 Practice Free Response Question
How much do Americans pay for coffee on average? In a 2015 survey of 105 randomly selected U.S.
adults, the respondents were asked what price they paid for their most recent purchase of coffee.
The sample data had a mean of $3.42 and a standard deviation of $0.75.
(a) Construct and interpret a 95% confidence interval for the mean price paid for coffee in 2015 by all
U.S. adults.
(b) In 2013, tax receipts show that Americans paid an average of $2.98 per coffee. Do the 2015
survey results provide convincing evidence that the average price Americans pay for coffee has
changed? Explain your reasoning.
620
Unit 6 Multiple Choice
1. An AP Statistics class surveys 24 randomly selected female students from their high school, and
calculates a 95% confidence interval for the mean height of female students to be 63.4 ± 1.6 inches.
Which of the following is a correct interpretation of this interval?
(A) There is a 95% probability that the true mean height of female students at the school falls
between 61.8 and 65.0 inches.
(B) We can be 95% confident that the true mean height of female students at the school is 63.4
inches.
(C) 95% of the time, we can be confident that the mean sample height of female students will fall
between 61.8 and 65.0 inches.
(D) We can be 95% confident that the true mean height of female students at the school is between
61.8 and 65.0 inches.
(E) There is a 95% probability that the true mean height of female students at the school is 63.4
inches.
2. A random sample of 18 adults, chosen from the 1500 adults in the town, took a survey asking their
opinion on a recent property tax change. 25% of those who responded said they were in favor of the
change. The company running the survey wants to construct a confidence interval estimating the
proportion of all adults in the town who support the change. Which of the conditions for inference
have been satisfied?
I. Random condition
II. Normal condition
III. 10% condition
(A) I
(B) II
(C) I and III
(D) II and III
(E) I, II, and III
621
3. For which of the following samples would it be appropriate to use t-procedures for inference for
the population mean?
(A) III
(B) I and II
(C) I and III
(D) II and III
(E) I, II, and III
4. The students of a Statistics class want to estimate how many years it takes for a university
professor to earn a PhD. They survey a random sample of 40 professors with PhDs, which results in a
sample mean of 5.4 years and a standard deviation of 1.6 years. Which of the following represents
the 95% confidence interval for the true mean of the number of years it takes a professor to earn a
PhD?
(A) 5.4 ± 1.96(1.6)
(B) 5.4 ± 1.96
(C) 5.4 ± 1.96
(D) 5.4 ± 2.09
(E) 5.4 ± 2.09
+.,
-.
+.,
√-.
+.,
-.
+.,
√-.
5. A study intends to estimate a population mean with an unknown population standard deviation
and a sample size of 15. Which of the following is closest to the appropriate critical value to create a
98% confidence interval?
(A) 2.055
(B) 2.249
(C) 2.264
(D) 2.602
(E) 2.624
622
6. A newspaper plans to conduct a survey for the upcoming presidential election in order to estimate
the proportion of the population, p, who supports a certain candidate. What is the smallest sample
size needed to obtain an estimate that is within 4% of the true proportion p at the 96% confidence
level?
(A) 26
(B) 376
(C) 601
(D) 660
(E) Cannot be determined from information given.
7. A random sample of size n is collected from a considerably larger population of size N. That
sample is used to create a 95% confidence interval to estimate a population proportion. Using the
same sample proportion, the confidence interval would be narrower if:
(A) A smaller sample size was used.
(B) A t-procedure was used instead of a z-procedure.
(C) A higher confidence level was used.
(D) The population size N was larger.
(E) A lower confidence level was used.
8. Electric vehicles make up a very small proportion of the overall car market, but by how much has
that proportion increased? Independent surveys of randomly selected car dealerships were
completed, with 0.1% of the 10,000 cars sold in the 2012 sample being electric vehicles, and 0.5%
percent of the 20,000 cars in the 2016 sample being electric vehicles. Which of the following
represents a 95% confidence interval for the change in the proportion of electric cars sold from 2012
to 2016?
(A) (0.005 − 0.001) ± 1.964
(B) (0.005 − 0.001) ± 1.654
....5(+6....5)
....5(+6....5)
(C) (0.005 − 0.001) ± 1.964
(D) (0.004) ± 1.964
7.,...
7.,...
+
....+(+6....+)
+
....+(+6....+)
....:;(+6....:;)
7.,...
+.,...
+.,...
+
....:;(+6....:;)
+.,...
....-(+6....-)
:.,...
....:;(+6....:;)
(E) (0.0037) ± 1.964
:.,...
623
9. A recent study examining the effects of sugar consumption on a middle school student’s ability to
focus on a reading assignment used 18 volunteer subjects and divided them into 9 pairs based on
their reading speeds. One randomly assigned member of each pair was given a beverage containing
a substantial amount of sugar, and the other drank a sugar-free version of the beverage. Each
subject was given a passage to read and the time (in seconds) it took to read was recorded. The
difference for each pair is calculated (sugar – sugar-free). A 90% confidence interval for the mean
difference in reading times is (-5.8, 0.15).
(A) Because the center of the interval is -2.825, we have convincing evidence that sugar causes faster
reading times, on average.
(B) Because the confidence interval includes 0, we don’t have convincing evidence that sugar causes
faster reading times, on average.
(C) Because the confidence interval includes 0, we have convincing evidence that sugar causes faster
reading times, on average.
(D) Because the confidence interval includes more negative than positive values, we have convincing
evidence that sugar causes faster reading times, on average.
(E) Causation should not be inferred because the subjects were volunteers.
10. A social media developer wants to determine if the proportion of teenagers who use Facebook is
the same as the proportion of teenagers who use Snapchat. She takes a random sample of 100
teenagers and finds that 75 of the 100 students use Facebook and 89 of the 100 students use
Snapchat. Would it be reasonable for the social media developer to construct a 95% confidence
interval for the true difference in proportion of teenagers that use Facebook and Snapchat?
(A) No, the random condition is not satisfied.
(B) No, the normal condition is not satisfied.
(C) No, the two samples are not independent.
(D) Yes, all conditions have been met.
(E) Cannot be determined from the given information.
624
11. A political polling organization conducted a survey by selecting 100 random samples, each
consisting of 500 registered voters. The registered voters in each sample were asked whether they
planned to vote for the Republican or Democratic candidate in the next presidential election. For
each of the 100 samples, the polling organization created a 99 percent confidence interval for the
proportion of all registered voters who planned to vote for the Republican candidate. Which of the
following statements is the best interpretation of the 99 percent confidence level?
(A) We would expect about 99 of the 100 confidence intervals to contain the sample proportion of
the registered voters who plan to vote for the Republican candidate.
(B) We would expect about 99 of the 100 confidence intervals to contain the proportion of all
registered voters who plan to vote for the Republican candidate.
(C) We would expect the margin of error to be 0.01 because the polling organization constructed 99
percent confidence intervals.
(D) We would expect the margin of error to be less than 0.01 because the polling organization
constructed 100 different 99 percent confidence intervals.
(E) We would expect only 1 of the 100 confidence intervals to reveal that the Democratic candidate
is favored.
12. A pharmaceutical company claims that side effects will be experienced by fewer than 20% of the
patients who use medication X. A clinical trial with a random sample of 400 patients was conducted
in which half of the patients were randomly assigned to take medication X and the other half
received a placebo. Of those that received medication X they find 68 who experienced side effects.
Assuming all conditions for inference were met, which of the following is a 90 percent confidence
interval for the proportion of patients that will experience side effects while taking medication X?
(A) (0.1391, 0.2009)
(B) (0.2744, 0.4057)
(C) (0.2849, 0.3951)
(D) (0.6257, 0.7343)
(E) (0.6416, 0.7184)
625
13. Household income in the United States is strongly skewed to the right. The current presidential
administration claims that the mean household income is greater than it has ever been in the past.
An independent contractor will obtain a random sample of 100 households in the United States and
will calculate the mean household income. Which of the following statements is true?
(A) The sampling distribution of the sample mean household income is approximately normal
because the sample size of 100 is greater than 30.
(B) The distribution of household income for the sample is approximately normal because the
sample size of 100 is greater than 30.
(C) The sampling distribution of the sample mean household income is strongly skewed to the right
because the population standard deviation is unknown.
(D) The distribution of household income is strongly skewed to the left because the population
sample size of 100 is greater than 30.
(E) The sampling distribution of the sample mean household income is strongly skewed to the right
because the population distribution is strongly skewed to the right.
14. In a month-long randomized comparative experiment, participants were assigned to one of two
treatments. 35 of the 50 participants who received the first treatment, a fitness watch and training on
how to meet their daily goals as measured by the watch, lost weight. However, only 16 of the 40
participants who received the second treatment, instruction to recite positive affirmations each
morning, lost weight. Which of the following is an appropriate margin of error for a 99 percent
confidence interval to estimate the difference in the population proportion of all users of each
treatment who would lose weight?
(..;)(..:)
(A) 2.5764
(B) 1.9604
5.
(..;)(..:)
5.
+
(..-)(..,)
+
(..-)(..,)
-.
-.
(..5+)(..-=)
(C) 2.5764
+..
(..5+)(..-=)
(D) 1.9604
+..
(..;)(..:)
(E) 2.576 >
5.?
+
(..-)(..,)
-.?
@
626
15. As the temperature rises in Chicago, does the crime rate also rise? Using data available from the
Chicago Police Department, an interested citizen recorded the high temperature and number of
crimes reported for 8 randomly selected days.
Temperature ℉
17
35
46
55
64
78
84
89
Number of Crimes
56
60
66
70
71
78
74
76
The citizen wants to find a confidence interval that can be used to estimate the number of additional
crimes that can be expected to be reported for each degree that the daily high temperature
increases with 95% confidence. Which of the following is the most appropriate procedure for such an
investigation?
(A) A chi-square test of association
(B) A linear regression t-interval for slope
(C) A one-sample t-interval for a mean
(D) A two-sample t-interval for a difference of means
(E) A one-sample z-interval for a proportion
16. A large-sample 95 percent confidence interval for the proportion of credit card customers that
have reported fraudulent charges on their account is (0.028, 0.086). What is the point estimate for
the proportion of all credit card customers that have reported fraudulent charges on their account?
(A) 0.057
(B) 0.058
(C) 0.029
(D) 0.114
(E) 0.196
627
17. A writer for a career magazine is working on an article about the projected career earnings for
college graduates in various fields. He selects a random sample of 25 educators and surveys them
about their current salary, the number of years they have worked in the field, and their pay raise
structure. Based upon this information he computes the average projected career earnings for
graduates with degrees in education to be $2.5 million dollars with a standard deviation of 0.4
million dollars. Assuming all conditions for inference are met, which of the following is a 90 percent
confidence interval for the mean projected career earnings for graduates with degrees in education?
(A) 2.5 ± 1.711
..-
√75
(B) 2.5 ± 1.645(0.04)
(C) 2.5 ± 1.960
(D) 2.5 ± 1.645
..-
√75
..√75
(E) 2.5 ± 1.960(0.04)
18. A recent article claimed that women are waiting longer to have their first child. The article
estimates that the average age of first-time mothers is 26 years old, which is up from 21 years old in
1970. The margin of error for the estimate was 1.5 years. Based on the estimate and the margin of
error, which of the following is an appropriate conclusion?
(A) 95% of the women in the study were 26 years old when they had their first child.
(B) The age of every first-time mother in the sample must have been between 24.5 and 27.5 years
old.
(C) The age of every first-time mother in the sample must have been between 23 and 29 years old.
(D) This study proves that women are waiting longer to have their first child.
(E) It is plausible that the average age of first-time mothers is 27 years old.
628
19. James has a desk job and would like to become more fit, so he purchases a tread walker and a
standing desk which will allow him to walk at a slow pace as he works. However, he is concerned that
standing and walking while working may cause his productivity to decline. After working this way for
6 months he takes a simple random sample of 15 days. He records how long he walked that day (in
hours) as recorded by his fitness watch as well as his billable hours for that day as recorded by a work
app on his computer.
Regression Analysis: Billable hours versus Walk time
Predictor
Constant
Walk time
Coef
7.785
–0.245
S = 5.051
SE Coef
0.542
0.205
R-Sq = 84.3%
T
14.363
–1.195
P
0.000
0.127
R-Sq(adj) = 82.4%
Assuming that all conditions for inference are met, which of the following is a 95 percent confidence
interval for the average change in the number of billable hours for each increase of 1 hour spent
walking?
(A) –0.245 ± 1.960(0.205)
(B) –0.245 ± 2.131(0.205)
(C) –0.245 ± 2.160(0.205)
(D) 7.785 ± 1.960(0.542)
(E) 7.785 ± 2.160(0.542)
20. A curious student wanted to determine if there was a difference in the average price of a quarter
pound hamburger in the United States and Japan. The student randomly selected 15 McDonald’s
restaurants in the United States and 10 McDonald’s restaurants in Japan and recorded the prices of
their quarter pound hamburgers. Prices of the quarter pound hamburgers in Japan were converted
to U.S. dollars. The data are summarized in the table below.
Sample mean
Sample standard deviation
Sample size
U.S.
4.53
0.24
15
Japan
4.01
0.38
10
Calculate a 99 percent confidence interval for the difference in mean price for a quarter pound
hamburger in the United States and Japan.
(A) (4.53 − 4.01) ± 3.0124
..7- ?
(B) (4.53 − 4.01) ± 3.0124
..7- ?
+5
+5
+
−
..:B?
+.
..:B?
+.
(..7-)(..;,)
(C) (0.24 − 0.38) ± 3.012 >
+5?
(D) (0.24 − 0.38) ± 3.0124
(..7-)(..;,)
(E) (4.53 − 4.01) ± 3.0124
(..7-)(..;,)
+5
+5
+
(..:B)(..,7)
+
(..:B)(..,7)
+
(..:B)(..,7)
+.?
@
+.
+.
629
Unit 7 Practice Free Response Question
An online streaming service providing television programs claims that a 30-minute program will
stream with advertisements that average 45 seconds. A consumer advocacy group is investigating to
see if this claim is true. They recorded the times of 21 randomly selected advertisements. The times
are listed below:
Time
(seconds) 45 45 43 50 50 45 43 50 45 49 46 48 42 46 44 52 48 45 46 50 48
The mean and standard deviation for these times are 46.67 seconds and 2.78 seconds respectively.
Do these data provide convincing evidence that the true mean advertisement length is longer than
45 seconds?
630
Unit 7 Multiple Choice
1. A Statistics class from a high school with 4,000 students took a survey of the first 35 students who
walked through the front door of the school, and asked how far they traveled to school that day. The
class plans to run a one-sample t-test to determine if the average travel distance has increased since
last year. The students notice that the sample data are right skewed. Which conditions have been
satisfied for the t-test?
I.
II.
III.
The sample is from a random sample or randomized experiment.
The sampling distribution of sample means is approximately normal.
The sample size is small relative to the population.
(A) III
(B) I and II
(C) I and III
(D) II and III
(E) I, II, and III
2. An advertiser wants to find convincing evidence that television viewers remember more than 4
commercials, on average, after watching a 30 minute TV program. They take a random sample of
100 television viewers and ask them how many commercials they could remember after watching a
30 minute TV program. The appropriate t-test was conducted, which resulted in a P-value of 0.15.
Assuming all conditions were met, which of the following is an appropriate conclusion?
(A) Because the P-value is less than 0.05, at the 5% significance level, there is not convincing
evidence that television viewers remember more than 4 commercials, on average, after watching a
30 minute TV program.
(B) Because the P-value is greater than 0.05, at the 5% significance level, there is convincing
evidence that television viewers remember fewer than 4 commercials, on average, after watching a
30 minute TV program.
(C) Because the P-value is greater than 0.01, at the 1% significance level, there is not convincing
evidence that television viewers remember more than 4 commercials, on average, after watching a
30 minute TV program.
(D) Because the P-value is greater than 0.01, at the 1% significance level, there is convincing
evidence that television viewers remember exactly 4 commercials, on average, after watching a 30
minute TV program.
(E) Because the P-value is less than 0.01, at the 1% significance level, there is convincing evidence
that television viewers remember fewer than 4 commercials, on average, after watching a 30 minute
TV program.
631
3. After completing a statistical analysis of a survey of 40 students, the principal of North High
School made the following conclusion: reject the null hypothesis; there is convincing evidence that
more than 50% of students support a schedule change to have lunch occur earlier in the day. Which
error could have been committed?
(A) Type I error: Conclude that more than 50% of students want earlier lunch, when 50% or less want
earlier lunch.
(B) Type I error: Conclude that more than 50% of students want earlier lunch, when more than 50%
want earlier lunch.
(C) Type II error: Fail to reject that 50% of students want earlier lunch, when more than 50% want
earlier lunch.
(D) Type II error: Fail to reject that 50% of students want earlier lunch, when 50% or less want earlier
lunch.
(E) Type II error: Fail to reject that more than 50% of students want earlier lunch, when 50% or less
want earlier lunch.
4. A significance test was conducted using the hypotheses !" : $% − $' = 0, !+ : $% − $' < 0 where
$% is the true mean number of fouls called during games played at neutral sites and $' is the true
mean number of fouls called during games played at the home team’s stadium with a resulting Pvalue of 0.24. Which of the following is an accurate interpretation of this P-value?
(A) There is a 24% probability that the null hypothesis is true.
(B) If the test were repeated many times, we would correctly reject the null hypothesis 24% of the
time.
(C) If the test were repeated many times, we would incorrectly fail to reject the null hypothesis 24%
of the time.
(D) If the null hypothesis is true, there is a 24% probability of getting a sample difference in means as
far or farther below 0 as the difference found in the samples.
(E) If the null hypothesis is false, there is a 24% probability of getting a sample difference in means
as far or farther below 0 as the difference found in the samples.
632
5. Is the proportion of adults who watch the nightly news dropping? In a survey taken in 2013, 24 out
of 40 adults surveyed responded that they had watched the local TV news at least once in the last
month. In a similar survey in 2010, 40 out of 50 adults said they had watched the local TV news at
least once in the last month. Is this convincing evidence that the proportion of adults watching the
local TV news dropped between 2010 and 2013? For a significance test with the hypotheses !" : -. −
-/ = 0, !+ : -. − -/ < 0, where -. is the proportion of all adults who watched the local news at least
once a month in 2013, and -/ is the proportion for 2010, which of the following is closest to the
standard error for the test statistic?
(A)1
".3..(.5".3..)
7"
(B) 1
".:(.5".:)
(C) 1
(D)1
7"
+
+
9"
".:(.5".:)
(E) 1
+
<"
7"
7"
9"
".;(.5".;)
".3..(.5".3..)
".:(.5".:)
".3..(.5".3..)
+1
.
".3..(.5".3..)
<"
".;(.5".;)
9"
9"
.
.
".;(.5".;)
−1
.
.
6. A random sample of 95 vehicles is taken from a large parking lot at an office park. Below is the
type of each vehicle, and whether it is owned or leased by the driver. A chi-square test will be
conducted to determine if there is an association between type of vehicle and method of obtaining
the vehicle. Which of the following expressions represents the expected count of leased SUVs?
Owned
Leased
Car
29
21
SUV
20
10
Truck
11
4
(A) 10
(B)
(C)
(D)
(E)
."
<9
(=9)(=")
<9
(."5....)>
....
(....5.")>
."
633
7. A random sample of 300 balls hit into play in college baseball games is taken. For each ball hit
into play, it is recorded whether the ball was hit to the left, center, or right of the field, and whether
the ball resulted in a hit or an out. Which of the following is the appropriate null hypothesis to test if
there is a relationship between the direction of the ball and if it resulted in a hit or an out?
(A) !" : The proportion of balls hit to the left, right, and center is the same.
(B) !" : The proportion of balls that result in hits and outs is the same.
(C) !" : The distribution of the direction of the ball is the same for the population of hits and the
population of outs.
(D) !" : The direction of the ball and whether it results in a hit or an out are independent.
(E) !" : There is an association between the direction of the ball and whether it results in a hit or an
out.
8. Do stain-polyurethane mixes protect wood as well as stain and polyurethane applied in separate
coats? Five types of wood will be used, with two boards of each type of wood. One board of each
type of wood (randomly selected) will have the stain-polyurethane mix applied to it, and the other
will have stain and polyurethane applied in separate coats. Each board will have water poured on it,
and the amount of water retained will be measured. Which significance test is most appropriate?
(A) A chi-square test for goodness of fit
(B) A one-sample z-test for a proportion
(C) A matched-pair t-test for a mean difference
(D) A two-sample t-test for a difference between two means
(E) A two-sample z-test for a difference between two proportions
9. An analysis of 8 used trucks listed for sale in the 48076 zip code finds that the power model
?@(-ABCD
E ) = 3.748 − 0.1395?@(MN?DO), for price (in thousands of dollars) and miles driven (in
thousands), is an appropriate model of the relationship. If a used truck has been driven for 47,000
miles, which of the following is closest to the predicted price for the truck?
(A) $9.46
(B) $24.80
(C) $3,210.00
(D) $9,460.00
(E) $24,800.00
634
10. Below is computer output from the least squares regression analysis on the body mass index,
BMI, and percent body fat for 10 randomly selected adult males. Which of the following represents
the 95% confidence interval for the slope of the regression line relating BMI and percent body fat for
the population of adult males?
Predictor
Constant
BMI
Coef
-20.096
1.695
SE Coef
2.786
0.2280
S = 3.195
R-Sq = 87.3%
R-Sq(adj)=86.8%
T
-7.213
7.432
P
0.000
0.000
(A) −20.096 ± 2.21(2.786)
(B) 1.695 ± 7.432(0.2280)
(C) 1.695 ± 2.306(0.2280)
(D) 1.695 ± 2.262(0.2280)
(E) 1.695 ± 2.228(0.2280)
11. A 20-ounce soda bottle is supposed to contain 20 ounces of soda. The distribution of the actual
amount of soda in a 20-ounce soda bottle is approximately normal. A curious student randomly
selects fifteen 20-ounce soda bottles from various retailers and carefully measures their content to
see if the soda company is cheating the customers. The mean and standard deviation of the fifteen
bottles is 19.4 ounces and 0.25 ounces, respectively. Which of the following is the test statistic for
the appropriate test to determine if the average amount of soda that is contained in their 20-ounce
soda bottles is significantly less than 20-ounces?
(A) S =
(B) X =
(C) S =
(D) X =
(E) X =
.<.75/"
T.>U
√WU
.<.75/"
1(T.>U)(WYT.>U)
WU
/"5.<.7
Z("./9)(.5"./9)
/"5.<.7
T.>U
√WU
.<.75/"
W W
]
WU >T
1("./9)(".39)[ \
635
12. A study conducted by a professor at Waynesburg University investigated whether time
perception is impaired when completing the last example problem just prior to the start of fall break.
The professor timed how long it took to complete the last example problem (in seconds), then asked
a random sample of 10 students to estimate how long it took to complete the problem (in seconds).
Let $ represent the average difference in time (actual time – estimated time) it took to complete the
problem for all students. She would like to investigate whether there will be convincing statistical
evidence that the students overestimate the time it took to complete the problem, on average.
Which of the following is the correct hypotheses for this test?
(A) H0: $
(B) H0: $
(C) H0: $
(D) H0: $
(E) H0: $
= 0 and Ha: $
= 0 and Ha: $
= 0 and Ha: $
> 0 and Ha: $
< 0 and Ha: $
≠0
>0
<0
<0
=0
13. A 21-year old college student submits a photo to the website “Guess My Age”. At a later date
he checks back and thousands of users have made guesses about his age. Of the following, which is
the best procedure to investigate whether there is convincing statistical evidence that, on average,
he is perceived to be less than 21 years old?
(A) One sample t-test for a mean
(B) One sample z-test for a proportion
(C) Matched-pairs t-test for a mean difference
(D) Two-sample t-test for the difference between two means
(E) A chi-square test of association
14. Students in a statistics class would like to investigate if more than two-thirds of the Earth is water.
To answer this question, students tossed an inflatable globe back and forth. After each catch, the
student recorded whether the tip of their pointer finger of their right hand was on water or land. In
50 tosses, their finger was on water 38 times. Assuming all conditions for inference are met, do the
data provide convincing statistical evidence at the significance level of ^ = 0.05 that more than twothirds of the Earth is water?
(A) Yes, because the p-value of 0.76 is greater than the significance level of 0.05.
(B) Yes, because the p-value of 0.08 is greater than the significance level of 0.05.
(C) Yes, because the p-value of 0.05 is less than the significance level of 0.08.
(D) No, because the p-value of 0.08 is greater than the significance level of 0.05.
(E) No, because the p-value of 0.03 is less than the significance level of 0.05.
636
15. According to the National Association of Colleges and Employers, the mean salary for a new
college graduate is $45,327. A small college wants to know if the mean salary of their most recent
graduates is greater than $45,327. A random sample of 10 recent graduates from the small college
was selected and the mean and standard deviation of the salary for those graduates was found. With
all conditions for inference met, a significance test was conducted and a p-value of 0.045 was
obtained. Which of the following statements is the most appropriate conclusion using a significance
level of α = 0.05?
(A) There is convincing statistical evidence that the mean salary for all recent college graduates from
the small college is greater than $45,327.
(B) There is convincing statistical evidence that the mean salary for the sample of 10 college
graduates from the small college is greater than $45,327.
(C) There is not convincing statistical evidence that the mean salary for all recent college graduates
from the small college is greater than $45,327.
(D) There is not convincing statistical evidence that the mean salary for the sample of 10 college
graduates from the small college is greater than $45,327.
(E) There is not convincing statistical evidence that the graduates from this college will always earn
more than $45,327.
16. A financial analyst wants to investigate the relationship between the annual starting salary of new
employees at a large firm versus years of education the employee has (beyond high school). The
analyst selected a random sample of 18 new employees and recorded their starting salary and how
many years of education they have beyond high school. The computer output of an analysis of salary
versus years of education is shown in the table.
Regression Analysis: Salary (in $1,000) versus Education
Predictor
Constant
Education
Coef
25.840
13.4862
SE Coef
3.803
0.5310
T
***
***
P
***
***
S = 5.051
R-Sq
84.3%
= 82.4%is the appropriate test
Assuming that all conditions
for inference
are=met,
which R-Sq(adj)
of the following
statistic for testing the null hypothesis that the slope of the population regression line equals 0?
(A) 0
(B) 0.163
(C) 1
(D) 1.54
(E) 25.40
637
17. At the county fair there are many game booths. One booth has a prize
wheel with 8 equal sectors: 2 of which are red, 2 of which are white, 2 of
which are yellow, and 2 of which are green. It costs $1 to spin the wheel.
There are 4 possible “prizes”. If the player lands on one of the green
sectors, they win $5. If they land on a yellow sector, they win a small emoji
stuffed toy. If they land on a white sector, they get a ticket to ride the Ferris
wheel. If they land on a red sector, they win nothing. Because you suspect
that the wheel is not fair, you watch a random sample of 100 people play
the game. Here are the findings:
Color
Red
White
Green
Yellow
Observed
48
22
5
25
Expected
25
25
25
25
A chi-square goodness of fit test was conducted to determine whether the data provide convincing
evidence that the wheel is not fair. The test statistic was 37.52. Which statement is true?
(A) At the significance level ^ = 0.05, the data do not provide convincing evidence that the wheel is
not fair.
(B) At the significance level ^ = 0.05, the data provide convincing evidence that the wheel is not fair.
(C) No valid conclusion can be made because the observed frequency for one cell is 5.
(D) The chi-square statistic has 100 – 1 = 99 degrees of freedom.
(E) The wheel may not be fair, but the game is fair because all of the expected values are equal.
18. A credit card company claims that the mean time the customers spend on hold is 3.5 minutes.
An employee of this company believes that customers spend more than 3.5 minutes on hold. A
random sample of n = 36 calls is selected and the mean time the customers in this sample spent on
hold was 5.15 minutes. All conditions for inference were met and the p-value for the appropriate
hypothesis test was 0.031. Which of the following statements is the best interpretation of the pvalue?
(A) There is a 0.031 probability that the alternative hypothesis is true.
(B) There is a 0.031 probability that the null hypothesis is true.
(C) If the null hypothesis is true, there is a 0.031 probability of finding convincing evidence that 5.15
minutes is greater than 3.5 minutes.
(D) If the null hypothesis is true, the probability of finding convincing evidence that the null
hypothesis is true is 0.031.
(E) If the null hypothesis is true, the probability of observing a sample mean of at least 5.15 minutes
is 0.031.
638
19. A machine that makes dimes is adjusted to make them at diameter 1.9 cm. Production records
show that when the machine is properly adjusted, it will make dimes with a mean diameter of 1.9 cm
and with a standard deviation of 0.1 cm. During production, an inspector checks the diameters of
dimes to see if the machine has slipped out of adjustment. A random sample of 64 dimes is
selected. The inspector would like to detect if the true mean diameter happens to reach 1.95 cm at a
significance level of a = 0.01. He determines the power of this test to be 0.9228. What is the
probability that the inspector will make a Type II Error?
(A) 0.01
(B) 0.0385
(C) 0.05
(D) 0.0772
(E) 0.9228
20. An orange juice manufacturer advertises a new orange juice product that contains 50% less
sugar. This new product is expected to increase profits substantially because they create the 50%
less sugar product by replacing 50% of the juice with water while selling it for the same price. An
inspector wants to investigate whether the actual percentage of water in the product is within 5% of
the intended 50%. He selects a random sample of 500 containers of orange juice and determined
the percentage of each that is water. He plans to conduct a significance test at the ^ = 0.01 level to
see if there is convincing evidence that the proportion of water added is different from 0.50. What is
the probability that the inspector makes a Type I error?
(A) 0.01
(B) 0.05
(C) 0.10
(D) 0.45
(E) 0.50
639