Statistics Class 8

advertisement
Statistics Class 8
2/13/2013
Quiz 18
Listed below are brain sizes (in 𝑐𝑚3 ) and
Wechsler IQ scores of subjects. Is there
sufficient evidence to conclude that there is a
linear correlation between brain size and IQ
Score? Does it appear that people with larger
brains are more intelligent?
Brain 965
1029
1030
1285
1049
1077
1037
1068
1176
1105
IQ
85
86
102
103
97
124
125
102
114
90
Regression
Given a collection of sample data, the regression equation
𝑦 = 𝑏𝑜 + 𝑏1 𝑥
Algebraically describes the relationship between the two
variables x and y. The graph of the regression equation is
called the regression line (or line of best fit, or least-squares
line).
x is called the predictor variable, or Independent variable.
𝑦 is called the response variable, or dependent variable
Regression
Given a collection of sample data, the regression equation
𝑦 = 𝑏𝑜 + 𝑏1 𝑥
Algebraically describes the relationship between the two
variables x and y. The graph of the regression equation is
called the regression line (or line of best fit, or least-squares
line).
x is called the explanatory variable, or independent variable.
𝑦 is called the response variable, or dependent variable
The regression equation should remind you of
𝑦 = 𝑚𝑥 + 𝑏
Regression
Requirements
1. The sample of paired (x,y) data is a random sample of
quantitative data.
2. Visual examination of the scatterplot shows that the
points approximate a straight line pattern.
3. Outliers can have a strong effect on the regression, so
remove any outliers that are known to be errors. Consider
the effects of any outliers that are not known errors
Regression
Formulas
• 𝑏1 =
𝑛
𝑥𝑦 −( 𝑥)( 𝑦)
𝑛 𝑥 2 −( 𝑥)2
• 𝑏0 =
( 𝑦)
𝑛
𝑥 2 −( 𝑥)( 𝑥𝑦)
𝑥 2 −( 𝑥)2
𝑠𝑦
• 𝑏1 = 𝑟 , where 𝑠𝑦 is the standard deviation of the y values
𝑠𝑥
and 𝑠𝑥 is the standard deviation of the x’s
• 𝑏0 = 𝑦 − 𝑏1 𝑥
Regression
Formulas
• 𝑏1 =
𝑛
𝑥𝑦 −( 𝑥)( 𝑦)
𝑛 𝑥 2 −( 𝑥)2
• 𝑏0 =
( 𝑦)
𝑛
𝑥 2 −( 𝑥)( 𝑥𝑦)
𝑥 2 −( 𝑥)2
𝑠𝑦
• 𝑏1 = 𝑟 , where 𝑠𝑦 is the standard deviation of the y values
𝑠𝑥
and 𝑠𝑥 is the standard deviation of the x’s
• 𝑏0 = 𝑦 − 𝑏1 𝑥
Rounding
Round to 3 significant digits.
Regression
Cost of Pizza
0.15
0.35
1.00
1.25
1.75
2.00
Subway Fare
0.15
0.35
1.00
1.35
1.50
2.00
Regression
Cost of Pizza
0.15
0.35
1.00
1.25
1.75
2.00
Subway Fare
0.15
0.35
1.00
1.35
1.50
2.00
We are using the Pizza/Subway data from last time. Where
pizza is the x variable (explanatory) and the subway fare is
the y variable (response).
1. Put the data into the Calculator.
Regression
Cost of Pizza
0.15
0.35
1.00
1.25
1.75
2.00
Subway Fare
0.15
0.35
1.00
1.35
1.50
2.00
We are using the Pizza/Subway data from last time. Where
pizza is the x variable (explanatory) and the subway fare is
the y variable (response).
1. Put the data into the Calculator.
2. Hit the [STAT] button, then [→], then [4] or select
LinReg(ax+b).
Regression
Cost of Pizza
0.15
0.35
1.00
1.25
1.75
2.00
Subway Fare
0.15
0.35
1.00
1.35
1.50
2.00
We are using the Pizza/Subway data from last time. Where
pizza is the x variable (explanatory) and the subway fare is
the y variable (response).
1. Put the data into the Calculator.
2. Hit the [STAT] button, then [→], then [4] or select
LinReg(ax+b).
3. Skip down to Calculate and hit [ENTER].
For us a is 𝒃𝟏 and b is 𝒃𝟎 .
Regression
Cost of Pizza
0.15
0.35
1.00
1.25
1.75
2.00
Subway Fare
0.15
0.35
1.00
1.35
1.50
2.00
1. Put the data into the Calculator.
2. Hit the [STAT] button, then [→], then [4] or select
LinReg(ax+b).
3. Skip down to Calculate and hit [ENTER].
For us a is 𝒃𝟏 and b is 𝒃𝟎 .
4. Make a scatter plot.
Regression
Cost of Pizza
0.15
0.35
1.00
1.25
1.75
2.00
Subway Fare
0.15
0.35
1.00
1.35
1.50
2.00
1. Put the data into the Calculator.
2. Hit the [STAT] button, then [→], then [4] or select
LinReg(ax+b).
3. Skip down to Calculate and hit [ENTER].
For us a is 𝒃𝟏 and b is 𝒃𝟎 .
4. Make a scatter plot.
5. Press [Y=] with the flashing box in the the 𝑌1 spot, hit the
[VARS] button then [5] then [→] twice so that you are in
the EQ menu. Press [1] or select REGEQ.
6. Press [GRAPH].
Regression
Using the Regression Equation for Predictions
1. Use the regression equation only if the graph of the
regression line on the scatterplot fits the data reasonably
well.
Regression
Using the Regression Equation for Predictions
1. Use the regression equation only if the graph of the
regression line on the scatterplot fits the data reasonably
well.
2. Use the regression equation only if there is a linear
correlation.
Regression
Using the Regression Equation for Predictions
1. Use the regression equation only if the graph of the
regression line on the scatterplot fits the data reasonably
well.
2. Use the regression equation only if there is a linear
correlation.
3. Use the regression equation for predictions that do not go
very far past the sample data.
Regression
Using the Regression Equation for Predictions
1. Use the regression equation only if the graph of the
regression line on the scatterplot fits the data reasonably
well.
2. Use the regression equation only if there is a linear
correlation.
3. Use the regression equation for predictions that do not go
very far past the sample data.
4. If the regression equation does not appear to be useful for
making predictions the best predicted value of a variable
is its point estimate, which is its sample mean.
Regression
Using the Regression Equation for Predictions
1. Use the regression equation only if the graph of the
regression line on the scatterplot fits the data reasonably
well.
2. Use the regression equation only if there is a linear
correlation.
3. Use the regression equation for predictions that do not go
very far past the sample data.
4. If the regression equation does not appear to be useful for
making predictions the best predicted value of a variable
is its point estimate, which is its sample mean.
Try!
The paired value of the Consumer Price Index
(CPI) and the cost of subway fare are listed
below. Find the best predicted cost of Subway
fare when the Consumer Price index is 182.5
CPI
30.2
48.3
112.3
162.2
191.9
197.8
Subway
Fare
0.15
0.35
1.00
1.35
1.50
2.00
Try Some more!
Find the best predicted temperature (in °F) at a
time when cricket chirps 3000 times in one
minute. What is wrong with this predicted
value?
Chirps
882
1188 1104
864
1200 1032
960
900
Temp
(°F)
69.7
93.3
76.3
88.6
71.6
79.6
84.3
82.6
HOMEWORK!!
• 10-3: 1-9 odd, 13-17 odd, 23, 25, 27.
Download