The Best-Fit Line

advertisement
CHM 103
Sinex
How do you determine the
best-fit line through data
points?
The Best-Fit Line
y-variable
Linear Regression
Fortunately technology,
such as the graphing calculator
and Excel, can do a better
job than your eye and a ruler!
x-variable
PGCC CHM 103
Sinex
The Equation of a Straight Line
y = mx + b
where m is the slope or ∆y/∆x and
b is the y-intercept
Linear regression minimizes
the sum of the squared deviations
y = mx + b
y-variable
deviation = residual
= y data point – y equation
In some physical settings, b = 0 so the
equation simplifies to:
y = mx
PGCC CHM 103
Sinex
Linear Regression
• Minimizes the sum of the square of
the deviations for all the points and
the best-fit line
• Judge the goodness of fit with r2
• r2 x100 tells you the percent of the
variation of the y-variable that is
explained by the variation of the xvariable (a perfect fit has r2 = 1)
x-variable
PGCC CHM 103
Sinex
Goodness of Fit: Using r 2
r2 is low
y-variable
r2 is high
How about the value of r2 ?
x-variable
PGCC CHM 103
Sinex
PGCC CHM 103
Sinex
1
CHM 103
Sinex
Noisy indirect relationship
Strong direct relationship
y = 2.0555x - 0.1682
R2 = 0.9909
20
y-variable
y-variable
25
15
10
5
0
0
2
4
6
8
10
y = -2.2182x + 25
R2 = 0.8239
30
25
20
15
10
5
0
0
2
x-variable
8
10
Only 82% of the y-variation is due to
the variation of the x-variable - what
is the other 18% caused by?
PGCC CHM 103
Sinex
PGCC CHM 103
Sinex
In Excel
When there is no trend!
20
y-variable
6
x-variable
99.1% of the y-variation is due to
the variation of the x-variable
15
10
5
R2 = 0.0285
0
0
4
2
4
6
8
10
x-variable
• When the chart is active, go to
chart, and select Add Trendline,
choose the type and on option select
display equation and display r2
• For calibration curves, select the set
intercept = 0 option Does this make
physical sense?
No relationship!
PGCC CHM 103
Sinex
PGCC CHM 103
Sinex
absorbance
Does the set intercept = 0 option
make a difference?
The equation becomes
Calibration Curve
1
0.8
y = 0.8461x + 0.0287
2
R = 0.9954
0.6
0.4
A = mc
or
A = 0.89c
y = 0.8888x
2
R = 0.9911
0.2
0
0
0.2
0.4
0.6
concentration
0.8
1
Using the set intercept = 0 option
lowers the r2 value by a small amount
and changes the slope slightly
PGCC CHM 103
Sinex
PGCC CHM 103
Sinex
99.1% of the variation of the
absorbance is due to the
variation of the concentration.
2
Download