The Best-Fit Line Linear Regression

advertisement
The Best-Fit Line
Linear Regression
How do you determine the
best-fit line through data
points?
y-variable
PGCC CHM 103
Sinex
Fortunately technology,
such as the graphing calculator
and Excel, can do a better
job than your eye and a ruler!
x-variable
The Equation of a Straight Line
y = mx + b
where m is the slope or Dy/Dx and
b is the y-intercept
In some physical settings, b = 0 so the
equation simplifies to:
y = mx
PGCC CHM 103
Sinex
Linear regression minimizes
the sum of the squared deviations
y = mx + b
y-variable
deviation = residual
= ydata point – yequation
PGCC CHM 103
Sinex
x-variable
Linear Regression
• Minimizes the sum of the square of
the deviations for all the points and
the best-fit line
• Judge the goodness of fit with r2
• r2 x100 tells you the percent of the
variation of the y-variable that is
explained by the variation of the xvariable (a perfect fit has r2 = 1)
PGCC CHM 103
Sinex
Goodness of Fit: Using r2
r2 is low
y-variable
r2 is high
How about the value of r2?
x-variable
PGCC CHM 103
Sinex
Strong direct relationship
y-variable
25
y = 2.0555x - 0.1682
R2 = 0.9909
20
15
10
5
0
0
2
4
6
8
x-variable
99.1% of the y-variation is due to
the variation of the x-variable
PGCC CHM 103
Sinex
10
y-variable
Noisy indirect relationship
y = -2.2182x + 25
R2 = 0.8239
30
25
20
15
10
5
0
0
2
4
6
8
10
x-variable
Only 82% of the y-variation is due to
the variation of the x-variable - what
is the other 18% caused by?
PGCC CHM 103
Sinex
When there is no trend!
y-variable
20
15
10
5
R2 = 0.0285
0
0
2
4
6
x-variable
No relationship!
PGCC CHM 103
Sinex
8
10
In Excel
• When the chart is active, go to
chart, and select Add Trendline,
choose the type and on option select
display equation and display r2
• For calibration curves, select the set
intercept = 0 option Does this make
physical sense?
PGCC CHM 103
Sinex
absorbance
Does the set intercept = 0 option
make a difference?
Calibration Curve
1
y = 0.8461x + 0.0287
0.8
R2 = 0.9954
0.6
0.4
y = 0.8888x
R2 = 0.9911
0.2
0
0
0.2
0.4
0.6
concentration
0.8
Using the set intercept = 0 option
lowers the r2 value by a small amount
and changes the slope slightly
PGCC CHM 103
Sinex
1
The equation becomes
A = mc
or
A = 0.89c
PGCC CHM 103
Sinex
99.1% of the variation of the
absorbance is due to the
variation of the concentration.
Download