CHM 103 Sinex How do you determine the best-fit line through data points? The Best-Fit Line y-variable Linear Regression Fortunately technology, such as the graphing calculator and Excel, can do a better job than your eye and a ruler! x-variable PGCC CHM 103 Sinex The Equation of a Straight Line y = mx + b where m is the slope or ∆y/∆x and b is the y-intercept Linear regression minimizes the sum of the squared deviations y = mx + b y-variable deviation = residual = y data point – y equation In some physical settings, b = 0 so the equation simplifies to: y = mx PGCC CHM 103 Sinex Linear Regression • Minimizes the sum of the square of the deviations for all the points and the best-fit line • Judge the goodness of fit with r2 • r2 x100 tells you the percent of the variation of the y-variable that is explained by the variation of the xvariable (a perfect fit has r2 = 1) x-variable PGCC CHM 103 Sinex Goodness of Fit: Using r 2 r2 is low y-variable r2 is high How about the value of r2 ? x-variable PGCC CHM 103 Sinex PGCC CHM 103 Sinex 1 CHM 103 Sinex Noisy indirect relationship Strong direct relationship y = 2.0555x - 0.1682 R2 = 0.9909 20 y-variable y-variable 25 15 10 5 0 0 2 4 6 8 10 y = -2.2182x + 25 R2 = 0.8239 30 25 20 15 10 5 0 0 2 x-variable 8 10 Only 82% of the y-variation is due to the variation of the x-variable - what is the other 18% caused by? PGCC CHM 103 Sinex PGCC CHM 103 Sinex In Excel When there is no trend! 20 y-variable 6 x-variable 99.1% of the y-variation is due to the variation of the x-variable 15 10 5 R2 = 0.0285 0 0 4 2 4 6 8 10 x-variable • When the chart is active, go to chart, and select Add Trendline, choose the type and on option select display equation and display r2 • For calibration curves, select the set intercept = 0 option Does this make physical sense? No relationship! PGCC CHM 103 Sinex PGCC CHM 103 Sinex absorbance Does the set intercept = 0 option make a difference? The equation becomes Calibration Curve 1 0.8 y = 0.8461x + 0.0287 2 R = 0.9954 0.6 0.4 A = mc or A = 0.89c y = 0.8888x 2 R = 0.9911 0.2 0 0 0.2 0.4 0.6 concentration 0.8 1 Using the set intercept = 0 option lowers the r2 value by a small amount and changes the slope slightly PGCC CHM 103 Sinex PGCC CHM 103 Sinex 99.1% of the variation of the absorbance is due to the variation of the concentration. 2