Simple Linear Regression Question Is annual carbon dioxide concentration related to annual global temperature? 1 Simple Linear Regression Response variable, Y. Annual global temperature (o C). Explanatory (predictor) variable, x. Annual atmospheric CO2 concentration. 2 Regression model Y y| x •Y represents a value of the response variable. • y| x represents the population mean response for a given value of the explanatory variable, x. • represents the random error 3 Linear Model y| x 0 1 x 0 The Y-intercept parameter. 1 The slope parameter. 4 Conditions The relationship is linear. The random error term, , is Independent Identically distributed Normally distributed with standard deviation, . 5 y| x 0 1 x 6 15.0 Temp 14.5 14.0 13.5 300 350 CO2 400 7 Describe the plot. Direction – positive/negative. Form – linear/non-linear. Strength. Unusual points? 8 CO2 and Temperature. There is a fairly strong, positive linear relationship between CO2 and temperature. Larger (smaller) values of CO2 are associated with larger (smaller) values of temperature. 9 Method of Least Squares Find estimates of 0 and 1 such that the sum of squared vertical deviations from the estimated straight line is the smallest possible. 10 residual residual 11 Least Squares Estimates Slope: Intercept: Line of Best fit: ˆ1 x x y y x x 2 ˆ ˆ 0 y 1 x ˆy ˆ ˆ x 0 1 12 Bivariate Fit of Temp By CO2 15.0 Temp 14.5 14.0 13.5 300 350 400 CO2 Linear Fit 13 Linear Fit yˆ ˆ0 ˆ1 x Predicted Temperature = 9.8815 + 0.012584*CO2 14 Interpretation Estimated Y-intercept. This does not have an interpretation within the context of the problem. Having no CO2 in the atmosphere is not reasonable given the data. 15 Interpretation Estimated slope. For each additional 1 ppmv of CO2, the annual global temperature goes up o 0.012584 C, on average. 16 Bivariate Fit of Temp By CO2 15.0 Temp 14.5 14.1755 14.0 341.2305 13.5 300 350 400 CO2 Linear Fit 17 How Strong? The strength of a linear relationship can be measured by R2, the coefficient of determination. RSquare in JMP output. 18 How Strong? SS Model R SSTotal 2 0.80145 R 0.806 0.99450 2 19 Interpretation 80.6% of the variation in the global temperature can be explained by the linear relationship with carbon dioxide concentration. 19.4% is unexplained. 20 Interpretation There is a fairly strong positive linear relationship between carbon dioxide concentration and global temperature. Cause and effect? 21 Cause and Effect? There is a strong positive linear relationship between the number of 2nd graders in communities and the number of crimes committed in those communities. 22 Connection to Correlation If you square the correlation coefficient, r, relating carbon dioxide to global temperature you get R2, the coefficient of determination. r R 0.806 0.898 2 23 Connection to Correlation s y ˆ1 r sx s y is the standard deviation of the y values s x is the standard deviation of the x values 24