Stat 301 – Lecture 7 Simple Linear Regression Question Is annual carbon dioxide concentration related to annual global temperature? 1 Bivariate Fit of Temp By CO2 15.0 Temp 14.5 14.0 13.5 300 350 400 CO2 Linear Fit 2 Linear Fit yˆ ˆ0 ˆ1 x Predicted Temperature = 9.8815 + 0.012584*CO2 3 1 Stat 301 – Lecture 7 R2 (RSquare) 80.6% of the variation in the global temperature can be explained by the linear relationship with carbon dioxide concentration. 19.4% is unexplained. 4 Interpretation There is a fairly strong positive linear relationship between annual carbon dioxide concentration and annual global temperature. 5 Testing Hypotheses Question Is the linear relationship between annual carbon dioxide concentration and annual global temperature statistically significant? 6 2 Stat 301 – Lecture 7 Step 1 - Hypotheses H 0 : 1 0 H A : 1 0 7 Step 2 – Test Statistic ˆ 0 Test Statistic: t Std Error: se( ˆ1 ) This image cannot currently be display ed. 1 se( ˆ1 ) MS Error 2 x x df n 2 8 Step 2 – Test Statistic ˆ1 0.012584 se( ˆ1 ) 0.001456 ˆ 0 0.012584 8.64 t 1 se( ˆ ) 0.001456 This image cannot currently be display ed. 1 P - value 0.0001 9 3 Stat 301 – Lecture 7 Step 3 - Decision Reject the null hypothesis because the P-value is so small (smaller than 0.05). 10 Step 4 – Conclusion Based on our sample data, there is a statistically significant linear relationship between annual carbon dioxide concentration and annual global temperature. 11 Confidence Interval ˆ1 t * se ˆ1 t * from t - table with df n 2 95% confidence, df 18, t * 2.101 12 4 Stat 301 – Lecture 7 Confidence Interval ˆ1 t * se ˆ1 0.012584 2.1010.001456 0.012584 0.003059 0.0095 to 0.0156 13 Interpretation – Part 1 The population slope parameter relating CO2 to temperature could be any value between 0.0095 and 0.0156. Units are o C/ppmv. 14 Interpretation – Part 2 We are 95% confident that intervals based on random samples from the population with capture the actual population slope parameter. This is confidence in the process. 15 5 Stat 301 – Lecture 7 Y-Intercept When the Y-intercept is interpretable within the context of the problem, it may be of interest to do inference based on the estimated yintercept, ̂ 0 . 16 Test of Hypotheses H 0 : 0 0 : 0 0 ˆ 0 0 t se ( ˆ ) H A 0 df n 2 17 Confidence Interval ˆ0 t * se( ˆ0 ) t * from t - table with df n 2 18 6