STATISTICS 401B Fall 2014 Laboratory Assignment 7 1. The following data are tensile strength measurements on wire that is used in industrial equipment. The wire is manufactured by drawing the base metal through a die that has a specified diameter. The data consist of tensile strengths of 18 samples from large spools of wire made with each of three dies. In manufacturing processes, attention is often focussed on variabiliy. The variability of the tensile strengths may be just as important as the mean strentghs of the wire produced by the dies. A testing agency wanted to test the hypothesis that there are differences among the three dies with respect to the variability in the tensile strength of the wire produced by each. Use the text file die.txt downloaded from the web page to create an appropriate JMP data table. Tensile Strength (103 psi) Die 1 85.769 86.725 86.725 84.292 87.168 84.513 84.513 86.725 84.513 84.513 83.628 82.692 82.912 83.407 84.734 84.070 82.964 85.337 Die 79.424 81.628 82.692 86.946 86.725 85.619 84.070 85.398 86.946 2 82.912 83.185 86.725 84.070 86.460 83.628 84.513 84.292 85.619 Die 82.423 81.941 81.331 82.205 81.986 82.860 82.641 82.592 83.026 3 81.768 83.078 83.515 82.423 83.078 82.641 82.860 82.592 81.507 (a) Examine evidence that the assumptions about the population distributions required to use Hartley’s test for the above purpose are satisfied or not. (b) Use the statistics output from executing a JMP analysis to conduct Hartley’s test by hand using α = .05. State the null and alternative hypotheses, test statistic, and the rejection region clearly. 1 (c) Extract from the JMP output the results of performing Levene’s test for testing the above hypotheses. State the value of the test statistic, the p-value and your decision based on α = .05 2. A construction company performed a study of the daily gas consumption of 20 homes heated with gas, installed with a new foam insulation material. They set up instruments to record the temperature both inside and outside of the homes over a six-month period of time (Oct-Mar). The average difference in these values over this period and the average daily gas consumption (in killowatt hours, kWh) are recorded. The gas usage and average temperature difference data were used to build a simple linear regression model to predict gas usage with the new insulation. Temperature Difference (◦ F ), x Gas Consumption(kWh), y x (Continued . . .) y (Continued . . .) 20.1 65.3 28.8 94.9 21.1 66.5 29.2 93.9 21.9 67.8 30.6 87.1 22.6 73.2 30.8 84.2 23.4 75.3 32.6 106.6 24.2 81.1 32.4 111.3 24.9 82.2 34.8 100.9 25.1 85.7 35.9 101.9 26.0 90.9 36.0 110.1 27.2 87.4 36.5 119.1 Perform the following computations by hand calculation. Must show work for all your numerical answers. Attach Excel sheets to show your computations for parts b), f), g), and h). Answers to some of the questions below must be provided on separate sheets when necessary. Finally, execute a JMP program to obtain all quantities needed and confirm that all numbers you computed by hand can be obtained from the JMP output by hi-liting and labeling them on the JMP output. Use the insulation.jmp JMP data file and the text file insulation.txt provided. a) Construct a plot (using software known to you or using JMP Graph Builder) which shows the scatter of (x, y) data points with y on the vertical axis. b) Compute the following using Excel: X xi = , X x2i = , X yi = , X yi2 = , X x i yi = c) Use the method of least squares to obtain estimates β̂0 and β̂1 of the parameters in the model y = β0 + β1 x + using the results of part (b). d) Give the least squares prediction equation (which is the equation of the line that yields minimum sum of squared residuals). e) According to this model, what is the expected increase in the average gas consumption (in kWh) associated with a 5◦ F increase in average difference in temperature? f) Compute a table of predicted values ŷ and residuals y − ŷ corresponding to the observed values y. (You may use Excel for this and the next two parts). g) Extend the table in part(f) to include columns y − ȳ and ŷ − ȳ. h) Compute the sums of squares (y − ȳ)2 , (ŷ − ȳ)2 and (y − ŷ)2 using the table in part (g). Explain how these give a decomposition of total variability in gas consumption into two parts and identify the parts. P P P i) What proportion of the total variability in the gas consumption is accounted for by the explanatory variable x (i.e., by the linear regression model)? What does this say about the role of x in predicting y? 2 j) Give the point estimate s2 of σ2 k) Compute the estimated standard error of β̂1 . `) Construct 95% confidence interval for β1 . m) Test the hypothesis H0 : β1 = 0 vs. Ha : β1 6= 0 using a t-statistic. Give your conclusion using α = .05. n) Construct an analysis of variance table using quantities computed in parts (b),(c), and Syy . Include a column for computing the F-statistic to test the hypothesis in part (m). Use the F-tables to determine the rejection region and state your decision. o) Construct a 95% confidence interval for the mean gas consumption for the population of months with an average temperature difference of 25◦ F . p) Construct a 95% prediction interval for the prediction of gas consumption for a month recording a average temperature difference of 25 ◦ F . q) (JMP analysis only) Save columns of predicted values, residuals, and confidence and prediction intervals into the JMP data table. Re-format numbers so that the output is more readable. Journal this table and save as a Word file. Obtain a printed copy to turn in along with the printed copy of the output from the JMP analysis. Make sure the JMP analysis contains the following plots and analyses (in addition to the standard output): i) A plot of the data superimposed by the fitted regression line, the confidence bands, and the prediction interval bands. ii) A plot of the residuals against the corresponding temperature difference (x) values. iii) A plot of the residuals against the corresponding predicted (ŷ) values. iv) A normal probability plot of the residuals. r) Study the plots obtained in (ii), (iii), and (iv) of part (q) above. Use these to comment on the adequacy of the model you fitted to this data. Are the model assumptions you made supported by these plots? s) The manufacturer of the insulation contends that for an average temperature difference of 25 ◦ F , the new insulation will keep the average monthly gas consumption below 80 kWh. Do the data support this contention? Due Thursday, November 6, 2014 (turn in the first 20 minutes during lab) 3