Chapter 4 – Statistics 1. Consider the following data. 45.5 44.2 59.4 55.2 49.6 47.3 46.7 Find the (a) mean (b) standard deviation and (c) variance Mean x = 49.7 St. Dev s = 5.6 Variance s2 = 31.2 2. Suppose a Gaussian population has a mean of 2000 and a standard deviation of 75. What fraction of the population lies in the following interval? (a) > 2000 (b) 1925 – 2075 (c) < 1775 (d) 1850 – 2150 (e) 1898 – 1958 a) z = (2000 -2000)/75 = 0 P(0 - ∞) = 0.5000 50.00 % b) z = (1975 - 2000)/75 = -1 P(-1 to 0 ) = P (0 to 1) = 0.3413 z = (2075 - 2000)/75 = 1 P (0 to 1) = 0.3413 P(-1 to 1) = P(-1 to 0 ) + P (0 to 1) = 0.3413 + 0.3413 = 0.6826 69.26 % c) z = (1775 – 2000)/75 = -3 P(-3 to 0) = P( 0 to 3) = 0.4987 P(3 to ∞) = P(0 to ∞) - P( 0 to 3) = 0.5000 – 0.4987 = 0.0013 0.13 % d) z = (1850 -2000)/75 = -2 z = (2150 -2000)/75 = 2 P(-2 to 0) = P(0 to 2) = 0.4773 P(0 to 2) = 0.4773 P(-2 to 2) = P(-2 to 0) + P(0 to 2) = 0.4773 + 0.4773 = 0.9546 95.46 % e) z = (1898 -2000)/75 = -1.4 z = (1958 -2000)/75 = -0.6 P(-1.4 to 0) = P(0 to 1.4) = 0.4192 P(-0.6 to 0) = P(0 to-0.6) = 0.2258 P(-1.4 to -0.6) = P(-1.4 to 0) - P(-0.6 to 0) = 0.4192 + 0.2258 = 0.1934 19.34 % 3. Two methods were used to measure the specific activity (units of enzyme activity per milligram of protein) of an enzyme. One unit of enzyme activity is defined as the amount of enzyme that catalyzes the formation of one micromole of product per minute under specified conditions. Enzyme activity (five replications) Method 1: 139 147 160 158 135 Method 2: 148 159 156 164 159 Is the mean value of method 1 significantly different from the mean value of method 2 at the 95% confidence level? Let’s assume that the measurements were made on the same sample by the two different testing methods; therefore, we use the comparison of individual differences test. Trial 1 2 3 4 5 di -9 -12 4 -6 -24 |di – d| 0.4 2.6 13.4 3.4 14.6 (di –d)2 0.16 6.76 179.56 11.56 213.16 411.2 Mean d = -9.4 St. Dev. s = (411.2/5-1)0.5 = 10.14 tcalc = |-9.4|(5)0.5/10.14 = 2.07 ttab for n = 5, DF = 4, at 95% confidence is 2.776 ttab > tcal ; therefore, there is no significant difference. 4. The Ti content (wt %) of two different ore samples was measured several times by the same method. Are the mean values significantly different at the 95% confidence level? Sample 1: 0.013 4 0.013 8 0.012 8 0.013 3 0.013 7 Sample 2: 0.013 5 0.014 2 0.013 7 0.014 1 0.014 3 Here we have two different samples measured using the same method. Sample 1 x1 = 0.0134 s1 = 3.9 *10-4 Sample 2 x2 = 0.0140 s2 = 3.4 *10-4 sp = {[s12(n1-1) + s22(n2-1)]/(n1 + n2 -2)}0.5 = = {[(3.9 *10-4)2(5 -1) + (3.4 *10-4)2(5 -1)]/(5 + 5 -2)} 0.5 = {1.07 *10-6/ 8}0.5 = 3.7*10-4 tcalc = |x1- x2| [n1n2/(n1+n2)]0.5/sp = |0.0134 – 0.0140|[5*5/(5+5)]0.5/3.7*10-4 = (6*10-4)(1.58)/ 3.7*10-4 = 2.56 ttab at 95 % confidence for 8 degrees of freedom is 2.306; therefore, tcalc > ttab => the samples are statistically significantly different. 5. Using the Q test, decide whether the value 1.459 should be rejected from the set of results: 1.422, 1.429, 1.459, 1.426, 1.426, 1.428. 1.422 1.426 1.426 1.428 1.429 1.459 Gap = 1.459 – 1.429 = 0.030 Range = 1.459 -1.422 = 0.037 Qcalc = Gap/Range = 0.030/0.037 = 0.81 Qtab for 6 measurements at 90 % confidence is 0.56. Since Qcalc > Qtab, the data point can be rejected. 6. A set of absorbance measurements were obtained as 0.985, 0.979, 0.961, 0.996 and 0.978 AU. What are the 50%, 95% and 99% confidence limits of there data. Mean St. dev n=5 x = 0.980 s = 1.27*10-2 DF = 4 50 % C.L. t = 0.741 = x ± ts/n0.5 = 0.980 ± 0.741*1.27*10-2/40.5 = 0.980 ± 4.71*10-3 = 0.975 … 0.985 95 % C.L. t = 2.776 = x ± ts/n0.5 = 0.980 ± 2.776*1.27*10-2/40.5 = 0.980 ± 1.76*10-2 = 0.962 … 0.998 99 % C.L. t = 4.604 = x ± ts/n0.5 = 0.980 ± 4.604*1.27*10-2/40.5 = 0.980 ± 2.92*10-2 = 0.951 … 1.009