CHAPTER 5 Normal Probability Distributions and Sampling Distributions Continuous Random Variables Outcomes of random experiments, but values occur in a continuum. Unlike discrete r.v.’s, cannot list all possible outcomes. Subtle change in description of probability: no longer meaningful to say P(X=x) since this will always equal 0. Ex. I have absolutely no idea what time it is. What is pb. that it is 1:00 p.m.? This pb. = 0 since it is never 1:00 p.m. for any length of time. However, it is meaningful to ask, what is pb. that it is between 1:00 and 2:00? Ans.: 1/24. Therefore, continuous random variable will be described by its probability density function. (for example, see next section) The Normal Probability Distribution Also called Gaussian Distribution Most important pb. distribution in statistics. - Appears to capture many naturally occurring random variables (eg. IQ) - Easily used as approximation to more cumbersome distributions - By Central Limit Theorem, many random variables central to classical statistical inference will be (approximately) normal. The probability density function for a normally distributed random variable is given by: f ( x) 2 1 e (1/ 2)[( x ) / ] 2 where e = 2.718… , π = 3.1415…. , μ = population mean, σ = pop. standard deviation. Note: a two-parameter distribution. Each individual member is described by a pair of values for μ and σ. Chapter 5 - 1 Appears complex, but f(x) simply defines a function. This function will always: 1. 2. 3. 4. 5. be bell-shaped and perfectly symmetrical about the mean have same values for mean, median, and mode have a range of values from minus to plus infinity have an area under the curve equal to 1 have c. 68% of the area within 1 std. dev. of the mean, 95% within 2 std. dev.’s, and 99% within 3 std.dev.’s. Changing μ shifts the curve horizontally, changing σ collapses or expands the distribution about the mean. Ex. μ determines where the midpoint of the distribution falls. Changes in μ , without changes in σ , result in moving the distribution to the right or left, depending upon whether the new value of μ was larger or smaller than the previous value, but does not change the shape of the distribution. An example of how changes in μ affect the normal curve are presented below: Changes in the value of σ, on the other hand, change the shape of the distribution without affecting the midpoint, because d affects the spread or the dispersion of scores. The larger the value of σ , the more dispersed the scores; the smaller the value, the less dispersed. Perhaps the easiest way to understand how σ affects the distribution is graphically. The distribution below demonstrates the effect of increasing the value of σ : Chapter 5 - 2 Note that the shape of the second distribution has changed dramatically, being much flatter than the original distribution. It must not be as high as the original distribution because the total area under the curve must be constant, that is, 1.00. The second curve is still a normal curve; it is simply drawn on a different scale on the X-axis. If X is a normally distributed random variable, this is denoted by X ~ N(μ,σ2) The Standard Normal Density Function: z There are an infinite number of normal r.v.’s (one for each combination of μ and σ. A special member of the normal class is found when μ = 0 and σ = 1. Given the notation z: z ~ N(0,1) is the standard normal random variable. This is the ONLY member of the normal family for which textbooks report probabilities. Refer to a standard normal distribution table. In our textbook, Table E.2 provides area under density function below the stated values of z, where z is available up to 2 decimal places. Ex. P(z < 1.35) = P(z ≤ 1.35) = 0.9115 Note: care should be exercised if using other texts. Most texts report area between z = 0 and required value of z; i.e., P(0 < z < 1.35) = 0.4115. Easy to calculate P(z < 1.35) since we know P(z < 0) = .5. Add this to 0.4115. Chapter 5 - 3 How to calculate P(z>1.35)? Use the fact that area under density function must equal 1. Then P(z>1.35) = 1 – P(z<1.35) = 1 – 0.9115 = 0.0885. Note that this is same as P(z<-1.35) : z density function is symmetric. When finding pb.’s, always draw a picture. Ex. Find P(-0.45 < z < 1.96) = P(z<1.96) – P(z<-0.45) = 0.9750 – 0.3264 = 0.6486. Some special cases P(-1 < z < 1) = P(z < 1) – P(z < -1) = 0.8413 – 0.1587 = 0.6826 P(-2 < z < 2) = P(z < 2) – P(z < -2) = 0.9772 – 0.0228 = 0.9544 P(-3 < z < 3) = P(z < 3) – P(z < -3) = 0.9987 – 0.0014 = 0.9973 Basis for empirical rule (or, normal approximation rule) used to interpret values of standard deviation. Finding pb.’s for other normal random variables Texts cannot produce tables for all possible normal random variables. Not necessary. Suppose X ~ N(μ,σ2). Compute a new random variable by subtracting μ and dividing by σ. This new random variable will be standard normal. i.e. (X - μ)/ σ ~ N(0,1) ~ z. Ex. Suppose X ~ N(24, 32). Then (X-24)/3 ~ N(0,1). Find P(X<19). P( x 19) P( x 19 24 ) P( z 1.67) 0.0475 3 Simple intuition. The value 19 lies 1.67 standard deviations below the mean of 24. The pb. that this random variable X takes a value below 19 is the same as the pb. that any normally distributed random variable takes a value more than 1.67 standard deviations below the Chapter 5 - 4 mean. We can use z table to calculate this. -1.67 is 1.67 standard deviations below its mean. NOTE: Excel can calculate any normal pb. using NORMDIST( ). Ex. A battery manufacturer produces a battery pack that will power an electric car at 90 kmh for an average of 8 hours. There is variability in the life of these packs. Data shows that lives of packs is normally distributed with mean 8 hrs. and standard deviation of 0.4 hours. X ~ N(8, 0.42). Consumers require reliability and company is concerned that market will disappear if too many consumers find a battery life of less than 7.5 hours. What are the chances that this will happen (i.e., what proportion of batteries will last less than 7.5 hours)? P( x 7.5) P( x 7.5 8 ) P( z 1.25) 0.1056 0.4 N.B.: Always subtract mean from x in the correct order. The negative sign on -1.25 is VERY meaningful. Suppose this risk is unacceptable and that maximum risk that life is less than 7.5 hours is 2%. How much would the mean life expectancy have to be increased to lower risk to this level, assuming standard deviation is unchanged at 0.4 hours? Find z* so that P(z < z*) = 0.02. From z table, required z* = -2.06. Then mean must lie 2.06 standard deviations above 7.5. Required mean is 7.5 + 2.06(0.4) = 8.32 hours. Ex. Fast food chain finds that amount of time spent by customers from entering the parking lot to receiving their food order is normally distributed with μ = 22.14 minutes and σ = 6.09 minutes. Considers providing free meals to any customer having to wait more than 30 minutes. What is the pb. of having to provide a free meal? X ~ N(22.14, 6.092) Chapter 5 - 5 P ( X 30 ) P ( x 30 22.14 ) P ( z 1.29 ) P ( z 1.29 ) .0985 6.09 Therefore, 9.85% of customers will be provided a free meal. This pb. can be lowered to 5% by providing free meal to customers waiting more than 32 minutes. Finding Values of X Corresponding to Known Probability What is value of z that leaves 90% of pb. below it? I.e., find z* so that P(z<z*)=0.9 Use table entries to find, z* = 1.28. Now suppose test scores in a population are normally distributed with X ~ N(137, 17.22). What is the test score such that 90% of the population scores less than this score? Find X* such that P(X < X*) = 0.90. We know that the required value is 1.28 standard deviations above the mean, so X* = 137 + (1.28)(17.2) = 159.02 This is the 90th percentile. Ex. If x ~ N(60,102), find X* so that 90% of pb. lies within the range μ – X* and μ + X*. Step 1: What is z* so that P(μ – z* < z < μ + z*). Find z* so that P(0 < z < μ + z*) = 0.45 z* = 1.645 Step 2: Calculate X* as the value lying 1.645 standard deviations above and below the mean; 60 – (10)(1.645) = 43.55 60 + (10)(1.645) = 76.45. Therefore, 90% of pb. is between 43.55 and 76.45. Chapter 5 - 6 Example of use of normal distribution A set of final examination grades in a statistics course was found to be normally distributed with a mean of 73 and a standard deviation of 8. a. What is the pb. that a randomly selected grade is 91% or less? Construct picture. P(X ≤ 91) = P((x-μ)/σ < (91-73)/8) = P(z < 2.25) = 0.9878. Intuitively, 91 is 2.25 std. dev.’s above the mean. b. What percentage of students scored between 65 and 89? Construct picture. 65 73 x 89 73 ) P ( 1 z 2 ) 8 8 P ( z 2 ) P ( z 1) 0.9772 0.1587 0.8185 P ( 65 x 89 ) P ( c. What is the grade such that only 5% of students achieved a higher grade than this? Construct picture. We need a grade that leaves 5% of the area above that value. What value of z leaves 5% above? The same value that leaves 95% below. Search standard normal table for a cell containing 0.9500. Use 1.645. Therefore, the grade required lies 1.645 standard deviations above the mean. Only 5% of students achieved a grade higher than 73 + (1.645)(8) = 86.16% d. If the professor grades on a curve and gives A’s to the top 10% of the class regardless of the actual grades, is a student better off with a grade of 81 on this exam or a grade of 68 on a different exam where the mean is 62 and the std. dev. is 3? Chapter 5 - 7 On this exam, is 81% in the top 10% of the grades? Draw the picture. 81% is (81-73)/8 = 1 standard deviation above the mean. Therefore 84.13% of the class scored 81% or less and the student is not in the top 10%. On the other exam, 68% is (68-62)/3 = 2 standard deviations above the mean. Therefore 97.72% of students scored 68% or less and the grade of 68% is in the top 10%. Therefore, the second situation is better. Chapter 5 - 8