Chapter 7 – The Normal Probability Distribution The normal distribution is a special type of bell-shaped curve. Defn: A random variable X is said to be normally distributed or to have a normal distribution if its distribution has the shape of a normal curve: f x 1 2 e x 2 2 2 , for - < x < , - < < , and > 0. Here and are the parameters of the distribution; = the mean of the random variable X (or of the probability distribution); and = the standard deviation of X. The two mathematical constants appearing in the formula, 3.141592654, and e 2.718281828. The normal distribution is not just a single distribution, but rather a family of distributions; each member of the family is characterized by a particular pair of values of and . The mean, , tells us the location of the center of the bell-shaped curve. The standard deviation, , tells us how wide the bell-shaped curve is; smaller values of denote narrower, more sharply peaked bell-shaped curves; larger values of denote wider, less sharply peaked bell-shaped curves. In particular, the points given by - and + are the inflection points of the curve. Examples of Normal Distributions: Characteristics of normal distributions: 1) The bell is symmetric around the mean value . 2) The standard deviation, , describes the width of the bell. 3) The distribution is unimodal (one peak), and the mean, median and mode are all equal. 4) The curve is continuous; i.e., there are no gaps. 5) For any two numbers, a and b, the probability that the random variable X will be found to have a value between a and b is denoted by P(a X b), and is the area under the curve above the interval from a to b. (Note that the probability is concentrated around ). This probability can be found using a technique of calculus, called numerical integration. 6) The total area under the curve is 1. 7) The curve extends indefinitely in both directions, approaching, but never touching, the horizontal axis. Example: p. 335, Exercise 25. The Empirical Rule: For the normal distribution, 1) The probability that X will be found to have a value between - and + is approximately 0.68; 2) The probability that X will be found to have a value between - 2 and + 2 is approximately 0.95; 3) The probability that X will be found to have a value between - 3 and + 3 is approximately 0.9974. Examples of approximately normal random variables: 1) Heights of people in the general population of adults; 2) Diameters of pine tree trunks at 3 ft. above the ground; 3) Scores on an IQ test; 4) Fill weights of 12-oz cans of Pepsi-Cola. 5) Lengths of 3–long ¼ -20 bolts produced by an assembly line. There is a particular, important member of the family, called the standard normal distribution, which has = 0 and = 1, so that f z 2 z 1 e 2 . 2 We use a different notation for a standard normal random variable, calling it Z, rather than X. The following fact is one of the reasons that the standard normal distribution is so important: If X has a normal distribution with mean and standard deviation , then the random variable Z X has a standard normal distribution. This means that if we have a random variable X with a Normal(, ) distribution, and we want to find the probability that X will be found to have a value between the numbers a and b, we can first convert to a standard normal distribution, and 1) find the desired probability in a table of the standard normal distribution or 2) find the probability using the TI-83 calculator. To find areas under a normal distribution curve (probabilities) using the Table 2 in Appendix A: 1) Convert the probability statement to a statement about a standard normal distribution. 2) Draw a picture, shading the desired area. 3) Use the properties of the normal distribution to get the desired probability in terms of a probability that Z will take on a value between 0 and some positive number. 4) Go to Table 3 and look up that positive number, finding the probability from step 3) in the body of the table. 5) Calculate the desired probability. To find areas under a normal distribution curve (probabilities) using the TI-83 calculator (the simpler way to do it): 1) Choose 2nd, DISTR, and normalcdf(. 2) Enter the left-hand endpoint of the interval, the right-hand endpoint of the interval, the mean, , of the normal distribution (can skip this if the distribution is standard normal), and the standard deviation, , of the interval (can skip this if the distribution is standard normal). 3) Hit ENTER. The desired probability appears. The following examples involve finding probabilities associated with a random variable Z which has a standard normal distribution. Example: p. 346, Exercise 5. Example: p. 346, Exercise 9. Example: p. 346, Exercise 10. The normal distribution occurs in many situations in nature, in manufacturing processes, in many instances of analysis of survey data, and in the data analysis for most scientific experiments. In the examples below, we are given situations in which the random variable of interest, X, has a normal distribution for which we know the mean, , and the standard deviation, . In most real-world applications, we do not know the values of and , but must estimate the values using sample data. We will talk about techniques of estimation when we take up the topic of statistical inference. Note: When calculating probabilities associated with random variables, please write complete probability statements; it will help to prevent confusion and mistakes. For example, if we want the probability that a random variable X will be found to have a value between the numbers a and b, we would write P(a X b). Example: p. 355, Exercise 23 Example: p. 355, Exercise 25 Example: p. 355, Exercise 26 If you want to check the results you get with the TI-83/84 calculator, you may do so by going to the following website, which does calculations of normal probabilities: http://www.math.csusb.edu/faculty/stanton/m262/normal_distribution/normal_distribution.html When we talk about statistical inference, we will need to invert the above process of finding probabilities. Instead of finding the probability associated with a given interval of values of a normally distributed random variable, we will need to find one or more values of the random variable which are the endpoints of an interval for which we are given the probability. To do this, we use the invNorm( function of the TI-83 calculator. In the examples below, we are given the mean, , and the standard deviation, , of the normal distribution of the random variable, X. Example: p. 355, Exercise 27 Example: p. 355, Exercise 39