Chapter 5 Normal Probability Distributions 5-1 Continuous Random Variables 5-2 Standard Normal Distribution 5-3 Applications of Normal Distributions: • Finding Probabilities • Finding Values 5-4 Determining Normality 5-5 The Central Limit Theorem 5-6 Normal Distribution as Approximation to Binomial Distribution 1 5-1 Continuous Random Variables Uniform (rectangular) distribution Normal distribution Standard Normal distribution Note: – Continuous random variables are typically represented by curves – Discrete random variables are typically represented by histograms 2 Definitions Density Curve (or probability density function) is the graph of a continuous probability distribution Properties 1. The total area under the curve must equal 1. 2. Every point on the curve must have a vertical height that is 0 or greater. 3 Two Important Points: 1. Because the total area under the density curve is equal to 1, there is a correspondence between area and probability. 2. To find probability in a continuous distribution just find the area under the curve (Calculus) 4 Uniform Distribution Rectangular Distribution • Simplest type of continuous probability distribution • random variable values are spread evenly over the range of possibilities; • graph results in a rectangular shape. • Finding probability is the same as finding the area of a rectangle 5 Using Area to Find Probability of Class Length Find P(51.5 < x < 52) 6 Definitions Uniform Distribution • Formal definition of density function y= 1 for a < x < b b-a From previous example: a = 50 and b = 52 1 1 y= = 2 52 - 50 7 Examples: Let X be a uniform random variable, find the density function for each and graph 1. If X is between 0 and 5 Find a. p(x > 4) b. p(1 < x < 5) 2. If X is between 6 and 12 Find a. Find p(x > 6) b. p(x < 5) c. p(x > 12) Go to excel 8 Normal Distribution Continuous random variable Normal distribution Curve is bell shaped and symmetric µ Score Formula y= e 1 2 x-µ 2 ( ) 2p 9 5-2 The Standard Normal Distribution 10 Definition Standard Normal Distribution: a normal probability distribution that has a mean of 0 and a standard deviation of 1, and the total area under its density curve is equal to 1. y= e 1 2 Area found in z - table or calculator z2 2p 0.4429 0 z = 1.58 Note: Typically ‘z’ denotes a standard normal random variable 11 Notation P(a < z < b) denotes the probability that the z score is between a and b P(z > a) denotes the probability that the z score is greater than a P (z < a) denotes the probability that the z score is less than a 12 Z - Table Uses calculus to find the area under density function curve associated with a particular zvalue Table available on website Calculator is easier 13 POSITIVE Z SCORES 14 NEGATIVE Z SCORES 15 To find: z Score the distance along horizontal scale of the standard normal distribution; refer to the leftmost column and top row of Table Area the region under the curve; refer to the values in the body of Table 16 TI-83 Calculator Finding Areas of Standard Normal Distribution 1. Press 2nd Distr 2. Choose normcdf 3. Enter normcdf(low z value, high z value) 4. Press enter Note: if low z value is negative infinity use –E99 if high z value is positive infinity use E99 17 Example: If thermometers have an average (mean) reading of 0 degrees and a standard deviation of 1 degree for freezing water and if one thermometer is randomly selected, find the probability that, at the freezing point of water, the reading is less than 1.58 degrees. P (z < 1.58) = 0.9429 The probability that the chosen thermometer will measure freezing water less than 1.58 degrees is 0.9429. 18 Example: If thermometers have an average (mean) reading of 0 degrees and a standard deviation of 1 degree for freezing water and if one thermometer is randomly selected, find the probability that, at the freezing point of water, the reading is less than 1.58 degrees. P (z < 1.58) = 0.9429 94.29% of the thermometers have readings less than 1.58 degrees. 19 Example: If thermometers have an average (mean) reading of 0 degrees and a standard deviation of 1 degree for freezing water, and if one thermometer is randomly selected, find the probability that it reads (at the freezing point of water) above –1.23 degrees. P (z > –1.23) = 0.8907 The probability that the chosen thermometer with a reading above –1.23 degrees is 0.8907. 20 Example: If thermometers have an average (mean) reading of 0 degrees and a standard deviation of 1 degree for freezing water, and if one thermometer is randomly selected, find the probability that it reads (at the freezing point of water) above –1.23 degrees. P (z > –1.23) = 0.8907 89.07% of the thermometers have readings above –1.23 degrees. 21 Example: A thermometer is randomly selected. Find the probability that it reads (at the freezing point of water) between –2.00 and 1.50 degrees. P (z < –2.00) = 0.0228 P (z < 1.50) = 0.9332 P (–2.00 < z < 1.50) = 0.9332 – 0.0228 = 0.9104 The probability that the chosen thermometer has a reading between – 2.00 and 1.50 degrees is 0.9104. 22 Example: A thermometer is randomly selected. Find the probability that it reads (at the freezing point of water) between –2.00 and 1.50 degrees. P (–2.00 < z < 1.50) = 0.9104 If many thermometers are selected and tested at the freezing point of water, then 91.04% of them will read between –2.00 and 1.50 degrees. 23 Let’s try several examples examples (sketch a graph) 1) P(Z < 1.45) 8) P(-1.35 < Z < 1.45) 2) P(Z < -1.45) 9) P(.56 < Z < 1.45) 3) P(Z > 1.45) 4) P(Z > -1.45) 5) P(0 < Z < 1.45) 6) P(-1.45 < Z < 0) 7) P(-1.45 < Z < 1.45) Test Problems 24 Frequently used Z – Scores 1) Z = 1.28 2) Z = 1.645 3) Z = 1.96 4) Z = 2.33 5) Z = 2.58 To illustrate why, find 1) P(Z > 1.28) 2) P(Z > 1.645) 3) P(Z > 1.96) 4) P(Z > 2.33) 5) P(Z > 2.58) 25 The Empirical Rule Standard Normal Distribution: µ = 0 and = 1 99.7% of data are within 3 standard deviations of the mean 95% within 2 standard deviations 68% within 1 standard deviation 34% 34% 2.4% 2.4% 0.1% 0.1% 13.5% x - 3s x - 2s 13.5% x-s x x+s x + 2s x + 3s 26 Finding a z - score when given a probability (Area) 1. Draw a bell-shaped curve, draw the centerline, and identify the region under the curve that corresponds to the given probability. 2.Use the invNorm calculator function to determine the corresponding z score. 27 Finding z Scores when Given Probabilities Finding the 95th Percentile 95% 5% 5% or 0.05 0.45 0.50 0 invNorm(.95) 1.645 (z score will be positive) 28 Finding z Scores when Given Probabilities Finding the 10th Percentile 90% 10% Bottom 10% 0.10 0.40 z 0 (z score will be negative) 29 Finding z Scores when Given Probabilities Finding the 10th Percentile 90% 10% Bottom 10% 0.10 0.40 -1.28 (z score will be negative) 0 invNorm(.10) 30 Examples: Find the Z score that corresponds the following percentiles 1) 50th 2) 25th 3) 75th Find the Z score(s) for the following areas from the mean 4) 0.4495 5) 0.3186 31 Find the Z score when given the probability (sketch each) 1) P(0 < z < a) = .34 2) P(-b < z < b) = .8442 3) P(z > c) = .056 4) P(z > d) = .8844 5) P(z < e) = .3400 These are similar to HW #4 (Extra Credit) 32 5-3 Applications of Normal Distributions If 0 or 1 (or both), we will convert values to standard scores using the z-score formula we learned in Chapter 2, then procedures for working with all normal distributions are the same as those for the standard normal distribution. z= x-µ Note: z scores are real numbers that have no units 33 Converting to Standard Normal Distribution z= x– 34 Probability of Sitting Heights Less Than 38.8 Inches = 36 = 1.4 38.8 – 36.0 z = = 2.00 1.4 35 Probability of Sitting Heights Less Than 38.8 Inches = 36 = 1.4 P ( x < 38.8 in.) = P(z < 2) normalcdf (-E99,2) = 0.9772 36 Probability of Weight between 150 pounds and 200 pounds µ = 150 σ = 25 x z= 150 200 200 - 150 25 = 2.00 Weight z 0 2.00 37 Probability of Weight between 150 pounds and 200 pounds normalcdf (0,2) = .4772 µ = 150 σ = 25 x 150 200 Weight z 0 2.00 38 Probability of Weight between 150 pounds and 200 pounds There is a 0.4772 probability of randomly selecting someone with a weight between 150 and 200 lbs. µ = 150 σ = 25 x 150 200 Weight z 0 2.00 39 Probability of Weight between 150 pounds and 200 pounds OR - 47.72% have weights between 150 lb and 200 lb. µ = 150 σ = 25 x 150 200 Weight z 0 2.00 40 Normal Distribution: Finding Values From Known Areas 41 Cautions to keep in mind 1. Don’t confuse z scores and areas. 2. Choose the correct (right/left) side of the graph. 3. A z score must be negative whenever it is located to the left of the centerline of 0. 42 Procedure for Finding Values Using TI-83 Calculator 1. 2. Sketch a normal distribution curve, enter the given probability or percentage in the appropriate region of the graph, and identify the value(s) being sought. x Find the z score invNorm(area) Make the z score negative if it is located to the left of the centerline. 3. Use x = µ + (z • ) to find the value for x or invNorm(area,µ,σ) 4. Refer to the sketch of the curve to verify that the solution makes sense. 43 Finding P10 for Weights x = + (z ● ) µ = 150 σ = 25 x = 150 + (-1.28 • 25) = 118.75 0.10 0.40 0.50 x z x=? 150 -1.28 0 Weight 44 Finding P10 for Weights The weight of 119 lb (rounded) separates the lowest 10% from the highest 90%. 0.10 0.40 x z x = 119 -1.28 0.50 150 Weight 0 45 In Class Example A tire company finds the lifespan for one brand of its tires is normally distributed with a mean of 48,500 miles and a standard deviation of 5000 miles. a) What is the probability that a randomly selected tire will be replaced in 40,000 miles or less? b) If the manufacturer is willing to replace no more than 10% of the tires, what should be the approximate number of miles for a warranty? Similar to hw #9 – Test problem 46 REMEMBER! Make the z score negative if the value is located to the left (below) the mean. Otherwise, the z score will be positive. 47 Extra Credit A teacher informs her statistics class that a test if very difficult, but the grades will be curved. Scores are normally distributed with a mean of 25 and standard deviation of 5. The grades will be curved according to the following scheme. A’s: Top 10% B’s: Between the top 10% and top 30%. C’s: Between the bottom 30% and top 30% D’s: Between the bottom 10% and bottom 30% F’s: Bottom 10% Find the numerical limits for each grade. Include sketch. 48 5-4 Determining Normality 49 Determining Normality Use Descriptive Methods 1. Histogram 2. Dot Plot 3. Stem-leaf 50 Determining Normality Use Normal Probability Plots • Plot the observed data versus expected z-score of the data. • Expected z-scores are based on the position of the data in ascending order similar to percentiles. • If the data are from a normal distribution then the Normal Probability Plot will be approximately linear. Note: X = µ + Zσ (linear relationship between X and Z) • Due to the tedious nature of this process, use technology. 51 TI-83 Calculator Normal Probability Plot 1. Enter data in L1 2. Choose statplot (2nd Y=) 3. Press Enter 4. Plot on should be ON 5. Cursor to Type and choose the last plot type 6. Press Zoom 7. Choose 9 52 5-5 The Central Limit Theorem (sampling distribution of the mean) 53 Distribution of 200 digits from Social Security Numbers Frequency (Last 4 digits from 50 students) 20 10 0 0 1 2 3 4 5 6 7 8 9 Distribution of 200 digits 54 x SSN digits 1 5 9 5 9 4 7 9 5 7 2 6 2 2 5 0 2 7 8 5 8 3 8 1 3 2 7 1 3 3 7 7 3 4 4 4 5 1 3 6 6 3 8 2 3 6 1 5 3 4 6 7 3 7 3 3 8 3 7 6 4 6 8 5 5 2 6 4 9 4.75 4.25 8.25 3.25 5.00 3.50 5.25 4.75 5.00 2 6 1 9 5 7 8 6 4 0 7 4.00 5.25 4.25 4.50 4.75 3.75 5.25 3.75 4.50 6.00 55 Frequency Distribution of 50 Sample Means for 50 Students 15 10 5 0 0 1 2 3 4 5 6 7 8 9 56 Definition Sampling Distribution of the mean the probability distribution of sample means, with all samples having the same sample size n. See Excel 57 Sampling Distributions http://onlinestatbook.com/stat_sim/sampling_dist/index.html 58 Central Limit Theorem Given: 1. The random variable x has a distribution (which may or may not be normal) with mean µ and standard deviation . 2. Samples all of the same size n are randomly selected from the population of x values. 59 Central Limit Theorem Conclusions: 1. The distribution of sample x will, as the sample size increases, approach a normal distribution regardless of the original population distribution. 2. The mean of the sample means will be the population mean µ. 3. The standard deviation of the sample means will approach n 60 Notation the mean of the sample means µx = µ the standard deviation of sample mean x = n (often called standard error of the mean) 61 Central Limit Theorem Conditions for Application: 1. Original population is not normal or unknown • Sample must be sufficiently large (n > 30) 2. The original population is normal. • Sample may be of any size Test Question 62 Describing the Sampling Distribution of the Mean As the sample size increases, the sampling distribution of sample means approaches a normal distribution with µx = µ Test Question x = n 63 Example: The mean age of students at a community is 25 with a standard deviation of 8. A sample of 64 students is drawn. Describe the sampling distribution of the mean. Solution: X~N(25,1) Test Question 64 Example: Given the population of women has normally distributed weights with a mean of 143 lb and a standard deviation of 29 lb, a.) if one woman is randomly selected, find the probability that her weight is greater than 150 lb. b.) if 36 different women are randomly selected, find the probability that their mean weight is greater than 150 lb. 65 Example: Given the population of women has normally distributed weights with a mean of 143 lb and a standard deviation of 29 lb, a.) if one woman is randomly selected, find the probability that her weight is greater than 150 lb. (same approach as section 5.3. z = 150-143 = 0.24 29 P (X > 150) = P(z > .24) Normalcdf (.24,E99)= 0.4052 0.5948 = 143 = 29 0 x 150 z 0.24 66 Example: Given the population of women has normally distributed weights with a mean of 143 lb and a standard deviation of 29 lb, a.) if one woman is randomly selected, the probability that her weight is greater than 150 lb. is 0.4052. P (X > 150) = P(z > .24) Normalcdf (.24,E99)= 0.4052 0.5948 = 143 = 29 0 x 150 z 0.24 67 Example: Given the population of women has normally distributed weights with a mean of 143 lb and a standard deviation of 29 lb, b.) if 36 different women are randomly selected, find the probability that their mean weight is greater than 150 lb. P (x > 150) x = 143 150 x = 29 = 4.83333 36 x 68 Example: Given the population of women has normally distributed weights with a mean of 143 lb and a standard deviation of 29 lb, b.) if 36 different women are randomly selected, the probability that their mean weight is greater than 150 lb is 0.0735. z = 150-143 = 1.45 29 36 P (x > 150) = P(z > 1.45) Normalcdf (1.45,E99)= 0.0735 0.9265 x = 143 x = 4.83333 0 150 x z 1.45 69 Example: Given the population of women has normally distributed weights with a mean of 143 lb and a standard deviation of 29 lb, a.) if one woman is randomly selected, find the probability that her weight is greater than 150 lbs. P(x > 150) = 0.4052 b.) if 36 different women are randomly selected, find the probability that their mean weight is greater than 150 lbs. P(x > 150) = 0.0735 It is much easier for an individual to deviate from the mean than it is for a group of 36 to deviate from the mean. 70 5-6 Normal Distribution as Approximation to Binomial Distribution 71 Approximate a Binomial Distribution with a Normal Distribution if: np 5 nq 5 Basically if ‘n’ is large your ok. Typically this approximation is only used when ‘n’ is large anyway. 72 Approximate a Binomial Distribution with a Normal Distribution if: np 5 nq 5 then µ = np and = npq and the random variable has a distribution. (normal) 73 Approximate a Binomial Distribution with a Normal Distribution: http://bcs.whfreeman.com/ips4e/cat_010/applets/CLT-Binomial.html 74 Definition: Continuity Correction: • Needed when we use the normal distribution (continuous) as an approximation to the binomial distribution (discrete) • Made to a discrete whole number x in the binomial distribution by representing the single value x by the interval from x - 0.5 to x + 0.5. 75 Procedure for Using a Normal Distribution to Approximate a Binomial Distribution 1. Verify np 5 and nq 5. 2. Calculate µ = np and = npq. 3. Identify the continuity correction x - 0.5 to x + 0.5. 4. Draw a normal curve and enter the values of µ 4. Change x by replacing it with x - 0.5 or x + 0.5, as appropriate. 5. Find the area corresponding to the desired probability. 76 Finding the Probability of “At Least” 520 Men Among 1000 Accepted Applicants 77 P(x 520) 520, 521, 522, …. Use P(x 519.5) . 520 519.5 P(x > 520) 521, 522, 523, . . . Use P(x > 520.5) P(x 520) 0, 1, . . . 518, 519, 520 Use P(x 520.5) 521 520.5 520 520.5 P(x < 520) 0, 1, . . . 518, 519 Use P(x < 519.5) 519 519.5 78 P(x=520) 520 519.5 520.5 Interval represents discrete number 520 79