Normal Distribution

advertisement
Normal Distribution
Consider the following: Four coins are flipped and the number of heads is recorded.
1. List all of the possible outcomes. Are all outcomes equally likely? Explain.
There are 16 possible outcomes.
If order is considered, then no outcome is the same. Thus,
each outcome has the same likelihood of occurring.
However, considering the number of heads possible for
each outcome, the outcomes have an unequal likelihood of
occurring.
2. What is the theoretical probability of getting 4 heads? 3 heads? 2 heads? 1 head? 0 heads?
1
4
6
4
1
4 heads= , 3 heads= , 2 heads= , 1 heads= , 0 heads=
16
16
16
16
16
Calculate the percent likelihood.
4 heads=6.25%, 3 heads=25%, 2 heads=37.5%, 1 heads=25%, 0 heads=6.25%
3. What observations can be made about the theoretical probabilities?
The data is symmetric; the fractions add up to 1; the percentages add up to 100%
4. On the axis below, complete the histogram of the probability of each number of heads.
a. Draw a point at the midpoint of the top of each bar.
Connect the points in a smooth curve.
b. What do you notice about the graph’s shape?
Symmetric, bell shaped
c. What do you observe about the graph’s mean,
median, and mode?
They are all the same (equal to 2)
Characteristics of a normal curve:
 Symmetric about the mean and bell-shaped
 Mean, median, and mode are equal
 Never crosses the x-axis
 Total area under the curve is 1 (sum of the probabilities)
Normal Distribution
Normally distributed data sets have a normal curve graph.
 Examples of normally distributed data: heights, tread life of tires, SAT scores
Normal curves follow the 68-95-99.7% Rule (aka Empirical Rule or 3 Sigma Rule)
Consider the following: Pulse rates are normally distributed with a mean of 72 and a standard
deviation of 12. Label the normal model below.
26
-3σ
48
-2σ
60
-1σ
72
µ
84
1σ
96
2σ
108
3σ
The 68-95-99.7% Rule tells us that
 68% of all people will have pulse rates within 1 standard deviation of the mean.
That means that 68% of people have pulse rates between 60 and 84
 95% of all people will have pulse rates within 2 standard deviations of the mean.
That means that 95 % of people have pulse rates between 48 and 96
 99.7% of all people will have pulse rates within 3 standard deviations of the mean.
That means that 99.7 % of people have pulse rates between 36 and 108
Values outside of 2σ or the 95% range are considered unusual.
5. Given a normally distributed data set of 500 observations measuring tree heights in a forest,
what is the approximate number of observations that fall within two standard deviations from
the mean?
Since 95% of the data falls within 2 standard deviations of the mean, then:
500 * 95% = 500 (0.95) = 475 Trees
6. A normally distributed data set containing the number of ball bearings produced during a
specified interval of time has a mean of 150 and a standard deviation of 10. What percentage
of the observed values fall between 140 and 160?
Since 140 is 1 standard deviation below the mean and 160 is 1 standard deviation above the
mean. Therefore the percentage of data that falls within 1 standard deviation is 68%.
Normal Distribution
Finding the area under a normal curve means to find the probability of a value being greater than
the rest of the data.
Areas can be found under a normal
curve by using the 68-95-99.7 rule if the
areas are bounded at places where an
exact deviation occurs.
Areas that are not bounded at ±1, ±2, or
±3 standard deviations can be found
using a z-table.
Consider the following: A corn chip factory packs chips in bags with normally distributed
weights with a mean of 12.4 oz. and a standard deviation of 0.14 oz.
7. On the graph to the right, label the mean and three
standard deviations above and below the mean.
8. Shade the region that indicates the percentage of bags
that contain less than 12.64 oz.
9. Calculate the z-score of 12.64.
12.64  12.4
z
 1.714
0.14
10. Use the z-table (Standard Normal Probabilities Table) to find the area associated with the
z-score and interpret your result.
a. The probability that a data value will fall below the data value associated with a z-score
of 1.174 is 0.9591 (95.91%).
b. 12.64 falls in the below percentile. This means that 95 percent of the data in the
distribution fall below the value associated with a z-score of 1.714
c. The probability that a value from the data set will fall above this value is
1 – 0.9591 = 0.0409 (or 4.09% of the data falls above this value)
Normal Distribution
11. On the graph at the right, label and shade the region
that represents the likelihood a bag will contain between
12.1 and 12.76 oz.
12. Calculate the z-score for each and find the probability.
12.1  12.4
z
 2.14  0.0162
0.14
z
12.76  12.4
 0.9949
0.14
13. How can we use these values to determine the probability of choosing a bag between 12.1
and 12.76 oz.?
Find the probability for each z-score. Then subtract the higher z-score’s probability from the
lower z-score’s probability.
0.9949 – 0.0162 = 0.9787 (or 97.87%)
Consider the following: The length of rhinoceros’ horns are normally distributed with a mean of
10.1 cm and a standard deviation of 1.4 cm.
14. On the graph to the right, label the mean and three
standard deviations above and below the mean.
15. What percent of rhinos have horns shorter than
10.1 cm? 50% Longer than 10.1cm? 50%
16. Calculate the probability of a rhinos’ horn being longer
than 9 cm. Interpret your results.
9  10.1
z
 0.78
1.4
Probability for a z-score of -0.78 is 0.2148; so 1-0.2148 = 0.7852
The probability that a rhino’s horn will be bigger than 9 cm is 78.52%.
17. Find P  9.5  x  10.5
z
10.5  10.1
 0.29  0.6141
1.4
z
9.5  10.1
 0.43  0.3336
1.4
0.6141-0.3336 = 0.2805 (28.05%)
Download