Percentiles and the Normal Curve

advertisement
Percentiles and
the Normal Curve
A Normal Curve
The Normal Approximation
Father’s Height
• The height of the men in the sample was on
average 69 inches; the SD was 3 inches.
• Estimate the percentage of these men with heights
between 63 inches and 72 inches.
• 63 inch = –2 SD; 72 inch = +1 SD.
• The area between –2 and +1 is 47.5%, between 0
and +1 is 34.5%.
• Approximately 82% of the men will be between
63 and 72 inches.
Financial Hardship
• The average income for families was about
$45,000.
• The standard deviation was about $32,000.
• The normal table tells us that 8% of the area
lies to the left of –1.4 standard deviation.
• This suggests that about 8% of the families
have a negative income.
An Income Histogram
Percentiles
• When the normal curve is not a good
approximation to a histogram, we often use
percentiles.
• We have seen this idea before: the median
is the 50th percentile.
• The kth percentile is that number such that
k percent of the data is less than that
number.
Percentiles for Family Income
1
$1,300
10
$10,200
25
$20,100
50
$36,800
75
$58,100
90
$85,000
99
$151,800
Quartiles
• The 25th percentile is often called the lower
quartile.
• The 75th percentile is often called the upper
quartile.
• The interquartile range is the
75th quartile – 25th quartile.
• A ‘five number summary’ of a data set: minimum,
lower quartile, median, upper quartile, maximum.
Percentiles and the
Normal Curve
• When a histogram does not follow the
normal curve, we often use percentiles
instead.
• When a histogram does follow a normal
curve, we can use the normal table to
estimate the percentiles.
Example
• Among all applicants to a university one
year, the Math SAT scores averaged 535,
the SD was 100, and the scores followed the
normal curve.
• Estimate the 95th percentile.
Example
• This score is above average, by some
number of standard deviations, z.
• We cannot use the normal table directly,
because this only gives us values for the
area between –z and +z.
• The area to the right of the z is 5%. Hence
the area to the left of –z is 5%, too. The area
between –z and +z is 90%.
Example
• We can look this up in the normal table: z =
1.65.
• You have to score 1.65 SD above average to
be in the 95th percentile.
• This is a score 1.65 x 100 = 165 points
above average.
• The 95th percentile of the score distribution
is 535 + 165 = 700.
Percentile vs. Percentile Rank
• A percentile on a test is a score:
the 95th percentile is a score of 700.
• A percentile rank is a percentile:
if you score 700, then your percentile rank
is 95%.
Freshman GPA
• Suppose that the average freshmen GPA is around
3.0, and the SD is about 0.5. Assume that the
histogram follows the normal curve. Estimate the
GPA of someone in the 30th percentile.
• This GPA is below average, say -z SD.
• The area to the left is 30%. Thus, the area outside
–z and z is 60%, inside 40%.
• Look up in the table: z is between 0.5 and 0.55.
• GPA is about 2.75.
Download