Statistics 101L – Homework 3 Solution

advertisement
Statistics 101L – Homework 3
Solution
Assignment:
Weights of adult males and females have increased over the past 40 years but so have
heights. The body mass index (BMI) combines information about an individual’s height
and weight and can be used to classify individuals into a Weight Status category.
BMI
Below 18.5
18.5  BMI  25.0
25.0  BMI  30.0
30.0 and above
Weight Status
Underweight
Normal weight
Overweight
Obese
Below is information on BMI for males and females age 18 and 19 in 1980, 1990 and
2000.
BMI
mean, μ
std. dev., σ
1980
males
females
22.8
22.4
4.4
4.4
1990
males
females
23.1
23.3
5.2
5.4
2000
males
females
24.5
24.9
5.9
6.0
1. Use the 68-95-99.7 rule to find …
a) the percentage of males in 1980 with BMI less than 14.0.
2.5%
95% of BMIs will be between 14.0 and 31.6, therefore
2.5% will be less than 14.0.
b) the percentage of females in 1980 with BMI greater than 18.0.
84%
68% of BMIs will be between 18.0 and 26.8, therefore
68 + 16 = 84% will be greater than 18.0.
c) the BMI values that correspond to the central 95% of the distribution of
males in 2000?
24.5 – 2(5.9) = 12.7 to 24.5 + 2(5.9) = 36.3
d) the value such that 0.15% of females in 2000 have BMIs greater than the
value.
42.9
99.7% of BMI values will be between 6.9 and 42.9,
therefore 0.15% will be greater than 42.9.
1
2. Use Table Z in the text to answer the following questions about 18 and 19 year
olds.
a) What percentage of females in 1980 would be classified as underweight?
18.7%
18.5  22.4  3.9

 0.89
4.4
4.4
0.1867
z
b) What percentage of females in 1990 would be classified as underweight?
18.7%
18.5  23.3  4.8

 0.89
5.4
5.4
0.1867
z
c) What percentage of females in 2000 would be classified as underweight?
14.2%
18.5  24.9  6.4

 1.07
5.9
6.0
0.1423
z
d) Comment on the change from 1980 to 2000 in the percentages of females
classified as underweight.
The percentage underweight females stayed the same between 1980
and 1990 and then dropped in 2000.
e) What percentage of males in 1980 would be classified as obese?
5.1%
30  22.8 7.2

 1.64
4.4
4.4
1  0.9495  0.0505
z
f) What percentage of males in 1990 would be classified as obese?
9.2%
30  23.1 6.9

 1.33
5.2
5.2
1  0.9082  0.0918
z
g) What percentage of males in 2000 would be classified as obese?
17.6%
30  24.5 5.5

 0.93
5.9
5.9
1  0.8238  0.1762
z
2
h) Comment on the change from 1980 to 2000 in the percentages of males
classified as obese.
The percentage of obese males has increased, almost doubling every
10 years.
i) We wish to identify the cutoff value in 2000 such that 15% of the
population has BMI greater than this cutoff. Find the cutoff value for
males and the cutoff value for females.
15%  z  1.04
15%  z  1.04
cutoff males  24.5  1.04( 5.9 )
cutoff females  24.9  1.04( 6.0 )
cutoff males  24.5  6.136
cutoff females  24.9  6.24
cutoff males  30.64
cutoff females  31.14
3. Use the JAVA applet found on the web to answer the following questions. Some
sites are:
http://davidmlane.com/hyperstat/z_table.html
a) What percentage of males in 1980 would be classified as overweight?
0.2576561 or 25.8%
b) What percentage of males in 1990 would be classified as overweight?
0.265145 or 26.5%
c) What percentage of males in 2000 would be classified as overweight?
0.290616 or 29.1%
d) What percentage of females in 1980 would be classified as normal
weight?
0.534999 or 53.5%
e) What percentage of females in 1980 would be classified as normal
weight?
0.436517 or 43.7%
f) What percentage of females in 2000 would be classified as normal
weight?
0.363588 or 36.4%
g) We want to make a new weight status category that identifies the 2% of
females in 2000 with the highest BMIs. What BMIs should be included in
this new category?
above 37.2225
3
h) Repeat part g) but this time for males.
above 36.6171
4. Stat 101 students in the past answered questions on a Stat 101 Survey at the
beginning of the semester. The anonymous responses are in the JMP data table
StatSurvey.JMP on the course web page.
a) Go to the Stat 101 course webpage and open the StatSurvey file.
b) Go to the Tables pull-down menu and select Subset. Click on the circle
next to Random Sample and enter 50 in the box for Sampling Ratio or
Sample Size. Click on OK. This will create a new JMP data table
consisting of a random selection of 50 cases from the StatSurvey. Use this
new JMP data table for parts c) and d).
Each student will have a different random selection and so you should
compare the student’s description against the JMP output. Below is a
sample description.
c) Use Analyze – Distribution to create a Histogram, Box plot and Normal
Quantile Plot for the variable height. Describe each of the plots and use
your descriptions to comment on whether the random sample of heights
could have come from a population with a distribution that follows a
normal model. Turn in the JMP output with your assignment.
Histogram: Bi-modal (some samples will give a symmetric mounded
in the middle shape). The center is around 67.5 inches with values
spread from 58 to 77 inches.
Box plot: Fairly symmetric (note that the mean and median are fairly
close) with no apparent outliers.
Normal Quantile Plot: The heights generally follow the straight line
that indicates a normal distribution. There is no apparent curvature
in the plot.
The sample could have come from a population whose distribution
follows a normal model.
Note: Even with the bi-modal shape in the histogram, the box plot and
the normal quantile plot indicate the sample could have come from a
normal distribution. For some random samples the bi-modal shape in
the histogram may show up in the normal quantile plot. Students
may then conclude that the sample did not come from a population
whose distribution follows a normal model.
4
d) Use Analyze – Distribution to create a Histogram, Box plot and Normal
Quantile Plot for the variable musiccds. Describe each of the plots and
use your descriptions to comment on whether the random sample of
heights could have come from a population with a distribution that follows
a normal model. Turn in the JMP output with your assignment.
Histogram: Mounded on the left and skewed to the right. The center
is around 100 with values spread from 0 to 1000.
Box plot: Skewed right (note that the mean is much larger than the
median) with several potential outliers, values above 250.
Normal Quantile Plot: The number of music cds start out following
straight line but not the one that indicates a normal distribution.
There is a considerable amount of curvature in the plot.
The sample probably did not come from a population whose
distribution follows a normal model. The population distribution is
most likely skewed right.
5
3
.99
2
.95
.90
.75
.50
1
0
.25
.10
.05
.01
Normal Quantile Plot
Distribution: height
-1
-2
-3
10
Count
15
5
55
60
65
70
75
80
Quantiles
100.0%
75.0%
50.0%
25.0%
0.0%
Moments
Mean
Std Dev
N
maximum
quartile
median
quartile
minimum
77.000
70.250
67.500
65.000
58.000
67.56
3.8869746
50
6
3
.99
2
.95
.90
.75
.50
1
0
.25
.10
.05
.01
Normal Quantile Plot
Distribution: musiccds
-1
-2
-3
40
20
Count
30
10
0
200
400
600
800
1000 1200
Quantiles
100.0%
75.0%
50.0%
25.0%
0.0%
Moments
Mean
Std Dev
N
maximum
quartile
median
quartile
minimum
1000.0
105.0
55.0
20.0
0.0
113.4
177.83838
50
7
Download