Descriptive Statistics & the Empirical Rule

advertisement
Name: ___________________________________
STT1600 LAB 3: Descriptive Statistics & the Empirical Rule
Purpose: In this lab, we will use Excel to generate summary statistics of a data set consisting of a sample
of birth weights of newborn babies and compute some descriptive statistics for this data set. We will
also examine the Empirical Rule using this data set. Fill in the answers to the questions on both pages
of this lab and turn this in.
For your lab report, fill in the blanks and answer questions on this sheet and turn it in.
1. Open the excel data set “babywts” which contains the birth weights of n=1200 newborn babies (in
pounds).
2. Click your mouse at the top of column A and then click the “data” tab in the ribbon. Click on “Sort”
and choose “Smallest to Largest”. Notice that the birth weights are now sorted from smallest to
largest.
3. Compute the mean birth weight by clicking in an empty cell and typing “=average(A1:A1200)”.
4. Compute the standard deviation of birth weights by clicking in an empty cell and typing:
“=STDEV(A1:A1200)”.
a. What is the mean weight (round to two digits)? _________________
b. What is the standard deviation of the weights (round to 3 digits)? __________________
c. Are the mean and standard deviations you computed STATISTICS or PARAMETERS? (circle
one)
STATISTICS
PARAMETERS
5. Empirical Rule: In this step, we will compute an interval of weights: the mean ± 2 standard
deviations. To get the lower end of this interval, click in an empty cell in your excel spreadsheet and
type “=AVERAGE(A1:A1200) - 2*STDEV(A1:A1200)”. Click in an empty cell next to this cell and type
the same thing except change the “-“ to “+”. You have just computed the mean ± two standard
deviations.
a. Write down the interval of weights you just computed ____________________________.
b. According to the empirical rule, approximately what percentage of birth weights should fall
in this range (if the distribution is bell-shaped)? ________________________
c. Look through the 1200 sorted birth weights in Column A and compute the actual percentage
of observations that fall in this range and record the value? (Note: it may be easier to count
how many of the 1200 weights do NOT lie in this interval)________________________
d. Does the actual percentage you computed in part (c) match closely with the Empirical Rule
value? (circle one) YES
NO
e. Repeat steps (a)-(d) for the interval of the mean plus/minus 3 standard deviations and fill in
the blanks:
The interval: __________________
Percentage expected in this interval by Empirical Rule: _________
Actual Percentage of birth weights in this interval: __________________
6. In this step of the lab, we will use excel to compute median and also various percentiles of the birth
weight data set using the “PERCENTILE” function in excel.
a. Compute the median birth weight by clicking in an empty cell and typing
“=MEDIAN(A1:A1200)”.
What is the median birth weight? _________________
b. Compute the 50th percentile of the birth weights by clicking in an empty cell and typing
“=PERCENTILE(A1:A1200, 0.50)”. Note that the 0.50 tell excel to find the 50th percentile.
What is the 50th percentile? ___________________
What is the relation between the 50th percentile and the median from part (a)?
_________________________________________________________
c. Repeat part (b) for the 20th percentile of the birth weights.
What is the 20th percentile? _________________
d. Repeat part (b) for the 95th percentile. What is the 95th percentile? ___________________
e. In what row can you find the 95th percentile of the baby weights? (Hint: compute 95% of
1200) ___________________________
f. Suppose a parent has a baby whose weight is at the 95th percentile (the value you found in
part (d)), write a sentence explaining to the parent how their baby’s weight compares to the
weights of other babies.
___________________________________________________________________________
Download