Document 17881823

advertisement
MATH 1342. Chapters 1 and 2. (2 days)
Mary Parker. January 2013
page 1 of 3
Day 2 of Chapters 1 and 2.
Activity 1. Do this activity with a partner. A study of 74 seventeen-year-old girls asked them how many
servings of fruit a day they eat, on the average. Here is a histogram of the data. (Moore text, exercise 1.30)
1.a. Estimate the center (the score that half
the girls score below and half score above.)
1.b. Count and find precisely what score that
center is.
1.c. What percentage of the girls at fewer
than two servings a day?
1.d. What percentage of the girls ate two or
fewer servings a day?
Do any of these scores look as if they are
outliers?
1.e. Explain what numbers you would have to type into the software to compute summary statistics of these
scores. (You do not have to actually type them in – just explain how you know what the numbers are.)
1.f. Find the quartiles and the InterQuartile Range of this distribution without typing them in – just by counting
as you did when you found the median.
1.g. According to our text’s definition of “Suspected Outliers” in Chapter 2, are any of these scores outliers?
(Show your work as well as answer the question.)
Activity 2. Standard deviation. Download the Excel spreadsheet provided by the teacher:
http://www.austincc.edu/mparker/stat-general/ named stdev.xls. Make up six numbers and guess their
standard deviation. Then type in the six numbers into the yellow cells of the “six” worksheet. How close was
your guess to that standard deviation? (It doesn’t have to be very close – just “in the ballpark.”) See what
numbers your partner used. Discuss. (It doesn’t have to be very close – just “in the ballpark.” I won’t really
test you on this guessing. This is about understanding the concept.)
Activity 3. With a partner, answer 2.12 at the end of the section “Choosing a measure of center and spread.”
Activity 4. With a partner, do Exercise 2.13 in “Organizing a Statistical Problem”.
MATH 1342. Chapters 1 and 2. (2 days)
Mary Parker. January 2013
page 2 of 3
After the second, and final, class on Chs 1 and 2:
1. From any frequency graph (not a relative frequency graph) be able to “reconstruct” the dataset, as
much as the information allows.
2. By hand and with software, compute all the numerical summaries indicated in the given table in this
handout. When computing median and quartiles by hand,
a. in the first step, sort the data. (If the dataset has more than about six values, making a first
draft of a stemplot by hand is helpful to begin.)
b. correctly use
n 1
several times.
2
c. consider the values below, not including, the median when finding Q1, etc.
d. recognize that software may use a more sophisticated formula for medians and quartiles and
so the software values for these may differ from what you found by hand, or from what
another software package obtained.
3. Use the IQR rule to identify “suspected outliers.”
4. Boxplots. Sometimes software does not extend the “whiskers” all the way out to the maximum and
minimum. When it does not, and puts extra dots beyond the “whiskers” that means those are
suspected outliers, according to some rule like the 1.5*IQR rule. (Example: Minitab, Parker dataset
Fish_prices.txt .)
5. Standard deviation:
a. Recognize that Example 2.7 which shows how to compute a standard deviation, is given to
show you what we mean by “the amount a typical score differs from the mean” and is NOT to
show how you are supposed to compute standard deviation. DO NOT compute standard
deviations in that way. It takes too long and you need your time for learning more important
things.
b. When given a frequency graph which is not strongly skewed, estimate the mean and make
some reasonable statements about what values would be unreasonably small or unreasonably
large as estimates of the standard deviation. (The mean is approximately ___ and the
standard deviation surely must be between ___ and ____. (This doesn’t have to be a very
“tight” interval – just enough to show that you know that the standard deviation is “the
amount a typical score differs from the mean.”
6. Identify and use the four steps of the four-step process to answer questions about comparing the
distributions of a variable on two or more groups, including discussing the meaning of any differences
you see.
MATH 1342. Chapters 1 and 2. (2 days)
Mary Parker. January 2013
page 3 of 3
Quiz 3. Due Wed. Feb. 6 at the beginning of class.
Exercise 1.32. Do parts a-d using just the information given in the exercise and the graph, without looking at
the full dataset. Then, using the full dataset and software to find the quartiles, answer part e as given just
below here.
e. Find the quartiles and use our rule to determine whether data points they called outliers are
identified as “suspected outliers”. Show your work. (It is not necessary to provide printed output
here. Just say what relevant values you obtained from the software and use them.)
Exercise 2.46. Write something for each part of the four-step process. It is not necessary to re-write the entire
problem in the “State” part. Just say what you are asked to do (two sentences here.) Use Minitab for graphs
and computations and provide a printout of your results, as well as summarize them in your written answers.
Exercise 4.26. Before you make a graph, identify the response variable and the explanatory variable. Use
Minitab for graphs and computations and include a printout of the graph in your results. You may just say
what you found with software for the correlation coefficient without providing a printout of the results from
the software.
Exercise 4.38. No computation. Answer this question in writing.
Download