Box and Whisker Plots (Day 2) - MELT-Institute

advertisement
Box and Whisker Plots
Name: ___________________
1. Just the Math: Box – and – Whisker Plots A box-and-whisker plot (also known as a box
plot) is used to represent a large data set. Box-and-whisker plots allow you to easily make
comparisons of data sets. The box-and-whisker plot below represents a set of data.
What do you think is the lower extreme of the data set? What do you think is the upper
extreme? Use complete sentences in your answer.
The vertical line inside the box represents the second quartile Q2 of the data. What is the value
of Q2? Use a complete sentence in your answer.
The point in the box directly to the left of the second quartile represents the first quartile Q 1.
What is the value of Q1? Use a complete sentence in your answer.
Similarly, the point in the box directly to the right of the second quartile represents the third
quartile Q3? What is Q3? Use a complete sentence in your answer.
The horizontal lines on both ends of the box are called whiskers. What do the dots at the end of
the whiskers represent? Use a complete sentence in your answer.
2. To create a box-and-whisker plot of the cereal data use the extremes and quartiles that you
found. Complete the information below.
Draw a number line below and label it to represent the full
range of data values. Locate the second quartile on the
number line. About an inch above the number line, draw a dot
for Q2 and label its value. Repeat this process to draw dots for
the first and third quartiles and for the upper and lower
extremes. Then draw a box with sides at the first and third
quartiles. Draw a vertical line through the median. Draw two whiskers from the sides of the box
to the extremes.
Lower Extreme
First Quartile, Q1
Second Quartile, Q2
Third Quartile, Q3
Upper Extreme
3. From the box and whisker plot above, (the one created in #2) why is one whisker longer than
the other? What does this mean? Use a complete sentence in your answer.
4. An outlier is a data value that is much greater than or much less than the other values in the
set. What is the outlier in the breakfast cereal data? Make a conjecture, or an educated guess,
about which quartile would be the most affected by removing the outlier. Write your answer
and your conjecture using complete sentences.
5. Remove the outlier and
find the values of the
quartiles. Complete the
table to see if you were
correct.
Original Data
Data with outlier removed
First Quartile
Second Quartile
Third Quartile
6. Quartiles divide the data set into four parts. Percentiles also divide the data set into parts.
Based on the root of this word, into how many parts do you think percentiles divide the data
set? Use a complete sentence in your answer.
The nth percentile for the data set is the value for which n percent of the numbers in the set are
less than that value. For example, the median in a data set often represents the 50 th percentile
because 50% of the data are less than the median. In the original cereal data set, is the median
the 50th percentile? Use a complete sentence in your answer.
What percentile ranking is the first quartile in this data set? Use a complete sentence to explain
your reasoning.
The third quartile represents what percentile in this data set? Explain your answer using a
complete sentence.
7. The interquartile range, or IQR is the difference between the upper and lower quartiles
(Q3 – Q1) and represents the range of approximately the middle 50% of the data. The IQR
indicates the spread between the lower and upper quartiles. If it is a small number, then the
middle 50% of the data are consistent. If it is a large number, then the middle 50% of the data
are spread apart. Find the IQR for the original cereal data.
Download