Chapter 4 Describing Data: Displaying and Exploring Data True/False 1. A dot plot and a scatter diagram are different names for the same graph. Answer: False 2. A dot plot is an easy way to represent the relationship between two variables. Answer: False 3. A dot plot is useful for quickly graphing frequencies in a small data set. Answer: True 4. A stem and leaf diagram shows the actual data values. Answer: True 5. There is some loss of information when raw data is tallied into a stem-and-leaf display. Answer: False 6. For a stem-and-leaf display, the leaf for the value 98 is 9. Answer: False 7. The stem in a stem-and-leaf display is the leading digit. Answer: True 8. In a stem-and-leaf display, the leaf represents a class of a frequency distribution. Answer: False 9. In a stem-and-leaf display, the leaf represents a member of a class in a frequency distribution. Answer: True 10. In a stem-and-leaf display, for each class, the leaves are arranged or sorted from smallest to largest. Answer: True 11. In a stem-and-leaf display, it is easy to find the range for a data set. Answer: True 12. Quartiles divide a distribution into four equal parts. Answer: True 13. A percentile divides a distribution into one hundred equal parts. Answer: True 14. A student scored in the 85th percentile on a standardized test. This means that the student scored lower than 85% of all students who took the test. Answer: False 15. Quartiles are another way to describe the central location of a distribution. Answer: False 16. Quartiles are another way to describe the dispersion of a distribution. Answer: True 17. The 50th percentile of a distribution is the same as the distribution mean. Answer: False 18. A percentile can be a decile, but a decile can not be a quartile. Answer: True 19. A quartile can be a decile, but a decile can not be a percentile. Answer: False 20. For a distribution, the 2nd quartile, the 5th decile, and the 50th percentile, are the same as the median. Answer: True 21. The interquartile range is the difference between the values of the first and third quartile, indicating the range of the middle fifty percent of the observations. Answer: True 22. A box plot graphically shows data that are in percentiles. Answer: False 23. The "box" in a box plot shows the interquartile range. Answer: True 24. An outlier is a data point that occurs in the first quartile. Answer: False 25. An outlier is a value in a data set that is inconsistent with the rest of the data. Answer: True 26. A box plot shows the relative symmetry of a distribution. Answer: True 27. A box plot shows a distribution's mean and mode. Answer: False 28. A box plot shows the range of values that correspond to the upper 25% of the distribution. Answer: True 29. In a box plot, if a value is more than 1.5 times the standard deviation from the first or third quartile, the value is an outlier. Answer: False 30. In a box plot, if a value is more than 1.5 times the interquartile range from the first or third quartile, the value is an outlier. Answer: True 31. The coefficient of variation is a measure of relative dispersion that expresses the standard deviation as a percent of the mean. Answer: True 32. The Pearson's coefficient of skewness is a measure of distribution's symmetry. Answer: True 33. The coefficient of variation is useful for comparing distributions with different units. Answer: True 34. The coefficient of variation is computed by dividing the standard deviation by the median and multiplying the quotient by 100. Answer: False 35. Negatively skewed indicates that a distribution is not symmetrical. The long tail is to the left or in the negative direction. Answer: True 36. In a negatively skewed distribution, the mean is smaller than the median or mode and the mode occurs at the peak of the curve. Answer: True 37. If Pearson's coefficient of skewness is equal to 0, then the mean and median are equal. Answer: True 38. If Pearson's coefficient of skewness is negative, then the mean is greater than the median. Answer: False 39. If Pearson's coefficient of skewness is negative, then the distribution is skewed to the left. Answer: True 40. If Pearson's coefficient of skewness is negative, then the distribution is skewed to the right. Answer: False 41. A scatter diagram of sales versus production may be constructed by plotting the data on a graph labeled with sales on the Y-axis and production on the X-axis. Answer: True 42. A relationship between gender and preference for Coke or Pepsi can be best represented by a scatter diagram. Answer: False 43. A relationship between gender and preference for Coke or Pepsi can be best represented by a contingency table. Answer: True Multiple Choice 44. A dot plot shows A) The general shape of a distribution B) The mean, median, and mode C) The relationship between two variables D) The interquartile range. Answer: A 45. A row of a stem-and-leaf chart appears as follows: 3 | 0 1 3 5 7 9. Assume that the data is rounded to the nearest unit. A) The frequency of the class is seven. B) The minimum value in the class is 0. C) The maximum value in the class is 39. D) The class interval is 5. Answer: C 46. The test scores for a class of 147 students are computed. What is the location of the test score associated with the third quartile? A) 111 B) 37 C) 74 D) 75% Answer: A 47. What statistics are needed to draw a box plot? A) Minimum, maximum, median, first and third quartiles B) Median, mean and standard deviation C) A median and an interquartile range D) A mean and a standard deviation. Answer: A 48. A box plot shows A) The mean and variance B) The relative symmetry of a distribution for a set of data C) The percentiles of a distribution D) The deciles of a distribution Answer: B 49. What does the interquartile range describe? A) The lower 50% of the observations B) The middle 50% of the observations C) The upper 50% of the observations D) The lower 25% and the upper 25% of the observations E) None of the above Answer: B 50. The coefficient of variation for a set of annual incomes is 18%; the coefficient of variation for the length of service with the company is 29%. What does this indicate? A) More dispersion in the distribution of the incomes compared with the dispersion of their length of service B) More dispersion in the lengths of service compared with incomes C) Dispersion in the two distributions (income and service) cannot be compared using percents D) Dispersions are equal Answer: B 51. Mr. and Mrs. Jones live in a neighborhood where the mean family income is $45,000 with a standard deviation of $9,000. Mr. and Mrs. Smith live in a neighborhood where the mean is $100,000 and the standard deviation is $30,000. What is the relative dispersion of the family incomes in the two neighborhoods? A) Jones 40%, Smith 20% B) Jones 20%, Smith 30% C) Jones 30%, Smith 20% D) Jones 50%, Smith 33% E) None of the above Answer: B 52. A large oil company is studying the number of gallons of gasoline purchased per customer at selfservice pumps. The mean number of gallons is 10.0 with a standard deviation of 3.0 gallons. The median is 10.75 gallons. What is the Pearson's coefficient of skewness? A) -1.00 B) -0.75 C) +0.75 D) +1.00 Answer: B 53. What is the value of the Pearson coefficient of skewness for a distribution with a mean of 17, median of 12 and standard deviation of 6? A) +2.5 B) -2.5 C) +0.83 D) -0.83 Answer: A 54. A study of business faculty at state supported institutions in Ohio revealed that the arithmetic mean salary for nine months is $52,000 and a standard deviation of $3,000. The study also showed that the faculty had been employed an average (arithmetic mean) of 15 years with a standard deviation of 4 years. How does the relative dispersion in the distribution of salaries compare with that of the lengths of service? A) Salaries about 100%, service about 50% B) Salaries about 6%, service about 27% C) Salaries about 42%, service about 81% D) Salaries about 2%, service about 6% Answer: B 55. What is the possible range of values for the coefficient of variation? A) -1 and +1 B) -3 and +3 C) 0% and 100% D) Unlimited values Answer: C 56. A research analyst wants to compare the dispersion in the price-to-earnings ratios for a group of common stocks with their return on investment (ROI). For the price-to-earnings ratios, the mean is 10.9 and the standard deviation is 1.8. The mean return on investment is 25 percent and the standard deviation 5.2 percent. What is the relative dispersion for the price-to-earnings ratios and return on investment? A) Price-to-earnings = 32.0 percent, ROI =19.0 percent B) Price-to-earnings =16.5 percent, ROI = 20.8 percent C) Price-to-earnings =132.0 percent, ROI =190.0 percent D) Price-to-earnings = 50.0 percent, ROI =10.0 percent Answer: B 57. A study of the scores on an in-plant course in management principles and the years of service of the employees enrolled in the course resulted in these statistics: - Mean test score was 200 with a standard deviation of 40 - Mean number of years of service was 20 years with a standard deviation of 2 years. In comparing the relative dispersion of the two distributions, what are the coefficients of variation? A) Test 50%, service 60% B) Test 100%, service 400% C) Test 20%, service 10% D) Test 35%, service 45% Answer: C 58. A large group of inductees was given a mechanical aptitude and a finger dexterity test. The arithmetic mean score on the mechanical aptitude test was 200, with a standard deviation of 10. The mean and standard deviation for the finger dexterity test were 30 and 6 respectively. What is the relative dispersion in the two groups? A) Mechanical aptitude 5 percent, finger dexterity 20 percent B) Mechanical aptitude 20 percent, finger dexterity 10 percent C) Mechanical aptitude 500 percent, finger dexterity 200 percent D) Mechanical aptitude 50 percent, finger dexterity 200 percent Answer: A 59. A sample of experienced typists revealed that their mean typing speed is 87 words per minute and the median is 73. The standard deviation is 16.9 words per minute. What is the Pearson's coefficient of skewness? A) -2.5 B) -4.2 C) +4.2 D) +2.5 Answer: D 60. A study of the net sales of a sample of small corporations revealed that the mean net sales is $2.1 million, the median $2.4 million, the modal sales $2.6 million and the standard deviation of the distribution is $500,000. What is the Pearson's coefficient of skewness? A) -9.1 B) +6.3 C) -3.9 D) +2.4 E) None of the above Answer: E 61. In a scatter diagram, we describe the relationship between A) two variables measured at the ordinal level B) two variables, one measured as an ordinal variable and the other as a ratio variable C) two variables measured at the interval or ratio level D) a variable measure on the interval or ratio level and time. Answer: C 62. In a contingency table, we describe the relationship between A) two variables measured at the ordinal or nominal level. B) two variables, one measured as an ordinal variable and the other as a ratio variable C) two variables measured at the interval or ratio level D) a variable measure on the interval or ratio level and time. Answer: A Fill-in-the-Blank 63. What chart or graph is useful for illustrating frequencies? _______________________. Answer: dot plot 64. For a stem-and-leaf display, what is the stem for the value 67? ____. Answer: 6 Essay 65. Construct a stem-and-leaf display for the following data: 29 33 46 22 69 32 57 35 30 19 37 21 54 38 58 34 65 26 39 42 38 35 55 31 50 22 52 59 20 51 Answer: 1| 9 2| 0 1 2 2 6 9 3| 0 1 2 3 4 5 5 7 8 8 9 4| 2 6 5| 0 1 2 4 5 7 8 9 6| 5 9 66. From the following stem-and-leaf display, find the minimum value, the 1st quartile, the median, the 3rd quartile, and the maximum value. List and interpret the interquartile range. 1| 9 2| 0 1 2 2 6 9 3| 0 1 2 3 4 5 5 7 8 8 9 4| 2 6 5| 0 1 2 4 5 7 8 9 6| 5 9 Answer: Minimum=19 1st quartile = 29.75 median = 37.5 3rd quartile = 52.5 Maximum = 69. Interquartile range is 52.5-29.75 = 22.75. It means that 50% or 15 of the 30 observations are between 52.5 and 29.75 Fill-in-the-Blank 67. For a stem-and-leaf display, what is the leaf for the value 123? ____. Answer: 3 68. If you are constructing a stem-and-leaf display, the "3" in 19.3 would be the ____________. Answer: leaf 69. If you are constructing a stem-and-leaf display, the "20" in 20.5 would be the _____________. Answer: stem 70. What is the best way to display the relationship between two variables measured on an interval or ratio level? Answer: scatter diagram 71. What is the main advantage of a stem-and-leaf chart over a histogram? ___________________ Answer: The identity of each observation is not lost 72. The percentile range is the distance between any two _______________. Answer: percentiles 73. In a symmetric distribution, where is the 99th percentile located? _______________ Answer: In the far right tail 74. In a positively skewed distribution, where is the 99th percentile located? _______________ Answer: In the far right tail 75. In a negatively skewed distribution, where is the 1st percentile located? _______________ Answer: In the far left tail 76. If the mean of a distribution is smaller than the median and mode, what is the sign of Pearson's coefficient of skewness? _______________ Answer: negative 77. A frequency distribution may be divided into how many percentiles? ___ Answer: 99 78. For a set of data, how many quartiles are there? _____ Answer: three 79. If two sets of data are measured in different units, what statistic can be used to compare their dispersions? ___________________________________ Answer: coefficient of variation 80. What unit of measurement is used to express the coefficient of variation? _________ Answer: percent 81. The coefficient of variation is a measure of _______________. Answer: relative dispersion 82. The research director of a large oil company conducted a study of the buying habits of consumers with respect to the amount of gasoline purchased at full-service pumps. The arithmetic mean amount is 11.5 gallons and the median amount is 11.95 gallons. The standard deviation of the sample is 4.5 gallons. What is the Pearson's coefficient of skewness? ________ Answer: -0.30 83. Rainbow Trout, Inc. feeds fingerling trout in special ponds and markets them when they attain a certain weight. A group of 9 trout (considered the population) were isolated in a pond and fed a special food mixture called Grow Em Fast. At the end of the experimental period, the weights of the trout were (in grams): 124, 125, 123, 120, 124, 127, 125, 126 and 121. Another special mixture, Fatso 1B, was used in another pond. The mean of the population was computed to be 126.9 grams and the standard deviation was 1.20 grams. Which food results in a more uniform weight? ____________ Answer: Fatso 1B 84. The annual incomes of the five vice presidents of Elly's Industries are: $41,000, $38,000, $32,000, $33,000 and $50,000. The annual incomes of Unique, another firm similar to Elly's Industries, were also studied and found to have a mean of $38,900 and a standard deviation of $6,612. What company has the greater coefficient of variation? ______________ Answer: Elly, (19.0) > Unique (17.0) 85. The spread in the annual prices of stocks selling under $10 and those selling over $60 are to be compared. The mean price of the stocks selling under $10 is $5.25 and the standard deviation is $1.52. The mean price of those stocks selling over $60 is $92.50 and the standard deviation is $5.28. Why should the coefficient of variation be used to compare the dispersion in the prices? ______________________ Answer: means differ vastly 86. The lengths of stay on the cancer floor of Community Hospital were organized into a frequency distribution. The mean length was 28 days, the median 25 days and the modal length 23 days. The standard deviation was computed to be 4.2 days. What is the Pearson's coefficient of skewness? __________ Answer: 2.14 87. A sample of the homes currently offered for sale revealed that the mean asking price is $75,900, the median $70,100 and the modal price is $67,200. The standard deviation of the distribution is $5,900. What is the Pearson's coefficient of skewness? __________ Answer: 2.95 88. The Pearson's coefficient of skewness (Sk) measures the amount of skewness and may range from 3.0 to +3.0. It is computed by subtracting the median from the mean, multiplying the result by 3 and dividing by? ________________ Answer: standard deviation Essay 89. Given the sample information in the following table regarding public opinion on gun control, who is more likely to favor gun control? Party Affiliation Democrat Republican Total Favor 90 90 180 Opinion on Gun Control Oppose No opinion 98 46 54 10 152 56 Total 234 154 388 Answer: Republicans are more likely to favor gun control with 58% favoring gun control. Only 38% of democrats favor gun control. Fill-in-the-Blank Use the following to answer questions 90-94: A telemarketing firm is monitoring the performance of its employees based on the number of sales per hour. One employee had the following sales for the last 20 hours 9 4 5 4 2 7 6 8 5 4 6 4 4 5 4 5 4 4 7 8 90. What is the median for the distribution of number of sales per hour? ____________ Answer: Median = 5 sales per hour 91. What is the first quartile for the distribution of number of sales per hour? ________________ Answer: Q1 = 4 sales per hour 92. What is the third quartile for the distribution of number of sales per hour? _____________ Answer: Q3 = 6.5 sales per hour 93. For the distribution of number of sales per hour, 50% are greater than ____________ Answer: The median or 5 sales per hour 94. For the distribution of number of sales per hour, 50% of the observations are between __________ and ____________. Answer: Q1 (4) and Q3 (6.5) Use the following to answer questions 95-101: The following stem and leaf display reports the number of boat shipments per week by Ottertail Boats, Inc. 11| 1 5 9 12| 0 1 2 2 6 9 13| 0 1 2 3 4 5 5 7 8 8 9 14| 2 6 8 15| 0 1 2 4 5 7 8 9 16| 1 5 7 9 95. How many weeks were included in the study?____________ Answer: 35 weeks 96. How many observations are in the third class?__________ Answer: 11 weeks 97. What are the smallest and largest values?___________ Answer: 111 and 169 orders 98. List the actual values in the fourth class._______________ Answer: 142, 146, and 148 orders 99. How often did the company complete 111 shipments? ____________ Answer: 1 or once 100. How often did the company complete more than 140 shipments? __________ Answer: 15 times 101. What is the median value? _______________ Answer: 138 shipments, Essay 102. What is the common purpose of a scatter diagram and a contingency table? Answer: Both are used to summarize two variables: ,7 103. What is the difference between a scatter diagram and a contingency table? Answer: A scatter diagram requires interval or ratio scaled variables, a contingency table requires nominal or ordinal variables. ,7 104. Draw a negatively or positively skewed distribution and show the relative locations of the mean, median, and mode. Answer: See Text: