PRACTICE EXAM 1 Business Statistics I Examination I, Chapters 1-3 Payne Part 1. Problem Solving. Please do all work on your own paper; do not write your work on this test form. 75 points (includes 5 bonus); point values are in [ ]. For each problem: • Write the formula, where applicable. • Set up the problem IN SYMBOLS and show all calculations. SHOW ALL WORK. • Answer the problem. You may round to ≥3 decimals. Please circle your answers. YOU MUST ARRANGE YOUR WORK IN LOGICAL ORDER TO RECEIVE CREDIT! 1. Consider the following hypothetical sample of 6 Statistics students. COLLSEM is the number of semesters of college completed. (i) (x) ID COLLSEM Here is the solution to part d, the Skewness coefficient. 1 5 3 ⎛ x − x⎞ 2 3 ⎟ ⎜ 3 n 3 2 ⎛ x − x⎞ ⎝ s ⎠ n x− x ID COLLSEM ⎜ ⎟ Sk = 4 6 (n − 1)(n − 2) i = 1 ⎜⎝ s ⎟⎠ 1 5 0.3333 0.0057 5 5 2 3 -1.6667 -0.7173 6 6 7 = ( −13083 . ) 3 2 -2.6667 -2.9379 ∑ 4 5 6 6 5 7 28 1.3333 0.3333 2.3333 0.3672 0.0057 1.9682 -1.3083 (6 − 1)(6 − 2) FOR THE VARIABLE COLLSEM: [10] a. Calculate the coefficient of = -0.392 variation (CV). [6] b. Find P70. [4] c. What is the mode? Interpretation: The data show a slight negative skewness. [x] d. What is the skewness coefficient? Interpret this result. (NEW) 2. A hypothetical sample of public school administrators provided information on the dollar value of their contributions to the Democratic Party for 2004. The data were summarized in the following frequency distribution. Class CONTRIB 1 $0 up to 250 2 $250 up to 500 3 $500 up to 750 4 $750 up to 1000 SUMS f 4 7 5 12 CF m (midpt) 4 124.5 11 374.5 16 624.5 28 874.5 f*m 498.00 2,621.50 3,122.50 10,494.00 $16,736.00 m^2 15,500.25 140,250.25 390,000.25 764,750.25 f*m^2 62,001.00 981,751.75 1,950,001.25 9,177,003.00 $12,170,757.00 FOR THE VARIABLE CONTRIB: [4] a. What is the value of n, the number of observations? [6] b. Estimate the sample mean contribution. [5] c. Estimate the sample standard deviation. [4] d. What is the relative frequency of class 3 ($500-$750)? 3. [8] Bob completed 5 courses at Calhoun last semester. Using the data shown below, and using credit hours as the weight variable and quality points as the x variable, calculate his grade point average (GPA) for the semester as a weighted arithmetic mean. Course Western Civ II Economics I Biology I College Algebra English Lit I 4. Grade B A B C A Quality points 3 4 3 2 4 Credit hours 3 3 4 4 3 [10] Consider these inflation rates for 5 selected western European countries: 1.5% 0.4% 2.4% 1.5% 0.3% Calculate the geometric mean inflation rate. 5. [8] Consider these parameters for data set x: Calculate the population variance of x. 6. [6] Refer to the box-and-whisker plot below. a. What is the (approximate) value of Q3, the third quartile? b. Does the distribution appear to be negatively skewed, positively skewed, or symmetrical? 10 0 7. [4] 20 30 40 50 60 70 80 90 Consider the following 4 frequency polygons. Which best represents a skewness coefficient of +1.95? f f f x x Diagram A f Diagram x Diagram x Diagram D -----END OF PART 1 Part 2. True or False. Write a T or F on your own paper. 1 point each, 10 points total. 1. Carrie observes the dogs at a dog show and organizes the data into a frequency distribution, classifying the observations by breed of dog. This is an example of ordinal data. 2. An example of ratio level data would be the gasoline consumption per mile of a set of automobiles. 3. Barbara studies a distribution of variable R which contains 1,000 observations. According to Chebyshev’s Theorem, at least 850 of the observations must lie within 2 standard deviations of the mean. 4. A statistic measures some characteristic of a sample. 5. The median of a group of data is the value which represents the center of the "weight" of the distribution, or the "balance point." 6. For a given distribution, mode < median < mean. The distribution is positively skewed. 7. The standard deviation shows the average difference between the observations in a data set and the mean of that data set. 8. A parameter is a quantity that has a variable definition and a variable value. 9. Say that you wish to compare the dispersion between two data sets. Variable x consists of the weights of a sample of Mack trucks. Variable y consists of the number of hours required to make each of the trucks. Of the measures of dispersion we have studied, the best one to make the comparison is the coefficient of variation (CV). 10. Say that you study a variable, T. You find that, no matter what two possible values you select, an infinity of other possible values will lie between them. The variable T must be a continuous variable. 11. John completes a study which shows that heavy use (2 bottles or more per day) of hand lotion made with radish pulp decreases the risk of skin cancer by 50%. Women around the country begin using large quantities of radish-pulp hand lotion, but with no effects on skin cancer rates. John’s study apparently lacks external validity. 12. Say that a properly designed and properly executed sample of the shrimp population living off the coast of Crab Island finds a mean weight of 2.143 ounces. The true population mean is actually 2.259 ounces. The difference is sampling error. 13. A sample can be more accurate than a census only if the sample is itself very large. END OF IN-CLASS PORTION OF EXAM ******Please submit your homework before you leave. (10 points) Part 3. [10] Frequency distribution and graph construction. DUE: BEGINNING OF THE NEXT CLASS. Answers may vary. DATA: School life expectancy (expected number of years of formal schooling), from a sample of 54 countries, most recent available data. (Source: Statistics Division, United Nations. http://unstats.un.org/unsd/demographic/products/socind/education.htm) 11 11 16 11 13 20 15 10 13 8 14 14 19 7 15 7 14 12 15 15 13 12 3 5 9 9 16 12 5 14 10 14 15 11 7 8 11 6 13 13 13 15 17 4 12 12 11 9 5 16 5 18 15 12 A. FREQUENCY DISTRIBUTION: Construct a frequency distribution with the following characteristics: a. A "reasonable" number of classes. Describe briefly how you arrived at this number. Mathematically defensible methods are preferred to "ad hoc" methods. b. Give the frequency, relative frequency, cumulative frequency, and relative cumulative frequency for each class. (I.e., a standard complete frequency distribution) c. Make a complete stem and leaf diagram for these data B. GRAPH: i. ii. iii. iv. v. vi. Draw any one of the following graphs to represent the data: Use pencil and paper, not a computer. histogram frequency polygon less-than-cumulative frequency ogive more-than-cumulative frequency ogive pie graph box-and-whisker plot Briefly explain (a) why you chose to use this type of graph and (b) what it shows about the data and the datagenerating process.