Statistics Midterm Review 2007 Name __________________ Chapter 1 1. The Coca Cola Company has 366,000 stockholders and a poll is conducted by randomly selecting 30 stockholders from each of the 50 states. The number of shares held by each sampled stockholder is recorded. a. Are the values obtained discrete or continuous? DISCRETE b. Identify the level of measurement (nominal, ordinal, interval, rational) for the sample data. RATIO c. Which type of sampling (random, systematic, convenience, stratified, cluster) is being used? STRATIFIED d. If the average (mean) number of shares is calculated from the sample group, is the result a statistic or a parameter? STATISTIC e. What is wrong with gauging stockholder views by mailing a questionnaire that stockholders could complete and mail back? The sample will be self-selected and thus probably biased. 2. Identify the level of measurement used in each of following. a. The amounts of tar (in milligrams) in a sample of cigarettes. RATIO b. A movie critic’s ratings of “must see, recommended, not recommended, don’t even think about going.” ORDINAL c. A movie critic’s classification of “drama, comedy, adventure” NOMINAL 3. Determine whether the following values are discrete or continuous. a. A statistics professor counts 3 absent students. DISCRETE b. A statistics professor finds that on the first test, the first paper is turned in 39.627 minutes after the test began. CONTINUOUS 4. The Hartford Insurance Company has hired you to poll a sample of adults about their car purchases. What is wrong with using telephone directories as the population from which the sample is drawn? It excludes people with unlisted numbers and people without phones 5. Decide whether each description is an observational study or an experiment. a. Cans of Coke are opened, and the volume (in ounces) of the contents are measured. Observational study b. The new drug Statisticzene is tested by recording its effects on the students who are given the drug. Experiment c. The effectiveness of multimedia teaching is tested with a sample of students who complete a course of study using the multimedia approach. Experiment d. Much controversy arose over a study of patients with syphilis who were not given a treatment who could have cured them. Observational study 6. Identify which type of sampling is used: random, systematic, convenience, stratified, or cluster. a. The Gallup Organization plans to conduct a poll of New York City residents with the 212 area code. Computers are used to randomly generate telephone numbers that are automatically called. Random b. A marketing expert for MTV is planning a survey in which 500 people will be randomly selected from each age group of 10-19, 20-29, and so on. Stratified c. An ABC news reporter polls people as they pass him on the street. Convenience d. A Johns Hopkins University researcher surveys all cardiac patients in each of 30 randomly selected hospitals. Cluster 7. Athens Academy is conducting a survey of its students to determine if they prefer having break before mid-morning meeting or after. It will randomly select 5 students from each advisory to complete the survey. a. What is the population in this case? The AA student body b. What is the sample in this case? The students selected to take the survey. c. What type of sampling is AA using? Stratified. Chapter 2 Directions: A survey was conducted of men’s heights. The results are summarized below. Use the frequency table to answer the questions that follow. Height of Frequency Men (inches) 55-59 1 60-64 3 65-69 49 70-74 46 75-79 1 1. Identify the class width, class midpoints, and class boundaries for the frequency table. Class width: 5 Class midpoints: 57, 62, 67, 72, 77 Class boundaries: 54.5 – 59.5 59.5 – 64.5 64.5 – 69.5 69.5 – 74.5 74.5 – 79.5 2. Construct a relative frequency table using the data in the frequency table. 55-59 60-64 65-69 70-74 75-79 1% 3% 49% 46% 1% 3. Construct the cumulative frequency table that corresponds to the given frequency table. 55-59 1 60-64 4 65-69 53 70-74 99 75-79 100 4. Construct a relative frequency histogram using the relative frequency table you constructed in #2. A relative frequency histogram should have class boundaries on the x-axis and relative frequencies (percentages) on the y-axis. The x-axis should be labeled “height (in)” and the y-axis should be labeled “relative frequency.” The bars of the histogram should be adjacent to one another with no spaces. Directions: Fifteen students took an Algebra test. Their grades are listed below. Use their grades to answer the questions that follow 48 99 75 82 61 44 96 83 92 67 50 75 79 71 91 5. Construct a frequency table for the data given above. You may determine the class width and the class boundaries. 40-49 2 50-59 1 60-69 2 70-79 4 80-89 2 90-99 4 6. Construct a stem-and-leaf plot for the data above. 4 4 8 5 0 6 7 8 9 1 1 2 1 7 5 3 2 5 9 6 9 8. Construct an ogive for the data given above. An ogive should have class boundaries on the x-axis and frequencies on the y-axis. The x-axis should be labeled “test scores” and the y-axis should be labeled “cumulative frequencies.” The ogive should show cumulative frequencies. An insurance company is conducting research on how often people see a doctor each year. Twelve people were surveyed and asked how many times they had seen a doctor in the past year. Their answers are listed below. Use this data for #9-11 0 5 2 14 12 7 3 2 7 9. Find the mean, median, mode, and midrange of this data. μ =6 median = 6 mode = 2, 7 10. Find the range and standard deviation of this data. Range = 14 σ = 4.4 10 1 midrange = 7 9 11. The accompanying frequency table summarizes a sample of human body temperatures. Find the mean of this frequency table. μ = 98.17 Temp 0 F 96.5-96.8 Frequency 96.9 – 97.2 8 97.3 – 97.6 14 97.7 – 98.0 22 98.1 – 98.4 19 98.5 – 98.8 32 98.9 – 99.2 6 99.3 – 99.6 4 1 12. If data are normally distributed, what percent of values fall within two standard deviations of the mean? 95% 13. Heights of women have a normal-distribution with a mean of 63.6 inches and a standard deviation of 2.5 inches. Using the empirical rule, what is the approximate percentage of women with heights between a) 61.1 inches and 66.1 inches? 68% b) 56.1 inches and 71.1 inches? 99.7% c) Use the range rule of thumb to identify the lowest and highest unusual heights for women. Min = 58.6 inches Max = 68.6 inches 14. What do you know about the values in a data set if the data have a standard deviation of 0? All the values in the data set are equal. Directions: For the following data sets, find the mean, standard deviation, and maximum and minimum usual values. Then, find the z-score of the highest value in the set and the lowest value in the set. 15. The given values are the measurements of distances (in millimeters) between the pupils of adult patients being fitted for eyeglasses. 67 66 59 62 x = 64.25 mm 63 66 66 s = 2.7 mm 65 z59 = -1.94 z67 = 1.02 (Remember to use x and s when you are determining the mean and standard deviation of a sample rather than a population. This is a sample because it does not include all adult eyeglass wearers.) *Note that this is different than what the original, handwritten answer key said. The same applies for #16. 16. The given values are the service times (in seconds) of McDonald’s drive-through customers. 88 107 35 93 x = 81 sec 65 55 s = 26.4 sec 119 83 z35 = -1.74 99 74 46 108 z119 = 1.44 17. What is the z-score of the lowest “usual” value in any data set (according to the range rule of thumb)? -2 18. What is the z-score of the highest “usual” value in any data set(according to the range rule of thumb)? 2 19. Would it be unusual for a customer who went to the McDonald’s mentioned in #16 to wait for only 35 seconds? Why or why not? No, because 35 seconds is within two standard deviations of the mean. 20. Look at Table 2-16 on page 87 of your book. What percentile is the weight .8126? 17th percentile 21. Look at Table 2-16 on page 87 of your book. Find the value that marks P60, the 60th percentile. .8189 Chapter 3 Directions: Which method of determining probability is described in each situation below? 1. Based on today’s temperature, the location of two cold fronts, and the current air pressure, a meteorologist says there is a 30% chance of rain tomorrow. Subjective 2. Victor Niederhoffer is a professional investor. He tries to predict what the stock market will do and he buys and sells stock accordingly. In the following excerpt from an article in the New Yorker, Niederhoffer notices that, for the past three days, the stock market has opened down. “Niederhoffer turned to Alex Castaldo, a thin, bespectacled fifty-three-year-old Italian who has a degree in electrical engineering from M.I.T. and a Ph.D. in finance from CUNY, and asked him to compile some data. “Doc,” he said to Castaldo, “what does the market do when it opens down a lot three days in a row?” A few minutes later, Castaldo handed Niederhoffer a computer printout, which showed that since the start of 2003 there had been just ten occasions on which, for three consecutive days, the S. & P. 500 had fallen sharply in the first hour and a half of trading. On eight of those occasions, stocks had bounced back.” What method of determining probability is Niederhoffer using to predict what the market will do? Relative frequency approximation 3. The probability of rolling a three is 1/6. Classical approach 4. If a person is randomly selected, find the probability that his or her birthday is October 18th, which is National Statistics Day in Japan. Ignore leap years. 1 365 5. If a person is randomly selected, find the probability that he or she was NOT born on National Statistics Day. Ignore leap years. 364 365 Directions: Use the diagram below to answer questions… If one of the shapes above is selected at random, what is the probability that.. 6. it will be a rectangle? 6 14 7. it will have exactly three sides? 8 14 9 14 8. it will have four sides or have the letter B? 9. it will be a have either three or four sides? 1 10. it will have five sides or the letter A? 7 14 11. it will have three sides or the letter A? 10 14 If two of the shapes above are selected at random, with replacement, what is the probability that… 12. the first shape will be a rectangle and the second will be a triangle? 13. the first will have the letter B and the second will be a rectangle? 48 196 42 196 14. they will both have the letter B? 15. neither will have the letter B? 49 196 49 196 16. the first will be a rectangle or have the letter A and the second will have the letter A 22 and be a rectangle? 196 If two of the shapes above are selected at random without replacement, what is the probability that… 17. they will both have four sides? 30 182 18. the first will have an A and the second will have a B? 19. they will both be triangles with As? 49 182 20 182 If one shape is picked at random… 20. Are the following events mutually exclusive? Picking a triangle Picking a shape with an A No, because there are triangles with As. 21. Are the following events mutually exclusive? Picking a square Picking a triangle? Yes, no shape is both a triangle and a rectangle. 22. If one shape is picked at random, find the probability that it is a rectangle given that it 4 has a B. 7 23. If one shape is picked at random, find the probability that has an A on it, given that it 2 is rectangle. 6 24. If one shape is picked at random, find the probability that it is a triangle, given that it is a rectangle. 0 If three shapes are picked at random, with replacement, find the probability that.. 25. none of the shapes is a triangle. 216 2744 26. at least one of them is a triangle 1 27. at least one of them has a B. 216 2528 2744 2744 2401 2744 28. If P(A) = .34, find P(the complement of A) .66 29. Evaluate 5! 120 30. How would you evaluate aPb ? 31. How would you evaluate aCb ? a! (a b)! a! (a b)!b ! 32. A small class of five students is instructed to form a straight line. In how many possible orders can the children line up? 5! = 120 33. A class of ten children is putting on a play. There are four speaking roles. How many different assignments of speaking roles are there? 10P4 = 5040 34. A class of ten children is putting on a concert. During one song, there is a section of music that only requires four singers. How many different groups of four can be selected? 10C4 = 210 35. You have just started your own airline company called Air America. You have one plane for a route connecting Austin, Boise, and Chicago. How many routes are possible? 3!= 6 35. Walton Pharmaceuticals wants to test the effectiveness of a new drug designed to relieve allergy symptoms. The initial test will be conducted by treating six people chosen from a pool of 15 volunteers. If the treatment group is randomly selected, what is the probability that it consists of the six youngest people in the pool? 1 1 5005 15 C6 36. Each social security number is a sequence of nine digits. If you randomly generate nine digits, what is the probability of getting your own social security number? 1 1 9 (10) 1, 000, 000, 000 Chapter 4 Directions: Determine if a probability distribution is given. If it is, find μ and σ. If it isn’t a probability distribution, explain why not. 1. x P(x) 0 .125 1 .375 2 .375 3 .125 2. x 0 1 2 3 4 P(x) .522 .368 .098 .011 .001 Yes, it is a probability distribution. μ = 1.5 σ = .9 Yes, it is a probability distribution. μ = .6 σ = .7 3. Focus groups are often used for finding detailed information about a product. A focus group of 12 people is randomly selected to discuss products of the Coca Cola Company. In such groups of 12 people, the mean number who recognize the coca Cola brand name is 11.4 and the standard deviation is .75. Would it be unusual to randomly select 12 people and find that fewer than eight of them recognize the Coca Cola brand name? Yes, because 8 is more than two standard deviations below the mean. 4. Look at Table 4-1 on page 181 in your book. Find the probability of getting 10 or more girls among 14 births. If you are testing a gender-selection technique and you get 10 girls among 14 births, does it appear that the method is effective? Why or why not? P(10 or more) = .09 Since the probability of getting 10 or more girls naturally is greater than .05, the technique does not seem to be effective. 5. A scratch-off lottery ticket costs $3.00. Out of every 10,000 scratch-off tickets, 10 of them pay out a prize of 50, and 2 of them pay out a prize of $1,000. What is the expected value of one ticket? Possible Outcome Win $50 Win $1,000 Lose X (Value for that outcome) $47 $997 -$3 P(x) .001 .0002 .9988 Expected value = μ = -$2.75 6. What four conditions must a procedure meet in order to yield a binomial probability distribution? 1. There must be exactly two possible outcomes. 2. There must be a fixed number of trials (n). 3. The events must be independent. 4. The probabilities of success and failure must remain constant. 7. Bryan flips a coin 10 times and records the number of times heads comes up. Does this yield a binomial probability distribution or not? Why or why not? If so, define S, F, n, p, and q. Then find P(3) and P(8 or more). Yes, it does yield a binomial probability distribution. 1. Two outcomes – heads or tails 2. n = 10 3. Independent – one coin flip does not affect the next. 4. The probability of getting heads is always ½ . S= heads F = tails n = 10 p = .5 q = .5 P(3) = .117 P(8 or more) = P(8) + P(9) +P(10) = .055 8. You are testing telephone directory assistance. You request 10 numbers. Assume that director assistance gives out the wrong number 15% of the time. a) Find the probability of getting one wrong number. This is a binomial probability distribution. 1. Two outcomes – correct telephone number or incorrect telephone number. 2. n =10 3. Getting a wrong number one time does not affect the next number you receive from directory assistance. 4. The probability of getting a wrong number is always .15. S = getting the wrong number p = .15 q = .85 F = getting the right number n = 10 P(1) = .347 b) Find the probability of getting at most one wrong number. P(at most 1) = P(0) + P(1) = .544 c) If you do get at most one wrong number, does it appear that the rate of wrong numbers is not 15%? No. Given a wrong # rate of 15%, it is very likely (> .05) that you will get either 0 or 1 wrong numbers when you request 10 numbers. 9. Several students are unprepared for a true/false test with 20 questions and all of their answers are guesses. a) Find the mean and standard deviation for the number of correct answers for such students.(Hint: Determine whether or not this is a binomial probability distribution. If it is, explain why, define S, F, n, x, p, and q and then use the shortcut formulas for μ and σ.) This is a binomial distribution. 1. Two outcomes – choose the right answer or the wrong answer 2. n = 20 3. If you are guessing, getting one question right does not affect whether you get the next question correct. 4. If you are guessing on a true/false test, the probability of choosing the right answer is always ½ . S = Choosing the right answer F = Choosing the wrong answer n = 20 p = .50 q = .50 When you have a binomial probability distribution, μ = np = 20(.5) = 10 When you have a binomial probability distribution, σ = npq = 2.24 b) Would it be unusual for a student to pass by guessing and getting at least 12 corect answers? Why or why not? No, because 12 is within two standard deviations of the mean. Or, No, because P(12 or more) is > .05 10. There are 15 particularly winding miles along Highway 1 where 30 accidents happened last year. a. If I randomly select one 1-mile stretch from that treacherous section, what is the probability that 4 accidents happened in that 1-mile stretch? (Round answers to three decimal places.) Use the Poisson distribution here. Interval: 1-mile stretch of highway Event: accident 30accidents μ= = 2 accidents/section 15sec tions x=4 P(4) = .09 b. How many 1-mile sections would you expect to have been the site of 0 accidents? 1 accident? 2 accidents? 3 accidents? (Round probabilities to three decimal places and round expected number of sections to one decimal place.) # of # of accidents sections 0 2.0 1 4.1 2 4.1 3 2.7 Chapter 5 1. The readings on thermometers are normally distributed with μ = 00 C and σ = 10 C. a. If you randomly select one thermometer, find the probability that its reading is between .5 and 1.3. .2117 b. If you randomly select one thermometer, find the probability that its reading is between -.4 and 2.0. .6326 c. If you randomly select one thermometer, find the probability that its reading is less than -1.7. .0446 d. Find the temperature that separates the top 7% of thermometers from the rest. 1.480 e. Find the temperature that separates the bottom 11% of thermometers from the rest. -1.230 f. Quality control will eliminate the thermometers in the top and bottom 10% . Find the temperatures that separate the good thermometers from the bad. -1.280 and 1.280 g. If thermometers must have readings between -1.00 C and 1.00 C to pass quality control inspections, what percentage of thermometers is discarded for failing inspection? 31.74% 2. Women’s weights are normally distributed with a mean of 143 lbs and a standard deviation of 29 lbs. If one woman is randomly selected, find the probability that her weight is… a) greater than 158 lbs. . 3015 b) between 120 lbs and 150 lbs. .4469 c) less than 125 lbs. .2676 Find the weight that separates… a) the bottom 15% of weights from the rest of the weights. 112.84 pounds b) the top 22% of weights from the rest. 159. 82 pounds If a group of 25 women is randomly selected, find the probability that… a) the average of their weights is greater than 170. .0001 b) the average of their weights is between 128 and 149. .8484