Final Exam Review Name: 1. Association of measurement variables 1. 2. 3. Describe the relationship between two variables (measurement v measurement) Create and interpret scatter plots identify outliers Open the dataset “SAT by State 06.xls.” Construct a scatterplot of “math” versus “percent taking” (a.) positive, strong, linear (b.) negative, moderate, linear (c.) negative, strong, non linear (d.) no relationship Open the dataset “Governors05.xls.” Construct a scatterplot of “Governor Salary” versus “Median house value.” Which best describes the relationship? (a.) positive, strong, linear, with no outliers (b.) negative, moderate, linear, with an outlier (c.) positive, moderate, linear with an outlier (d.) no relationship In the scatterplot of “Governor Salary” versus “Median house value,” which state is an outlier? (a.) Pennsylvania (b.) Hawaii (c.) California (d.) Alaska 2. Correlation Coefficient Calculate and interpret the correlation coefficient 4. Calculate the correlation between “Median Housing Value” and “Governor Salary.” (a) -.02 (b) .55 (c) .33 (d) -.55 5. Which of the following is true about correlation? 6. (a.) it is only appropriate to use correlation when the relationship is linear (b.) if the correlation coefficient is close to 1 or -1, there is a cause-and-effect relationship (c.) correlation is resistant to outliers (d.) all of the above are true If a correlation coefficient between Age and Coolness Rating is -0.70, then: (a) A person’s Age is usually less than the person’s Coolness Rating. (b) A person’s Age is usually more than the person’s Coolness Rating. (c) Below average Ages tend to be associated with below average values of Coolness. (d) Below average Ages tend to go with above average values of Coolness. 3. Calculate and Apply Regression Equation 7. Calculate the regression equation Graph and make predictions based on the LSR line Understand the difference between an outlier and an influential observation in the context of regression – level 1 only “SAT by State 06.xls.” Use StatCrunch to calculate the linear regression equation of “Math” versus “Percent Taking” Equation: _____________________________ 8. Use the regression equation to predict the Math SAT score for a state with 50% of students taking the SAT Answer: 9. _____________ What is the y-intercept of the equation? Answer: ______________ 4. Interpret Regression Equation Interpret the slope and intercept of the regression equation Understand the Least Squares criterion for line of best fit 10. Explain what the slope of the regression equation (in 8.) means in context. 11. Below is the Statcrunch output for a regression run on “Airfare” (in dollars) versus “Distance” (in miles) for a set of data for flights between 12 cities. Simple linear regression results: Dependent Variable: Airfare Independent Variable: Distance Airfare = 83.26736 + 0.11737509 Distance Sample size: 12 R (correlation coefficient) = 0.795 R-sq = 0.63200194 Estimate of error standard deviation: 37.827023 The slope of the regression equation indicates that… 12. (a) There is a strong negative linear relationship (b) An additional mile tends to cost about 11.7 cents (c) A trip of 0 miles costs .117 dollars (d) None of the above The “line of best fit” under the least squares criterion is the one which… (a) minimizes the sum of all the perpendicular distances from the points to the line. (b) has the greatest sum of squared errors in predictions. (c) has the smallest sum of squared residuals (d) has the smallest average residual 6. Basic counting rules Use the multiplication and addition rules of counting Compute and apply factorials Construct and apply tree diagrams and tables 16. If Mr. Mellor has 4 shirts and 3 pairs of pants, how many different outfits can he put together? Make a tree diagram to show all the possible outfits. 17. How many 3-digit numbers contain no 4’s? How many 3-digit numbers contain at least one 4? 18. How many ways can 5 T/F questions be answered? 19. How many ways can the letters in the word “COMPUTER” be arranged? 7. Advanced counting rules 20. Use combinations and permutations to solve counting problems Distinguish between situations where combinations, permutations, or other counting rules apply Three door prizes are to be given to 3 lucky people in a crowd of 100. a. If the three prizes are identical, in how many ways can this be done? b. If the three prizes are different (1st , 2nd, and 3rd), in how many ways can this be done? 21. Baskin Robbins has 31 flavors. How many three scoop cones can be made if order is important and flavors cannot be repeated? 22. How many three scoop cones are possible if flavors can be repeated? 8. Law of Large Numbers 23. Understand the relative frequency interpretation of probability and the Law of Large Numbers Conduct simulations with physical models and random number generators to make empirical estimates of probability Which is a correct interpretation of the Law of Large Numbers? (a.) If you flip a coin 10 times and get 9 heads, we expect to get tails on the next toss. (b.) If you flip a coin a small number of times you are more likely to get close to 50% “heads” than if you flip a coin a large number of times. (c.) If you flip a coin twice and get “heads” both times, the next flip will most likely come up “tails” (d.) If you flip a coin 50 times, you probably won’t get exactly 25 “heads” but you are likely to get somewhat close to 50% “heads” Standard 9. Computing Probabilities 24. Compute probabilities using an appropriate sample space Compute probabilities using counting rules including permutations and combinations Which of the following is NOT a possible probability? (a.) 25/100 (b.) 1.25 (c.) 1 (d.) 0 25. When rolling a pair of 6-sided dice, what is the probability of rolling a sum of 6 or 7? (a.) 1/6 (b.) 3/14 (c.) 5/12 (d.) 11/36 26. Tina has 5 red, 6 blue, 3 white, and 4 orange marbles. All marbles are put in a sack and one marble is selected at random. Compute the probability of drawing a red marble and the probability of drawing a blue or white marble. (a.) 4/19; 9/20 (b.) 3/10; 11/20 (c.) 5/18; 1/2 (d.) 9/21; 7/18 27. A statistics project was assigned in a class of 20 students. Each student will present his or her work to the class. Three of the 20 students will be selected at random each day to present to the class, and presentations will continue over several days. What is the probability that you and your two best friends in the class will be the 3 students selected to present on the very first day of project presentations? (a.) 1/1140 (b.) 3/20 (c.) 1/2130 (d.) 2/101 Standard 10. Compound Events 28. Solve compound probability problems using tree diagrams and area diagrams Use conditional probabilities – level 1 In a basketball one-and-one situation. If a player makes the first foul shot, she gets another foul shot. If she misses it , she does not get to shoot again. (So a player can score 0,1, or 2 points in a one-and-one situation.) Suppose Liz has a 70% chance of making each foul shot. What is the probability that she scores exactly one point in a one-and-one situation? (a.) 70% (b.) 9% (c.) 49% (d.) 21% 29. A drawer contains 5 black socks and 3 blue socks. If you reach into the drawer without looking and pull out two socks, what is the probability that you get a matching pair? (a.) 21/56 (b.) 13/28 (c.) 4/56 (d.) 1/2 30. What is the probability that 3 students selected at random were all born on different days of the week? What is the probability that at least two of them were born on the same day of the week? (a.) 30/49; 19/49 (b.) 36/49; 2/49 (c.) 30/56; 25/36 (d.) 2/3; 1/3 Standard 11. Expected Value 31. Compute expected value Make and defend decisions based on expected value calculations Use the table to compute the expected value for X. X P(X) 32. 33. (a.) 1.2 (b.) 2.5 (c.) -.8 (d.) -2 1 .3 2 .2 -3 .5 A die is rolled. If the number rolled is odd, Player A wins $2 from Player B. If it is a 6, A wins $4 from B. Otherwise B wins $5 from A. Is this a fair game? (a.) Yes, because the expected value is 0. (b.) Yes, because the expected value is positive for player A (c.) No, because the expected value is negative for player A (d.) No, because the expected value is 0 Refer to Liz’s one-and-one situation in number 5. What is Liz’s expected value for the average number of points she scores in a one-and-one situation? (a.) 1.2 (b.) 2.5 (c.) -.8 (d.) -2 Standard 12. Using the binomial distribution 34. Graph a binomial distribution Compute probabilities based on a binomial distribution Which of the following sets of values when substituted into the formula above gives the probability of getting 6 sixes in 10 rolls of a die? (a) n = 4, p = 10, k = 1/6 (b) n = 10, p = 1/6, k = 4 (c) n = 10, p = 6, k = 1/6 (d) n = 4, p = 1/6, k = 10 35. 36. Which of the following is not a situation where the binomial probability formula should be applied? (a.) A coin is flipped 20 times. What is the probability of getting 5 or more heads? (b.) What is the probability that the first 6 occurs on the 5th roll of a six-sided die? (c.) What is the probability that Liz will make exactly 3 out of 5 shots if she has a 70% chance of making each shot and we assume independence? (d.) A bag contains 6 marbles of which two are green. A marble is selected at random from the bag, the color is noted, and the marble is returned to the bag. Repeating this action 15 times, what is the probability of getting 5 or fewer green marbles? The Good Guys have won 60% of their games over a long season. With seven games to go, what is the probability that the Good Guys will win at least five of them? Assume that the Good Guys have a .6 probability of winning each game. (a.) .6 (b.) .36 (c.) .42 (d.) .48 Standard 13. Using Normal Distributions 37. Sketch the graph of a normal distribution calculate z-scores Calculate probabilities/proportions pertaining to a normal distribution Calculate values of a variable corresponding to given probabilities or proportions Apply the normal approximation to the binomial distribution - level 1 only The distribution of lifetimes for a certain type of light bulb is normally distributed with a mean of 1000 hours and a standard deviation of 100 hours. Find the 33rd percentile of the distribution of lifetimes. (a.) 560 (b.) 330 (c.) 1044 (d.) 1440 (e.) none of these 38. Compute the z-score for a score of 70 on a test with the following summary statistics: min = 20, Q1 =70, med = 75, Q3 = 76, max = 100, s = 4, x = 70 (a) z = 2 (b) z = 2.5 (c) z = 0 (d) z = 7 39. A negative z-score indicates… (a) that someone made a computational error (b) that the mean is a negative number (c) that the distribution is skewed toward lower values (d) that the item in question is below the mean 40. A person with a Z-Score of -2.00 has performed below approximately what percent of the students taking the test? (a) 2 percent (b) 15 percent (c) 50 percent (d) 84 percent 41. In a population there are 60 scores, the mean is 45 and a standard deviation is 25. A score of 85 from this population: (a.) has a Z-score of 1.60. (b.) has a Z-score of 1.00. (c.) has a Z-score of -1.00. (d.) it is impossible to compute Z without additional information 42. One hundred students took a test on which the mean score was 73 with a standard deviation of 8. A grade of A was given to all who scored 85 or better. Approximately how many A's were there, assuming scores were normally distributed? (Choose the closest.) (a.) 42 (b.) 7 (c.) 58 (d.) 5 (e.) 22 Standard 16. Carrying out statistical tests 43. Carry out a test of significance (hypotheses, collect and organize data, compute test statistic, compute p-value, make decision, interpret in context) Suppose a medical experiment randomly assigns patients with a certain condition to either receive an “old” treatment or a “new” treatment. The experimenters are interested in whether the new treatment works better than the old treatment. Which of the following would be an appropriate null hypothesis? 44. 45. (a.) The new treatment works better than the old treatment (b.) The new treatment works the same as the old treatment (c.) The old treatment works better than the new treatment (d.) Neither the old or new treatments work. Medical experimenters randomly assigned 164 pregnant HIV-positive women to receive the drug AZT during during pregnancy while another 160 such women were randomly assigned to a control group which received a placebo. Of those in the AZT group, 13 had babies who tested HIV-positive, compared with 40 HIV-positive babies in the placebo group. Contruct two-way tables of observed and expected counts. If there is no difference between AZT and the placebo, what number of babies of mothers in the control group would be expected to be HIV positive? (a.) 26.2 (b.) 19.3 (c.) 40 (d.) 13 In an experiment to study attempting to identify factors which can influence people’s responses to survey questions, subjects were randomly given one of the two following statements and asked whether they agree or disagree: A: “Individuals are more to blame than social conditions for crime and lawlessness in this country.” B: “Social conditions are more responsible than individuals for crime and lawlessness in this country.” The responses are summarized below: Blame Individuals Blame Social Conditions Total A 282 191 473 B 204 268 472 Total 486 459 945 Statistic Chi-square DF Value P-value 1 25.4347 <0.0001 Which of the following is an appropriate conclusion for a chi-square test of the null hypothesis that the wording of the question did not influence the responses: (a.) Accept the null hypothesis. There isn’t sufficient evidence to demonstrate that the wording of the question had an effect. (b.) Reject the null hypothesis. There is strong evidence that people were influenced by a tendency to agree with the researcher (c.) Fail to reject the null hypothesis. People were more likely to disagree than to agree with whichever statement was given. (d.) Reject the alternative hypothesis. There no evidence suggesting that the wording of the question influenced responses. Standard 17. Understanding Statistical Tests 46. Explain p-value as a probability Understand the reasoning for a test of significance Which of the following is the best explanation of a null hypothesis, Ho, in a statistical test. (a) a statement of what we are trying to prove in an experiment (b) it is usually a statement that “there really is a difference” (c) a neutral assumption made for the sake of argument that the researcher tries to disprove based on conflict with the observed data (d) it is the hypothesis that the data cannot be explained by random variability 47. A p-value of 0.11 would indicate… (a) that someone made a computational error (b) strong evidence against the null hypothesis (c) that the alternative hypothesis must be false (d) that we should fail to reject the alternative hypothesis at a .05 significance level 48. Roberto suspects that his brother’s coin is biased towards heads. He flips the coin a large number of times and records the results. Suppose that the number of “heads” he got differs from the expected number of “heads” from a fair coin so much that the pvalue in a Chi-square test is 0.04. Which of the following is not a correct interpretation of this p-value? (a) In only 4 random samples in one hundred would we see such a large difference between the observed and expected numbers of heads when flipping a fair coin (b) The null hypothesis of the test is rejected at the .05 significance level (c) The coin is biased such that it comes up “heads” less than 4% of the time. (d) If Roberto’s brother’s coin is fair, the probability of observing such a large difference between the observed and expected numbers of “heads” is fairly small. 49. It is appropriate to reject the null hypothesis in a test of significance if… (a) it would be very unlikely to get the sort of data that was observed if the null hypothesis were true. (b) the data are consistent with what would be expected if the null hypothesis were true (c) the significance level if .05 and the p-value is greater than .05 (d) the chi-square value is close to 0 Standard 20. Experimental Design Understand principles of control including comparison, replication, randomization, blindness, and blocking Carry out a well-controlled experiment 50. What is the role of randomization in an experiment? (a.) A random sample of subjects is selected from a population to avoid bias (b.) Subjects are randomly assigned to treatments so create experimental groups that are similar (c.) Treatments are applied to every subject at random intervals (d.) Experimenters must randomly decide what the purpose of the study is 51. The control group in an experiment should be designed to receive: (a.) the opposite of the experiences afforded the experimental group. (b.) the same experiences afforded the experimental group except for the treatment under examination. (c.) the experiences afforded the experimental group except for receiving the treatment at random intervals. (d.) the experiences which constitute an absence of the experiences received by the experimental group. 52. In an experiment to see if Smartfood Popcorn makes people smarter, 50 students are randomly assigned to two groups. The first group receives Smartfood popcorn and the second gets regular popcorn. Then each student writes an essay on an assigned topic. The essays are scored by a teacher and the results are compared for the two groups to see if Smartfood had a positive effect on essay writing. Which of the following would ensure blindness and double-blindness in this study? (a.) the students and reseachers should both wear blindfolds (b.) no one should know whether Smartfood was used for either group. (c.) the students should not know whether they got Smartfood or a different brand of popcorn, and the teacher should not know which group the the student was in while grading the essay. (d.) Smartfood should be given to both groups but in random intervals so that the students do not know when they are getting it Standard 21.Studies distinguish between explanatory and response variables distinguish between observational studies and experiments understand when a "cause and effect" conclusion is justified understand and identify lurking (confounding) variables 53.. An experiment is conducted to determine if the use of certain specified amounts of a drug will increase IQ scores. In this study, IQ serves as: (a) an explanatory variable (b) a moderator variable (c) a response variable (d) a control variable 54. In order to test the effects of light level on student performance on tests, subjects are randomly assigned to two rooms to take a test. One room had a lower light level than the other. During the testing it was noted that one room’s temperature was 70 degrees whereas subjects in the other group are simultaneously tested in a nearby identically appointed room with the heat set at 60 degrees. A possible confounding variable in this study is… (a.) test score (b.) lighting level (c.) temperature (d.) random assignment 55. Researchers would like to understand how pet ownership is related to longevity. Which of the following is true? (a.) This study would be considered an observational study if they randomly assigned subjects to either receive a pet or not. (b.) This study would be considered an experiment if they randomly assigned subjects to either receive a pet or not. (c.)This study would be considered an experiment if they took a random sample of pet owners and noted whether or not they owned a pet (d.) none of the above is true 56. If the researchers in a study want to be able to conclude that watching television CAUSES poor academic performance and is not merely ASSOCIATED with school performance they need to… (a.) Conduct a well-controlled experiment (b.) Use an unbiased random sample in an observational study (c.) both (a.) and (b.) would be appropriate studies for the researchers (d.) neither of these types of studies can be used to establish a cause-and-effect relationship between watching TV and academic performance