AP Stats Review Assume that the probability that a baseball player will get a hit in any one at-bat is 0.250. Give an expression for the probability that his first hit will next occur on his 5th at bat? What kind of distribution is this? A symmetric, mound-shaped distribution has a mean of 70 and a standard deviation of 10, find the 16th percentile score. It is known that, for a particular school, math scores are normally distributed. A random sample of the scores of 10 students of each gender yielded the values shown below. Do these data indicate a difference in the mean performance of all the boys and girls in this school at the 5% level of significance? Girls 90 80 70 75 87 92 86 61 94 100 Boys 70 75 96 92 85 72 63 95 68 98 1. Give the formula for this test. 2. Give the assumptions for this test. 3. Hypothesis The table below give the estimated marginal cost for a piece of furniture. Find the residual amount for 400 units. Units 100 200 300 400 500 600 Marginal Cost $300 $250 $220 $200 $180 $175 What’s the difference between blocking & stratifying? Predictor Constant height Coef -4.792 0.6077 SE Coef T 8.521 -0.56 0.1236 4.92 P 0.594 0.003 S = 0.932325 R-Sq = 80.1% R-Sq(adj) = 76.8% Find & interpret the correlation coefficient. A pharmaceutical company claims that 50% of adult males living in a city in the Midwest get at least two colds per year. A random sample of 100 adult males living the city of interest reported that only 42% of them experience two or more colds. Do these data indicate (at the 5% sig. level) that the true proportion of people who get more than 2 colds per year is less than 50%? 1. Give the formula. 2. Give the assumptions. 3. Give the hypothesis Name each type of sampling method: A. Code every member of a population and select 100 randomly chosen members. B. Divide a population by gender and select 50 individuals randomly from each group. C. Select five homerooms at random from all of the homerooms in a large high school. D. Choose every 10th person who enters the school. E. Choose the first 100 people who enters the school. Predictor Constant height Coef -4.792 0.6077 SE Coef T 8.521 -0.56 0.1236 4.92 P 0.594 0.003 S = 0.932325 R-Sq = 80.1% R-Sq(adj) = 76.8% Find an estimate of the population slope. (Use 95%) If I increase the significance level, what happens to the power of the test? Explain. The specifications fro the length of a part in a manufacturing process call for a mean of 11.25 cm. A simple random sample of 50 parts indicates a mean of 11.56 cm with a standard deviation of 0.54 cm. Find the probability that a random sample of 50 of the parts will have a mean of 11.56 cm or more. Predictor Constant height Coef -4.792 0.6077 SE Coef T 8.521 -0.56 0.1236 4.92 P 0.594 0.003 S = 0.932325 R-Sq = 80.1% R-Sq(adj) = 76.8% Find & interpret the coefficient of determination. Students were give a pretest at the beginning of the unit on linear equations and the same exam as a posttest at the end of the unit. Do these data indicate at the 5 % level, that there was an improvement in the scores once the instruction on the unit was completed? Pre 75 82 45 91 65 75 85 82 78 64 Post 78 81 55 93 65 78 81 86 82 66 1. Give the formula. 2. Give the assumptions. 3. Give the hypothesis A preliminary study has indicated that the standard deviation of a population is approximately 7.85 hours. Determine the smallest sample size needed to be within 2 hours of the population mean with 95% confidence. Predictor Constant height Coef -4.792 0.6077 SE Coef T 8.521 -0.56 0.1236 4.92 P 0.594 0.003 S = 0.932325 R-Sq = 80.1% R-Sq(adj) = 76.8% Find & interpret the slope What is the p-value? Explain the power of a test. A midterm exam in Applied Mathematics consist of problems in 8 topical area. One of the teachers believe that the most important of these, and the best indicator of overall performance, is the section on problem solving. She analyzes the scores of 36 randomly chosen students using MINITAB, comparing the total score to the problem-solving subscore. Give the equation for the least squares regression line. Predictor Coef StDev T P Constant 12.96 6.228 2.08 0.045 ProbSolv 4.0162 0.5393 7.45 0.000 s = 11.09 R-Sq = 62.0% R-Sq (adj)= 60.9% Predictor Constant height Coef -4.792 0.6077 SE Coef T 8.521 -0.56 0.1236 4.92 P 0.594 0.003 S = 0.932325 R-Sq = 80.1% R-Sq(adj) = 76.8% Find the residual amount if the observed value was (68,37). She analyzes the scores of 36 randomly chosen students using MINITAB, comparing the total score to the problem-solving subscore. Find and interpret the coefficient of determination. Predictor Coef StDev T P Constant 12.96 6.228 2.08 0.045 ProbSolv 4.0162 0.5393 7.45 0.000 s = 11.09 R-Sq = 62.0% R-Sq (adj)= 60.9% She analyzes the scores of 36 randomly chosen students using MINITAB, comparing the total score to the problem-solving subscore. Find and interpret the slope. Predictor Coef StDev T P Constant 12.96 6.228 2.08 0.045 ProbSolv 4.0162 0.5393 7.45 0.000 s = 11.09 R-Sq = 62.0% R-Sq (adj)= 60.9% She analyzes the scores of 36 randomly chosen students using MINITAB, comparing the total score to the problem-solving subscore. Find an estimate for the slope. Justify your answer. Predictor Coef StDev T P Constant 12.96 6.228 2.08 0.045 ProbSolv 4.0162 0.5393 7.45 0.000 s = 11.09 R-Sq = 62.0% R-Sq (adj)= 60.9% She analyzes the scores of 36 randomly chosen students using MINITAB, comparing the total score to the problem-solving subscore. Can you justify that there is a linear relationship – using statistical justification? Show it! Predictor Coef StDev T P Constant 12.96 6.228 2.08 0.045 ProbSolv 4.0162 0.5393 7.45 0.000 s = 11.09 R-Sq = 62.0% R-Sq (adj)= 60.9% The table below specifies favorite ice cream flavors by gender. Is there a relationship between favorite flavor and gender? Male Female Chocolate 32 16 Vanilla 14 4 Strawberry 3 10 1. What kind of test is this? 2. What is the expected number of males who prefer chocolate? 3. What is the degrees of freedom? 4. What assumptions must be made? A study of 20 teachers in a school district indicated that the 95% confidence interval for the mean salary of all teachers in that school district is ($38,945, $41, 245). What assumptions must be true for this confidence interval to be valid? A. No assumptions are necessary. The Central Limit Theorem applies. B. The sample is randomly selected from a population of salaries that is a t-distribution. C. The distribution of the sample means is approximately normal. D. The distribution of teachers’ salaries in the school district is approximately normal. E. The standard deviation of the distribution of teachers’ salaries in the school district is known. Predictor Constant height Coef -4.792 0.6077 SE Coef T 8.521 -0.56 0.1236 4.92 P 0.594 0.003 S = 0.932325 R-Sq = 80.1% R-Sq(adj) = 76.8% Can you prove that a linear relationship exists? Show it! A study of 20 teachers in a school district indicated that the 95% confidence interval for the mean salary of all teachers in that school district is ($38,945, $41, 245). Explain what is meant by the 95% confidence interval. Explain what is meant by the 95% confidence level. If an NFL quarterback’s pass completion percent is 79%, what is the probability that he will only complete 15 of 30 passes in his next game? If an NFL quarterback’s pass completion percent is 79%, what is the probability that he will only complete 15 of 30 passes in his next game? Give me two other ways of stating the formula for the previous problem. If an NFL quarterback’s pass completion percent is 79%, what is the probability that he will only complete 15 of 30 passes in his next game? Does this problem really meet the criteria for a binomial variable? A candy make coats her candy with one of three colors: red, yellow, or blue, in published proportions of 0.3, 0.3, and 0.4 respectively. A simple random sample of 50 pieces of candy contained 8 red, 20 yellow, and 22 blue pieces. Is the distribution of colors consistent with the published proportions. Give appropriate statistical evidence to justify your answer. The primary air exchange system on a proposed spacecraft has four separate components (A, B, C, D) that all must work properly for the system to operate well. Assume that the probability of any one component working is independent of the other components. It has been shown that the probabilities of each component working are P(A) = 0.95, P(B) = 0.90, P(C )= 0.99, and P(D) = 0.90. Find the probability that the entire system works properly. The primary air exchange system on a proposed spacecraft has four separate components (A, B, C, D) that all must work properly for the system to operate well. Assume that the probability of any one component working is independent of the other components. It has been shown that the probabilities of each component working are P(A) = 0.95, P(B) = 0.90, P(C )= 0.99, and P(D) = 0.90. What is the probability that at least one of the four components will work properly?