AP Statistics Probability Review Chapter 15: Probability Rules Addition Rule: When two events A and B are disjoint, we can use the addition rule for disjoint events: P(A B) = P(A) + P(B) If two events are not disjoint, use the General Addition Rule: For any two events A and B, P(A B) = P(A) + P(B) – P(A B) Conditional Probability: To find the probability of the event B given the event A, we restrict our attention to the total outcomes in A. We then find the fraction of those outcomes in B that also occurred. Multiplication Rule: When two events A and B are independent, we can use the multiplication rule for independent events: P(A B) = P(A) x P(B) If the two events are not independent, use the General Multiplication Rule: For any two events A and B, P(A B) = P(A) P(B|A) or P(A B) = P(B) P(A|B) Independence: Events A and B are independent whenever P(B|A) = P(B). (Equivalently, events A and B are independent whenever P(A|B) = P(A).) Tree Diagram: Figure 15.5 is a nice example of a tree diagram and shows how we multiply the probabilities of the branches together. All the final outcomes are disjoint and must add up to one. We can add the final probabilities to find probabilities of compound events. Chapter 16: Random Variables A random variable assumes a value based on the outcome of a random event. o We use a capital letter, like X, to denote a random variable. o A particular value of a random variable will be denoted with the corresponding lower case letter, in this case x. o Two types of random variables: Discrete random variables can take one of a countable number of distinct outcomes. Example: Number of credit hours, amount of people at an event Continuous random variables can take any numeric value within a range of values. Example: Cost of books this term, daily temperature A probability model for a random variable consists of: o The collection of all possible values of a random variable (x), and the probabilities P(x) that the values occur. o Of particular interest is the value we expect a random variable to take on, notated μ (for population mean) or E(X) for expected value. 2 Var X x P x 2 SD X Var X RECALL: Adding or subtracting a constant from data shifts the mean but doesn’t change the variance or standard deviation: E(X ± c) = E(X) ± c Var(X ± c) = Var(X) In general, multiplying each value of a random variable by a constant multiplies the mean by that constant and the variance by the square of the constant (since we multiply the standard deviation by the constant): E(aX) = aE(X) Var(aX) = a2Var(X) In general, o The mean of the sum of two random variables is the sum of the means. o The mean of the difference of two random variables is the difference of the means. E(X ± Y) = E(X) ± E(Y) o If the random variables are independent, the variance of their sum or difference is always the sum of the variances. Var(X ± Y) = Var(X) + Var(Y) Chapter 17: Probability Models We have Bernoulli trials if: o there are two possible outcomes (success and failure). o the probability of success, p, is constant. o the trials are independent. A Geometric probability model tells us the probability for a random variable that counts the number of Bernoulli trials until the first success. o Geometric probability model for Bernoulli trials: Geom(p) p = probability of success q = 1 – p = probability of failure X = number of trials until the first success occurs P(X = x) = qx-1p Calculator Shortcuts o 2nd DISTR geometpdf( o Note the pdf for Probability Density Function o Used to find any individual outcome o Format: geometpdf(p,x) o o o o 2nd DISTR geometcdf( Note the cdf for Cumulative Density Function Used to find the first success on or before the xth trial Format: geometcdf(p,x) A Binomial model tells us the probability for a random variable that counts the number of successes in a fixed number of Bernoulli trials. n! Ck n o In n trials, there are k ! n k ! ways to have k successes. o Binomial probability model for Bernoulli trials: Binom(n,p) n = number of trials p = probability of success q = 1 – p = probability of failure X = # of successes in n trials P(X = x) = nCx px qn–x np o npq Calculator Shortcuts 2nd DISTR binompdf( Note the pdf for Probability Density Function Used to find any individual outcome Format: binompdf(n,p,x) 2nd DISTR binomcdf( Note the cdf for Cumulative Density Function Used for getting x or fewer successes among n trials Format: binomcdf(n,p,x) Practice: 1. Suppose that 40% of cars in your area are manufactured in the United States, 30% in Japan, 10% in Germany, and 20% in other countries. If cars are selected at random, find the probability that: a) A car is not U.S.-made. b) It is made in Japan or Germany. c) You see two in a row from Japan. d) None of three cars came from Germany. e) At least one of three cars is U.S.-made. f) The first Japanese car is the fourth one you choose. 2. One card is drawn from a standard deck of cards. What is the probability it is an ace or red? 3. Two cards are drawn without replacement. What is the probability they are both aces? Extend to the probability of getting 5 hearts in a row. 4. I draw one card and look at it. I tell you it is red. What is the probability it is a heart? And what is the probability it is red, given that it is a heart? (Conditional probability) 5. Are “red card” and “spade” independent? Mutually exclusive? 6. Are “red card” and “ace” independent? Mutually exclusive? 7. Are “face card” and “king” independent? Mutually exclusive? 8. A sample of automobiles dealerships found that 19% of automobiles sold are silver, 22% of automobiles sold are SUVs. 16% of automobiles are silver SUVs. a. Are the two events disjoint/mutually exclusive? b. What is the probability that a randomly chosen sold automobile is silver or an SUV? 9. The following table shows the number of males and females wearing jeans to class on a certain day. c. What is the probability that a male wears jeans? d. What is the probability that someone wearing jeans is a male? e. Are being male and wearing jeans disjoint? f. Are sex and attire independent? 10. Using the chart below: Education majors Non-Education Majors Total Males 95 1015 1110 Females 700 1727 2427 Total 795 2742 3537 a. Find the probability the student is male or an education major. b. Find the probability the student is female or not an education major. c. Find the probability the student is not female or an education major. d. Find the probability the student is a female and education major. e. Given a student is male, what is the probability he is an education major? 11. April 2003, Science magazine reported on a new computer-based test for ovarian cancer, “clinical proteomics,” that examines a blood sample for the presence of certain patterns of proteins. Ovarian cancer, though dangerous, is very rare, afflicting only 1 of every 5000 women. The test is highly sensitive, able to correctly detect the presence of ovarian cancer in 99.97% of women who have the disease. However, it is unlikely to be used as a screening test in the general population because the test gave false positives 5% of the time. Why are false positives such a big problem? Draw a tree diagram and determine the probability that a woman who tests positive using this method actually has ovarian cancer. 12. Consider a dice game: no points for rolling a 1, 2, or 3; 5 points for a 4 or 5; 50 points for a 6. (b) Find the expected value and the standard deviation. (c) Imagine doubling the points awarded. What are the new mean and standard deviation? (d) Now imagine just playing the game twice. What are the mean and the standard deviation of your total points? (e) Suppose you and a friend both play the dice game. What are the mean and standard deviation of the difference in your winnings? 13. Suppose a used car dealer runs autos through a two-stage process to get them ready to sell. The mechanical checkup costs $50 per hour and takes an average of 90 minutes, with a standard deviation of 15 minutes. The appearance prep (wash, polish, etc.) costs $6 per hour and takes an average of 60 minutes, with a standard deviation of 5 minutes. (f) What are the mean and standard deviation of the total time spent preparing a car? (Note that we cannot find the standard deviation if we do not believe that the two phases of the process are independent, an important assumption to check.) (g) What are the mean and standard deviation of the total expense to prepare a car? (h) What are the mean and standard deviation of the difference in costs for the two phases of the operation? (i) What is the probability that it will take longer to do the appearance prep than the mechanical checkup? (Note that we cannot answer this question unless we believe each phase of the process can be described by a Normal model.) Practice Answers: 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. (a) 0.6 (b) 0.4 (c) 0.09 7/13 1/221 ; 0.0005 ½;1 No; Yes Yes; No No; No (a) No – 16% are both (b) 25% (a) 12/17 (b) 3/5 (c) No (a) 1810/3537 (b) 3442/3537 12. (a) µ = 10; 𝜎 2 = 325; 𝜎 = 18.03 (b) µ = 20; 𝜎 = 36.06 (c) µ = 20; 𝜎 = 25.5 (d) µ = 0; 𝜎 = 25.5 13. (a) µ = 150 min ; 𝜎 = 15.8 min (b) µ = $81; 𝜎 = $12.51 (c) µ = $69; 𝜎 = $12.51 (d) µ = 30 min ; 𝜎 = 15.8 min (d) 0.729 (d) No (c) 1810/3537 (e) 0.784 (f) 0.1029 (d) 700/3537 (e) 95/1110