Mrs. Lerner Charlotte Catholic High School With some updates by Dr. Davidson Mallard Creek High School 1. Are uncertain in the short run 2. Exhibit a consistent pattern in the long run Note the dual aspect Note: This is the Condition for the Probability we study An event is an outcome or a set of outcomes of random phenomenon (RP). Outcome(s) of interest are “Success(es)” [even if they are bad news!!!!] (note dual aspect) Probability of an event = 1. proportion of times a success occurs in the long run (measure by experiment or simulation) 2. [number of ways for a success to occur]/[all possible events (specified by a model of the RP) If there are n ways to do a first event & m ways to do a second event Then the number of all possible outcomes=nm This is called the multiplication rule Consider the sum of two dice that are rolled Let’s say we are interested in the Probability the sum is a 4, so SUCCESS = ? Find # of ways to get a sum = 4 [a success] is 3: 1+3, 2+2, 3+1 [3 ways] The probability of a specific outcome is [# of ways to get a success]/ [all possible outcomes Probability of getting a sum =4 is: 3/36 = 1/12 ~ .0833 = 8.33 % e.g. 1. This is the chance the next outcome is a success 2. This is the proportion of times that successes occur in a large repetitive # of identical trials The probability P(A) of any event A satisfies 0≤P(A)≤1 Thus the probability of any event is between 0 & 1 The Sample Space is the set of all possible outcomes If S is the sample space in a probability model, then P(S)=1 The probabilities of all possible outcomes must add up to 1 The complement of any event A is 1. the event that A does not occur 2. written as Ac The complement rule states that P(Ac)=1-P(A) The probability that an event does not occur is 1 minus the probability that it does occur Two events A and B are disjoint 1. if they have no outcomes in common 2. cannot occur at the same time. If A and B are disjoint, then P(A or B) = P(A) + P(B). If two events have no outcomes in common, then the probability of either one occurring is the sum of their individual probabilities For any two events A and B: P(A or B) = P(A) + P(B) – P(A and B) The probability of either A or B occurring is the sum of their individual probabilities minus the probability that they occur at the same time Two events A and B are independent if knowing that one occurs does not change the probability that the other occurs. If A and B are independent, then P(A and B) = P(A)P(B). If two events are independent, then the probability that they both occur is the product of their individual probabilities Note – Disjoint events are always NOT INDEPENDENT, for if one occurs, then we know the other can not occur! And = Joint = Intersection Or = Both (1, or other, or 2) = Union P A and B P A B P A or B P A B When P(A)>0, the conditional probability of B given A is: The probability of B occurring, given that A occurs, is the probability of A & B jointly occurring divided by the probability of the stated condition (A) occurring For any two events A and B, P(A and B) = P(B|A) P(A) P(A and B) = P(A|B) P(B) The probability of A and B occurring jointly is the probability that B occurs given that A has already occurred multiplied by the probability that A occurs Example - 2: Suppose that for a certain Caribbean island in any 3-year period: 1. the probability of a major hurricane is .25 2. the probability of water damage is .44 3. and the probability of both a hurricane and water damage is .22. What is the probability of water damage given that there is a hurricane? Suppose that for a certain Caribbean island in any 3-year period the probability of a major hurricane is .25, the probability of water damage is .44 and the probability of both a hurricane and water damage is .22. What is the probability of water damage given that there is a hurricane? P water damage|hurricane P water damage and hurricane .22 .25 .88 P hurricane If three people, Joe, Betsy, and Sue, play a game in which Joe has a 25% chance of winning Betsy has a 40% chance of winning What is the probability that Sue will win? 2. If three people, Joe, Betsy, and Sue, play a game in which Joe has a 25% chance of winning and Betsy has a 40% chance of winning, what is the probability that Sue will win? P Sue 1 .25 .4 1 .65 .35 4. A summer resort rents rowboats to customers but does not allow more than four people to a boat. Each boat is designed to hold no more than 800 pounds. Suppose the distribution of the weight of adult males who rent boats, including their clothes and gear, is normal with a mean of 190 pounds and standard deviation of 10 pounds. If the weights of individual passengers are independent, what is the probability that a group of four adult male passengers will exceed the acceptable weight limit of 800 pounds? X 190 X 10 T 4 X 4 190 760 X 190 X 10 T 4 X T2 X2 X2 X2 X2 4 190 102 102 102 102 760 400 T 20 X 190 X 10 T 4 X T2 X2 X2 X2 X2 4 190 102 102 102 102 760 400 T 20 normalcdf 800,1E99,760, 20 P exceed 800 lbs 0.023 The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. 18-30 31-40 41-50 Over 50 Dem. 25 32 17 14 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 5. What proportion of the Republicans are over 50? a. b. c. d. e. 61/238 32/96 96/238 32/61 Cannot be determined. The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. 18-30 31-40 41-50 Over 50 Dem. 25 32 17 14 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 5. What proportion of the Republicans are over 50? a. b. 32/96 c. d. e. The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. Dem. 25 32 17 14 18-30 31-40 41-50 Over 50 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 6. If one adult citizen is chosen at random, what is the probability that this person is a Democrat between the ages of 41 and 50? a. b. c. d. e. 17/238 17/88 61/238 17/61 88/238 The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. 18-30 31-40 41-50 Over 50 Dem. 25 32 17 14 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 6. If one adult citizen is chosen at random, what is the probability that this person is a Democrat between the ages of 41 and 50? a. 17/238 b. c. d. e. The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. Dem. 25 32 17 14 18-30 31-40 41-50 Over 50 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 7. Given that a person chosen at random is between 31 and 40, what is the probability that this person is an Independent? a. b. c. d. e. 10/238 10/63 10/54 54/238 63/238 The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. 18-30 31-40 41-50 Over 50 Dem. 25 32 17 14 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 7. Given that a person chosen at random is between 31 and 40, what is the probability that this person is an Independent? a. b. 10/63 c. d. e. The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. 18-30 31-40 41-50 Over 50 Dem. 25 32 17 14 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 8. What proportion of the citizens sampled are over 50 or Independent? a. b. c. d. e. 54/238 61/238 100/238 115/238 Cannot be determined The following table shows the frequencies of political affiliations in the age ranges listed from a random sample of adult citizens in a particular city. 18-30 31-40 41-50 Over 50 Dem. 25 32 17 14 Repub. Indep. 18 12 21 10 25 17 32 15 _____________________________________________________________ 8. What proportion of the citizens sampled are over 50 or Independent? a. b. c. 100/238 d. e. Example Assume an multiple-choice examination consists of questions, each having five possible answers. Linda estimates that she has probability 0.75 of knowing the answer to any question that may be asked. If she does not know the answer, she will guess, with conditional probability 1/5 of being correct. What is the probability that Linda gives the correct answer to a question? (Draw a tree diagram to guide the calculations.) An examination consists of multiple-choice questions, each having five possible answers. Linda estimates that she has probability 0.75 of knowing the answer to any question that may be asked. If she does not know the answer, she will guess, with conditional probability 1/5 of being correct. What is the probability that Linda gives the correct answer to a question? (Draw a tree diagram to guide the calculations.) P(correct) = .75 +.25*.2 = .8 A random variable assumes any of several different values as a result of some random phenomenon A Discrete RV – has a countable number of possible values Value of X x1 x2 x3 … xk Probability p1 p2 p3 … pk ◦ The probabilities must satisfy two requirements Every probability is between 0 and 1 The sum of all probabilities is 1 ◦ We can use a probability histogram to look at the probability distribution. Mean of a Discrete R. V. – (also called expected value) – X x1 p1 x 2 p 2 x3 p3 x k p k x p i i Variance of a Discrete R. V. – X2 x1 X 2 p1 x 2 X 2 p 2 x3 X 2 p 3 x k X 2 p k x i X 2 p i Continuous R. V. – takes all values in an interval of numbers ◦We look at its distribution using a density curve ◦The probability of any event is the area under the density curve in that interval. 1. If X is an R. V. and a & b are fixed numbers, then the mean μa+bX = a +bμX 2. If X and Y are R. V.‘s, then μX±Y = μX ± μY If X is an R. V. and a and b are fixed numbers, then σ2a+bX = b2σX2 ◦ Note that multiplying by a constant changes the variance but adding a constant does not. If X and Y are independent R. V.’s, then σ2X±Y =σX2+σY2 “Pythagorean Theorem of Statistics” For STANDARD DEVIATION: square ‘em, add ‘em, take the square root 11. Suppose X and Y are random variables with μX = 10, σX = 3, μY = 15, and σY = 4. Given that X and Y are independent, what are the mean and standard deviation of the random variable X+Y? 11. Suppose X and Y are random variables with μX = 10, σX = 3, μY = 15, and σY = 4. Given that X and Y are independent, what are the mean and standard deviation of the random variable X+Y? μX+Y = μX + μY =10 + 15 = 25 σX+Y = √σ2X + σ2Y = √9+16 = √25 =5 12. You roll a die. If it comes up a 6, you win $100. If not, you get to roll again. If you get a 6 the second time, you win $50. If not, you lose. a. Create a probability model for the amount you will win at this game. 12. You roll a die. If it comes up a 6, you win $100. If not, you get to roll again. If you get a 6 the second time, you win $50. If not, you lose. a. Create a probability model for the amount you will win at this game. Winnings $100 $50 $0 Probability 1/6 (5/6)(1/6) 5/36 (5/6)(5/6) 25/36 12. You roll a die. If it comes up a 6, you win $100. If not, you get to roll again. If you get a 6 the second time, you win $50. If not, you lose. b. Find the expected amount you’ll win. 12. You roll a die. If it comes up a 6, you win $100. If not, you get to roll again. If you get a 6 the second time, you win $50. If not, you lose. b. Find the expected amount you’ll win. X E X 1 5 25 100 50 0 6 36 26 $23.61 Law of Large Numbers – The long run relative frequency of repeated independent trials gets closer and closer to the true relative frequency as the number of trials increases. Binomial Distribution – the distribution of the count X successes in the binomial setting. B(n,p) where n is the number of observations and p is the probability of success Use binompdf(n,p,X) to find the probability of a single value of X, such as P(X = 3). Use binomcdf(n,p,X)to find the probability of at most X successes for example P(X ≤ 3). np np1 p 13. Pepsi is running a sales promotion in which 12% of all bottles have a “FREE” logo under the cap. What is the probability that you find two free cans in a 6-pack? 13. Pepsi is running a sales promotion in which 12% of all bottles have a “FREE” logo under the cap. What is the probability that you find two free cans in a 6-pack? 6 2 4 P X 2 .12 .88 2 13. Pepsi is running a sales promotion in which 12% of all bottles have a “FREE” logo under the cap. What is the probability that you find two free cans in a 6-pack? 6 2 4 P X 2 .12 .88 2 binompdf 6,.12, 2 P X 2 .13 14. The National Association of Retailers reports that 62% of all purchases are now made by credit card; you think this is true at your store as well. On a typical day you make 20 sales. a. Let X represent the number of customers who use a credit card on a typical day. What is the probability model for X? 14. The National Association of Retailers reports that 62% of all purchases are now made by credit card; you think this is true at your store as well. On a typical day you make 20 sales. a. Let X represent the number of customers who use a credit card on a typical day. What is the probability model for X? The model is = B(20, .62) [i.e. B(n,p)] Please Explain Why 14. The National Association of Retailers reports that 62% of all purchases are now made by credit card; you think this is true at your store as well. On a typical day you make 20 sales. b. Find the mean and standard deviation. 14. The National Association of Retailers reports that 62% of all purchases are now made by credit card; you think this is true at your store as well. On a typical day you make 20 sales. b. Find the mean and standard deviation. X np 20 .62 12.4 X np 1 p 12 .62 .38 2.17 14. The National Association of Retailers reports that 62% of all purchases are now made by credit card; you think this is true at your store as well. On a typical day you make 20 sales. c. What is the probability that on a typical day at least half of your customers use a credit card? 14. The National Association of Retailers reports that 62% of all purchases are now made by credit card; you think this is true at your store as well. On a typical day you make 20 sales. c. What is the probability that on a typical day at least half of your customers use a credit card? P X 10 1 P X 9 .9077 1 binomcdf (20,.62,9) 19. The volumes of soda in quart soda bottles can be described by a Normal model with a mean of 32.3 oz and a standard deviation of 1.2 oz. What is the probability that a randomly selected bottle has a volume less than 32 oz? There are typos in the next slide: The z score calculation should read: P(x< 32) = P(z < [32-32.3]/1.2) = P(z< -.25) = .4013 And Normalcdf(-E99, 32, 32.3, 1.2) = .4013 The volumes of soda in quart soda bottles can be described by a Normal model N(32.3, 1.2) a mean of 32.3 oz and a standard deviation of 1.2 oz. What is the probability that the volume of a randomly selected bottle has a less than 32 oz? 19. 32.3 32 P x 32 P z 1.2 P z .1429 .4013 normalcdf (1E 99,32,32.1,1.2) 20. A bank's loan officer rates applicants for credit. The ratings can be described by a Normal model with a mean of 200 and a standard deviation of 50. If an applicant is randomly selected, what is the probability that the rating is between 200 and 275? 20. A bank's loan officer rates applicants for credit. The ratings can be described by a Normal model with a mean of 200 and a standard deviation of 50. If an applicant is randomly selected, what is the probability that the rating is between 200 and 275? P 200 x 275 .4332 Sampling distribution – the distribution of values taken by a statistic in all possible samples of the same size from the same population Provided that the sampled values are independent and the sample size is large enough, the sampling distribution of p̂ is modeled by a Normal model with mean p̂ p and standard deviation SD pˆ p .1 p n Assume that 12% of students at a university wear contact lenses. We randomly pick 200 students. ◦ What is the mean of the proportion of students in this group who may wear contact lenses? ◦ What is the standard deviation of the proportion of students in this group who may wear contact lenses? Assume that 12% of students at a university wear contact lenses. We randomly pick 200 students. ◦ What is the mean of the proportion of students in this group who may wear contact lenses? ◦ What is the standard deviation of the proportion of students in this group who may wear contact lenses? .12 .12 .88 .023 200 Suppose that x-bar is the mean of an SRS of size n drawn from a large population with mean μ and standard deviation σ. Then the ◦ mean of the sampling distribution of xbar is μ (hence xbar is an unbiased indicator of μ) ◦ standard deviation of the sampling distribution of xbar is σ /√n. The scores of individual students on the ACT have a normal distribution with mean 18.6 and standard deviation 5.9. At Northside High, 76 seniors take the test. If the scores at this school have the same distribution as national scores, what are the mean and standard deviation of the distribution of sample means for these 76 students? The scores of individual students on the ACT have a normal distribution with mean 18.6 and standard deviation 5.9. At Northside High, 76 seniors take the test. If the scores at this school have the same distribution as national scores, what are the mean and standard deviation of the average (sample mean) for the 76 students? 18.6 5.9 76 .6768