5 – Binomial Random Variable and Probability Distribution 5.1 - Motivating Example and Definitions: 5.2 - Binomial Distribution and Random Variable A binomial random variable, X, is defined to be the number of “successes” in n independent trials where the P(success) = p is constant for each of trials. In the definition above notice the following conditions need to be satisfied for a binomial experiment: 1. There is a fixed number of n trials carried out. 2. The outcome of a given trial is either a “success” or “failure”. 3. The probability of success (p) remains constant from trial to trial. P( success ) p and P( failure ) (1 p) = q 4. The trials are independent, that is the outcome of a given trial is not affected by the outcome of any other trial. Binomial Probability Function n n! P( X x) p x q n x p x q n x , x 0,1,..., n x x ! ( n x )! where k! 1 2 3 k “k factorial” is the product of the numbers 1 through k and is denoted by k! The factorial expression in front counts the number of ways it is possible to obtain x successes in n trials while the remainder of the formula gives the probability of getting a particular sequence with x “successes” and n – x “failures”. Mean # of successes = np Standard deviation of the # of successes = √𝑛𝑝𝑞 44 The JMP spreadsheet Binomial Table Generator does all the probability calculations for the following: P(X = x) = probability of obtaining x “success” in n trials. P(X < x) = probability of obtaining x or less “successes” in n trials. P(X > x) = probability of obtaining x or more “successes” in n trials. To use the Binomial Table Generator file in JMP you simply need to change the number of independent trials (n) and the probability of success (p) in the table below. To change the n and p values right-click at the top of the column and select Formula. You can then change the number in the formula to your desired values for n and p. The table automatically updates and gives the following probabilities in the last three columns. Be sure to look at the comments on each in the upper left-hand corner of the table. Example 1: Drug Side Effects A drug company claims that 10% of patients taking the drug will experience adverse side effects. To test this claim a researcher administers the drug to a random sample of 20 patients. Let X = the number patients in our sample who experience adverse side effects. Does this study meet the conditions of binomial probability experiment? 45 To set up the binomial probability table for this situation we need to change the number of trials to n = 20 and the probability of success to p = .10, to do this right-click on the top of each column and select Formula. a) What is the probability that exactly 4 patients experience side effects? 46 b) What is the probability that 2 patients or less experience side effects? c) What is the probability that 5 patients or less experience side effects? d) What is the probability that 6 or more patients experience side effects? e) Suppose that in our sample 6 patients experience side effects. What do think about the claim made by the drug company on the basis of this result? f) Suppose we gave the drug to 100 patients. If the drug company’s claim regarding side effects is true, how many side effects do we expect to observe, i.e. what is the mean or expected value of X? Recall: For a binomial random variable np g) Find the standard deviation of the number of side effects and use it to give an interval that is very likely to cover the number of side effects observed when giving the drug to n = 100 patients. Recall: For a binomial random variable npq 47 When n is sufficiently large then X = # of “successes” in the n trials is approximately normally distributed. As an example consider the histogram below which shows the simulated results of 10,000 clinical trials where for each trial n = 100 patients are given a drug which has p P( side effect) .10 or a 10% chance of causing a side effect and the number of patients experiencing side effects is observed. h) How many side effects would you have to observe to be convinced that the 10% side effect is wrong and that the true side effect rate is greater? Binomial Distribution Table for n = 100 and p = .10 48 Example 2: Effect of Togetherness on Heart Rates of Rats (SIGN TEST) A researcher was interested in determining if the heart rate of rats increases when they are in a cage with other rats versus when they are in a cage by themselves. The researcher thought this would be the case but wanted to conduct a study to determine if this hypothesis was supported empirically. The results were obtained: Rat Alone Together Difference Sign of Difference (A) (T) =T-A 1 463 523 2 462 494 3 462 461 4 456 535 79 5 450 476 26 6 426 454 28 7 418 448 30 8 415 408 -7 9 409 470 61 10 402 437 35 What did we conclude regarding the research hypothesis? 49 Example 3: Swain vs. Alabama In 1965, an appeal to the Supreme Court was made by a black man (the petitioner) sentenced to death after being convicted in the Circuit Court of Talladega County, Alabama, of the rape of a 17 year-old white girl by an all-white jury. At the time of the crime, the defendant was 19. The petitioner alleged (among other things) that the entire process of selecting eligible jurors, from the jury pool of eligible jurors to the jury panel (from which the trial jury is chosen), was racially discriminatory. At the time, in the early 1960’s, eligible jurors in Alabama were males over 21. According to Census figures available then, there were 16,406 individuals that fit the profile in Talladega Country. Twenty-six percent of these individuals were AfricanAmericans. When 100 individuals were chosen to serve on the jury panel, only 8 of them were African-Americans*. What is the likelihood of getting 8 blacks in a sample of 100 drawn from the population of males eligible for jury duty in Talladega County? 50