Chapter 3: Probability Distributions and Statistics Section 3.1-3.3 3.1 Random Variables and Histograms A is a rule that assigns precisely one real number to each outcome of an experiment. We usually denote a random variable by X. Notes: • The assignments of numbers to the outcomes of experiments are normally done in a manner that is reasonable and, most importantly, in a manner that permits these numbers to be used for interpretation and comparison. • Unless otherwise specified, when the outcomes of an experiment are themselves numbers, the random variable is the rule that assigns each number to itself. Example 1 The following are examples of random variables: 1.) You flip a coin. Assign the random variable X the value 1 if heads and 2 if tails. 2.) You flip a coin 10 times. X is the number of heads that occur. 3.) You roll a die. X is the number that appears on the uppermost face of the die. Types of Random Variables: if it assumes only a (1) A random variable is finite number of values. (2) A random variable is if it takes on an infinite number of values that can be listed in a sequence, so that there is a first one, a second one, a third one, and so on. (3) A random variable is said to be take any of the infinite number of values in some interval of real numbers. 1 if it can Example 2 Classify the following random variables. a) X = The number of defective batteries in a sample of 8 b) X = The number of times you flip a coin until a heads is shown c) X = The amount of time an Aggie student stands during a home football game A probability distribution of a random variable assigns a probability, p1 , p2 , ..., pn to each value of the random variable, x1 , x2 , ..., xn . Recall that these can be represented by probability distribution tables. The probability distribution of a random variable X satisfies: 1. 0 ≤ pi ≤ 1 2. p1 + p2 + ... + pn = 1 Example 3 Determine whether the table gives the probability distribution of the random variable X. x P (X = x) -2 -1 0 1 2 0.1 0.2 0.2 0.3 0.1 Example 4 The probability distribution of the random variable X is shown in the table. x -5 -3 -2 0 2 3 P (X = x) 0.17 0.13 0.33 0.16 0.11 0.10 Find: a) P (X = −3) 2 b) P (X ≤ 2) c) P (−3 ≤ X ≤ 10) d) P (X > 0) Example 5 Two fair four-sided dice are rolled. Let the random variable X denote the sum of the faces that fall uppermost. a) List the outcomes of the experiment. b) Find the value assigned to each outcome of the experiment by the random variable X. c) Find the probability distribution of X. d) What is P (X ≥ 4)? 3 e) What is P (3 < X ≤ 6)? Example 6 A carton contains a dozen eggs, of which 5 are cracked. An experiment consists of randomly selecting 3 eggs from the carton. Let X be the number of cracked eggs that are selected. a) Find the probability distribution for X. b) What is the probability that more than 1 cracked egg will be selected? 4 Example 7 An unfair die has the property that the probability of rolling a 1 is 0.3. This die is rolled 4 times. Let X be the number of times a 1 is rolled. Find the probability distribution for X. Note: This is a binomial experiment (section 2.4). Therefore, this distribution is referred to as a binomial distribution. We can represent a probability distribution graphically using a . The x-axis of a histogram corresponds to the values of the random variable and the y-axis corresponds to probabilities. Normally, we draw a bar of width one above each value of the random variable and the height of the bar is the probability that the random variable takes on that particular value. Area and Probability: The of a region of a histogram associated with the random variable X is equal to P (X), the probability that X occurs. Furthermore, the probability that X takes on the values in the range Xi ≤ X ≤ Xj is the sum of the areas of the histogram from Xi to Xj . Example 8 Draw a histogram for the probability distribution below. Shade the area representing P (0 < X ≤ 3) and calculate its value. X 0 1 2 3 4 5 Probability 0.2 0.15 0.3 0.1 0.05 0.2 5 3.2 Measures of Central Tendency The by µ or x, is given by or of the n numbers x1 , x2 , ..., xn , denoted Example 9 Find the mean of the following set of numbers: 64, 67, 68, 71 Example 10 Find the mean of the following set of numbers: 2, 64, 67, 68, 71 The mean is not always the best representation of what outcomes are occurring “most of the time” because it is sensitive to extreme values. We will look at some other notions of central tendency. The of a set of numerical data is the middle number when the numbers are arranged in order of size and there is an odd number of entries in the set. In the case that the number of entries in the set is even, the median is the mean of the two middle numbers. of a set of observations is the observation that occurs more frequently The than the others. If the frequency of occurence of two observations is the same and also greater than the frequency of occurrence of all the other observations, then we say that the set is bimodal and has two modes. If no one or two observations occurs more frequently than the others, we say that the set has no mode. Example 11 What are the mean, median, and mode of the set of numbers: 1, 4, 4, 5, 9, 9, 9? (Round your answers to four decimal places.) 6 Example 12 What are the mean, median, and mode of the set of numbers: 5, 9, 4, 1, 5, 1? (Round your answers to four decimal places.) Finding Mean and Median Using Your Calculator Given a List of Data: (1) Press STAT. (2) Select EDIT. (3) In the first column, enter the values. (4) Exit to the home screen. (5) Press STAT. (6) Scroll to the right to CALC. (7) Select 1-Var Stats (Option 1) (8) Press L1 (2nd → 1). (9) Press ENTER. (10) The mean is given by x. To find the median, scroll down until you see Med. Example 13 The daily maximum temperature in degrees Fahrenheit for the month of October in College Station follow: 85, 82, 85, 90, 89, 77, 63, 70, 85, 89, 89, 89, 86, 87, 82, 70, 90, 79, 82, 86, 90, 88, 86, 87, 88, 62, 63, 67, 71, 75, 84 Find the mean temperature and the median temperature. (Round to four decimal places.) 7 Finding Mean and Median Using Your Calculator Given the Random Variable and the Frequency or Probability: (1) Press STAT. (2) Select EDIT. (3) In the first column, enter the values the random variable X takes on. (4) In the second column, enter the frequency or probability for each value of X. (5) Exit to the home screen. (6) Press STAT. (7) Scroll to the right to CALC. (8) Select 1-Var Stats (Option 1) (9) Press L1 (2nd → 1). Press the comma key. Press L2 (2nd → 2). (10) Press ENTER. (11) The mean is given by x. To find the median, scroll down until you see Med. Example 14 A certain puzzle manufacturer makes and sells puzzles. The data below shows how many puzzles this company makes with different numbers of pieces. Number of Puzzles 400 200 350 150 275 300 75 Number of Pieces 500 1000 750 2000 300 1500 3000 a) Determine which row represents the random variable X and which row represents the frequency. b) Calculate the mean. (Round to four decimal places.) Let X denote the random variable that has values x1 , x2 , ... xn , and let the associated probabilities be p1 , p2 , ..., pn . The variable X, denoted by E(X) or µ, is or 8 of the random Example 15 Calculate the mean of the above table using our new definition by hand. (Round to four decimal places.) Example 16 Find the expected value of a random variable X having the following probability distribution. Also find the mode(s). X Probability 2 3 4 5 6 1 6 1 3 1 4 1 12 1 6 5/12 1/3 1/4 1/6 1/12 0 2 3 4 X 9 5 6 Example 17 Two cards are selected at random from a standard deck of cards. What is the expected number of hearts? Expected values are often used in games to determine whether the game is “fair.” In a game situation, we let X be the net winnings (profit) of the player. A game is considered “fair” when the expected net winnings are 0, ie, when E(X) = 0. Example 18 Visitors at a carnival pay $1.00 to play a game. The game consists of pulling a marble out of a bag. If the marble is purple, the participant wins $5.00. If the marble is maroon, the participant wins $2.00. If the marble is teal, the participant wins $1.00. However, if the marble is white, the participant loses. Assume the bag contains 1 purple marble, 2 maroon marbles, 3 teal marbles, and 4 white marbles. a) Draw a histogram of the probability distribution for this game where X represents your net winnings. b) Determine the expected winnings. 10 c) Is this game fair? Example 19 (From Tan) In a lottery, 5000 tickets are sold for $1 each. One first prize of $2000, one second prize of $500, three third prizes of $100 and ten consolation prizes of $25 are to be awarded. What are the expected net earnings of a person who buys a ticket? Example 20 A game consists of rolling a pair of fair 6-sided dice. The game costs $4 to play. If you roll the same number on both dice (a double), you win $a. Otherwise you win nothing. What value of a would make this game fair? Example 21 A life insurance company charges customers a yearly premium of $140 for a $25,000 life insurance policy. If the probability that a given customer will live through the next year is 0.998, what is the life insurance company’s net earnings? 11 If the probability that the man lives drops to 0.98, what is the minimum amount of money he can expect to pay for his policy? The expected value of the binomial distribution with n trials and probability of success p in a single trial and q = 1 − p is Example 22 Given that a binomial trial is repeated 300 times with a probability of success of 0.68, find the expected value of this binomial trial. Example 23 Assume that the expected number of successes in a binomial experiment consisting of 60 trials is 23. What is the probability of success? 12 3.3 Measures of Spread Example 24 The following lists are test scores for 10 students in Math 166. Calculate the mean of each. a) 44,48,55,56,58,95,97,98,99,100 b) 71,72,72,74,75,75,77,77,78,79 Goal: Measure the extent of the dispersion of the data from the mean. Let X denote the random variable that takes on the values x1 , x2 , ..., xn , and let the associated probabilities be p1 , p2 , ..., pn . of the random variable, de- Then if µ = E(X), the noted by V ar(X), is , denoted by σ(X) is The Example 25 Compare the variances of the following distributions. A B 13 Finding Standard Deviation (and Variance) Using Your Calculator: 1. Press STAT, then EDIT 2. In the first column, enter the values the random variable X takes on. 3. In the second column, enter the frequency or probability for each value of X. 4. Exit to the home screen. 5. Press STAT. Scroll to the right to CALC. Select 1-Var Stats (Option 1). Press L1 (2nd → 1). Press the comma key. Press L2 (2nd → 2). Press ENTER. 6. The standard deviation is given by σx. 7. To find the variance, press VARS. Select Statistics... (Option 5). Select σx (Option 4). Press x2 . Press ENTER. Example 26 Find the variance and standard deviation of the random variable X having the following probability distribution. X Probability 2 3 4 5 6 1 6 1 3 1 4 1 12 1 6 Example 27 A certain puzzle manufacturer makes and sells puzzles. The data below shows how many puzzles this company makes with different numbers of pieces. Number of Pieces 500 1000 750 2000 300 1500 3000 Number of Puzzles 400 200 350 150 275 300 75 Find the standard deviation and variance for the number of pieces in a puzzle. 14