Statistics, winter semester 20–21 M. Raux Exercise Set I Probability fundamentals 1. (Airline example from Wooldridge, Appendix B). A flight has 100 available seats, and the airline sells more tickets than available seats. Given that the probability that each person shows up is π = .85, if the airline sells 110 tickets, what is the probability that more than 100 travellers show up? Write the formula, you do not need to compute the exact value of the probability. 2. (Wooldridge, exercise B2) Much is made of the fact that certain mutual funds outperform the market year after year (that is, the return from holding shares in the mutual fund is higher than the return from holding a portfolio such as the S&P 500). For concreteness, consider a 10-year period and let the population be the 4170 mutual funds reported in The Wall Street Journal on January 1, 1995. By saying that performance relative to the market is random, we mean that each fund has a 50–50 chance of outperforming the market in any year and that performance is independent from year to year. (i) If performance relative to the market is truly random, what is the probability that any particular fund outperforms the market in all 10 years? (ii) Find the probability that at least one fund out of 4170 funds outperforms the market in all 10 years. What do you make of your answer? (iii) Find the probability that at least five funds outperform the market in all 10 years (Write the formula, you do not need to compute the exact value of the probability). 3. An airline opens the sale of tickets for a flight. Before starting to sell the tickets, the airline does not know the proportion of passengers that will buy a business class ticket, so this is also a random variable. The following tables gives the probabilities that a passenger actually shows up after having bought a flight ticket, and whether he is an economy or business traveler. The definitions of the variables are π = 0 1 if the passenger shows up, if the passenger does not shows up, π = 0 1 buys economy class, buys business class. The following two tables give the probability density functions ππ |π (π¦|π₯) and πππ (π₯, π¦) : ππ |π (π¦|π₯) Y (i) 0 1 0 0.3 0.7 X πππ (π₯, π¦) 1 0.2 0.8 0 0.03 0.07 0 1 Y Compute ππ (π₯) . (ii) Can you compute ππ |π (π₯ |π¦) ? (iii) Let ππ (π₯) be the ones computed in point. (i) Let ππ |π (π¦|π₯) now be: ππ |π (π¦|π₯) Y 0 1 0 0.2 0.8 X 1 0.2 0.8 X 1 0.18 0.72 Can you compute ππ |π (π₯ |π¦) ? 4. (Wooldridge, exercise B4) For a randomly selected county in the United States, let π represent the proportion of adults over age 65 who are employed, or the elderly employment rate. Then, π is restricted to a value between zero and one. Suppose that the cumulative distribution function for π is given by πΉ (π₯) = 3π₯ 2 − 2π₯ 3 , for 0 ≤ π₯ ≤ 1. Find the probability that the elderly employment rate is at least 0.6 ( 60%) . 5. (Wooldridge, exercise B6) Let π denote the prison sentence, in years, for people convicted of auto theft in a particular state in the United States. Suppose that the pdf of π is given by 1 9 ππ (π₯) = π₯ 2, 0 < π₯ < 3. What is the expected prison sentence. 6. You observe a stock price for 3 days in a row. Each day the stock price can go up by an amount π’ or down by an amount π . (i) Write the sample space π for this experiment. How many elements does it contain? (ii) Assume that π’ = −π . Does the sample space change? How many elements does it have? (iii) Assume that the price of the stock at day 0 is 10, and that π’ = 1 and π = −2. Let π be the value of the stock at the end of the 3 days. What is the sample space given by the outcomes of π ? (iv) I bought the stock at day 0 and I am selling it on day 3 only if I make a profit, that is if the price is strictly larger than 10. If each day the probabilities of the stock moving up or down are equal, what is the probability that I sell the stock on day 3? 7. (Wooldridge, exercise B7) If a basketball player is a 74% free throw shooter, then, on average, how many free throws will he or she make in a game with eight free throw attempts? 8. (Based on Example 1.5.4. De Groot Schervish) Demands for Utilities. A contractor is building an office complex and needs to plan for water and electricity demand (sizes of pipes, conduit, and wires). After consulting with prospective tenants and examining historical data, the contractor decides that the demand for electricity will range somewhere between 1 million and 150 million kilowatt-hours per day and water demand will be between 4 and 200 (in thousands of gallons per day). All combinations of electrical and water demand are considered possible. Let π be the demand of water and π the demand of electricity. The contractor is interested in these two events: high water demand, πΈ 1 : {(π₯, π¦) | π₯ ≥ 100}, and high electricity demand, πΈ 2 : {(π₯, π¦) | π¦ ≥ 115}. Assume that the relative probabilities of the events are given by the ratios of the surfaces of the ππ plane defined by the events. (i) What is the sample space π ? What is π (π) ? (ii) What is the probability of πΈ 1 ? (iii) What is the probability of πΈ 2 ? (iv) What is the probability that the contractor has to face both a high demand in electricity and a high demand in water? (v) What is the probability that a the contractor will have to assure a high level of water supply conditional to tenants asking for a high electricity supply, that is ππ |π (π = 1 |π = 1) ? (vi) Are the variables π and π independent? Explain. 9. Consider the picture below. Variables and interpretations are the same as in exercise 8. The hatched rectangle denotes commercial tenancies. Define the variable π , where π = 0 if the tenant is residential and π = 1 if he is commercial. If we restrain our attention to residential tenancies, that is, if we condition the distributions on π = 0, are π and π conditionally independent? Formally, this is written as π and π are independent given π if πππ |π (π₯, π¦|π§) = ππ |π (π₯ |π§)ππ |π (π¦|π§) . Electricity (Y) 150 115 1 4 100 200 Water (X) 10. Suppose that at a large university, college grade point average, πΊππ΄, and SAT (a college admission test) score, ππ΄π , are related by the conditional expectation πΈ (πΊππ΄|ππ΄π ) = .70 + 0.002 ππ΄π . (i) Find the expected πΊππ΄ when ππ΄π = 800. Find πΈ (πΊππ΄|ππ΄π = 1400) . Comment on the difference. (ii) If the average ππ΄π in the university is 1100, what is the average GPA? (iii) If a student’s ππ΄π score is 1100, does this mean he or she will have the πΊππ΄ found in part (ii)? Explain. 11. (Advanced) Show that if π· is a Bernoulli variable, and π a generic variable defined on the same sample space as π· , then: πΈ (π |π· = 1) = πΈ (π π·) πΈ (π π·) = . πΈ (π·) π (π· = 1) (Hint. π and π· are defined on the same sample space π , so π· is a partition of π . Split the integral of the definition of πΈ (π π·) according to the regions given by this partition, and the result follows). 12. Let π and π be random variables such that π 3 1 −1 ∼N , . π 1 −1 4 (i) Define what it means for π and π to be jointly normal. (ii) Find the joint distribution of π + 2π and 2π − π . (iii) Calculate π (π + 21 π ≤ 0) . (iv) Find πΈ [π |π ] and Var (π |π ) . (v) Compare Var (π |π = 1) with Var (π |π = −1/2) . Is there any difference? Explain why. 13. π and π are standard normal random variables, and Corr (π, π ) = 0.5. (i) 2π Write the variance-covariance matrix of 3π (ii) Choose π 1, π 2 such that π = π 1π + π 2π and π are independent. (iii) Choose π 1, π 2 such that π = π 1π 2 + π 2π 2 follows a π 22 distribution.