Name:_______________________________ Student #:____________________________ Section #: ____________________________ BUEC 333 MIDTERM There is a formula sheet attached at the end of the exam and some blank pages for your rough work. Use the bubble sheet to record your answers for the multiple choice questions and write your answers to the short answer questions in the space provided. Multiple Choice (1 points each) 1.) Suppose that on any given trading day, a stock’s price can increase, decrease, or remain unchanged. Likewise, on any given day, the weather can be sunny or cloudy. You have the following information about the joint distribution of weather and stock price changes. It is sunny today. What is the probability of a price increase? Sunny Cloudy Decrease 3/20 1/10 Unchanged 3/10 1/5 Increase 1/10 a) 1/4 b) 3/10 c) 3/16 d) 3/20 e) none of the above 2.) If the covariance between two random variables X and Y is zero then a) X and Y are independent b) Knowing the value of X provides no information about the value of Y c) E(X) = E(Y) = 0 d) a and b are true e) none of the above 3.) The value of X1 is an unbiased estimator of the sample mean. Now consider the estimator (X1 + X2). Is this another unbiased estimator of the population mean? a) Yes b) No c) Depends on the value of X1 d) Depends on the value of (X1 + X2) e) none of the above 4.) Suppose you have a random sample of 20 observations from a normal distribution with mean = 10 and variance = 2. The sample mean (x-bar) is 8 and the sample variance is 3. The sampling distribution of x-bar has a) mean 8 and variance 3 b) mean 8 and variance 0.3 c) mean 10 and variance 0.2 d) mean 10 and variance 0.1 e) none of the above 1 5.) The law of large numbers says that: a) the sample mean is a biased estimator of the population mean in small samples b) the sampling distribution of the sample mean approaches a normal distribution as the sample size approaches infinity c) the behaviour of large populations is well approximated by the average d) the sample mean is an unbiased estimator of the population mean in large samples e) none of the above 6.) The power of a test is the probability that you: a) reject the null when it is true b) fail to reject the null when it is false c) reject the null when it is false d) fail to reject the null when it is true e) none of the above 7.) Which of the following is not a linear regression model: a) Yi X i X i2 i b) Yi 0 1 X i 2 i c) log( Yi ) 0 1 log( X i ) i d) Yi 0 1 log( X i ) i e) all of the above 8.) In the linear regression model, the least squares estimator a) minimizes the sum of squared residuals b) is unbiased c) is most efficient among the class of linear estimators d) maximizes the value of R2 e) all of the above 9.) In the linear regression model, the degrees of freedom a) is equal to the number of observations (n) minus k minus 1 b) affects the precision of the coefficient estimates c) does not affect the value of the coefficient estimates d) all of the above e) none of the above 10.) In the Capital Asset Pricing Model (CAPM), a) R2 is meaningless b) β is greater than one c) α is less than zero d) β measures the sensitivity of the expected return of a portfolio to specific risk e) β measures the sensitivity of the expected return of a portfolio to systematic risk 2 Short Answer (10 points each – show your work!) 1.) Consider the case of a uniformly distributed random variable where each outcome (1, 2, 3, 4) has an equal chance of occurring. a) What is the population mean and variance of this random variable? b) Suppose that a random number generator provides the following sequence of numbers, 2-1-4-1. What is the mean and variance of this sample? c) What is the sampling distribution of the sample mean calculated above? Provide a verbal interpretation of the sampling distribution of the sample mean. d) Compute the value of the t-statistic for testing the null hypothesis that μ = 2.5. e) The critical value for a t distribution with 3 degrees of freedom and a 0.20 level of significance in the presence of a two-sided alternative is equal to 1.638. Can you reject the null hypothesis that μ = 2.5 at the 20% level of significance? What about at the 10% level of significance? a) f(1) = 0.25, f(2) = 0.25, f(3) = 0.25, f(4) = 0.25 µx = 1 * (0.25) + 2 * (0.25) + 3 * (0.25) + 4 * (0.25) µx = 0.25 + 0.50 + 0.75 + 1.00 = 2.50 Outcome Pr[X = x] Xi – E(X) = Xi – 2.50 [Xi – E(X)]2 Pr[Xi] * [Xi – E(X)]2 X=1 0.25 -1.50 2.25 0.5625 X=2 0.25 -0.50 0.25 0.0625 X=3 0.25 0.50 0.25 0.0625 X=4 0.25 1.50 2.25 0.5625 Var(X) = 0.5625 + 0.0625 + 0.0625 + 0.5625 Var(X) = 1.25 1 n (2 1 4 1) Xi 2 n i 1 4 b) X s2 1 n (2 2) 2 (1 2) 2 (4 2) 2 (1 2) 2 2 X X i n 1 i 1 3 (0) 2 (1) 2 (2) 2 (1) 2 0 1 4 1 6 s 2 3 3 3 2 c) The mean is equal to the population mean of 2.50. The sampling variance is equal to the population variance divided by the number of observations or 1.25/4 = 0.3125. This represents the set of possible values that the statistic might take, and the probabilities associated with each of them. It measures uncertainty over the possible value that the statistic might take in repeated samples from the same population. d) T 2.00 2.50 0.50 0.7071 ~ t 2/ 4 2/2 3 e) In this case, you will fail to reject the null that μ = 2.5 at the 20% level of significance. And as the critical value will only increase with the level of significance, we will fail to reject at the 10% level as well. 3 2.) Suppose that on any given day, the length of time spent waiting to board the gondola at Whistler Mountain can be short, medium, or long. Likewise, on any given day, there can be fresh snow on the mountain or not. You have the following information about the joint distribution of snowfall and waiting times: Fresh Snow No Fresh Snow Short Wait 1/20 1/5 Medium Wait 1/10 3/10 Long Wait 5/20 a) What is the joint probability that there is no fresh snow and the wait time is long? b) If a short wait is one minute, a medium wait is ten minutes, and a long wait is twenty minutes, what is the expected waiting time on any given day? c) There is NO fresh snow today. What is the probability of a long wait? d) Explain the difference between your answer to part a) and your answer to part c). e) Are wait times and weather independent? Explain. a) Pr(no fresh snow, long wait) = 1 – 1/20 – 1/10 – 5/20 – 1/5 – 3/10 Pr(no fresh snow, long wait) = 20/20 – 1/20 – 2/20 – 5/20 – 4/20 – 6/20 = 2/20 b) Pr(short wait) = 1/20 + 1/5 = 1/20 + 4/20 = 5/20 Pr(medium wait) = 1/10 + 3/10 = 2/20 + 6/20 = 8/20 Pr(long wait) = 5/20 + 2/20 = 7/20 E(waiting time) = 5/20 * 1 minute + 8/20 * 10 minutes + 7/20 * 20 minutes E(waiting time) = 5/20 minutes + 80/20 minutes + 140/20 minutes = 11.25 minutes c) Pr(long wait|no fresh snow) = Pr(no fresh snow, long wait)/Pr(no fresh snow) Pr(no fresh snow) = 1/5 + 3/10 + 2/20 = 4/20 + 6/20 + 2/20 = 12/20 Pr(long wait|no fresh snow) =(2/20)/(12/20) = 2/12 = 1/6 d) Inherently, the answer in part a is the joint probability of both events happening simultaneously. The answer in part c is the conditional probability that you will experience a long wait given that there is no fresh snow today. The fundamental difference is that the conditional probability is based on more information, that is, we already know that there is no fresh snow. e) For independence, we require that Pr(long wait|no fresh snow) = Pr(long wait). Clearly, this is not the case as 1/6 does not equal 2/20 and so we conclude that wait times and weather are not independent. 4 3.) Suppose we have a linear regression model with two independent variables and a constant: Yi = β0 + β1X1i + β2X2i + εi Verbally, explain the steps necessary to derive the least squares estimator, providing as much intuition as possible for each step. The least squares estimator seeks to minimize the sum of the squared residuals from the estimating equation. 1.) Thus, we first have to define our residual as the difference between that which is observed and that which is predicted by the regression. In this way, the residual is best thought of as a prediction error, that is, something we would like to make as small as possible. 2.) Next, we need to define a minimization problem. Because our residuals will likely be both positive and negative, simply considering their sum is unsatisfactory as these will tend to cancel one another out. Additionally, minimizing the sum of residuals does not generally yield a unique answer. A better way forward is to minimize the sum of the squared “prediction errors” which will definitely yield a unique answer and which will penalize us for making big errors. 3.) We must take the derivatives of the sum of squared residuals with respect to the beta-hats and set them equal to zero. These first order conditions establish the values of beta-hat for which the sum of squared residuals “bottoms out” and is, thus, minimized. 4.) Finally, we must solve for the values of the beta-hats which are consistent with these first order conditions, thus, yielding our least squares estimators. 5 4.) Suppose that in a random sample of 100 SFU graduates, you estimate the following regression equation: Yi 0 1 X 1i 2 X 2i 3 X 3i i Yi = Hourly wage (in dollars) at first full-time job X1i = Cumulative GPA at graduation X2i = Number of years to complete SFU degree X3i = Usual number of hours worked off-campus per week while a student You estimate the model in EViews and obtain the following: ˆ0 10 ˆ1 5 ˆ2 1 ˆ3 0.2 R 2 0.3 a) Interpret the estimates ˆ and ˆ . 0 1 b) Compute the predicted wage of a graduate with a GPA of 3.0, who completes their degree in 5 years, and who worked 10 hours per week off-campus. c) What is the value of this student’s residual if their hourly wage is $25? What does the value of this student’s residual tell you (that is, interpret its value)? d.) If a student from the same graduating class had a GPA of 4.0 and took only four years to complete their degree, how many hours per week do you predict they would need to have worked as a student in order to make that same $25 hourly wage? e.) Are there any important variables missing from the regression above? If so, name one which it should be easy for you and/or the university to collect data on. a) The estimated value of the intercept tells us that when the value of all our explanatory variables (GPA, years at SFU, hours worked) are all zero, then the hourly wage will be $10. Since such an observation cannot occur (in particular, no graduate of SFU can do so in zero years), the constant is of no intrinsic interest to us. On the other hand, achieving one more grade point and holding years at SFU and hours worked constant is associated with an additional $5 for a graduate’s hourly wage. Yˆi 10 5 X 1i 1X 2i 0.2 X 3i b) Yˆi 10 15 5 2 $18 c) ei Yi Yˆi $25 $18 $7 This suggests that this particular student makes $7 more than the regression model predicts, given their GPA, years at SFU, and hours worked as a student. Yˆi 10 5 X 1i 1X 2i 0.2 X 3i 25 10 20 4 0.2 X 3i d) 0.2 X 26 25 1 3i X 3i 5 e) One obvious observable missing variable would be a student’s major. 6