Page 1 Instructor : Donglei Du ADM 2623 Midterm I (Nov 4-2015) Student Name:_______________________ Student ID________ Duration: 80 minutes ________________ 1. Problem 1. Short Explanations (8 marks). a. (2 points) What is a random variable? ANS: a function from the sample space to numbers b. (2 points) What is the difference between the general and special addition rule? ANS: general rule can be applied to any events and special rule is only good for mutually exclusive evetns c. (2 points) What is the difference between independence and mutually exclusiveness between two events? ANS: Independence means the joint probability of two events is equal to the product of the two event probabilities (or one event has no impact on the other in terms, or the conditional probability is equal to the unconditional probability), while mutually exclusiveness means two events are disjoint. d. (2 points) What is the difference between Chebyshev Theorem and the empirical rule? ANS: Chebyshev Theorem is applicable to any data set while the empirical rule only works for normally distributed data (bell-shaped symmetric data) 2. Problem 2. (4 marks). Thirty AA batteries were tested to determine how long they would last. The results, to the nearest minute, were recorded as follows: 423, 369, 387, 411, 393, 394, 371, 377, 389, 409, 392, 408, 431, 401, 363, 391, 405, 382, 400, 381, 399, 415, 428, 422, 396, 372, 410, 419, 386, 390 Generate a frequency table for this raw data i. (1 point) Calculate the number of classes: ANS: k=5 ii. (1 point) Calculate the class width ANS: w=14 iii. (1 point) Calculate the lower limit of the first class ANS: l=362 iv. (1 point) draw a frequency distribution table below: class freq [362,376) 4 [376,390) 6 [390,404) 9 [404,418) 6 Page 2 [418,432] 5 3. Problem 3. (6 marks). Given the following grouped data: Class Limits Frequency Class fx (f) midpoint (x) [40, 50) 1 45 45 [50, 60) 2 55 110 [60, 70) 3 195 65 [70 80) 4 300 75 [80, 90] 6 510 85 [90,100] 4 380 95 fx^2 2025 6050 12675 22500 43350 36100 20 1540 122700 a. (2 points) Calculate the mean for this grouped data ANS: mean = 1540/20 = 77 b. (2 points) Calculate the median for this grouped dat. ANS: =70+10*(20/2-6)/4=80 c. (2 points) Calculate the standard deviation for this grouped data assuming it is a sample. ANS: Var = (122700-1540^2/20)/(20-1) 216.8421053 and SD=14.72555959 Problem 4. (4 marks) Most people who are used to receiving e-mail containing a word "replica" know that this message is likely to be spam, more precisely a proposal to sell counterfeit copies of well-known brands of watches. The spam detection software, however, does not "know" such facts; all it can do is compute probabilities. Recent statistics show that the probability of any message being spam is 80%; the probability that a spam message contains the word "replica" is 90%, while the probability that a non-spam message contains the word "replica" is 5%. Suppose that now you receive a message with a “replica” in it, what is the chance that this message is a spam? (hint: Bayes Theorem)? ANS: Let S be the event representing all spams, and R be the event represent all messages containing the word “Replica”. Then P(S)=0.8, P(R|S)=0.9, P(R|~S)=0.05 (here ~ represents complement) Based on Bayes theorem 𝑃(𝑆|𝑅) = 𝑃(𝑅|𝑆) ∗ 𝑃(𝑆) 0.9 ∗ 0.8 0.72 = = ≈ 0.9863014 𝑃(𝑅|𝑆) ∗ 𝑃(𝑆) + 𝑃(𝑅|~𝑆) ∗ 𝑃(~𝑆) 0.9 ∗ 0.8 + 0.05 ∗ 0.2 0.73 Problem 5. (8 marks) Short calculations a. (2 points). Given a random variable with probability distribution as follows X P(X) 2 0.4 -2 0.1 3 0.5 Page 3 Find the probability distribution of the new random variable Y=2X-3 ANS: Y 1 -7 3 b. P(Y) 0.4 0.1 0.5 (2 points) Given a joint probability distribution as follows, please explain if X and Y are independent or not? (X,Y) -1 -2 -3 1 0.01 0.02 0.15 0.18 2 0.19 0.18 0.03 0.40 3 0.20 0.20 0.02 0.42 0.4 0.4 0.2 ANS: They are not independent because some joint probability is not equal to the product of the two corresponding marginal probabilities, such as P(X=-1, Y=1)=0.01 is not equal to P(X=-1)*P(Y=1)=0.4(0.18 c. (2 points)? In a group of 101 students, 49 are juniors, 50 are female, and 22 are female juniors. Find the probability that a student picked from this group at random is either a junior or a female. ANS: Note that P(junior) = 49/101 and P(female) = 50/101, and P(junior and female) = 22/101. Thus P(junior or female) = 49101 + 50/101 – 22/101 = 77/101 d. (2 points) Let's try a new probability experiment. This time, consider a bag of marbles, containing 10 red, 20 blue, and 15 green marbles. Suppose that two marbles are drawn without replacement. (The first marble is not put back in the bag before drawing the second.) What is the probability that both marbles drawn are red ANS: Let's define a couple events: E = first marble is red F = second marble is red We want P(E and F). Using the General Multiplication Rule, we see P(E and F) = P(E) • P(F|E) = (10/45) • (9/44) ≈ 0.0455