Stats Exam Prep. Dr. Lin Lin WARNING • The goal of this workshop is to go over some basic concepts in probability and statistic theories required for IS 665 • It is NOT to help you pass the exam • NO EXAM QUESTIONS will be covered here Probability • Probability is the measure of how likely something will occur. • It is the ratio of desired outcomes to total outcomes. – P(event) = (# desired events) / (# total events) • Probabilities of all outcomes sums to 1. BEFORE Probability • We need to learn to count the number of possible events • Exercise I: How many different five-digit numbers exist? • How did you get the answer? BEFORE Probability • Exercise II: How many different five-digit numbers WITHOUT 0 exist? • How did you get the answer? BEFORE Probability • Exercise III: US phone number is in the format of (###) – ### - #### – The first digit cannot be zero – There cannot be a “000-0000” number – How many numbers are possible? • How did you get the answer? BEFORE Probability • Exercise IV: let’s make a three-digit number. There is only one rule: no two digits could be identical. How many numbers could we make? • How did you get the answer? BEFORE Probability • Exercise V: • You could rearrange these shapes anyway you want. However, cannot be on either side. How many different ways could we have? • How did you get the answer? Probability Example • If I roll a number cube, there are six total possibilities. (1,2,3,4,5,6) • Each possibility only has one outcome, so each has a PROBABILITY of 1/6. • For instance, the probability I roll a 2 is 1/6, since there is only a single 2 on the number cube. Practice • If I flip a coin, what is the probability I get heads? • What is the probability I get tails? • Remember the equation? Number of desired outcomes divided by number of possible outcomes Answer • P(heads) = 1/2 • P(tails) = 1/2 • If you add these two up, you will get 1, which means the answers are probably right. Answer • Let’s make it harder – assuming that the coin is not fair, and P(H) = 0.6 • What is the chance of getting a tail in one flip? Bernoulli Trial • In the theory of probability and statistics, a Bernoulli trial (or binomial trial) is a random experiment with exactly two possible outcomes, "success" and "failure", in which the probability of success is the same every time the experiment is conducted. • So it is basically coin-flipping with a p of not necessarily 0.5 Two or more independent events • If there are two or more independent events, you need to consider if it is happening at the same time (and) or one after the other (or). And • If the two events are happening at the same time, you need to multiply the two probabilities together. This probability is called joint probability • Usually, the questions use the word “and” when describing the outcomes. • P(A & B) = P(A)*P(B) Joint Probability Just a fancy way of saying “AND” ◦ p(I will listen to Backstreet Boys Today) = 0.8 ◦ p(I will eat at Subway today) = 0.7 What is the probability that I will listen to Backstreet Boys AND eat at Subway? ◦ 0.7 * 0.8 = 0.56? WHEN EVENT A AND B ARE INDEPENDENT: ◦ P(A&B) = P(A) * P(B) Or • If the two events are happening one after the other, you need to add the two probabilities. • Usually, the questions use the word “or” when describing the outcomes. • P(A or B) = P(A) + P(B) Practice • If I roll a number cube and flip a coin: – What is the probability I will get a heads and a 6? – What is the probability I will get a tails or a 3? • How did you get them? Answers • P(heads and 6) = 1/2 x 1/6 =1/12 • P(tails or a 5) = 1/2 + 1/6 = 8/12 = 2/3 Summary: Independent Events • One event has no influence on the outcome of another event • If events A & B are independent then P(A&B) = P(A)*P(B) P(A or B) = P(A) + P(B) • Coin flipping if P(H) = P(T) = .5 then P(HTHTH) = P(HHHHH) = .5*.5*.5*.5*.5 = .55 = .03 Summary: Independent Events • Coin flipping if P(H) = P(T) = .5 then P(HTHTH) = P(HHHHH) = .5*.5*.5*.5*.5 = .55 = .03 • What if P(H) = 0.6 (Bernoulli trial)? – What is P(HTHH)? – What is P(at least one head in five trials)? • if you are flipping a coin and it has already come up heads 6 times in a row, what are the odds of an 7th head? .5 • Is P(10H) = P(4H,6T)? Next year the economy will experience one of three states: a downturn, stable state, or growth. The following probability matrix displays joint probabilities of a bond default and the economic state: For example, the joint probability that the economy is stable and the bond defaults is 1.0%; the unconditional probability that the economy will be stable is 50.0% = 49.0% + 1.0%. Next year the economy will experience one of three states: a downturn, stable state, or growth. The following probability matrix displays joint probabilities of a bond default and the economic state: For example, the joint probability that the economy is stable and the bond defaults is 1.0%; the unconditional probability that the economy will be stable is 50.0% = 49.0% + 1.0%. Conditional Probability • Concern the odds of one event occurring, given that another event has occurred • P(A|B)=Prob. of A, given B Examples • P(Professor Lin walks in without a Pepsi in his hand) = 0.1 • HOWEVER… • P(Professor Lin walks in without a Pepsi in his hand | you promise to give me $1,000,000 if I do so) = 1 !! • What changed my behavior? Conditional Probability (cont.) • P(B|A) = P(A&B)/P(A) • if A and B are independent, then P(B|A) = P(A)*P(B)/P(A) = P(B) The Chain Rule What if A and B ARE dependent of each other? ◦ p (I am teaching IS 665 today) = 1/7 ◦ p (I am eating at Subway today) = 0.7 What is the chance that I am teaching 665 today and eating at Subway? ◦ p (I am teaching IS 665 today & I am eating at Subway today) = 0! WHY? ◦ Because to teach 665, I have NO TIME to eat at Subway! ◦ In other words, these two events are dependent The Chain Rule What is the chance that I am teaching 665 today and eating at Subway? ◦ p (I am teaching IS 665 today & I am eating at Subway today) = p (I am eating at Subway today | I am teaching 665) * p (I am teaching 665) = 0 * 1/7 = 0 To put it (semi) formally: ◦ P(A & B) = P (A | B) * P (B) = P(B | A) * P(A) The Bayes Rule The Chain Rule Shows us: ◦ P(A & B) = P (A | B) * P (B) = P(B | A) * P(A) P (A | B) = P(B | A) * P(A) / P(B) !!! This is the Bayes Rule The Bayes Rule P(B | A) = P(B) * P (A | B) / P (A) PB P A | B P B | A PB P A | B P~ B P A |~ B Exercise If we observe that the bond has defaulted, what is the (posterior) probability that the economy experienced a downturn? a. 0.60% b. 19.40% c. 26.33% d. 31.58% Exercise If it is snowing, there is a 80% chance that class will be canceled. If it is not snowing, there is a 95% chance that class will go on. Generally, there is a 5% chance that it snows in NJ in the winter. If we are having class today, what is the chance that it is snowing? PB P A | B P B | A PB P A | B P~ B P A |~ B Distribution • How to Read a Histogram Normal Distribution • Watch the demo Regression • Predict the target value 𝑦 with the attributes 𝑋 by a function: 𝑦 = 𝛽0 + 𝛽1 ∗ 𝑥1 + 𝛽2 ∗ 𝑥2 + ⋯ + 𝛽𝑚 ∗ 𝑥𝑚 + 𝜀 • Handle numeric attributes and predict numeric value. Regression • Goal: minimize the error 𝜀 • An example – 𝑆𝑎𝑙𝑎𝑟𝑦 = 𝛽0 + 𝛽1 ∗ 𝑒𝑑𝑢𝑙𝑒𝑣𝑒𝑙 + 𝛽2 ∗ 𝐼𝑄 + 𝛽3 ∗ 𝑒𝑥𝑝𝑒𝑟𝑖𝑒𝑛𝑐𝑒 + 𝛽4 ∗ 𝑔𝑒𝑛𝑑𝑒𝑟 Variable Estimate Std. Error t value Pr(>|t|) (Intercept) 5000 XXX XXX 0.020 edu_level 1000 XXX XXX 0.001 IQ 50 XXX XXX 0.814 experience 300 XXX XXX 0.004 gender -2000 XXX XXX 0.300 – 𝑆𝑎𝑙𝑎𝑟𝑦 = 5000 + 1000 ∗ 𝑒𝑑𝑢_𝑙𝑒𝑣𝑒𝑙 + 50 ∗ 𝐼𝑄 + 300 ∗ 𝑒𝑥𝑝𝑒𝑟𝑖𝑒𝑛𝑐𝑒 − 2000 ∗ 𝑔𝑒𝑛𝑑𝑒𝑟 What is Regression, anyway? Number of nights I illegally parked Chance that I will get a ticket 0 3 1 21 2 36 3 44 4 66 5 81 y = 15.229x + 3.7619 Coefficient Intercept If I parked illegally 6 nights in a row, how likely am I to get a ticket? P value What is Regression, anyway? • You now know how to interpret a regression model • But how do we build one? – That will be covered in IS 665