i INF397C Introduction to Research in Information Studies Fall, 2009 Week 12 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 1 From Blink, by Malcolm Gladwell i • P. 32. The author is talking about negative emotions that tend to predict subsequent break-ups of marriages: “. . . there is one emotion that [Dr. Gottman] considers the most important of all: contempt. If Gottman observes one or both partners in a marriage showing contempt for one another, he considers it the single most important sign that the marriage is in trouble.” R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 2 p. 33 i “Gottman has found, in fact, that the presence of contempt in a marriage can even predict such things as how many colds a husband or wife gets; in other words, having someone you love express contempt toward you is so stressful that it begins to affect the functioning of your immune system.” R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 3 More resources – Assorted things i • http://www.stat.berkeley.edu/~stark/Java/Html/ NormHiLite.htm Use the slider bars! Man, don’t you wish you had had access to this tool for the last question on the midterm?! • http://www.stat.berkeley.edu/~stark/Java/Html/ StandardNormal.htm Play with the standard deviation slider bar! • http://wwwstat.stanford.edu/~naras/jsm/NormalDensity/N ormalDensity.html R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 4 More resources -- SE • i Dr. Phil Doty’s 26-minute online tutorial on standard error of the mean. Very helpful: http://cobra.gslis.utexas.edu:8080/ramgen/Content2/faculty/doty/research/dsmbb.r m – – – Note, I had not talked of “expected value.” When you hear that “M is the expected value of µ,” you can substitute “M is used to estimate µ.” Note also we have not talked about “CV,” though last week I did say to expect that S is kinda of the same general order of magnitude, but smaller than, M. Same idea. Don’t forget Dr. Doty’s page of tutorials, http://www.gslis.utexas.edu/~lis397pd/fa2002/tutorials.html where you will find also an eight-minute introduction to inferential statistics, two tutorials on confidence intervals, and one on Chi squared. I think one thing you’ll find interesting in these tutorials is that here is a second professor, using a different text book (Spatz), who studied at a different school, who’s never heard me lecture (nor I him) . . . and we use much the same language to describe things. The point is, this stuff (descriptive and inferential statistics) is universal. • Two pages of explanation of standard error of the mean: http://davidmlane.com/hyperstat/A103735.html R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 5 More resources - Probability i • A visual demonstration of probability and outliers (or some such). Here’s a good one: http://www.ms.uky.edu/~mai/java/stat/GaltonM achine.html • Here’s another: http://www.stattucino.com/berrie/dsl/Galton.ht ml • http://www.mathgoodies.com/lessons/vol6/intr o_probability.html R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 6 i • http://highered.mcgrawhill.com/sites/0072494468/student_view0 /statistics_primer.html R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 7 t tests i • Go to the McGraw-Hill statistics primer and read the subsections on Inferential Statistics. Not a lot of meat, there, but it will help you to hear it stated in a slightly different way. • For some examples of the use of t tests . . . – http://www.yogapoint.com/info/research.htm for an example of some t tests. – http://www.main.nc.us/bcsc/Chess_Research_Study_I.htm Notice how they continually say “p>.05” rather than “p<.05”! Do NOT, as they suggest at the end, “send a check for $39.95 payable to the American Chess School.” Go find more examples, just for yourself. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 8 Confidence Intervals i • We calculate a confidence interval for a population parameter. • The mean of a random sample from a population is a point estimate of the population mean. • But there’s variability! (SE tells us how much.) • What is the range of scores between which we’re 95% confident that the population mean falls? • Think about it – the larger the interval we select, the larger the likelihood it will “capture” the true (population) mean. • CI = M +/- (t.05)(SE) • See Box 12.2 on “margin of error.” NOTE: In the box they arrive at a 95% confidence that the poll has a margin of error of 5%. It is just coincidence that these two numbers add up to 100%. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 9 CI about a mean -- example • • • • i CI = M +/- (t.05)(SE) Establish the level of α (two-tailed) for the CI. (.05) M=15.0 s=5.0 N=25 Use Table A.2 to find the critical value associated with the df. – t.05(24) = 2.064 • CI = 15.0 +/- 2.064(5.0/SQRT 25) = 15.0 +/- 2.064 = 12.935 – 17.064 “The odds are 95 out of 100 that the population mean falls between 12.935 and 17.064.” (NOTE: This is NOT the same as “95% of the scores fall within this range!!!) R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 10 Another CI example i • Hinton, p. 89. • t test not sig. • What if we did this via confidence intervals? R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 11 Type I and Type II Errors World Our decision Reject the null hypothesis i Null Null hypothesis is hypothesis is false true Correct decision Type I error (α) Fail to reject Type II error Correct the null (β) decision hypothesis (1-β) R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 12 Limitations of t tests • • • i Can compare only two samples at a time Only one IV at a time (with two levels) But you say, “Why don’t I just run a bunch of t tests”? a) It’s a pain. b) You multiply your chances of making a Type I error. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 13 ANOVA i • Analysis of variance, or ANOVA, or F tests, were designed to overcome these shortcomings of the t test. • An ANOVA with ONE IV with only two levels is the same as a t test. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 14 ANOVA (cont’d.) i • Remember back to when we first busted out some scary formulas, and we calculated the standard deviation. • We subtracted the mean from each score, to get a feel for how spread out a distribution was – how DEVIANT each score was from the mean. How VARIABLE the distribution was. • Then we realized if we added up all these deviation scores, they necessarily added up to zero. • So we had two choices: we coulda taken the absolute value, or we coulda squared ‘em. And we squared ‘em. Σ(X – M)2 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 15 ANOVA (cont’d.) i • Σ(X – M)2 • This is called the Sum of the Squares (SS). And when we add ‘em all up and average them (well – divide by N-1), we get S2 (the “variance”). • We take the square root of that and we have S (the “standard deviation”). R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 16 ANOVA (cont’d.) i • Let’s work through the Hinton example on p. 119. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 17 i M= R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 18 i Total sum of squares = 328 (X-M)2 X 5 10 -5 25 11 10 1 1 14 10 4 16 6 10 -4 16 10 10 0 0 15 10 5 25 7 10 -3 9 9 1 17 10 7 49 5 10 0 25 11 10 1 1 13 10 3 9 3 10 -7 49 9 1 17 10 7 49 4 10 -6 36 10 10 0 0 14 10 4 16 X M X-M 160 M X-M (X-M)2 10 -1 10 -1 X M 4 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu XM (X-M)2 164 19 i Within conditions sum of squares = 28 X-M (X-M)2 X 5 5 0 0 11 10 1 1 14 15 -1 1 6 5 1 1 10 10 0 0 15 15 0 0 7 5 2 4 9 1 17 15 2 4 5 5 0 0 11 10 1 1 13 15 -2 4 3 5 -2 4 9 1 17 15 2 4 4 5 -1 1 10 10 0 0 14 15 -1 1 X M 10 M X-M (X-M)2 10 -1 10 -1 X M 4 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu X-M (X-M)2 14 20 Between conditions sum of squares = 300 X M X-M (X-M)2 times 6 5 10 5 25 150 10 10 0 0 0 15 10 5 25 150 50 300 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu i 21 i So . . . • Total sum of squares • Within conditions SS • Between conditions SS (Magic!) • How about df? = 328 = 28 = 300 – Total = 17 – Within conditions df = 15 – Between conditions df = 2 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 22 From Hinton p. 127 i • Variance ratio (F) = Between condition variance/Error variance Or Variance ratio (F) = Systematic differences + Error variance Error variance • Think about it – why is the “within condition variance” called “error variance”? • Note, what happens where there are no systematic differences? R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 23 Anova summary table, p. 128 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu i 24 So, from our example . . . Source of df variation SS MS F p Between 2 conditions 300 150 80.21 < .01 Within 15 conditions 28 1.87 Total 328 17 R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu i 25 Reading the F table R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu i 26 Check out . . . i • ANOVA summary table on p. 128. This is for a ONE FACTOR anova (i.e., one IV). (Maybe MANY levels.) • Sample ANOVA summary table on p. 132. • The only thing you need to realize in Chapter 13 is that for repeated measures ANOVA, we also tease out the between subjects variation from the error variance. (See p. 154 and 158.) R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 27 Check out, also . . . i • Note, in Chapter 15, that as factors (IVs) increase, the comparisons (the number of F ratios) multiply. See p. 172, 179. • What happens when you have 3 levels of an IV, and you get a significant F? (As we did in our worked example.) • Memorize the table on p. 182. (No, I’m only kidding.) R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 28 i R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 29 Interaction effects i • Here’s what I want you to understand about interaction effects: – They’re WHY we run studies with multiple IVs. – A significant interaction effect means different levels of one IV have different influences on the other IV. – You can have significant main effects and insignificant interactions, or vice versa (or both sig., or both not sig.) (See p. 164, 166.) R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 30 An Experiment i • First, let’s divide into two groups – who has an ODD SSN (last digit is odd). • Group 1 – You’ll do List A first, then List B. (Group 2 will keep their eyes closed during List A.) (I’m just sure of it.) Then we’ll all do List B. Then Group B will go back and do List A. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 31 List A i I’ll present 10 words, one at a time. Presented visually. After the 10th I’ll say “go” and you’ll write down as many as you can. Don’t have to remember them in order. Pencils down. (Group 2 – please close your eyes.) Ready? R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 32 i balloon doorknob minivan meatloaf teacher zebra pillow barn sidewalk coffin R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 33 List B i •Now, 10 new words. •Everyone (groups 1 and 2). •Same task -- recall them. •After the 10th one I’ll say “Go,” write down as many of the 10 words as you can. •Again, don’t have to remember them in order. •Pencils down. •Ready? R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 34 i forget interest anger imagine fortitude smart peace effort hunt focus R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 35 List A again i Now, for Group 2. (Group 1 can keep their eyes open – just don’t participate.) I’ll present 10 words, one at a time. Presented visually. After the 10th I’ll say “go” and you’ll write down as many as you can. Don’t have to remember them in order. Pencils down. Ready? R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 36 i balloon doorknob minivan meatloaf teacher zebra pillow barn sidewalk coffin R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 37 List A balloon doorknob minivan meatloaf teacher zebra pillow barn sidewalk coffin List B i forget interest anger imagine fortitude smart peace effort hunt focus R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 38 Calculate your difference score i • How many did you get right from List A? • How many did you get right from List B? • I’ll collect the data via a show of hands. R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 39 i • Was there a true difference? R. G. Bias | School of Information | UTA 5.424 | Phone: 512 471 7046 | rbias@ischool.utexas.edu 40