Name that tune. Song title? Performer(s)? R.G. Bias | rbias@ischool.utexas.edu | 1 Scientific Method and Statistics (continued) “Finding New Information” 3/28/2010 R.G. Bias | rbias@ischool.utexas.edu | 2 Objectives Continue experimental design and statistics. - Randolph – remember to take roll. - Who thinks there are two people in this classroom with the same birthday? (Month and day – not year.) http://www.nytimes.com/2009/08/06/technology/06 stats.html R.G. Bias | rbias@ischool.utexas.edu | To summarize Mode Range -Easy to calculate. -May be misleading. Median SIQR Mean (µ) SD (σ) -Capture the center. -Not influenced by extreme scores. -Take every score into account. -Allow later manipulations. R.G. Bias | rbias@ischool.utexas.edu | 4 Let me work an example all the way through. R.G. Bias | rbias@ischool.utexas.edu | 5 Summary An experiment is “personkind’s way of asking nature a question.” I manipulate some variables (IVs), control other variables, count on random assignment to wash out the effects of all the rest of the variables, and look for an effect on the DVs. Independent Groups = Between-subjects design Repeated Measures = Within-subjects design Natural Groups Design = Pseudoexperiment = NO inference of causality I start with a null hypothesis and an alternate hypothesis. If I find a difference (in my DV), I infer it was caused by the IV (the treatment) and I reject my H0 and accept my HA. If I find no difference I do NOT accept the H0, rather I say that we do not have any evidence to reject it at this time. (NOTE how hard it is to prove the absence of something!) 6 R.G. Bias | rbias@ischool.utexas.edu | Role of Data Analysis in Exps. Primary goal of data analysis is to determine if our observations support a claim about behavior. Is that difference really different? We want to draw conclusions about populations, not just the sample. Two different ways – statistics and replication. 7 R.G. Bias | rbias@ischool.utexas.edu | Scales The data we collect can be represented on one of FOUR types of scales: – Nominal – Ordinal – Interval – Ratio “Scale” in the sense that an individual score is placed at some point along a continuum. 8 R.G. Bias | rbias@ischool.utexas.edu | Nominal Scale Describe something by giving it a name. (Name – Nominal. Get it?) Mutually exclusive categories. For example: – Gender: 1 = Female, 2 = Male – Marital status: 1 = single, 2 = married, 3 = divorced, 4 = widowed – Make of car: 1 = Ford, 2 = Chevy . . . – Thinks sex is a good thing: 1 = Yes, 2 = No The numbers are just names. 9 R.G. Bias | rbias@ischool.utexas.edu | Ordinal Scale An ordered set of objects. But no implication about the relative SIZE of the steps. Example: – The 50 states in order of population: • • • • – People “Sue” has ever wanted to have sex with, by height: • • • • 1 = Vern 2 = Joe 3 = Mick 4 = Tim 1 = California 2 = Texas 3 = New York . . . 50 = Wyoming R.G. Bias | rbias@ischool.utexas.edu | 10 Interval Scale Ordered, like an ordinal scale. Plus there are equal intervals between each pair of scores. With Interval data, we can calculate means (averages). However, the zero point is arbitrary. Examples: – Temperature in Fahrenheit or Centigrade (of the room where someone had sex) – IQ scores 11 R.G. Bias | rbias@ischool.utexas.edu | Ratio Scale Interval scale, plus an absolute zero. Sample: – Distance, weight, height, time (but not years – e.g., the year 2002 isn’t “twice” 1001). – Number of times “Mary” thought about sex. 12 R.G. Bias | rbias@ischool.utexas.edu | Scales (cont’d.) It’s possible to measure the same attribute on different scales. Say, for instance, your midterm test. I could: Give you a “1” if you don’t finish, and a “2” if you finish. “1” for highest grade in class, “2” for second highest grade, . . . . “1” for first quarter of the class, “2” for second quarter of the class,” . . . Raw test score (100, 99, . . . .). – (NOTE: A score of 100 doesn’t mean the person “knows” twice as much as a person who scores 50, he/she just gets twice the score.) 13 R.G. Bias | rbias@ischool.utexas.edu | Scales (cont’d.) Nominal Ordinal Interval Ratio Name = = = Mutuallyexclusive = = = Ordered = = Equal interval = + abs. 0 Days of wk., Temp. Inches, Dollars Gender, Yes/No Class rank, Survey ans. R.G. Bias | rbias@ischool.utexas.edu | 14 Critical Skepticism Remember the Rabbit Pie example from earlier? The “critical consumer” of statistics asked “what do you mean by ’50/50’”? Let’s look at some other situations and claims. – From How to Lie with Statistics. 15 R.G. Bias | rbias@ischool.utexas.edu | Company is hurting. We’d like to ask you to take a 50% cut in pay. But if you do, we’ll give you a 75% raise next month. OK? Problem: Base rate. 16 R.G. Bias | rbias@ischool.utexas.edu | Sale! “Save 100%” I doubt it. 17 R.G. Bias | rbias@ischool.utexas.edu | Probabilities “It’s safer to drive in the fog than in the sunshine.” (Kinda like “Most accidents occur within 25 miles of home.” Doesn’t mean it gets safer once you get to San Marcos.) Navy literature around WWI: – “The death rate in the Navy during the SpanishAmerican war was 9/1000. For civilians in NYC during the same period it was 16/1000. So . . . Join the Navy. It’s safer.” 18 R.G. Bias | rbias@ischool.utexas.edu | Are all results reported? “In an independent study [ooh, magic words], people who used Doakes toothpaste had 23% fewer cavities.” How many studies showed MORE cavities for Doakes users? 19 R.G. Bias | rbias@ischool.utexas.edu | Sampling problems “Average salary of 1999 UT grads – “$41,000.” How did they find this? I’ll bet it was average salary of THOSE WHO RESPONDED to a survey. Who’s inclined to respond? 20 R.G. Bias | rbias@ischool.utexas.edu | Correlation ≠ Causation Around the turn of the 20th century, there were relatively MANY deaths of tuberculosis in Arizona. What’s up with that? 21 R.G. Bias | rbias@ischool.utexas.edu | Remember . . . 22 I do NOT want you to become cynical. Not all “media bias” is intentional. Just be sensible, critical, skeptical. As you “consume” statistics, ask some questions . . . R.G. Bias | rbias@ischool.utexas.edu | Ask yourself. . . Who says so? (A Zest commercial is unlikely to tell you that Irish Spring is best.) How does he/she know? (That Zest is “the best soap for you.”) What’s missing? (One year, 33% of female grad students at Johns Hopkins married faculty.) Did somebody change the subject? (“Camrys are bigger than Accords.” “Accords are bigger than Camrys.”) Does it make sense? (“Study in NYC: Working woman with family needed $40.13/week for adequate support.”) 23 R.G. Bias | rbias@ischool.utexas.edu | Claims “Better chance of being struck by lightening than being bitten by a shark.” Tom Brokaw – Tranquilizers. 24 R.G. Bias | rbias@ischool.utexas.edu | The Normal Distribution (From Jaisingh [2000]) 25 R.G. Bias | rbias@ischool.utexas.edu | 26 R.G. Bias | rbias@ischool.utexas.edu | Notice Many things influence how easy or hard it is to discover a difference. – How big the real difference is. – How much variability there is in the population distribution(s). – How much error variance there is. – Let’s talk about variance. 27 R.G. Bias | rbias@ischool.utexas.edu | Sources of variance Systematic vs. Error – Real differences – Error variance What would happen to the DV if our measurement apparatus was a little inconsistent? There are OTHER sources of error variance, and the whole point of experimental design is to try to minimize ‘em. Get this: The more error variance, the harder for real differences to “shine through.” 28 R.G. Bias | rbias@ischool.utexas.edu | Z distribution 29 R.G. Bias | rbias@ischool.utexas.edu | Probability VERY little of life is certain. It is PROBABILISTIC. (That is, something might happen, or it might not.) 30 R.G. Bias | rbias@ischool.utexas.edu | Prob. (cont’d.) Life’s a gamble! Just about every decision is based on probable outcomes. None of you is a “statistical wizard.” Yet every one of you does a pretty good job of navigating an uncertain world. – None of you touched a hot stove (on purpose.) – All of you made it to class. 31 R.G. Bias | rbias@ischool.utexas.edu | Probabilities Always between one and zero. Something with a probability of “one” will happen. (e.g., Death, Taxes). Something with a probability of “zero” will not happen. (e.g., My becoming a Major League Baseball player). Something that’s unlikely has a small, but still positive, probability. (e.g., probability of someone else having the same birthday as you is 1/365 = .0027, or .27%.) 32 R.G. Bias | rbias@ischool.utexas.edu | Just because . . . . . . There are two possible outcomes, doesn’t mean there’s a “50/50 chance” of each happening. When driving to school today, I could have arrived alive, or been killed in a fiery car crash. (Two possible outcomes, as I’ve defined them.) Not equally likely. But the odds of a flipped coin being “heads,” . . . . 33 R.G. Bias | rbias@ischool.utexas.edu | Let’s talk about socks 34 R.G. Bias | rbias@ischool.utexas.edu | Prob (cont’d.) Probability of something happening is – – – – # of “successes” / # of all events P(one flip of a coin landing heads) = ½ = .5 P(one die landing as a “2”) = 1/6 = .167 P(some score in a distribution of scores is greater than the median) = ½ = .5 – P(some score in a normal distribution of scores is greater than the mean but has a z score of 1 or less is ...? – P(drawing a diamond from a complete deck of cards) =? 35 R.G. Bias | rbias@ischool.utexas.edu | Yet more prob. http://www.midcoast.com.au/~turfacts/maths.htm l – The product or multiplication rule. "If two chances are mutually exclusive the chances of getting both together, or one immediately after the other, is the product of their respective probabilities.“ – the addition rule. "If two or more chances are mutually exclusive, the probability of making ONE OR OTHER of them is the sum of their separate probabilities." 36 R.G. Bias | rbias@ischool.utexas.edu | What’s the probability . . . 37 That the next card is a king? That the next card is a heart? That the next card is a spade? That the next card is a club and a king? That the next card is a spade OR a heart? That the next two cards are kings? R.G. Bias | rbias@ischool.utexas.edu | Think this through. What are the odds (“what are the chances”) (“what is the probability”) of getting two “heads” in a row? Three heads in a row? Three flips the same (heads or tails) in a row? 38 R.G. Bias | rbias@ischool.utexas.edu | So then . . . WHY were the odds in favor of having two people in our class with the same birthday? Think about the problem! What if there were 367 people in the class. – P(2 people with same b’day) = 1.00 39 R.G. Bias | rbias@ischool.utexas.edu | Happy B’day to Us But we had 50. Probability that the first person has a birthday: 1.00. Prob of the second person having the same b’day: 1/365 Prob of the third person having the same b’day as Person 1 and Person 2 is 1/365 + 1/365 – the chances of all three of them having the same birthday. 40 R.G. Bias | rbias@ischool.utexas.edu | Sooooo . . . http://en.wikipedia.org/wiki/Birthday_parad ox 41 R.G. Bias | rbias@ischool.utexas.edu | References Hinton, P. R. Statistics explained. Shaughnessy, Zechmeister, and Zechmeister. Experimental methods in psychology. R.G. Bias | rbias@ischool.utexas.edu | 42 http://highered.mcgrawhill.com/sites/0072494468/student_view0/ statistics_primer.html Click on Statistics Primer. 43 R.G. Bias | rbias@ischool.utexas.edu |