1 Click on the speaker to hear the audio for each slide…here first… Statistics in Applied Science & Technology Dr. Pete Smith McCormick 265G 438-3553 peter.smith@ilstu.edu 2 …and here second. 1 Getting started… Relation of this course to 497; values, variables, measurement scales 2 A quick word on purpose… Research methods and stats (497 and 445) Here’s what I believe about them Not everyone likes them But nobody can do or critique quantitative research well without understanding their content 1 2 I don’t want to argue, but these courses are not going away, so you might as well accept them. There are, honestly, very good reasons for keeping them. 497 & 445 Our way of teaching research methods 1 497 & 445 The KNR way of teaching research methods 497 dealt with internal, external and construct validity Stats deals with conclusion validity 1 Statistics are ways of representing large collections of numbers These numbers can be used to tell a story 2 Conclusion validity is the extent to which this story is true Conclusion Validity From Trochim (the 497 text from last semester for most of you [I think]): 1 “Conclusion validity is the degree to which conclusions we reach about relationships in our data are reasonable.” 2 Stats is largely about answering that question. There is an issue here with descriptive vs. inferential stats – that will follow Read about Trochim’s description here: http://www.socialresearchmethods.net/kb/concval.htm 2 Main Branches of Statistics 1 3 Inferential... Descriptive... reasoning from particulars organize & summarize to generals to help understanding infer (generalize) to a population frequency average variability relationships 2 4 from studying a sample drawn from the population margin of error evaluating experiments random sample observed differences expected variability relationships Population & Samples Population 1 2 Complete set of observations on a particular variable E.g. height & weight ==> 2 populations Can be all from same subject (height over lifespan) Defined by investigator Sample 3 Part of a population any subset of population this stats class is a sample of students taking stats in CAST Random sample: each case of the population has equal chance of being included in the sample this year’s stats class Parameters Statistics 4 Conclusion validity & … 1 Descriptive research Does not attempt to generalize, so conclusion validity is [relatively] simple: Are your measurements and computations accurate and do 2 they fully represent the patterns that are in the data? 3 4 Conclusion validity & … 1 Inferential research As for descriptive, plus the notion that your inference from the sample to the population is reasonable (Non-) violation of assumptions (if you violate assumptions of 2 3 the statistical procedures, the tests simply don’t work the same way – they are quite intricate) Effect size Type I and type II error Power What’s covered in 445? For an overall picture, see inside cover of Cronk – 1 2 it’s a nice summary As we proceed, I’ll note what sections of Cronk the slides, assignments, and applets refer to Java programs that run in a web browser (netscape, internet explorer, firefox, safari, etc…) that give a dynamic graphical interpretation of the concepts we are trying to learn What’s covered in 445? 1 Applets… Example: applet for mean, median, mode (measures of something called central tendency – we’ll cover them 2 weeks from now]) http://www.ratrat.com/histogram_explorer/he.html 2 3 First week objectives 1 Get started with SPSS statistics Need to open the program and do a few things just to make sure you can get things going…nice and easy start so that you don’t get despondent too soon Get started with the conclusion validity Many assumptions of statistical tests depend on levels or scales of measurement…so we need to familiarize ourselves with them Levels of Measurement 1 2 Assign value (number or name) to an observation or characteristic (qualitative vs quantitative) What does a particular value mean? 40 pounds vs 20 pounds 1st place vs 2nd place Male vs Female S.S. Stevens (1946): Four Scales of Measurement to facilitate interpretation and analysis of measured values in order of complexity… Nominal “In nominal measurement the numerical values just "name" the attribute uniquely. No ordering of the cases is implied. For example, jersey numbers in basketball are measures at the nominal level. A player with number 30 is not more of anything than a player with number 15, and is certainly not twice whatever number 15 is.” (Trochim) 1 Qualitative or Categorical variables (names) Mutually exclusive: only belong to one Exhaustive: enough categories for all cases 2 Ethnicity sex single-married Ordinal 1 “In ordinal measurement the attributes can be rank-ordered. Here, distances between attributes do not have any meaning. For example, on a survey you might code Educational Attainment as 0=less than H.S.; 1=some H.S.; 2=H.S. degree; 3=some college; 4=college degree; 5=post college. In this measure, higher numbers mean more education. But is distance from 0 to 1 same as 3 to 4? Of course not. The interval between values is not interpretable in an ordinal measure.” (Trochim) Exhaustive: enough categories for all cases Mutually exclusive: only belong to one Nothing implied about the magnitude of difference between the ranks 2 military / business rankings first place, second place, third place Interval 1 In interval measurement the distance between attributes does have meaning. For example, when we measure temperature (in Fahrenheit), the distance from 30-40 is same as distance from 70-80. The interval between values is interpretable. Because of this, it makes sense to compute an average of an interval variable, where it doesn't make sense to do so for ordinal scales. But note that in interval measurement ratios don't make any sense - 80 degrees is not twice as hot as 40 degrees (although the attribute value is twice as large). 2 Mutually exclusive 4 Exhaustive Indicates order but interval between scores has the same meaning anywhere on the scale aka Equal Interval Scale value of 0 is some arbitrary reference point (set by the investigator) 3 E.g. temperature in Degrees Celsius or Fahrenheit 0 and 100 degrees are set in Celsius as freezing & boiling point of water Why is 0 f set there? Zero Fahrenheit was the coldest temperature that the German-born scientist Gabriel Daniel Fahrenheit could create with a mixture of ice and ordinary salt (may be apocryphal – see Wikipedia) 1 Ratio “Finally, in ratio measurement there is always an absolute zero that is meaningful. This means that you can construct a meaningful fraction (or ratio) with a ratio variable. Weight is a ratio variable. In applied social research most "count" variables are ratio, for example, the number of clients in past six months. Why? Because you can have zero clients and because it is meaningful to say that "...we had twice as many clients in the past six months as we did in the previous six months.”” (Trochim) Mutually exclusive Exhaustive Indicates order but scale has an absolute 0 point reflecting absence of the characteristic being measured temperature in Degrees Kelvin (0 is Absence of heat) distance and derivatives (position, velocity, acceleration) Weight Interval & Ratio Measurements Easy way of telling if scale is interval or ratio: If you divide a score on the scale by two, is the amount half as 1 much as it was? Temperature – 25 degrees C is not half as hot as 50 C (interval) Weight – 25lbs is half as heavy as 50lbs (ratio) Other important definitions 1 Variable: characteristic that can take on different values Discrete variables: can only take on certain values # correct answers, Likert scales, # reps Continuous variables: can take any value within the range. Accuracy limited by instrumentation, data collection method height, weight, time, temperature Measurement turns continuous variable into discrete one (rounding) Things you should know: Independent variable, dependent variable For next time 1 A quiz on this stuff is posted in reggienet Just measurement scales, and identifying values, variables, independent variables and dependent variables Complete the practice exercises in Cronk, chapters 1 and 2. Let me know if you have problems All computer labs in CAST should have SPSS installed on the computers Listen to lots of slides for 1.25.15… central tendency, spread, z-scores, graphing.