Variability Variability: provides an objective, quantitative measure of the degree to which scores in a distribution are spread out or clustered together. Uses: > Describes the distribution: clustered/spread; how much distance is typical between scores, or between an individual score and the mean. > Measures how well an individual score (or group of scores) represents the entire distribution (important for inferential statistics). > Helps determine cutoffs for abnormal behavior, extreme scores (e.g., genius on an IQ test) (SD in particular does this). Range Definition: Difference between the largest score and the smallest score (taking into account real limits). (Semi-)Interquartile Range Distance between the 1st quartile (25th percentile) and the 3rd quartile (75th percentile). Or half that amount. In other words: Formulas: (sample formulas for variance and SD) Xmax – Xmin Highest X – Lowest X (know concept of real limits here, but okay to disregard in calculating) IQR: Q3 – Q1 SIQR: Q3 – Q1 2 Variance Standard Deviation Mean squared deviation (distance from the mean). Square root of the mean squared deviation. Mean of the squared deviation scores. Square of the standard deviation. I.e., square root of the variance. Conceptually: Typical distance of scores from the mean. Def: Comp: s² = SS = (X - M)² n–1 n–1 . s = / SS √ n–1 Deviation score: X – M SS: X²– (X)² (computational) Sum of Squares (SS): (X - M)² n Other useful formulas: Quick and easy way to assess variance. Relatively unaffected by extreme scores and sample size. Stable under sampling (p. 129). Only measure of variability for open-ended distributions (p. 129). SD is most common measure of variability, and goes hand-in-hand with the mean. Var and SD take each score into account, so representative of every score in the sample. ~ Consider distance between scores, so give information on how scattered or clustered scores are (not just overall range). ~ Stable under sampling (p. 129). Does not take entire population/sample into account. Drawbacks: Most affected by extreme scores. Only uses most extreme score on either end; Does not take other scores into account. Affected by sample size and unstable under sampling (p. 128-129). Influenced by extreme scores. Deviation scores are squared, so if one score has a large deviation from the mean very large (disproportionate) contribution to variance and SD. Other notes: For the sake of simplicity, you can disregard real limits when computing the range. I.e.: Range = Xhighest - Xlowest Generally goes together with the median (see end of p. 129) Samples tend to have less variability than the population they come from, so sample formulas adjust for this bias by using (n-1) rather than n (the population formulas use N). Uses, Advantages: Psyc 281 / Conley