STATISTICS David Pieper, Ph.D. dpieper@med.wayne.edu Types of Variables Categorical Variables Organized into category No necessary order No quantitative measure Examples male, female race marital status treatment A and treatment B Types of Variables Continuous Variables Have specific order Examples: weight temperature blood pressure Age Test score May be converted to categorical or ordinal Descriptive Statistics Measures of central tendency mean (average) Measures of variability range standard deviation Results of Memory Test Age Gender Age Group Student or Parent Total Score 17 M HS S 52 16 M HS S 49 30 F Adult P 50 16 M HS S 47 43 F Adult P 41 36 M Adult P 51 16 F HS S 43 43 F Adult P 41 36 F Adult P 33 Descriptive Statistics for Memory Test Age Total Score 196 196 Minimum 7 12 Maximum 72 54 24.9 37.4 16 8 Number of Cases Mean SD Research Hypothesis Null hypothesis: relationship among phenomena does not exist Example: Age does not have an influence on memory Probability and p Values p < 0.05 1 in 20 or 5% chance groups are not different when we say groups are significantly different p < 0.01 1 in 100 or 1% chance of error p < 0.001 1 in 1000 or .1% chance of error Type of Statistical Test to Use Continuous variable as end point 2 groups: t-test 3 or more groups: ANOVA Relation between 2 categorical variables: Chi-square Fisher’s test Exact test (2 x 2) Relation between 2 continuous variables: Regression analysis or correlation T-test When comparing 2 groups and endpoint variable is continuous Purpose is determine if the difference between the 2 groups is unlikely due to chance T-test Examples: Blood pressure before and after exercise program Would parents do better on a memory test than students Results of Memory Test Age Gender Age Group Student or Parent Total Score 17 M HS S 52 16 M HS S 49 30 F Adult P 50 16 M HS S 47 43 F Adult P 41 36 M Adult P 51 16 F HS S 43 43 F Adult P 41 36 F Adult P 33 T-test results comparing Parents and Students Total Score Number Mean SD Students 117 36.3 7.5 Parents 79 39.1 8.1 p < 0.02 Parents had higher scores than students Analysis of Variance (ANOVA) When comparing 3 or more groups and end-point is continuous Example: Compare score on memory test among: Grade school students Middle school students High school students Parents Total Score Analysis of Variance p < 0.03 High School Students and Adults scored better than Grade School or Middle School Students Chi-square Test When comparing 2 or more groups and the end point is categorical Chi-square Gender and Parent vs Student Student Parent Total Female 65 48 113 Male 52 31 83 Total 117 79 139 p = 0.5 There was no significant gender difference between students and parents Correlation or Regression When determining if there is a linear relationship between 2 continuous variables Ranges from -1 to 1 Pearson’s Correlation Coefficient Diastolic BP (mm) Weight (kg) 90 82 140 114 68 56 110 62 100 83 95 110 Is Diastolic BP related to Weight? r = 0.805 p < 0.01 Correlation of Age and Score on Memory Test r = 0.6 No correlation of age and score on memory test Illustrations: Use Graphs p < 0.01 Figure 1: Patients that failed the exercise test had a higher mortality than patients that passed • Label axes • Include brief description Free Statistics Software Mystat: http://www.systat.com/MystatProducts.aspx List of Free Statistics Software: http://statpages.org/javasta2.html