Using the Humble Crosstab to Partner with Parametrics Jack Williamsen Office of Institutional Effectiveness St. Norbert College De Pere, Wisconsin Sidebar: Some info about St. Norbert & the sample used in this presentation St. Norbert College (“SNC”) is a Catholic Liberal Arts College near Green Bay, WI with an undergraduate population of ~ 2000 students. The two largest undergraduate majors are Business Administration and Education. Data in this presentation come from a larger study of the role of gender in the educational experiences of SNC men and women conducted by the Office of Institutional Effectiveness. Do parametrics need a partner? Parametric statistics (e.g., means, Pearson r) are central to many quantitative analyses of information. They convey useful information in a compact “package.” But…… Terminating a quantitative analysis after computing summary statistics is like setting a book aside after reading the dust jacket You know something, but there is more to learn— useful knowledge that could deepen understanding or lead to more precise real-world action. Crosstabs to the Rescue Crosstabs provide a convenient, useful method to explore the continuous distribution(s) of variables summarized by means and correlations. Although parametric tools (such as the SD) offer insight into distributions…. Crosstabs convey information in tables that are understandable by non-statisticians. And they lend themselves to transformation into graphic visuals for the “numerically-challenged.” This presentation uses three examples: Example 1: The correlation between HSGPA & 1st semester freshman GPA (= 0.62) is dissected using a dual quintile (HSGPA quintiles by 1st sem. Fr. GPA quintiles) table. Example 2: Robust mean GPA differences (~0.30) between men and women students are analyzed using SPSS EXPLORE’s seven percentile categories. Example 3: Unusually high retention of business majors (vs. all other majors) is explored across the ‘GPA spectrum’ using quintiles. Example 1: How to dissect a Correlation The correlation (0.62) between HSGPA and 1st sem Fr. GPA is both typical and an indicator of a less-than-perfect ordering of case-by-case GPA pairs. We can literally see the nature of this “imperfection” by: (1) identifying quintile break points for HSGPA and for Fr. GPA. (See Appendix for methods.) Then (2) use Transform > “Recode [GPA] into another variable” [quintile] to create two categorical GPAs Finally (3), cross-tab the two “quintiled” GPAs in SPSS, using “Row Percent” to fill in the resulting table. Example 1: Notes Although quintiles are used in this example, any “slice & dice” set of categories can be used. The table in the next slide is “data-dense.” Readers may need some initial guidance (e.g., “Read table from rows, left to right”) and/or an illustration: “The table shows, for example, that 54% of freshmen with HSGPA < = 2.84 have 1st semester GPAs <= 2.38.” Example 1: Dissecting a Correlation with Quintiles Example 2: Mean GPA Differences Women consistently have higher mean GPAs than men 1st Year GPAs of 2007 SNC Freshmen 2007 Freshmen Women Men Difference (W – M) HSGPA 3.56 3.26 0.30 1 Sem GPA 3.07 2.71 0.36 3.15 2.80 0.35 st 2 nd Sem GPA GPAs of all 2007-08 enrolled SNC Students 2007 Enrolled Women Men Difference (W – M) HSGPA 3.48 3.18 0.30 SNC GPA 3.16 2.84 0.32 Example 2: Exploring Mean Differences It is an easily-made assumption that a mean difference is present equally “across the board.” Example: “The mean GPA for women students is ~ 0.30 higher than the mean GPA for men” can easily become: “The GPA for women is ~0.30 higher than for men.” The second statement encourages the assumption that the mean difference is present throughout the range of GPAs (i.e., “across the board”). Let’s see. Example 2: EXPLORE-ing a Mean Difference I used the “Percentiles” table provided by SPSS EXPLORE to get percentile points across the “GPA spectrum” for men and women students. The EXPLORE Percentiles table gives weighted mean GPAs for each of seven (5th, 10th, 25th, 50th, 75th, 90th, & 95th) percentile ranks. I then used EXCEL to create a visually more attractive and useful crosstab table than the one provided in SPSS output. See the next slide. Example 2: Mean GPA Differences by EXPLORE Ranks 2005-2007 Freshmen 2005-07 Freshmen Gender H.S. GPA Percentiles for HSGPA 5 10 25 50 75 90 95 Men (N = 654) Women (N = 879) 2.21 2.56 2.41 2.74 2.76 3.13 3.15 3.53 3.61 3.85 3.89 4.00 3.98 4.05 Diff (M-F) -0.35 -0.34 -0.37 -0.38 -0.24 -0.12 -0.07 2005-07 Freshmen Gender 1st Sem GPA Percentiles for SNC SEM1 GPA 5 10 25 50 75 90 95 Men (N = 654) Women (N = 879) 1.50 1.82 1.85 2.14 2.33 2.67 2.88 3.17 3.38 3.60 3.75 3.88 3.88 4.00 Diff (M-F) -0.32 -0.29 -0.34 -0.29 -0.22 -0.13 -0.12 The mean gender difference in GPA continually shrinks in the above-average ranks. Example 3: Investigating an Unusual Result. Fact: HSGPA and 1st sem. GPA are positively correlated with retention to sophomore year (rpbis = ~0.24 & ~0.35, respectively). Fact: Freshman males in business had lower mean HSGPAs than men in all other majors (3.17 vs. 3.36) and lower 1st semester GPAs (2.75 vs. 3.00) as well. Fact: 2005-2007 freshman males in business retained to 2nd year at a higher rate than men in all other majors combined (89% vs. 80%). Example 3: Question Linear correlations of HSGPA and 1st semester GPA with retention to sophomore year are modest in size but both GPAs are typically two of the strongest preand post-matriculation correlates of retention. Based on GPAs, we would not expect 1st year men majoring in business to retain at a higher rate than men in other majors. But they clearly do (89% vs. 80%). Question: Is this higher retention rate present across the GPA spectrum? What do you think? Example 3: Percent retained, by GPA Quintile—Business vs. all other majors Male 2005-2007 Freshman Business majors, compared to men in all other majors, are retained to sophomore year in greater percentages at every GPA quintile: 2005-07 Freshman Men: Percent Retained to Sophomore Year, by GPA GPA Quintile <=2.38 2.39-2.88 2.89-3.25 3.26-3.63 3.64+ Business All Others 79% 66% 94% 90% 94% 92% 96% 82% 100% 94% Diff (Bu-Others) 13% 4% 2% 14% 6% Conclusions Crosstabs: a convenient and useful way to further explore continuous variables summarized with a single parametric statistic. Crosstabs : (1) improve understanding of variables of interest and… (2) suggest directions for further research and/or “real world” action. Questions?? 1. ?? Etc.? This PowerPoint and the “Gender Matters” monograph can be accessed at : www.snc.edu/oie/ Click on the “Public Access Documents and Resources” Quick Link on right side of page. Thank You for Coming! jack.williamsen@snc.edu Appendix: SPSS methods for creating categories from continuous variables (Analyze > Frequencies > Statistics > Percentile Values) offers a number of user-selected options for generating categories, including percentiles. (Transform > Visual Binning) is also versatile, and provides a visual representation of the distribution of the variable of interest. Make cuts any way you wish. (Analyze > Descriptive Statistics >Explore > Statistics > Percentiles) provides a fixed set of percentile breaks. See Example 2 for an illustration.