Implementing “generalization, example, exception”: Behind-the-scenes work for summarizing a pattern Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Overview • Steps – – – – – – Display the data in a chart Indentify the dimensions of the comparison Choose a representative example Characterize the pattern Identify exceptions Write the description • “Generalization, example, exception” useful for – Three-way patterns – Interaction patterns calculated from multivariate coefficients The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Objectives of a GEE • GEE = “generalization, example, exception” • Aims to describe the forest, not the trees, in a complex pattern involving a lot of numbers – Which aspects can be summarized? – Which aspects deviate from that summary? • In a three-way association, describe the pattern among – Two independent variables (IVs) and a dependent variable (DV) – One IV and a series of DVs The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Display the data in a chart • Create a chart of the association • Charts are the easiest way to see patterns – Which values are highest and by how much compared to other values? – Whether trends are up, down, or level The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Table and corresponding chart Knowledge about AIDS transmission, by language spoken at home and ability to speak English, New Jersey, 1998 English Mode of transmission Likely modes of transmission Language spoken at home/ language used on questionnaire Spanish Spanish /English /Spanish English ques. ques. Spanish /English ques. Spanish /Spanish ques. 100 90 80 70 60 Sexual intercourse with an infected person 93.6 87.5 95.0 50 Shared needles for IV drug use 92.4 90.6 65.0 40 Pregnant mother to baby 89.5 75.0 80.0 30 Blood transfusion from infected person 87.5 81.3 60.0 20 10 0 Sexual Shared Pregnant Blood intercourse needles for mother to transfusion with an IV drug use baby from infected infected person person The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Design issues for your chart • Draw it to scale, so you can assess – Directionality – Size of differences and trends across values • Organize variables in the order you will describe them – Theoretical criteria – Empirical order – See podcast on organizing data in tables and charts The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Use a spreadsheet to create your chart • A spreadsheet can – Test different ways of organizing the variables in your chart – Calculate magnitude of differences across values for use in your description • Difference (subtraction) • Ratios (e.g., relative risks) • Percentage change The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Identify dimensions of the comparison • One dimension for each variable or set of variables in your table or chart • In a table, one dimension for each – Row – Column – Panel • In a chart, one dimension for each – Axis – Legend item The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Dimensions in a three-way table • Moving down the rows within one column • Moving across the columns within one row Knowledge about AIDS transmission, by language spoken at home and ability to speak English, New Jersey, 1998 Language spoken at home/ language used on questionnaire Mode of transmission Likely modes of transmission Sexual intercourse with an infected person Shared needles for IV drug use Pregnant mother to baby Blood transfusion from infected person English 93.6 92.4 89.5 87.5 Spanish Spanish /English ques. /Spanish ques. 87.5 90.6 75.0 81.3 95.0 65.0 80.0 60.0 The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Dimensions in a clustered bar chart • Moving across clusters within one bar color • Moving across bar colors within one cluster 100 90 80 70 60 50 English 40 Spanish /English ques. 30 Spanish /Spanish ques. 20 10 0 Sexual intercourse Shared needles for Pregnant mother Blood transfusion with an infected IV drug use to baby from infected person person The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Dimensions in a multiple-line chart • Moving left to right along one line • Moving vertically across lines at one value of the x variable Median sales price of new one-family homes, by region, US, 1980 to 2000 Median price ($) $250,000 $200,000 Northeast Midwest South West $150,000 $100,000 $50,000 $1980 1985 1990 1995 2000 Year The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Choose a representative example • Identify one representative example as the basis for each generalization about the shape of the pattern – Must characterize many if not most values in the pattern – Not an arbitrary choice! • Find one value of the first independent variable to use to describe the general shape of the association between the second IV and the DV. E.g., – Hold one IV constant while you vary the other – Don’t vary values of both IVs at the same time The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. What does “representative” mean? • A representative example of a general pattern will be characteristic of the association between the second IV and the DV for many if not all subgroups of that first IV. E.g., – A region that shows a time trend in prices that is similar to that observed in several other regions – A topic for which the rank order of language groups in test performance is the same The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Possible starting points for identifying representative examples • Theoretically based – A value of particular interest for your research question – A summary measure that combines results for the various component variables • Useful if comparing across a series of related outcomes • Empirically based – Start at one end of an empirically organized axis, column, or row – Overall sample, for a comparison across groups – Modal (most common) group The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Checking for representativeness of example in a three-way table • Describe the pattern across columns within one row • Test whether the direction and magnitude of that pattern also applies to other rows Knowledge about AIDS transmission, by language spoken at home and ability to speak English, New Jersey, 1998 Language spoken at home/ language used on questionnaire Mode of transmission Likely modes of transmission Sexual intercourse with an infected person Shared needles for IV drug use Pregnant mother to baby Blood transfusion from infected person Mean percentage of “likely” questions correct English 93.6 92.4 89.5 87.5 91.7 Spanish Spanish /English ques. /Spanish ques. 87.5 90.6 75.0 81.3 83.6 95.0 65.0 80.0 60.0 75.0 The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Checking for representativeness of example in a three-way table • Describe the pattern down the rows within one column • Test whether the direction and magnitude of that pattern also applies to other column Knowledge about AIDS transmission, by language spoken at home and ability to speak English, New Jersey, 1998 Language spoken at home/ language used on questionnaire Mode of transmission Likely modes of transmission Sexual intercourse with an infected person Shared needles for IV drug use Pregnant mother to baby Blood transfusion from infected person English 93.6 92.4 89.5 87.5 Spanish Spanish /English ques. /Spanish ques. 87.5 90.6 75.0 81.3 95.0 65.0 80.0 60.0 The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Checking representativeness of an example in a multiple-line chart • Describe the pattern cutting vertically across lines (regions) at one value of the x variable (year) • Check whether that rank order of regions also applies to the other dates Median sales price of new one-family homes, by region, US, 1980 to 2000 Median price ($) $250,000 $200,000 Northeast Midwest South West $150,000 $100,000 $50,000 $1980 1985 1990 1995 2000 Year The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Checking representativeness of an example in a multiple-line chart • Describe the pattern along one line moving left to right • Test whether that pattern applies to the other lines Median sales price of new one-family homes, by region, US, 1980 to 2000 Median price ($) $250,000 $200,000 Northeast Midwest South West $150,000 $100,000 $50,000 $1980 1985 1990 1995 2000 Year The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Trial and error in identifying a representative example • Might have to try several groups or values to find one that has a pattern that is similar to that in other groups • If your initial choice isn’t similar to any of the others in terms of direction or magnitude, it might be an exception • See whether other values have a pattern that can be generalized The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Characterize the pattern • For your representative value of one independent variable, characterize these aspects of the association between the other independent variable and the dependent variable: – Direction – Magnitude – Statistical significance The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Characterizing direction in a cross-sectional comparison • Which categories of the independent variable (IV) have the highest and lowest values of the dependent variable (DV)? • For multiple category IVs, describe the rank order of categories according to values of the DV The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Characterizing direction for trends • Describe whether each line slopes – – – – Upward Downward Is approximately level Follows a standard statistical distribution • e.g., a normal curve – Asymptotically approaches some value The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Characterizing magnitude • Do some behind-the-scenes math to measure the size of differences between values you are comparing. E.g., – Difference (subtraction or OLS coefficient) – Ratio (division, or odds ratio from logit model) – Percentage difference or change • See chapters on types of quantitative comparisons to help you choose the kind of calculation that best suits your objectives The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Take notes about direction and magnitude • On a hardcopy chart, make notes about: – Direction: Use “>,” “<,” “=,” or arrows – Size: Jot results of calculations to compare values • To help you later write a correct, complete description, mention – Units – Type of calculation – Shape: Verbs, adjectives, adverbs, analogies, or metaphors that capture the shape of the pattern – Statistical significance: of specific contrasts, marked with • Symbols • Shading (highlighter) • Don’t worry about full sentences at this point The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Identify exceptions • If parts of your table or chart depart substantially from the general pattern you have identified, they are exceptions • Exceptions can occur in terms of – Direction – Size – Statistical significance The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Exceptions in direction of a trend • The median sales prices dipped in the early 1990s in the West, but continued upward or remained level in each of the other three regions – The West was the exception Median sales price of new one-family homes, by region, US, 1980 to 2000 Median price ($) $250,000 $200,000 Northeast $150,000 Midwest $100,000 South West $50,000 $1980 1985 1990 1995 2000 Year The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Exceptions in magnitude of a cross-sectional difference – Generalize based on the two income groups for which the racial gap in ER use is similar (poor and near poor) – Non-poor are the exception Relative odds of emergency room visits for asthma, by race and income Black Non-black 8 Relative odds • The difference between odds ratios across racial groups is much larger among the non-poor than in the other two income groups (compare brackets 1 and 2 to bracket 3) 2 7 6 5 4 1 3 3 2 1 0 Poor Near poor Non-poor Income group The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Exceptions in statistical significance • Examples of this type of exception – All but one of the coefficients in a regression model are statistically significant at p < 0.05 – Only one of the coefficients in a regression model is statistically significant • The generalization is lack of statistical significance – In stratified models of the same outcome for several groups, the coefficient on one or more specific independent variables are statistically significant only for one group The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Write the description • Organize the material into paragraphs – One paragraph for each major pattern to be described • Write topic sentences for each paragraph – Introduce the substantive concepts to be discussed in that paragraph • Write evidentiary sentences to describe the – General pattern – Any exceptions • In those descriptions, mention – Direction – Magnitude – Statistical significance The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Wording for a generalization • If a pattern characterizes many of the numbers you are summarizing, introduce the generalization with – “In general, . . .” – “Typically, . . .” – “By and large . . .” – If the pattern characterizes all groups, say so! – “In every region studied, . . .” – “Throughout the entire period of observation . . .” – “Regardless of [characteristic], . . .” The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Wording to present the example • To document a pattern – Introduce numeric evidence – Refer to the table or chart that reports all of the numbers • E.g., – “For instance, . . . [numeric contrast, with the W’s specified].” – “For example, as shown in Figure X . . . [numeric fact, with its W’s attached].” • Reminder: “the W’s” refer to when, where, who, and what The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Wording to introduce exception(s) • To introduce exceptions to a general pattern, write – “An exception [to that pattern] is . . . ” – “On the other hand, . . .” – “However, . . .” • E.g., “In seven out of 10 years studied, [general pattern and example]. However, in the other three years, [contrasting pattern = exception].” The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Wording for exceptions in direction • Some exceptions are literally the opposite of the pattern described in the generalization. E.g., – A falling rather than rising trend – A deficit instead of excess for one group compared to another • Having described the direction of the general pattern, introduce such exceptions with phrases such as: – “On the contrary, . . .” – “Conversely, . . .” The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Summary • Use a systematic, step-by-step approach to identify which parts of a broad pattern are – Similar to one another – Different from one another • In terms of direction, magnitude, and statistical significance • Create a chart or table that you refer to in your prose description of that pattern • Organize material into substantively cohesive paragraphs • Use vocabulary and phrasing to differentiate general patterns from exceptions The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Suggested resources • Chapter 2 and appendix A in – Miller, J.E., 2004. The Chicago Guide to Writing about Numbers. (“WA#”) OR – Miller, J.E., 2013. The Chicago Guide to Writing about Multivariate Analysis, 2nd edition. (“WAMA II”) • Chapter 7 in WA# or chapter 6 in WAMA II on creating effective charts • Chapter 5 in WA# or chapters 8 and 9 in WAMA II on types of quantitative comparisons • Chapter 9 of WA#, section on three-way associations • Chapter 16 of WAMA II, on interactions The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Suggested online resources • Podcasts on – Reporting one number – Comparing two numbers or series of numbers – Summarizing a pattern with many numbers – Organizing data in tables and charts – Visualizing shapes of interaction patterns – Writing prose to present results of an interaction The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Suggested practice exercises • Study guide to The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. – Question #9 in problem set for chapter 2 – Question #9 in problem set for chapter 6 – Suggested course extensions for • chapter 2 – “Reviewing” exercise #7 – “Writing” and “revising” exercise #4 • chapter 14 – “Applying statistics and writing” exercises #3 and 4 – “Revising” exercise #3 The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Contact information Jane E. Miller, PhD jmiller@ifh.rutgers.edu Online materials available at http://press.uchicago.edu/books/miller/multivariate/index.html The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition.