DAGs intro with exercises 8h (reordered 4.5+3.5) DAG=Directed Acyclic Graph Hein Stigum http://folk.uio.no/heins/ courses May-16 H.S. 1 Agenda • Background • DAG concepts – Causal thinking, Paths • Analyzing DAGs – Examples • DAGs and stat/epi phenomena Exercises – Mediation, Matching, Mendelian randomization, Selection • More on DAGs • Limitations, problems May-16 H.S. 2 Background • Potential outcomes: Neyman, 1923 • Causal path diagrams: Wright, 1920 • Causal DAGs: Pearl, 2000 May-16 H.S. 3 Why causal graphs? • Estimate effect of exposure on disease* *DAGs not applicable for prediction models • Problem – Association measures are biased • Causal graphs help: – Understanding • Confounding, mediation, selection bias – Analysis • Adjust or not – Discussion • Precise statement of prior assumptions May-16 H.S. 4 Causal versus casual CONCEPTS (Rothman et al. 2008; Veieroed et al. 2012 May-16 H.S. 5 god-DAG Causal Graph: Node = variable Arrow = cause E=exposure, D=disease DAG=Directed Acyclic Graph Read of the DAG: Causality = arrow Association = path Independency = no path Estimations: E-D association has two parts: ED causal effect keep open ECUD bias try to close Conditioning (Adjusting): E[C]UD Time May-16 H.S. 6 Association and Cause Association 3 possible causal Association 3 possible causal structures structures 3 possible causal structure Association 1 1 Yellow Yellow fingers fingers Lung Lung cancer cancer Cause Cause (reverse cause) E Yellow Yellow fingers fingers Smoke Smoke D Lung Lung cancer cancer 2 2 Yellow Yellow fingers fingers Confounder Confounder Lung Lung cancer cancer U U 3 3 Yellow Yellow fingers fingers Collider Collider Lung Lung cancer cancer + more complicated structures May-16 H.S. 7 Confounder idea A common cause Smoking + Adjust for smoking Smoking + Yellow fingers Lung cancer + Yellow fingers + Lung cancer + • A confounder induces an association between its effects • Conditioning on a confounder removes the association • Condition = (restrict, stratify, adjust) • Paths • Simplest form • Causal confounding, (exception: see outcome dependent selection) May-16 H.S. 8 Collider idea Two causes for selection to study Selected + Yellow fingers Selected subjects Selected + Lung cancer + + Yellow fingers Lung cancer - or + and • Conditioning on a collider induces an association between its causes • “And” and “or” selection leads to different bias • Paths • Simplest form May-16 H.S. 9 Mediator M • Have found a cause (E) • How does it work? – Mediator (M) E direct effect D – Paths 𝑇𝑜𝑡𝑎𝑙 𝑒𝑓𝑓𝑒𝑐𝑡 = 𝑖𝑛𝑑𝑖𝑟𝑒𝑐𝑡 + 𝑑𝑖𝑟𝑒𝑐𝑡 𝑖𝑛𝑑𝑖𝑟𝑒𝑐𝑡 𝑀𝑒𝑑𝑖𝑎𝑡𝑒𝑑 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 = 𝑡𝑜𝑡𝑎𝑙 Use ordinary regression methods if: linear model and no E-M interaction. Otherwise, need new methods May-16 H.S. 10 Causal thinking in analyses May-16 H.S. 11 Regression before DAGs Use statistical criteria for variable selection Risk factors for D: Variable OR E 2.0 C 1.2 Comments Surprisingly low association Association Both can be misleading! C E May-16 Report all variables in the model as equals D H.S. 12 Statistical criteria for variable selection C E D - Want the effect of E on D (E precedes D) - Observe the two associations C-E and C-D - Assume statistical criteria dictates adjusting for C (likelihood ratio, Akaike (赤池 弘次) or 10% change in estimate) The undirected graph above is compatible with three DAGs: C C E D Confounder 1. Adjust Conclusion: E C D Mediator 2. Direct: adjust 3. Total: not adjust E D Collider 4. Not adjust The data driven method is correct in 2 out of 4 situations Need information from outside the data to do a proper analysis DAGs variable selection: close all non-causal paths May-16 H.S. 13 Reporting variable as equals: Association versus causation Use statistical criteria for variable selection Risk factors for D: Variable OR E 2.0 C 1.2 Comments Surprisingly low association Association Causation C C E D Symmetrical C is a confounder for E-D E is a confounder for C-D E Report all variables in the model as equals Base adjustments on a DAG Report only the E-effect D or use different models for each variable Directional C is a confounder for ED E is a mediator for CD Westreich & Greenland 2013 May-16 H.S. 14 Exercise: report variables as equals? Risk factors for Fractures Comments (surprises) Interpret as effect of: Physical activity 1.2 Protective in other studies? Obesity 1.0 No effect? Diabetes adjusted for all other vars. Phy. act. adjusted for all other vars. Obesity adjusted for all other vars. Bone density 0.8 Variable OR Diabetes 2 2.0 Bone d. adjusted for all other vars. P physical activity D E fractures diabetes 2 O B obesity bone density May-16 1. P is a confounder for E→D, but is E a confounder for P→D? 2. Which effects are reported correctly in the table? 5 min H.S. 15 Exercise: Stratify or not Want the effect of action(A, exposure/treatment) on disease (D). Have stratified on C. 1. Make a guess at the population effect of A on D 2. Calculate the population effect of A on D 3. What is the correct analysis (and RR)? OBS several answers possible! Population D 1 A 1 0 C=0 D sum risk 210 A 0 1 0 RR= C=1 D 1 0 sum risk 10 70 90 330 100 400 500 0.10 0.18 RR= 0.6 40% less disease if treated Population = crude A May-16 1 A0 1 0 sum risk 200 80 200 20 400 100 500 0.50 0.80 RR= 0.6 40% less disease if treated Stratified = adjusted for C C D 10 min H.S. Hernan et al. 2011 16 Summing up so far • Associations visible in data. Causation from outside the data. • Data driven analyses do not work. Need causal information from outside the data. • Reporting table of adjusted associations is misleading. • Simpson’s paradox: causal information resolves the paradox. May-16 H.S. 17 The Path of the Righteous Paths May-16 H.S. 18 Path definitions Path: any trail from E to D (without repeating itself) Type: causal, non-causal State: open, closed 1 2 3 4 Four paths: Path ED EMD ECD EKD Goal: Keep causal paths of interest open Close all non-causal paths May-16 H.S. 19 Four rules 1. Causal path: ED (all arrows in the same direction) otherwise non-causal Before conditioning: 2. Closed path: K (closed at a collider, otherwise open) Conditioning on: 3. a non-collider closes: [M] or [C] 4. a collider opens: [K] (or a descendant of a collider) May-16 H.S. 20 ANALYZING DAGs May-16 H.S. 21 Confounding examples May-16 H.S. 22 Vitamin and birth defects 1. Is there a bias in the crude E-D effect? 2. Should we adjust for C? 3. What happens if age also has a direct effect on D? Unconditional Path 1 ED 2 ECUD Type Status Causal Open Non-causal Open Conditioning on C Path 1 ED 2 EC]UD 3 EC] D Type Causal Non-causal Non-causal May-16 May-16 Status Open Closed Closed Bias No bias H.S. This is an example of confounding Question: Is U a confounder? 23 Exercise: Physical activity and Coronary Heart Disease (CHD) We want the total effect of Physical Activity on CHD. 1. Write down the paths. 2. Are they causal/non-causal, open/closed? 3. What should we adjust for? 5 minutes May-16 H.S. 24 Direct and indirect effects Intermediate variables May-16 H.S. 25 Exercise: Tea and depression 1. Write down the paths. O coffee E tea 2. You want the total effect of tea on depression. What would you adjust for? C caffeine D depression 3. You want the direct effect of tea on depression. What would you adjust for? 4. Is caffeine an intermediate variable or a variable on a confounder path? 10 minutes Hintikka et al. 2005 May-16 H.S. 26 Exercise: Statin and CHD C cholesterol E U lifestyle D CHD statin 1. Write down the paths. 2. You want the total effect of statin on CHD. What would you adjust for? 3. If lifestyle is unmeasured, can we estimate the direct effect of statin on CHD (not mediated through cholesterol)? 4. Is cholesterol an intermediate variable or a collider? 10 minutes May-16 H.S. 27 Direct and indirect effects So far: Causal interpretation: Controlled (in)direct effect linear model and no E-M interaction New concept: Natural (in)direct effect Causal interpretation also for: non-linear model , E-M interaction Controlled effect = Natural effect if linear model and no E-M interaction Hafeman and Schwartz 2009; Lange and Hansen 2011; Pearl 2012; Robins and Greenland 1992; VanderWeele 2009, 2014 May-16 H.S. 28 Confounder, collider and mediator Mixed May-16 H.S. 29 Diabetes and Fractures We want the total effect of Diabetes (type 2) on fractures Conditional Unconditional Path Path 11 E→D E→D 22 E→F→D E→F→D 33 E→B→D E→B→D 44 E←[V]→B→D E←V→B→D 55 E←[P]→B→D E←P→B→D May-16 Type Type Causal Causal Causal Causal Causal Causal Non-causal Non-causal Non-causal Non-causal Status Status Open Open Open Open Open Open Closed Open Closed Open H.S. Questions: Mediators Paths ←→? More paths? B a collider? V and P ind? Confounders 30 Drawing DAGs May-16 H.S. 31 Technical note on drawing DAGs • Drawing tools in Word (Add>Figure) • Use Dia • Use DAGitty • Hand-drawn figure. May-16 H.S. 32 Direction of arrow C Smoking ? E D Phys. Act. Diabetes 2 H C Health con. Smoking E D Phys. Act. Diabetes 2 May-16 Does physical activity reduce smoking, or does smoking reduce physical activity? Maybe an other variable (health consciousness) is causing both? H.S. 33 Time C Smoking ? E D Phys. Act. Diabetes 2 Does physical activity reduce smoking, or does smoking reduce physical activity? C Smoking -5 E D Phys. Act. -1 Diabetes 2 May-16 Smoking measured 5 years ago Physical activity measured 1 year ago H.S. 34 Drawing a causal DAG Start: E and D add: [S] add: C-s 1 exposure, 1 disease variables conditioned by design all common causes of 2 or more variables in the DAG C V E D C must be included V may be excluded M may be excluded K may be excluded common cause exogenous mediator unless we condition M K May-16 H.S. 35 Exercise: Drawing survivor bias 1. We what to study the effect of exposure early in life (E) on disease (D) later in life. 2. Exposure (E) decreases survival (S) in the period before D (deaths from other causes than D). 3. A risk factor (R) reduces survival (S) in the period before D. 4. The risk factor (R) increases disease (D). 5. Only survivors are available for analysis (look at Collider idea). Draw and analyze the DAG 10 minutes May-16 H.S. 36 Real world examples May-16 H.S. 37 Endurance training and Atrial fibrillation Tobacco Socioeconomic Status ** Endurance training Cardiovascular factors * Alcohol consumption BMI Diabetes Genetic disposition Atrial fibrillation Hyperhyreosis Health *** consciousness Height Gender Long-distance racing Several arrows missing! Age *Hypertension, heart disease, high cholesterol ** Socioeconomic status: Education, marital status *** Unmeasured factors (Blue: Mediators, red: confounders, violet: colliders) Myrstad et al. 2014b METHODS TO REMOVE CONFOUNDING May-16 H.S. 39 Methods to remove confounding Method Action DAG effect C Condition: Restrict, Stratify, Adjust Close path E D C Cohort matching, Propensity Score Inverse Probability Treatment Weighting Remove CE arrow E D C Case-Control matching? Other methods? May-16 Remove CD arrow H.S. E D 40 Matching: Cohort vs Case-Control May-16 H.S. 41 Matching in cohort, binary E Matching: For every exposed person with a value of C, find an unexposed person with the same value of C S E selected based on E and C E independent of C after matching All open paths between C and E must C→E C→[S]←E C D C sum to “null” E Cohort matching removes confounding D Unfaithful DAG Cohort matching is not common, except in propensity score matching May-16 H.S. Mansournia et al. 2013; Shahar and Shahar 2012 42 Matching in Case Control, binary D C Matching: For every case with a value of C, find a control with the same value of C E selected based on D and C D independent of C after matching All open paths between C and D must C→[S]←D D C C→D sum to “null” C→E→D S E S D Case-Control matching does not removes confounding, unless E→D=0 (or C→E=0) must adjust for C in all analyses Case-Control matching common, may improve precision May-16 H.S. (Mansournia et al. 2013; Shahar and Shahar 2012 43 Inverse probability weighting May-16 H.S. 44 Marcov decomposition • DAG implies: Joint probability can be factorized into the product of conditional distributions of each variable given its parents C 𝑃 𝐶, 𝐸, 𝐷 = 𝑃(𝐶) ∙ 𝑃(𝐸|𝐶) ∙ 𝑃(𝐷|𝐸, 𝐶) E D Pearl 2000 May-16 H.S. 45 Inverse Probability Weighting C Have observed data distribution: Factorization: 𝑃(𝐶) ∙ 𝑃(𝐸|𝐶) ∙ 𝑃(𝐷|𝐸, 𝐶) E C Want the RCT distribution: Factorization: D 𝑃(𝐶) ∙ 𝑃(𝐸) ∙ 𝑃(𝐷|𝐸, 𝐶) E D Can reweight the observed data with weights: 𝑃(𝐸)/𝑃(𝐸|𝐶) to obtain the RCT distribution IPW knocks out arrows in the DAG May-16 H.S. 46 Odds of Treatment weighting (OT) Previous weighting targeted the Average Causal Effect Want now the effect among the exposed (treated) instead 𝑃(𝐸 = 1|𝐶) is the probability of treatment Can reweight the observed data with weights: 𝑃(𝐸 = 1|𝐶)/𝑃(𝐸|𝐶) to obtain the RCT distribution among the exposed 𝑃(𝐸 = 1|𝐶)/𝑃 𝐸 𝐶 = 𝑃(𝐸 = 1|𝐶)/𝑃 𝐸 = 1 𝐶 =1 𝑃(𝐸 = 1|𝐶)/𝑃 𝐸 = 0 𝐶 =OT if E=1 if E=0 McCaffrey et al. 2004 May-16 H.S. 47 Marginal Structural Model C DAG for the reweighted pseudo data E D MSM: The expected value of a counterfactual outcome D under a hypothetical exposure e: 𝐸 𝐷𝑒 = 𝛼0 + 𝛼1 𝑒 effect = 𝐸 𝐷1 − 𝐸 𝐷0 May-16 Veieroed, Lydersen et al. 2012. Daniel, Cousens et al. 2013. Rothman, Greenland et al. 2008 HS 48 Outcome versus exposure modeling May-16 H.S. 49 Methods to remove confounding A E C B D How can we remove confounding? What variables should be involved? C- yes A- no B- maybe • Close ECD by conditioning – (ordinary) regression model E(D| E,C) outcome modeling • Remove the EC arrow, binary E – – Propensity score matching Inverse probability weighting P(E|C) exposure modeling Combine exposure and outcome modeling: doubly robust models Outcome vs exposure modeling • Outcome model: E(D|E,C) (and possibly B) A • Ex: Nano-particlesCardioVascularDisease E – Know little about risk for nano-particles – Know a lot about risk factors for CVD A • Ex: SmokingBladder cancer E May-16 H.S. B D Do outcome model • Exposure model: E(E|C) – Know a lot about risk for smoking – Know little about risk factors for bladder cancer C C B D Do exposure model 51 Doubly robust methods • Combine the outcome and the exposure models: Do the regression E(D|E,C) with inverse probability weighting (OT) • Will be unbiased if the outcome- or the exposure-model is correct • Doubly robust methods: Twice as right! May-16 H.S. 52 Randomized experiments Mendelian randomization May-16 H.S. 53 Strength of arrow, randomization E Not deterministic C1 D C2 C1, C2, C3 exogenous C3 C R full compliance deterministic E Full compliance no E-D confounding D U R not full compliance E D Sub analysis conditioning on E may lead to bias May-16 Not full compliance weak E-D confounding but R-D is unconfounded Path 1 RED 2 REUD H.S. Type Status Causal open non-causal closed 54 Randomized experiments C E Observational study D C R E D U R c E IVe ITTe D Randomized experiment with full compliance R= randomized treatment E= actual treatment. R=E Randomized experiment with less than full compliance (c) If linear model: ITTe=c*IVe, c<1 IntentionToTreat effect: effect of R on D (unconfounded) population PerProtocol: crude effect of E on D (confounded by U) InstrumentalVariable effect: adjusted effect of E on D (if c is known, 2SLS) individual May-16 H.S. 55 RCT exercise E R + - + 85 0 85 15 100 115 D R + - + 43 63 106 57 37 94 D E + - + 32 74 106 53 41 94 N Risk 100 100 200 0.85 0.00 N Risk 1. 100 100 0.43 0.63 Calculate the compliance (c) as a risk difference from the table. 2. Calculate the intention to treat effect (ITTe) as a risk difference. N Risk 3. 85 115 0.38 0.64 Calculate the per-protocol effect (PP) as a risk difference. 4. Calculate the instrumental variable effect (IVe). 5. Explain the results in words. R+ means randomized to treatment, E+ means treated and D+ means getting disease. 0.85 is the risk of treatment for R+ subjects, 0.00 is the risk for R- subjects, the risk difference is the difference between these. U 0.22 -0.15 R c E IVe D 10 minutes ITTe May-16 H.S. 56 Mendelian randomization • U Observational study – • Suffers from unmeasured confounding Randomized trial: 3 conditions 1. R affects E: 2. No direct R-D effect: 3. R and D no common causes: • E D U 3 balanced, strong effect R independent of D | E R 1 E R independent of U 2 Medelian randomization: 3 conditions 1. G must affect E: unbalanced, weak large N 2. No direct G-D effect: depends on gene function 3. G and D no common causes: Mendel’s 2. law D U 3 G 1 E D 2 Sheehan et al, 2008 May-16 May-16 H.S. H.S 57 57 Ex: Alcohol and blood pressure • U Observational study – – Alcohol use increases blood pressure Many ”lifestyle” confounders A BP • Gene: ALDH2, 2 alleles – – • 2,2 type suffer nausea, headache after alcohol low alcohol regardless of lifestyle (U) 40 30 20 10 0 1,1 Medelian randomization 1. Gene ALDH2 is highly associated with alcohol 2. OK, gene function is known 3. Mendel’s 2. law, no ass. to obs. confounders • alcohol ml/day Alcohol use Result: – – 1,2 Genotype U 3 G 1 2,2 A BP 2 1,1 type BP +7.4 mmHg Alcohol increases blood pressure Chen et al 2008 May-16 May-16 H.S. H.S 58 58 DAGs and other causal models May-16 H.S. 59 DAGs and causal pies DAG Sufficient causes E1 E1 1. E1 or E2 E2 D E1 E2 2. E1 and E2 E2 E1 E2 E1 E2 3. both DAGs are less specific than causal pies DAGs are scale free, interaction is scale dependent Greenland and Brumback, Causal modeling methods, Int J Epid 2002 May-16 H.S. 60 Exercise: causal pies 1. Write down the causal pies for getting into hospital based on the DAG. Show that the DAG is compatible with at least 3 different combinations of sufficient causes. H hospital E diabetes D fractures 2. Selection bias: Discuss how the different combinations of sufficient causes for getting into hospital might affect the estimate of E on D among hospital patients (perhaps difficult). 10 minutes May-16 H.S. 61 Structural Equation Models, SEM • Causal assumptions + statistical model + data • SEM: parametric DAG May 16 H.S. 62 Causal models compared • DAGs – qualitative population assumptions – sources of bias (not easily seen with other approaches) • Causal Pies (SCC) – specific hypotheses about mechanisms of action • SEM – quantitative analysis of effects May-16 H.S. 63 Two (3) concepts Selection bias May-16 H.S. 64 Selection bias: concept 1 Simple version • “Selected different from unselected” • Prevalence (D) Old have lower prevalence than young Old respond less to survey Selection bias: prevalence overestimated • Effect (E→D) Old have lower effect of E than young Old respond less to survey Selection bias: effect of E overestimated May-16 H.S. 65 Selection bias: concept 1 “Selected different from unselected” Paths smokeCHD S age smoke CHD Age Young Old All Type Causal Status Open Population RRsmoke Selected RRsmoke 50 % 4.0 75 % 4.0 50 % 2.0 25 % 2.0 3.0 3.5 Normally, selection variables unknown Name: interaction based? May-16 • Properties: - Need smoke-age interaction - Cannot be adjusted for, but stratum effects OK - True RR=weighted average of stratum effects - RR in “natural” range (2.0-4.0) - Scale dependent H.S. 66 Selection bias: concept 2 Simple version • “Distorted E-D distributions” • DAG Collider bias • Words Selection by sex and/or age Distorted sex-age distribution Old have more disease Men are more exposed Distorted E - D distribution May-16 H.S. 67 Selection bias: concept 2 “Distorted E-D distributions” S sex age smoke CHD Paths Type Status smokeCHD Causal Open smokesexSageCHD Non-causal Open Properties: Name: Collider stratification bias Open non-causal path (collider) Does not need interaction Can be adjusted for (sex or age) Not in “natural” range (“surprising bias”) Selection bias types: Berkson’s, loss to follow up, nonresponse, self-selection, healthy worker Hernan et al, A structural approach to selection bias, Epidemiology 2004 May-16 H.S. 68 22 1) “Exclusive or” selection S=5% -0.5 0.5 0.0 -1 00 IQ 11 S=95% S=95% -2 S=5% -2 -2 May-16 -1-1 0 0 EMF H.S. 1 1 2 2 69 Exercise: Dust and COPD Chronic Obstructive Pulmonary Disease D0 S cur. worker H health diseases E E0 D COPD prior dust cur. dust COPD risks: Dust low Health good poor high 5 % 10 % 10 % 20 % 1. Calculate the RR of dust on COPD in good and poor health groups. 2. Write down the paths for the effect of E on D. E0 and D0 are unknown (past) measures. 3. What would you adjust for? 4. Suppose the crude effect of dust on COPD is RR=0.7 and the true RR=2. What do you call this bias? 5. Could the concept 1 (interaction based) selection bias work here? 10 minutes May-16 H.S. 70 Convenience sample, homogenous sample H 1. Convenience: Conduct the study among hospital patients? hospital E diabetes Conditional Unconditional Path 1 E→D 2 E→H←D E→[H]←D D 2. Homogeneous sample: Population data, exclude hospital patients? fractures Type Causal Non-causal Non-Causal Status Open Closed Open Collider, selection bias Collider stratification bias: at least on stratum is biased May-16 H.S. 71 Outcome dependent selection Selection into the study based on D. Get bias among selected. S D E U 4 5 Explanation: • Always have exogenous U. 0.6 2 D 3 • D is a collider on E→D←U, S is a descendant of collider D. 1 1.0 0 • Conditioning on (a descendant of) a collider opens the E→D←U path, and U becomes associated with E. 0.6 0 2 3 E • U now acts a confounder for E→D. Selected if D<= 2.5 + 0.0*E Selection depends on: Strength of E→D. Strength of U→D Unmatched Case-Control Example of non-causal confounding May-16 1 H.S. 72 Selection bias summing up Concept 1 Concept 2 S S smoke age sex age CHD smoke CHD Selected differ from unselected in E-D effects Selected differ from unselected in E-D distributions Interaction Collider bias “natural” effects “surprising” effects Report stratum effects Adjust Quite different concepts May-16 H.S. 73 MORE ON DAGs May-16 H.S. 74 Back door, front door, D-separated Paths from E to D, all are “leaving” E Paths open before conditioning: E back door . non-causal open front door . causal open need to close Plus paths closed at a collider If all paths from A to B are closed d-separated Pearl 2000 May-16 H.S. 75 3 strategies for estimating causal effects C • Back-door criterion – Condition to close all no-causal paths E (between E and D) D U • Front-door criterion – Condition an all intermediate variables M1 M2 E (between E and D) • Instrumental Variables – – 1. 2. 3. IV IV must affect E No direct IV-D effect IV and D no common causes U 3 Use an IV to control the effect (of E on D) IV criteria: 1 D E D 2 Pearl 2009, Glymor and Greenland, 2008 May-16 May-16 H.S. H.S 76 76 Example: front–door criterion • Weight and Coronary Heart Disease U lifestyle • Assume: C – adjusted for sex, age and smoke E – lifestyle is unmeasured – no other mediators (between E and D) cholesterol weight B D CHD blood pressure • Can estimate effect of E on D Path 1 E→C→D 2 E→B→D 3 E←U→D • Type Causal Causal Non-causal Status Open Open Open Crude Difference = causal Adjusted for B and C Weight is not a good “action” May-16 H.S. 77 Confounding versus selection bias Path: Any trail from E to D (without repeating itself) Open non-causal path = biasing path Confounding and selection bias not always distinct May use DAG to give distinct definitions: C A E A B D Causal K B A B E D Confounding: E D Selection bias: Non-causal path without colliders Non-causal path open due to conditioning on a collider Note: interaction based selection bias not included Hernan et al, A structural approach to selection bias, Epidemiology 2004 May-16 H.S. 78 Testable implications The DAG implies: C is independent of D given E and O O Regress D on E, C and O, if the C coefficient is different from zero we reject the DAG or rather add the arrow. DAGitty gives a list of testable implications May-16 H.S. Textor et al. 2011 79 DEFINITIONS May-16 H.S. 80 Causal graphs: definitions • Causal graph – Graph showing causal relations and conditional independencies between variables • G={V,E} – Vertices=random variables – Edges=associations or cause • Edges undirected or → directed L A Y U • Path – Sequence of connected edges: – Parent → child – Ancestors → → descendants • Exogenous: variables with no parents May-16 H.S. [(L,A),(A,Y)] U 81 D irected A cyclic G raphs • Ordinary DAG – Arrows = associations L • Causal DAG A Y U 1. Arrows = cause 2. All common causes of any pair of variables in the DAG are included • Two types of variables – Immutable – Mutable sex, age exposure (actions), smoking • Mixing variables in a DAG is OK – All dependence/independence conclusions valid May-16 H.S. 82 Variables and arrows • Variable at least two values • cause, almost any causal definition will work • ED usually on the individual level, “at least one subject with an effect of the exposure” • ? age only possible on group level • ED +/-, the dose response can be linear, threshold, U-shaped or any other (DAGs are non-parametric) May-16 H.S. 83 DAG units DAG units individuals populations C gene (almost) No C D variable can influence a gene in an individual No confounding May-16 gene D A variable can influence gene frequency in a population H.S. 84 D-separation, moralization • Directed graph-separation – two variables d-separated – otherwise d-connected if no open path • 2 DAG analyses – Paths (Pearl) – Moralization (Lauritzen) – equivalent May-16 H.S. 85 DAGs and probability theory May-16 H.S. 86 DAGs rules and statistical independence DAG correct? • Two assumptions 1. Compatibility: – Faithfulness: – = separated independent separated independent connected dependent 2. Weak faithfulness: connected variables may be dependent B A B Y A Y Pearl 2009, Glymor and Greenland, 2008 May-16 H.S. 87 LIMITATIONS, PROBLEMS AND EXTENSIONS OF DAGS May-16 H.S. 88 Limitations and problems of DAGs • New tool relevance debated, focus on causality • Focus on bias precision also important • Bias or not direction and magnitude • Interaction scale dependent • Static may include time varying variables • Simplified infinite causal chain • Simplified do not capture reality May-16 H.S. 89 DAG focus: bias, not precision C Should we adjust for C? DAG: no bias from C, need not adjust E D May include C to improve precision, depends on model! Linear regression Logistic regression crude crude adjusted adjusted .9 1 1.1 Effect of E on D with 95% CI 1.2 1.4 1.6 Effect of E on D with 95% CI Including C: better precision Including C: worse precision OR not collapsible Robinson and Jewell 1991; Xing and Xing 2010 May-16 H.S. 90 Signed DAGs and direction of bias M + + E D X Y Positive or negative bias from confounding by U? Neg True Pos E→D on average Average monotonic effect +- for all Y=y Distributional monotonic effect U To find direction of bias, multiply signs: Need distributional monotonic effects except at each end Positive bias from this confounding May-16 H.S. VanderWeele, Hernan & Robins, 2008 91 Size of bias from unmeasured U C A Y U Assume: Difference in the distribution of U for two levels for A: a1 ,a0 , does not vary with C Assume: Difference in expected value of Y for two levels of U : u1 ,u0 , does not vary with A and C γ = 𝐸 𝑌 𝑢1 − 𝐸 𝑌 𝑢0 if linear model 𝛼 = 𝑃 𝑢 𝑎1 − 𝑃 𝑢 𝑎0 γ = 𝐸 𝑌 𝑢1 /𝐸 𝑌 𝑢0 Bias = 𝛼 ∗ 𝛾 Bias = 1+ 𝛾−1 𝑃 1+ 𝛾−1 𝑃 if linear model 𝑢 𝑎1 𝑢 𝑎0 Stata: episens May-16 if RR model H.S. if RR model VanderWeele & Arah 2011 92 Interaction in DAGs DAG Causal pie Extended DAG Mechanisms C C D + C = E C E E,C E E C D E Red arrow = interaction Specify scale VanderWeele and Robins 2007 May-16 H.S. 93 DAGs and time processes DAGs often static, but may have time varying A1, A2,… Want total effect of A-s, Time Dependent Confounding DAG Process graph HDL A1 A2 HDL CHD Alcohol CHD The process graph is simpler but less specific The process graph allows feedback loops and has a clear time component Aalen et al. 2012 May-16 HS 94 Infinite causal chain U E D the Most paths involving variables back in the chain (U) will be closed May-16 H.S. 95 DAGs are simplified DAGs are models of reality must be large enough to be realistic, small enough to be useful May-16 H.S. 96 Summing up • Data driven analyses do not work. Need causal information from outside the data. • DAGs are intuitive and accurate tools to display that information. • Paths show the flow of causality and of bias and guide the analysis. • DAGs clarify concepts like confounding and selection bias, and show that we can adjust for both. Better discussion based on DAGs Draw your assumptions before your conclusions May-16 H.S. 97 Recommended reading • Books – – – – – Hernan, M. A. and J. Robins. Causal Inference. Web: Rothman, K. J., S. Greenland, and T. L. Lash. Modern Epidemiology, 2008. Morgan and Winship, Counterfactuals and Causal Inference, 2009 Pearl J, Causality – Models, Reasoning and Inference, 2009 Veierød, M.B., Lydersen, S. Laake,P. Medical Statistics. 2012 • Papers – Greenland, S., J. Pearl, and J. M. Robins. Causal diagrams for epidemiologic research, Epidemiology 1999 – Hernandez-Diaz, S., E. F. Schisterman, and M. A. Hernan. The birth weight "paradox" uncovered? Am J Epidemiol 2006 – Hernan, M. A., S. Hernandez-Diaz, and J. M. Robins. A structural approach to selection bias, Epidemiology 2004 – Berk, R.A. An introduction to selection bias in sociological data, Am Soc R 1983 – Greenland, S. and B. Brumback. An overview of relations among causal modeling methods, Int J Epidemiol 2002 – Weinberg, C. R. Can DAGs clarify effect modification? Epidemiology 2007 May-16 H.S. 98 References 1 • Aalen OO, Roysland K, Gran JM, Ledergerber B. 2012. Causality, mediation and time: A dynamic viewpoint. Journal of the Royal Statistical Society Series A 175:831-861. • Chen L, Davey SG, Harbord RM, Lewis SJ. 2008. Alcohol intake and blood pressure: A systematic review implementing a mendelian randomization approach. PLoS Med 5:e52. • Daniel RM, Cousens SN, De Stavola BL, Kenward MG, Sterne JAC. 2013. Methods for dealing with time-dependent confounding. Statistics in Medicine 32:1584-1618. • Greenland S, Schlesselman JJ, Criqui MH. 1987. Re: "The fallacy of employing standardized regression coefficients and correlations as measures of effect". AJE 125:349-350. • Greenland S, Robins JM, Pearl J. 1999. Confounding and collapsibility in causal inference. Statistical Science 14:29-46. • Greenland S, Brumback B. 2002. An overview of relations among causal modelling methods. Int J Epidemiol 31:1030-1037. • Greenland S, Mansournia MA. 2015. Limitations of individual causal models, causal graphs, and ignorability assumptions, as illustrated by random confounding and design unfaithfulness. Eur J Epidemiol. • Greenland SM, Malcolm; Schlesselman, James J.; Poole, Charles; Morgenstern, Hal. 1991. Standardized regression coefficients: A further critique and review of some alternatives. Epidemiology 2:6. • Hafeman DM, Schwartz S. 2009. Opening the black box: A motivation for the assessment of mediation. International Journal of Epidemiology 38:838-845. • Hernan MA, Hernandez-Diaz S, Werler MM, Mitchell AA. 2002. Causal knowledge as a prerequisite for confounding evaluation: An application to birth defects epidemiology. AJE 155:176-184. • Hernan MA, Hernandez-Diaz S, Robins JM. 2004. A structural approach to selection bias. Epidemiology 15:615-625. • Hernan MA, Cole SR. 2009. Causal diagrams and measurement bias. AJE 170:959-962. • Hernan MA, Clayton D, Keiding N. 2011. The simpson's paradox unraveled. Int J Epidemiol. • Hintikka J, Tolmunen T, Honkalampi K, Haatainen K, Koivumaa-Honkanen H, Tanskanen A, et al. 2005. Daily tea drinking is associated with a low level of depressive symptoms in the finnish general population. European Journal of Epidemiology 20:359-363. • Lange T, Hansen JV. 2011. Direct and indirect effects in a survival context. Epidemiology 22:575-581. • Mansournia MA, Hernan MA, Greenland S. 2013. Matched designs and causal diagrams. International Journal of Epidemiology 42:860-869. • McCaffrey DF, Ridgeway G, Morral AR. 2004. Propensity score estimation with boosted regression for evaluating causal effects in observational studies. Psychological Methods 9:403-425. • Myrstad M, Lochen ML, Graff-Iversen S, Gulsvik AK, Thelle DS, Stigum H, et al. 2014a. Increased risk of atrial fibrillation among elderly norwegian men with a history of long- term endurance sport practice. Scand J Med Sci Spor 24:E238-E244. May-16 H.S. 99 References 2 • Pearl J. 2000. Causality: Models, reasoning, and inference. Cambridge:Cambridge Univeristy Press. • Pearl J. 2012. The causal mediation formula-a guide to the assessment of pathways and mechanisms. Prev Sci 13:426-436. • Robins JM, Greenland S. 1992. Identifiability and exchangeability for direct and indirect effects. Epidemiology 3:143-155. • Robins JM. 2001. Data, design, and background knowledge in etiologic inference. Epidemiology 12:313-320. • Robinson LD, Jewell NP. 1991. Some surprising results about covariate adjustment in logistic-regression models. Int Stat Rev 59:227-240. • Rothman KJ, Greenland S, Lash TL. 2008. Modern epidemiology. Philadelphia:Lippincott Williams & Wilkins. • Shahar E. 2009. Causal diagrams for encoding and evaluation of information bias. Journal of evaluation in clinical practice 15:436-440. • Shahar E, Shahar DJ. 2012. Causal diagrams and the logic of matched case-control studies. Clinical epidemiology 4:137-144. • Sheehan NA, Didelez V, Burton PR, Tobin MD. 2008. Mendelian randomisation and causal inference in observational epidemiology. PLoS Med 5:e177. • Textor J, Hardt J, Knuppel S. 2011. Dagitty a graphical tool for analyzing causal diagrams. Epidemiology 22:745-745. • VanderWeele TJ, Robins JM. 2007. Directed acyclic graphs, sufficient causes, and the properties of conditioning on a common effect. AJE 166:1096-1104. • VanderWeele TJ, Hernan MA, Robins JM. 2008. Causal directed acyclic graphs and the direction of unmeasured confounding bias. Epidemiology 19:720-728. • VanderWeele TJ. 2009. Mediation and mechanism. Eur J Epidemiol 24:217-224. • VanderWeele TJ, Arah OA. 2011. Bias formulas for sensitivity analysis of unmeasured confounding for general outcomes, treatments, and confounders. Epidemiology 22:42-52. • VanderWeele TJ, Hernan MA. 2012. Results on differential and dependent measurement error of the exposure and the outcome using signed directed acyclic graphs. AJE 175:1303-1310. • VanderWeele TJ. 2014. A unification of mediation and interaction: A 4-way decomposition. Epidemiology 25:749-761. • Veieroed M, Lydersen S, Laake P. 2012. Medical statistics in clinical and epidemiological research. Oslo:Gyldendal Akademisk. • Westreich D, Greenland S. 2013. The table 2 fallacy: Presenting and interpreting confounder and modifier coefficients. AJE 177:292-298. • Xing C, Xing GA. 2010. Adjusting for covariates in logistic regression models. Genet Epidemiol 34:937-937. May-16 H.S. 100 May-16 H.S. 101 Extra material May-16 H.S. 102 MEDIATION ANALYSIS Hafeman and Schwartz 2009; Lange and Hansen 2011; Pearl 2012; Robins and Greenland 1992; VanderWeele 2009, 2014 May-16 H.S. 103 Why mediation analysis? • Have found a cause • How does it work? M A May-16 direct effect 𝑇𝑜𝑡𝑎𝑙 𝑒𝑓𝑓𝑒𝑐𝑡 = 𝑖𝑛𝑑𝑖𝑟𝑒𝑐𝑡 + 𝑑𝑖𝑟𝑒𝑐𝑡 Y 𝑖𝑛𝑑𝑖𝑟𝑒𝑐𝑡 𝑀𝑒𝑑𝑖𝑎𝑡𝑒𝑑 𝑝𝑟𝑜𝑝𝑜𝑟𝑡𝑖𝑜𝑛 = 𝑡𝑜𝑡𝑎𝑙 H.S. 104 Classic approach: controlling May-16 H.S. 105 Controlled Direct effect Direct effect: Effect of statin on CHD “for the same cholesterol” Fixed M Fixed M: controlled direct effect CDE=E(Y|A=1,M=m) - E(Y|A=0,M=m) m M cholesterol A statin Y CHD Problems 1. Conceptual: Can we fix cholesterol levels? 2. Technical: A*M Interaction? 3. (Technical: non-linear models?) 0/1 Solution? Robins and Greenland 1992; VanderWeele 2009 May-16 H.S. 106 New approach: counterfactuals May-16 H.S. 107 Counterfactual causal effect • Two possible outcome variables – Outcome if treated: – Outcome if untreated: Y1 Y0 Counterfactuals Potential outcomes • Causal effect – Individual: – Average: Y1i-Y0i E(Y1)-E(Y0) or other effect measures Fundamental problem: either Y1 or Y0 is missing Hernan 2004 May-16 H.S. 108 Total causal effect, TCE Potential (counterfactual) outcomes: Effect of statin on CHD Y1 is the outcome if A is set to 1 M1 is the mediator if A is set to 1 M1 M0 M cholesterol A statin A set to 0 M0 Y1 Y0 𝑇𝐶𝐸 = 𝐸 𝑌1 Y CHD − 𝐸 𝑌0 = 𝐸 𝑌1,𝑀1 − 𝐸 𝑌0,𝑀0 0/1 May-16 A set to 1 M1 H.S. 109 Natural Direct effect Direct effect: Effect of statin on CHD “for the same cholesterol” M1 M0 M statin 0/1 May-16 A set to 0 M0 Natural Direct Effect: Keep M at M0 cholesterol A A set to 1 M0 Y CHD 𝑁𝐷𝐸 = 𝐸 𝑌1,𝑀0 − 𝐸 𝑌0,𝑀0 Takes care of the 3 earlier problems: 1. Don’t need to fix M=m 2. OK for interactions 3. (OK for non-linear models) in 4 slides H.S. 110 Exercise: nested counterfactuals • Write counterfactual outcomes for: – All are treated (A set to 1), the mediator is fixed at m – All are untreated (A set to 0), the mediator is fixed at m – All are treated, the mediator is at its natural distribution if all are untreated – All are untreated, the mediator is at its natural distribution if all are untreated 5 minutes May-16 H.S. 111 Natural Indirect effect Indirect effect: Effect of statin via cholesterol on CHD “for the same statin” M1 M set to M1 A=1 M0 Natural Indirect Effect: Keep A at 1 M cholesterol A statin 0/1 M set to M0 A=1 𝑁𝐷𝐸 = 𝐸 𝑌1,𝑀1 − 𝐸 𝑌1,𝑀0 Y CHD Why keep A at 1? then direct + indirect = total May-16 H.S. 112 Natural direct and indirect effects 1 Natural Direct Effect: 𝑁𝐷𝐸 = 𝐸 𝑌1,𝑀0 − 𝐸 𝑌0,𝑀0 A=1 vs. 0 for M=M0 Natural Indirect Effect: 𝑁𝐼𝐸 = 𝐸 𝑌1,𝑀1 − 𝐸 𝑌1,𝑀0 M=M1 vs. M0 for A=1 Total Causal Effect: 𝑇𝐶𝐸 = 𝐸 𝑌1,𝑀1 − 𝐸 𝑌0,𝑀0 = 𝑁𝐷𝐸 + 𝑁𝐼𝐸 May-16 H.S. 113 Binary outcome, RR effect measure Natural Direct Effect: 𝑁𝐷𝐸𝑅𝑅 = 𝑃 𝑌1,𝑀0 =1 𝑃 𝑌0,𝑀0 =1 A=1 vs. 0 for M=M0 Natural Indirect Effect: 𝑁𝐼𝐸𝑅𝑅 = 𝑃 𝑌1,𝑀1 =1 𝑃 𝑌1,𝑀0 =1 M=M1 vs. M0 for A=1 Total Causal Effect: 𝑃 𝑌1,𝑀1 = 1 𝑇𝐶𝐸𝑅𝑅 = = 𝑁𝐷𝐸𝑅𝑅 ∙ 𝑁𝐼𝐸𝑅𝑅 𝑃 𝑌0,𝑀0 = 1 May-16 H.S. 114 Example: Labor marked discrimination Are Emily and Greg more employable than Lakisha and Jamal? Names CV White: Emely Walsh, Greg Baker Af.-am.: Lakisha Washington, Jamal Jones CV: Job name low/high quality Job: callback Experiment: 5000 CV with random name type Boston and Chicago, 2002? Result: White names: African-am: 10 CV-s before callback 15 CV-s before callback Bertrand and Mullainathan 2004 May-16 H.S. 115 Exercise: Labor marked discrimination 1. Describe the experiment for estimating the controlled direct effect 2. Describe the experiment for estimating the natural direct effect CV Job name 5 minutes May-16 H.S. 116 Assumptions U2 Classic method: Linear models No A-M interaction M U3 P Y A Classic and new: No unmeasured confounders (U1, U2, U3,) No treatment dependent confounders (P) U1 VanderWeele 2009 May-16 H.S. 117 Summing up (so far) M • Classic method – Linear models, no A-M interaction Y A • New approach – Linear/non-linear, with/without A-M interaction – Natural Direct Effect • effect of A keeping M at its natural distribution – More assumptions • Mediation > Total • Natural > Controlled May-16 H.S. 118 Mediation analysis May-16 H.S. 119 Highlights • Classic decomposing into direct and indirect effects fail: – Do not account for interaction – Not meaningful decomposition in non-linear models • New methods define natural direct and indirect effects – Need special (often limited) software for estimation • A 4-way decomposition of mediation and interaction – Theoretical insight – Estimate with standard regression models* – * use delta or bootstrap for confidence intervals – All types of exposures and mediators May-16 H.S. 120 Classic method Classic method (Baron & Kenny): Total effect: regress Y on A Direct effect: regress Y on A and M Indirect effect: Total – Direct M Y A • Only OK for – Linear models – No A*M interaction – Strong no-confounding assumptions May-16 H.S. 121 New methods Estimate natural direct and indirect effects (Stata) D M Multiple M Sensitivity Paramed continuous binary count continuous binary no no Idecomp binary any yes no continuous binary continuous binary no yes any any ? ? (Lange-2011) Time-to-event continuous no no (Lange-2012) any any no no Medeff Gformula* * Only for treatment dependent confounding May-16 H.S. 122 Mediation analysis: 4-WAY DECOMPOSITION May-16 H.S. 123 Notation Counterfactuals if A is set to a, M is set to m: 𝑌𝑎 M 𝑀𝑎 𝑌𝑎𝑚 A Y For simplicity assume binary A and M: 𝑌10 (general results available) a May-16 H.S. m 124 Interaction on additive scale M No additive interaction: 𝑅𝐷11 = 𝑅𝐷10 + 𝑅𝐷01 Y A 𝑌11 − 𝑌00 = 𝑌10 − 𝑌00 + 𝑌01 − 𝑌00 𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 = 0 Measure of additive interaction: 𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 May-16 H.S. 125 Interaction in DAGs DAG Causal pie Extended DAG Mechanisms M M Y + M = A M A A,M A A M Y A Red arrow = interaction Specify scale VanderWeele and Robins 2007 May-16 H.S. 126 4 way decomposition TE CDE INTref INTmed PIE M = total effect = controlled direct effect = interaction effect at reference = interaction*mediator effect = pure indirect effect Y A Can always decompose the total effect into 4: 𝑌1 − 𝑌0 = Mediation Interaction TE (𝑌10 −𝑌00 ) CDE - - +𝑀0 (𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 ) INTref - + +(𝑀1 − 𝑀0 )(𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 ) INTmed + + +(𝑀1 − 𝑀0 )(𝑌01 − 𝑌00 ) PIE + - May-16 H.S. 127 Individual 4 mechanisms TE CDE INTref INTmed PIE M = total effect = controlled direct effect = interaction effect at reference = interaction*mediator effect = pure indirect effect Y A If AY for one subject then one of 4 mechanisms will work: M I (𝑌10 −𝑌00 ) AY|M=0 - 𝑀0 (𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 ) M0=1 & M*A inter. - + (𝑀1 − 𝑀0 )(𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 ) AM & M*A inter. + + (𝑀1 − 𝑀0 )(𝑌01 − 𝑌00 ) AM & MY|A=0 + - May-16 H.S. 128 - Interaction in DAGs Extended DAG Mechanisms M M A,M A Y A VanderWeele and Robins 2007 May-16 H.S. 129 Individual 4 mechanisms 1 M 2 M A,M A A M Y 3 M A,M A M Y A 4 M A,M A M Y A M A,M A Y A If AY for one subject then one of 4 mechanisms will work: M I (𝑌10 −𝑌00 ) AY|M=0 - 𝑀0 (𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 ) M0=1 & M*A inter. - + (𝑀1 − 𝑀0 )(𝑌11 − 𝑌10 − 𝑌01 + 𝑌00 ) AM & M*A inter. + + (𝑀1 − 𝑀0 )(𝑌01 − 𝑌00 ) AM & MY|A=0 + - May-16 H.S. 130 - 4 and 2 way decompositions Mediation: - - + + TE = CDE + INTref + INTmed + PIE TE =Natural Direct + Natural Indirect May-16 H.S. 131 Notation: Identification (estimate from data) Counterfactuals if A is set to a, M is set to m: M 𝑌𝑎 𝑀𝑎 For simplicity assume binary A and M: (general results available) 𝑌𝑎𝑚 𝑌10 Y A Consistency: Consistency assumption: Composition assumption: 𝐼𝑓 𝐴 = 𝑎 𝑎𝑛𝑑 𝑀 = 𝑚 𝑡ℎ𝑒𝑛 𝑌𝑎𝑚 = 𝑌 𝑌𝑎 = 𝑌𝑎𝑀𝑎 U2 Confounding: 4 assumptions needed for estimation: No unmeasured confounding (U1-U3) No confounders affected by A (P) M U3 P Y A U1 May-16 H.S. 132 Y and M in linear regression models Assume Y and M continuous and following: M 𝐸 𝑌 𝑎, 𝑚, 𝑐 = 𝜃0 + 𝜃1 𝑎 + 𝜃2 𝑚 + 𝜃3 𝑎𝑚 + 𝜃4′ 𝑐 𝐸 𝑀 𝑎, 𝑐 = 𝛽0 + 𝛽1 𝑎 + 𝛽2′ 𝑐 Y A then 𝐸 𝐶𝐷𝐸 𝑐 = 𝜃1 (𝑎 − 𝑎∗ ) 𝐸 𝐼𝑁𝑇𝑟𝑒𝑓 𝑐 = 𝜃3 (𝛽0 + 𝛽1 𝑎∗ + 𝛽2′ 𝑐)(𝑎 − 𝑎∗ ) 𝐸 𝐼𝑁𝑇𝑚𝑒𝑑 𝑐 = 𝜃3 𝛽1 (𝑎 − 𝑎∗ )2 𝐸 𝑃𝐼𝐸 𝑐 = (𝜃2 𝛽1 + 𝜃3 𝛽1 𝑎∗ ) (𝑎 − 𝑎∗ ) a=1 and a*=0 would simplify further May-16 H.S. 133 Binary outcomes and ratio scale M Similar results based on RR (with a scaling factor) Y A Example: Smoking Gene Lung Cancer 15q25.1 rs8034191 C alleles May-16 H.S. 134 Non-collapsibility of the odds ratio May-16 H.S. 135 Non-collapsibility of the OR Population D C OR=10 E A D 1 0 1 0 sum 470 1 775 530 7 225 1 000 9 000 10 000 OR= 3.6 C=0 A 0 D 1 0 sum 105 225 395 4 275 500 4 500 5 000 OR= 5.1 May-16 0.89 0.25 C=1 D 1 odds odds 0.27 0.05 1 A0 H.S. 1 0 sum 365 1 550 135 2 950 500 4 500 5 000 OR= 5.1 odds 2.70 0.53 136 Non-collapsibility of the OR C OR=10 E D Non-collapsibility depends on frequency of D Logistic regression crude adjusted D=22% : Not collapsible crude adjusted D=6% : Appr. collapsible crude adjusted D=1% : Collapsible 2 May-16 3 4 5 OR for E on D H.S. 6 7 137 Information bias Hernan and Cole 2009; Shahar 2009; VanderWeele and Hernan 2012 May-16 H.S. 138 Depicting measurement error a) • UE – E=true exposure – E*=measured exposure – UE=process giving error in E E* b) Error in E E D UE UD E* D* E D • Error in E and D – D=true disease – D*=diagnosed disease – UD=process giving error in D a) and b) Can test H0 b) shows independent non-differential errors May-16 H.S. 139 Dependent errors, differential errors a)a)a) b)b)b) c)c)c) UED UED UED UEUEUE UDUDUD UEUEUE UDUDUD UEUEUE UDUDUD E*E*E* D*D* D* E*E*E* D*D* D* E*E*E* D*D* D* EEE EEE EEE DDD Dependent errors: Temp. in lab DDD Differential error: Recall bias in Case-Control study DDD Differential error: Investigator bias in cohort study Hernan and Cole, Causal Diagrams and Measurement Bias, AJE 2009 May-16 H.S. 140 Exercise: Hair dye and congenital malformations We study the effect of hair dye (E) during pregnancy on malformations (D) in the baby in a traditional case–control study. Mothers are asked after birth how often they dyed their hair during pregnancy. 1. Draw a DAG of the situation were mothers do not recall exactly how often they dyed their hair, and were the recall is different for mothers with malformed babies. Use E for the correct amount of hair dye, and E* for the reported. Malformations are assumed to be without misclassification. 2. Show the paths for the effect of E on D. Will there be a bias? 3. Show the paths for the effect of E* on D. Will there be a bias? 4. Can E* be associated with D even if E→D is zero? 10 minutes May-16 H.S. 141 Selection bias depicted May-16 H.S. 142 Simplified example Selection S IQ • (common understanding for continuous and binary variables) • Focus: selection types and bias patterns IQ • Selection in quadrants 1 • Stratification Selection 0 EMF -1 D -2 E 2 E and D continuous, Z, normal True effect of E on D: =0 -2 0 EMF 2 • (Clarity over realism) May-16 HS 143 22 1) “Exclusive or” selection S=5% -0.5 0.5 0.0 -1 00 IQ 11 S=95% S=95% -2 S=5% -2 -2 May-16 -1-1 0 0 EMF H.S. 1 1 2 2 144 2 2) “Inclusive or” selection S=95% 1 S=95% 0.0 0.1 -1 0 IQ -0.2 S=95% -2 S=5% -2 May-16 -1 0 Parent income H.S. 1 2 145 2 3) “And” selection S=95% 0.2 0.0 -0.2 -1 0 1 S=5% S=4% -2 S=5% -2 May-16 -1 0 Fluoride in water HS 1 2 146 2 4) “Gradient” selection S=95% 1 S=50% 0.0 0 IQ -0.1 -1 -0.1 S=50% -2 S=4% -2 May-16 -1 0 EMF H.S. 1 2 147 Z-scores May-16 H.S. 148 Birth weight paradox • Results: – – • Possible explanation: – • Maternal smoking increases neonatal mortality overall Maternal smoking decreases neonatal mortality among low birth weight M birth weight E smoke U D neonatal mort conditioning on M opens collider path via U Some advocate standardizing birth weight with respect to smoking, i.e. Z-scores May-16 H.S. 149 Z-scores 𝑍𝑖𝑗 = 𝑏𝑖𝑟𝑡ℎ 𝑤𝑒𝑖𝑔𝑡ℎ𝑖𝑗 −𝑚𝑗 𝑠𝑑𝑗 𝑗 = 0,1 (𝑠𝑚𝑜𝑘𝑒) 0 E(smoke) independent of Z All open paths between E and Z must E→Z E→M→Z birth weight -4 -2 0 2 4 Z-scores of birth weight by smoke U Adjusting for Z estimates the total effect of E on D Z E smoke No gain, crude model also estimates the total effect of E on D D neonatal mort “unfaithful” May-16 6000 sum to “null” when we condition on Z M 2000 4000 Birth weight by smoke H.S. 150 Z-scores • Should not adjust for gestational age: – – • G gest. age removes part of the effect of exposure opens up a collider path involving U E toxicant U D birth weight Some advocate standardizing birth weight with respect to gestational age, i.e. Z-scores – but this also represents some type of adjustment for gestational age May-16 H.S. 151