Propensity Scores in Clinical Research: Laurel Beckett, PhD Daniel Tancredi, PhD

advertisement
Outline
Setting
Controlling for selection bias
Examples
Propensity
scores
Profs. Beckett &
Tancredi
Propensity Scores in Clinical Research:
Outline
Setting
Ideal case
Real world
Laurel Beckett, PhD
Daniel Tancredi, PhD
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
University of California, Davis
July 14, 2014
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
Outline
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Controlling for selection bias
Regression adjustment
Propensity scores
Examples
Examples
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Introduction to propensity scores
Propensity
scores
Profs. Beckett &
Tancredi
Propensity scores are a tool for helping to control for bias
due to heterogeneity and imbalance in comparative
clinical studies.
Propensity scores have been used extensively in studies
in anesthesiology and critical care, but the quality of
reporting of these studies needs improvement (Gayat
2010 PMID: 20689924).
The goal today is to provide an introduction and show
some examples of propensity scores as used in
anesthesiology and critical care studies.
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Educational objectives for talk
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
You should be able to:
Ideal case
Real world
I
Describe the study setting in which propensity scores
are most helpful.
Controlling for
selection bias
I
Explain how propensity scores differ from regression
adjustment.
Examples
I
List several ways propensity scores may be used to
correct for bias.
Profs. Beckett & Tancredi
Propensity scores
Regression adjustment
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
The ideal: direct comparison
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Suppose a physician wants to compare Treatment A to
Treatment B.
The question for patient care is: would the patient’s
outcome be better with A or B?
Ideally, we would answer this question by comparing the
two treatments directly in the same patients, and seeing
which worked better.
Profs. Beckett & Tancredi
Propensity scores
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Example of direct comparison
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Sometimes you can actually do direct comparisons in the
same patient:
I
Topical treatments (left vs. right arm)
I
Treatment for chronic pain (crossover design)
Example: high-flow O2 vs. placebo (air) for treating
cluster headaches.
Crossover design (Cohen, JAMA 2009 PMID: 19996400).
Profs. Beckett & Tancredi
Propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Study results (simplified)
Propensity
scores
Profs. Beckett &
Tancredi
A total of 76 patients had both treatments.
I
24% or n=18 had pain relief at 30 min with air
I
72% or n=55 had pain relief at 30 min with O2
I
They don’t say how many had relief with both.
Suppose it was 10 (14%).
I
That means 8 had relief only with air and 45 only with
O2 .
I
This imbalance is very unlikely to be due to chance.
The study presents strong evidence that O2 works better
in most people.
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Simulated data: differences in pain relief
Propensity
scores
Profs. Beckett &
Tancredi
Outline
This plot shows differences
in pain relief in a hypothetical
study where each patient
tried both A and B. A lot of
scatter here, but the mean
level of pain relief is better
with Treatment A (dashed
line).
Profs. Beckett & Tancredi
Propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Simulated data: adding a confidence interval
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
A 95% confidence interval for
the difference shows that the
mean favors A more than
chance alone.
Profs. Beckett & Tancredi
Propensity scores
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
You can’t always compare directly
Propensity
scores
Profs. Beckett &
Tancredi
Outline
What if you can’t compare directly?
Setting
I
Sometimes you can only treat once (post-op pain).
I
Sometimes you can only measure the outcome once
(time to death).
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
I
Then you have to compare a group of patients
treated just with A to a group treated just with B.
The problem: Patients vary in how they respond to A and
B! (Some patients actually respond to placebo but not to
O2 , some to both, some to neither).
Profs. Beckett & Tancredi
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Approach 1: randomized clinical trials
Propensity
scores
Profs. Beckett &
Tancredi
If you can’t treat each person with both A and B, you can
randomly assign to treatment groups.
I
I
Every patient had same chance to get treatment A.
Ensures that differences between groups are either
due to treatment or to chance.
I
We can calculate how likely it is that differences are
just chance.
I
Very unlikely that a post-op O2 group would get all
the easy-to-treat people and the placebo group all
the hard-to-treat ones.
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
A key feature of randomized trials
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
With randomization, we can obtain unbiased estimates of
the mean difference between outcome for a patient given
A and the same patient given B.
Thus randomization makes up for our inability to test both
treatments in the same patient.
Profs. Beckett & Tancredi
Propensity scores
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
You can’t always get what you want
Propensity
scores
Profs. Beckett &
Tancredi
Outline
You can’t always do a randomized clinical trial! It may be
difficult or impossible:
In emergencies
I
If a control group would be unethical
Real world
Regression adjustment
if informed consent is difficult
If you can’t do a randomized trial, you can still compare
observational data, but with caution.
Profs. Beckett & Tancredi
Ideal case
Controlling for
selection bias
I
I
Setting
Propensity scores
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
When treatment selection is not randomized
Propensity
scores
Profs. Beckett &
Tancredi
Outline
In observational studies, investigators do not control
treatment assignment.
I
I
You lose the protection given by randomization.
Study arms may differ considerably:
I
I
I
in observed characteristics of patients
and also in unmeasured characteristics.
Such imbalance can lead to selection bias.
Profs. Beckett & Tancredi
Propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Ideal case
Real world
Selection bias
Propensity
scores
Profs. Beckett &
Tancredi
Outline
If certain kinds of patients are differentially assigned to
Treatment A compared to B, we have selection bias.
Suppose, for example, pain patients in ER who had poor
response to other treatment were more likely to be
assigned to O2 .
Our estimate of the difference in outcome for Treatment A
vs B would be biased.
What can we do?
Profs. Beckett & Tancredi
Propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Controlling for selection bias
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Two approaches are commonly used to control for
selection bias:
I
I
Adjustment via regression modeling of outcome
Assessing probability of different treatments and
controlling via propensity scores.
Profs. Beckett & Tancredi
Propensity scores
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Regression adjustment for selection bias
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Basic assumptions in regression adjustment include that
I
the treatment groups differ in known prognostic
factors,
I
outcome is related to those factors, and
I
there are not unmeasured confounders.
We then use regression models to estimate the average
effect of treatment on outcome “conditional on" the values
of the prognostic factors.
Profs. Beckett & Tancredi
Propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Regression models used for adjusting for bias
Propensity
scores
Profs. Beckett &
Tancredi
The type of regression model we use depends on the
outcome variable.
I
I
I
If the outcome is quantitative, like a pain scale, we
usually use linear regression.
If the outcome is dichotomous, like breakthrough
pain, we use logistic regression.
If the outcome is censored survival data, like time to
needing surgery, we use proportional hazards
models or other survival models.
We usually include prognostic factors through a linear
combination of variables.
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Interpretation of regression models
Propensity
scores
Profs. Beckett &
Tancredi
The regression model constructs a prognostic scale X
(like the Apache score). We assume:
Outline
Setting
Ideal case
Real world
I
Average outcome gets worse as you move from one
end of scale to other.
I
The observed difference in mean outcome between
treatments for people who are at the same place on
the prognostic scale is unbiased.
I
The difference doesn’t depend on where you are on
the prognostic scale.
Profs. Beckett & Tancredi
Propensity scores
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Simulated data: regression model for pain
relief
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
This plot shows association
between prognostic score (X
axis) and pain relief (Y axis).
A lot of scatter but Treatment
A seems a little better at all
scores.
Profs. Beckett & Tancredi
Propensity scores
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Simulated data: regression estimates shown
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
This plot shows regression
lines for each treatment.
Prognostic score predicts
outcome. Treatment A has
about 3 points greater relief
at all scores.
Profs. Beckett & Tancredi
Propensity scores
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Propensity score methods
Propensity
scores
Profs. Beckett &
Tancredi
The propensity score approach assumes that the
treatment groups differ in known covariates, and develops
a score Z predicting how likely a person is to get
Treatment A. We assume:
I
The likelihood of getting treatment A is very similar
for all people with the same value of the propensity
score Z.
I
People with the same propensity score Z are like a
little tiny randomized trial of A vs B.
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Successful propensity scoring
Propensity
scores
Profs. Beckett &
Tancredi
In a randomized trial, the difference in prognostic factors
between treatment groups is very small, on average.
Outline
Setting
Ideal case
Real world
A good propensity score would have small differences in
prognostic factors, on average, if we compared just for
people with similar propensity scores.
Each little subgroup with similar scores would look
similar, even if there is imbalance overall.
The propensity score also provides computational
advantages over large multiple regression models in
small sample size settings
Profs. Beckett & Tancredi
Propensity scores
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Effectiveness of correction for bias
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Gayat shows a nice example of correction for selection
bias.
Data are from a study on effect of epidural anesthesia on
mortality after non-cardiac surgery.
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Differences between treatments were much larger before
propensity score correction (black circles) than after (red
squares).
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Correction for selection bias (Gayat)
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Using propensity scores
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Four ways to use propensity scores to correct estimates
of treatment effect for selection bias:
I
Match patients on propensity scores
I
Stratify patients on propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
I
Adjust for propensity score via regression modeling
I
Estimate treatment differences using weights based
on propensity scores
Profs. Beckett & Tancredi
Propensity scores
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Regression adjustment
Propensity scores
Propensity score weights refer to "potential
outcome" distributions
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
In survey sampling, the basic weight for unit i is the
inverse of being selected into the sample and refers back
to the full population: wi = 1/Pr {i ∈ s}
Similarly, the basic propensity score weight refers to
either of the two populations of potential outcomes,
according to observed treatment assignment:
wi = 1/πi , if individual i received treatment
wi = 1/(1 − πi ), if individual i received comparator
Profs. Beckett & Tancredi
Propensity scores
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Two examples from the clinical literature
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
I
One example from surgery literature
I
One on anesthesia choice in prostatectomy
Profs. Beckett & Tancredi
Propensity scores
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Impact of change of surgical teams
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Brown et al (Annals Surg Feb 2011 PMID: 21173693)
report a cohort analysis of non-emergent cardiovascular
surgery outcomes. They compared:
I
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Day team (7 am to 3 pm)
Propensity scores
Examples
I
Evening team (3 pm to 11 pm)
I
Changeover team (start in day, end in evening)
They used propensity scores to correct for selection bias.
Profs. Beckett & Tancredi
Setting
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
Results from change of team study
Propensity
scores
Profs. Beckett &
Tancredi
Comparison of Day, Evening and Changeover teams:
Outline
Setting
I
I
I
Unadjusted analyses showed major outcome
differences.
But patients differed in age, ejection fraction,
comorbidity, type of procedure.
After regression adjustment for propensity score:
I
I
C teams took 20+ min longer than D and E.
C had 1.85 fold greater risk of septicemia than D.
Explanation of propensity methods was skimpy!
Profs. Beckett & Tancredi
Propensity scores
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Anesthetic choice for prostatectomy
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Wuethrich et al compared outcomes for open radical
prostatectomy using general anesthesia with or without
epidural (Anesthesiology 2010 PMID: 20683253).
Outcomes considered:
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
I
Clinical progression-free survival
I
Biochemical progression-free survival
I
Cancer-specific survival
I
Overall survival.
Profs. Beckett & Tancredi
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Analysis methods for prostatectomy study
Propensity
scores
Profs. Beckett &
Tancredi
The investigators used Cox proportional hazard models
for all outcomes.
Outline
Setting
Ideal case
They used propensity scores because patients were not
randomized to treatment.
Real world
Controlling for
selection bias
Regression adjustment
I
Distributions of scores too different in the groups to
allow matching or stratification.
I
One analysis used regression adjustment for
propensity score.
I
A second analysis adjusted by inverse probability
weights.
Profs. Beckett & Tancredi
Propensity scores
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Weighted results for prostatectomy study
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
PSA PFS: P=0.40
Clin PFS: P=0.002
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Controlling for selection bias
Examples
A cautionary note from prostatectomy study
Propensity
scores
Profs. Beckett &
Tancredi
Outline
Setting
Although the overall conclusions were similar between
the two methods used in this study, the point estimates
and p values differed a fair amount.
For example, regression adjustment gave a hazard ratio
for cancer-specific mortality of 0.95 (P=0.95) but inverse
probability weights gave a hazard ratio of 0.45 (P=0.09).
Profs. Beckett & Tancredi
Propensity scores
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Some concluding remarks
Propensity
scores
Profs. Beckett &
Tancredi
I
Read the articles carefully; Gayat gives advice on
reporting.
I
Results may be sensitive to how propensity models
are employed.
I
I
Developing propensity scores is delicate and time
consuming.
Computer software for some aspects (matching,
weighting) is not exactly push-button.
Despite the cautions, propensity scores remain a very
useful tool for addressing selection bias problem, and far
better than ignoring it!
Profs. Beckett & Tancredi
Propensity scores
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
Propensity
scores
Thank You!
Profs. Beckett &
Tancredi
Outline
Disclosures: Professor Beckett’s recent funding comes from UC
Davis, NIH grants (various), and honoraria from talks at CSUS,
University of Rochester, Modesto Community College, Albert Einstein
School of Medicine and to Roche. She serves on external advisory
boards for the Alzheimer’s Disease Centers of Albert Einstein, UC
San Diego, and Washington University, and for the Alzheimer’s
Prevention Initiative.
Prof. Tancredi’s recent funding comes from UC Davis and various
research grants sponsored by the NIH, AHRQ, the CDC and the
California Air Resources Board. In addition, he has received
consulting fees and travel support from the International Scientific
Association of Probiotics and Prebiotics.
Profs. Beckett & Tancredi
Propensity scores
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Outline
Setting
Controlling for selection bias
Examples
References
Propensity
scores
I Brown ML, Parker SE, Quiñonez LG, Li Z, Sundt TM. Can the
impact of change of surgical teams in cardiovascular surgery be
measured by operative mortality or morbidity. A propensity
adjusted cohort comparison. Ann Surg. 2011
Feb;253(2):385-92. PMID: 21173693.
I Cohen AS, Burns B, Goadsby PJ. High-flow oxygen for
treatment of cluster headache: a randomized trial. JAMA. 2009
Dec 9;302(22):2451-7. PMID: 19996400.
I Gayat E, Pirracchio R, Resche-Rigon M, Mebazaa A, Mary JY,
Porcher R. Propensity scores in intensive care and
anaesthesiology literature: a systematic review. Intensive Care
Med. 2010 Dec;36(12):1993-2003. PMID: 20689924.
I Wuethrich PY, Hsu Schmitz SF, Kessler TM, Thalmann GN,
Studer UE, Stueber F, Burkhard FC. Potential influence of the
anesthetic technique used during open radical prostatectomy on
prostate cancer-related outcome: a retrospective study.
Anesthesiology. 2010 Sep;113(3):570-6. PMID: 20683253.
Profs. Beckett & Tancredi
Propensity scores
Profs. Beckett &
Tancredi
Outline
Setting
Ideal case
Real world
Controlling for
selection bias
Regression adjustment
Propensity scores
Examples
Download