Agenda z Introduction z Example: Analysis of Non-commensurate Outcomes HRQOL after intensive care z Common approach to multiple outcomes z The latent variable model z HRQOL results z Discussion and summary Armando Teixeira-Pinto AcademyHealth, Orlando ‘07 A. Teixeira-Pinto AcademyHealth, Orlando 2007 The city of PORTO The city of PORTO A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 The city of PORTO Introduction z Multiple outcomes are often collected in health studies z z z Typically these outcomes are correlated. For outcomes measured in the same scale there are several multivariate methods implemented in commercial software z A. Teixeira-Pinto AcademyHealth, Orlando 2007 Longitudinal data, repeated measurements, multiple informants, multi-dimension outcome (health related quality of life), multiple surrogates for an outcome of interest Generalized linear mixed model, GEE, GLM, MANOVA… A. Teixeira-Pinto AcademyHealth, Orlando 2007 Introduction z z Often the outcomes are non-commensurate (mixed type) as for example a binary and a continuous outcome Common approach: z z Motivation example Analyze each outcome separately (univariate framework) ignoring the correlation Quality of life after Intensive Care Objective: evaluate health related quality of life (HRQOL) of patients 6 months after ICU discharge. z Study the association with: z z A multivariate approach will: z z z z z Use the additional information contained in the correlation between outcomes Permit better control over Type I error rates Answer intrinsically multivariate questions Be helpful in some situations of missing data Age Previous health state z z z z Non-chronic disease Chronic disease with no disability Chronic disease with disability Apache II score z Severity score at ICU admission A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 Instrument EQ-5D Measuring HRQOL z EQ-5D is a standardized instrument for use as a measure of health outcome. z Applicable to a wide range of health conditions and treatments, it provides a simple descriptive profile and a single index value for health status based on 5 health related dimensions. z Includes a question about patient’s perception of his/hers HRQOL A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 Instrument EQ-5D z We’ll z Common approach consider two outcomes z EQ-5D index z z z D-VAS (visual analogue scale) z VAS Dichotomized <=50 and >50 z Binary outcome z z z And z Data for the HRQOL after ICU stay: z Summarizes the 5 dimensions of the EQ5D z Continuous outcome z the three covariates: Age ; Previous health state; Apache II A. Teixeira-Pinto AcademyHealth, Orlando 2007 z 4 years of data collection One intensive care unit from a tertiary hospital in Portugal 485 patients participated in the study The EQ-5D index was available for all the patients Only 366 patients answered the question associated with the D-VAS Common approach: z z Linear model for the EQ-5D index Logistic or probit regression for D-VAS A. Teixeira-Pinto AcademyHealth, Orlando 2007 Multiple outcomes Multiple outcomes age age EQ-5D EQ-5D index index previous health state EQ-5D EQ-5D index index n=485 Apache II age previous health state D-VAS D-VAS n=366 Apache II previous health state n=366 Apache II A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 Instrument EQ-5D Instrument EQ-5D A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 Multiple outcomes Why should we use a multivariate method? z Missing values of D-VAS are associated with lower HRQOL z For a separate model for D-VAS we have missing not a random (MNAR) and the regression estimates might be biased z n=485 Apache II age D-VAS D-VAS previous health state Because the two outcomes are correlated, in a joint model, we can ‘borrow’ information from the EQ-5d index and reduce the bias for the estimates associated with D-VAS A. Teixeira-Pinto AcademyHealth, Orlando 2007 z If the outcomes are of the same type, we could assume a multivariate distribution for the outcomes z For example, two continuous outcomes ⎛ ⎛ µ ⎞ ⎛ σ 12 MVN ⎜ ⎜⎜ 1 ⎟⎟, ⎜⎜ ⎜ ⎝ µ 2 ⎠ ρσ σ ⎝ 1 2 ⎝ ρσ 1 σ 2 ⎞ ⎞⎟ ⎟ σ 22 ⎟⎠ ⎟⎠ A. Teixeira-Pinto AcademyHealth, Orlando 2007 Binary and continuous outcomes z For mixed type of outcomes there is no obvious multivariate distribution z z Binary and continuous outcomes z Latent variable model ∫ f(yb |u) f(yc| u) f(u) du Strategy: Avoid direct specification of the joint distribution z We can specify separate equations for the outcomes conditional on u. z The latent variable is modeling the correlation between the outcomes Latent variable model for yb, yc z Introduce a latent variable, u, and assume that conditional on u the outcomes are independent f(yb, yc)= ∫ f(yb, yc ,u) du = = ∫ f(yb, yc |u) f(u) du = ∫ f(yb |u) f(yc| u) f(u) du A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 Latent model z Latent model z Mathematically speaking: z probit (P( yb = 1) ) = β bT X b + λbu However this models has parameters that are nonidentifiable and we have to fix some of them It can be shown that the correct way to fix some of the parameters is: yc = β cT X c + λc u + ε c u ~ N (0, σ u2 ), z probit (P( yb = 1) ) = β bT X b + u ε c ~ N (0, σ c2 ) yc = β cT X c + σ cu + ε c λb and λc are scale factors “adjusting” the latent variable ui ~ N (0, σ u2 ), to the different scales of the outcomes ε c ~ N (0, σ c2 ) A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 Latent model z Latent model IMPORTANT NOTE: The models are for yb |u and yc |u . I omit the conditional from the equations for simplification. z The same is true for βc ’s, but because of the linear link the interpretation is the same for yc |u and yc probit (P( yb = 1) ) = β bT X b + u z z probit (P ( yb = 1) ) = β bT X b + u yc = β cT X c + σ cu + ε c yc = β cT X c + σ c u + ε c ui ~ N (0, σ u2 ), ui ~ N (0, σ u2 ), ε c ~ N (0, σ c2 ) The interpretation of βb ’s referring to the effect of the covariates on the outcome yb is conditional on u, i.e., yb |u The ‘marginal’ effect can be obtained: βb z ε c ~ N (0, σ c2 ) A nice feature of this model is that it can be easily implemented in commercial stats software z With SAS, use PROC NLMIXED 1 + σ u2 A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007 SAS code to fit the Latent Model Results of the HRQOL study #SAS code to maximize the likelihood resulting from the latent variable model for the HRQOL example; Age -0.24 #likelihood; part1=a1 + b1*age + c1*apache +d1*pstate+ u; part2=eq5d - (a2 + b2*age +c2*apache + d2*pstate) - u*sigma2; if missing(dvas) then loglik=-log(sigma2)-.5*1/(sigma2**2)*(part2)**2; else loglik =dvas*log(PROBNORM (part1))+(1-dvas)*log(PROBNORM (-part1))-log(sigma2) 5*1/(sigma2**2)*(part2)**2; Apache II Previous state Latent model Coefficient P-value <0.01 -8.12 -0.46 ~1 ~0 0.01 -0.01 <0.01 -0.49 -0.46 <0.01 -0.49 <0.01 (0.11) -0.018 0.09 -0.027 <0.01 (0.010) z This effect would not be noticed with univariate analysis z Taking into account the correlation between the two outcomes (crude ρ = 0.42) helped to reduce the bias of the effects estimates <0.01 ~1 0.03 (0.005) (0.11) -0.018 0.03 (0.005) The analysis suggests that the severity of the episode leading to the ICU admission is associated with the patients perception of his/hers HRQOL but not with the EQ-5D index <0.01 (0.16) (0.005) Apache II -0.01 z D-VAS (n=366) -0.01 0.01 (1.53) (0.15) Previous state ~1 A. Teixeira-Pinto AcademyHealth, Orlando 2007 (0.06) (1.53) ~0 Results of the HRQOL study Univariate Age -0.01 A. Teixeira-Pinto AcademyHealth, Orlando 2007 -0.24 <0.01 (0.16) (0.005) Coefficient P-value ~0 ~1 (0.011) <0.01 -8.12 (1.53) ~0 Age Apache II (0.06) <0.01 D-VAS (n=366) EQ-5D Index (n=485) Apache II <0.01 (0.15) Results of the HRQOL study -8.12 -8.12 (0.11) #computes the ‘marginalized’ parameters for the probit model; estimate ‘intercept' a1/sqrt(1+sigmau**2); estimate 'age_marg' b1/sqrt(1+sigmau**2); estimate 'apache_marg' c1/sqrt(1+sigmau**2); estimate ‘pstate_marg’ d1/sqrt(1+sigmau**2); run; -0.24 (0.06) (1.53) #model (actually you can put any variable other than eq5d with complete observations; model eq5d ~ general(loglik) ; random u ~ normal(0,sigmau**2) subject=idnumb; Previous state <0.01 (0.06) Previous state -0.24 Latent model Coefficient P-value EQ-5D Index (n=485) proc nlmixed data=Icu.Euroqolreduced technique=newrap; #initial values; parms a1=-0.9 b1=.02 c1=-1 d1=0 a2=104 b2=-.2 c2=-9 d2=-4 sigmau=1 sigma2=15 ; bounds sigma2>0, sigmau>0; Age Univariate Coefficient P-value <0.01 (0.11) 0.09 (0.011) -0.027 <0.01 (0.010) A. Teixeira-Pinto AcademyHealth, Orlando 2007 Other approaches A. Teixeira-Pinto AcademyHealth, Orlando 2007 Extention to more than two outcomes Other strategies presented in the literature: z Factorization method: f(yb, yc) = f(yb)f(yc| yb) or z f(yb, yc) = f(yc)f(yb| yc) z For k outcomes: g1 (E ( y1 ) ) = β1T X 1 + λ1u g 2 (E ( y2 ) ) = β 2T X 2 + λ2u g 3 (E ( y3 ) ) = β 3T X 3 + λ3u z Extension of weighted GEEs to noncommensurate outcomes z Other strategies for the missing data can also be used, e.g., multiple imputation A. Teixeira-Pinto AcademyHealth, Orlando 2007 M g k (E ( yk ) ) = β kT X k + λk u u ~ N (0, σ u2 ) A. Teixeira-Pinto AcademyHealth, Orlando 2007 “Take home” message Complete cases Univariate approach + ≈ Same covariates for all the outcomes Multivariate approach Complete cases + Different covariates for the the outcomes Missing data on the outcomes Thank you for your attention! Univariate approach less efficient (larger std. errors) Slides available at: http://users.med.up.pt/tpinto/ahealth.ppt Multivariate approach more efficient (smaller std. errors) Univariate approach may lead to biased estimates Multivariate approach may reduce the bias A. Teixeira-Pinto AcademyHealth, Orlando 2007 A. Teixeira-Pinto AcademyHealth, Orlando 2007