APPENDIX A Markov Chain Model for Lifetime Risk of Diabetes by BMI The Markov chain governing the movement of people among the three agespecific statesno diabetes, diabetes, and deathwith age = a, a+1,…, A, A+1 years is defined by the transition matrix Q R P O I where O and I are conformable zero and identity matrices and O B a O O Q. . O O O O with Bi i 0 O B a 1 . O O O . . O BA O ; R a O R . O O O O R a 1 O . . . O RA O O O O . O I 2x2 i i 0 , R i 0 . The parameters are defined by the following. i i i = the probability of being non-diabetic a year later if non-diabetic at age = i years. i = the probability of being diabetic a year later if non-diabetic at age = i years. These are the age-specific incidence rates for diagnosed diabetes. i = the probability of being dead a year later if non-diabetic at age = i years. i = the probability of being diabetic a year later if diabetic at age = i years. i = the probability of being dead a year later if diabetic at age = i years. Clearly, the relations i i i 1 and i i 1 must hold for all ages i. The zero entries in the B matrices reflect the assumption that a person diagnosed with diabetes remains diabetic until death. Note that this specification ignores ages less than a minimum age = a years (a = 18 in our application), and “kills” people off at age = A+2 years (A+2 = 85). Hence, for this truncated model people are “born” at age = 18 and “death” is either dying before age = 85 or reaching age = 85. These transition rates are, for each of the eight race-sex groups, functions of age only. To extend this model so that lifetime risk and other related quantities are sensitive to BMI, modeling of the transition rates as functions of age and BMI category for each of the eight groups is required. Thus, the following specifications. (i,BMIj(i)) = the probability of being non-diabetic a year later if non-diabetic at age i years in BMI category j. (i,BMIj(i)) = the probability of being diabetic a year later if non-diabetic at age i years in BMI category j. These are the age- and BMI-specific incidence rates for diagnosed diabetes. (i,BMIj(i)) = the probability of being dead a year later if non-diabetic at age i years in BMI category j. (i,BMIj(i)) = the probability of being diabetic a year later if diabetic at age i years in BMI category j. (i,BMIj(i)) = the probability of being dead a year later if diabetic at age i years in BMI category j. From the theory of Markov chains1, the expected years of life remaining for a non-diabetic person whose age = i years is the sum of the elements in the (2i + 1)th row of the matrix W (I - Q) -1 and the expected years of life with diabetes is the sum of the elements in the even columns of this row, where we adopt the convention that, henceforth, age = 0 corresponds to 18 years. For example, an 18-year-old without diabetes has an expected lifetime equal to the sum of the elements in the first row of W with expected years of life with diabetes (duration) equal to the sum of even columns in the first row of W. Also, the element u2i+1,2j+2 of the matrix U (I - Q) -1 R , with j > i, equals the probability a person without diabetes whose age = i years will die with diabetes at age = j + 1 years. Thus, the following quantities are of interest. Residual Lifetime Risk for Diabetes The residual lifetime risk for diabetes for a non-diabetic person at age = i years is defined as the probability of dying with diabetes. Specifically, RLR i Prob(dying diabetic | non - diabetic at age i years) u2i 1,2 j 2 j i from the theory presented above. Expected Remaining Lifetime and Duration of Diabetes The expected remaining lifetime for people not diagnosed with diabetes at age = i years is, from the theory, ExpLifei w2i 1,k k 1 where the w’s are elements of the matrix W. The expected years of life with diabetes, or the duration of diabetes, is Duri w2i 1,2k 2 . k 0 Derivation of Transition Probabilities The following development was applied to each of the eight race-sex combinations for each given age = i years, where, to avoid notational clutter, all indices are suppressed except for BMI category j. For example, for white males age = 40 years we define the following. T total population size in 2004 (from Census) . M overall one year mortality rate (from Census) . b j prevalence of BMI category j (estimated from NHIS, 2001 - 2004) . 5 Note the relation b j 1 j 1. Thus, T j b jT population size of BMI category j . j prevalence of diabetes among those in BMI category j (estimated from NHIS) , 5 and, jT j j 1 T 5 b T j 1 j j T 5 j b j overall prevalence of diabetes . j 1 Set, overall death rate among diabetics . overall death rate among non - diabetics . r overall relative risk, i.e., ε r (estimated from ND death certificat es) . Thus, M T [ T ] [(1 ) T ] r T (1 ) T and, M . r (1 ) The additional estimates that were needed were relative risks of death by age group and BMI category and came from Flegal, et al. Since these were not reported by diabetes status we had to make the assumption they were the same for diabetics and nondiabetics. Specifically, we set r j relative risk of death for people in BMI category j with r2 1 , and make Assumption : r j is unchanged by diabetes status . Let, ε j death rate among diabetics in BMI category j , j death rate among nondiabetics in BMI category j . By the assumption, j r j 2 and j r j 2 ; j 1, ,5 . Thus, implies 2 5 5 j 1 j 1 j j T j rj 2 j b j T T r T r 5 . rj j b j j 1 Also, (1 ) T implies 2 5 5 j 1 j 1 j (1 j ) T j r j 2 (1 j ) b j T (1 ) 5 rj j 1 . (1 j ) b j Finally, with estimates j (from NHIS) αj 1 j j and j 1 j j 1, ,5 . ; We can summarize all of the above. Given the inputs M ; b1 , b2 , b3 , b4 , b5 ; 1 , 2 , 3 , 4 , 5 ; r ; r1 , r2 , r3 , r4 , r5 we can calculate 5 j b j , j 1 M , 2 r (1 ) r 5 rj j b j j 1 and, ε j r j 2 and γ j r j 2 ; j 1, ,5 . The additional estimates j (from NHIS) yield αj 1 j j and j 1 j ; j 1, ,5 . , 2 (1 ) 5 r j (1 j ) b j j 1 Estimation A Bayesian approach was taken to estimate all transition probabilities and other quantities of interest in the Markov chain models. Markov Chain Monte Carlo (MCMC) methods were used to generate 5000 draws from the posterior distributions of prevalences, incidence rates, and relative risks of death attributable to diabetes. Five thousand normal “posterior” draws for the relative risk of death by age group and BMI category were produced using the estimates and standard errors from Flegal, et al. For our analysis we made the simplifying assumption that the overall population mortality rates from Census and the estimates of BMI prevalences from the National Health Interview Survey were estimated without error. Each posterior draw yielded, through the calculations described above, a posterior draw of all transition probabilities in the Markov models. Interestingly, for all 5000 draws, the following restrictions obtained for every age- race-sex combination: 2 2 ; 0 j , j , j , j , j 1 for j 1,,5. This guaranteed 1) all chains were proper, i.e., all transition probabilities in the unit interval, and 2) all diabetes death rates were strictly greater than the corresponding nondiabetes death rate. Thus, none of the posterior draws were thrown out. The Markov chain models were then run 5000 times. The output from these runs provided the posterior distribution of all quantities of interest, e.g., lifetime risks, durations, etc. Posterior means were estimates of these quantities, and the 0.025 and 0.975 quantiles furnished 95 percent confidence intervals.