The life table

advertisement
The life table
• LT statistics: rates, probabilities, life
expectancy (waiting time to event)
• Period life table
• Cohort life table
Life table from
observational data
22 respondents
Namboodiri and Suchindran, 1987, Chapter 4
A. Exact time
Table 1
Hypothetical survey data for construction of life table (1986)
CASE ID
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
Date
of entry
in observation
Jan-02
Jan-17
Jan-18
Jan-22
Feb-10
Jan-30
Apr-04
Apr-29
May-18
May-20
May-15
Feb-05
Feb-05
Feb-06
Feb-26
Mar-10
Mar-11
Mar-28
Mar-15
Apr-13
Apr-04
Apr-25
- indicates censoring
Date
Date
of event of interview
Feb-11
May-04
Feb-28
May-17
Feb-12
Feb-25
Apr-18
May-18
May-08
Mar-23
May-09
May-16
May-25
May-17
May-10
May-13
May-23
May-15
May-06
May-27
May-29
May-31
May-18
May-19
May-10
May-28
May-22
May-25
May-12
May-29
May-10
May-20
May-11
May-31
Duration
Duration
Duration
to event to interview Exposure
Days
Days
Days
40
143
40
107
120
107
112
112
37
111
37
96
102
96
13
105
13
32
32
28
28
11
11
11
11
3
3
20
103
20
72
94
72
101
111
101
85
85
76
76
58
62
58
62
62
8
56
8
37
37
35
37
35
12
36
12
Table 2
Estimation of survival function
CASE ID
11
19
9
10
22
6
12
8
7
21
4
20
1
17
18
13
16
15
5
14
2
3
Date
of entry
in observation
May-15
Mar-15
May-18
May-20
Apr-25
Jan-30
Feb-05
Apr-29
Apr-04
Apr-04
Jan-22
Apr-13
Jan-02
Mar-11
Mar-28
Feb-05
Mar-10
Feb-26
Feb-10
Feb-06
Jan-17
Jan-18
Date
Date
of event of interview
Mar-23
May-16
Feb-12
Feb-25
May-09
Feb-28
Feb-11
May-08
Apr-18
May-17
May-18
May-04
-
May-18
May-10
May-29
May-31
May-31
May-15
May-19
May-27
May-06
May-11
May-13
May-20
May-25
May-12
May-29
May-10
May-25
May-22
May-23
May-28
May-17
May-10
Duration
Duration
Duration
At tisk
Event/
to event to interview Exposure at beginning censoring
Days
Days
Days
of day*
8
12
13
20
35
37
40
58
72
96
101
107
-
3
56
11
11
36
105
103
28
32
37
111
37
143
62
62
94
76
85
102
111
120
112
USE EXCEL DATA/SORT
Number of persons at risk decreases over time because of occurrence of event or censoring
ASS: population is homogeneous and censoring is independent of event of interest
* Assumes censoring NOT at beginning of interval (day)
3
8
11
11
12
13
20
28
32
35
37
37
40
58
62
72
76
85
96
101
107
112
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
0
1
0
0
1
1
1
0
0
1
1
0
1
1
0
1
0
0
1
1
1
0
Table 2
Estimation of survival function, cont
1/21
Day
At tisk
at beginning
of day*
Event Censored
3
8
11
11
12
13
20
28
32
35
37
37
40
58
62
72
76
85
96
101
107
112
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
0
1
0
0
1
1
1
0
0
1
1
0
1
1
0
1
0
0
1
1
1
0
1
0
1
1
0
0
0
1
1
0
0
1
0
0
1
0
1
1
0
0
0
1
Total
12
10
Prob
of event
0.0476
Observed
survival
proportions
0.9524
0.0556
0.0588
0.0625
0.9444
0.9412
0.9375
0.0769
0.0833
0.9231
0.9167
0.1000
0.1111
0.9000
0.8889
0.1429
0.8571
0.2500
0.3333
0.5000
0.7500
0.6667
0.5000
Survival
function
1
0.9524
0.9524
0.9524
0.8995
0.8466
0.7937
0.7937
0.7937
0.7326
0.6716
0.6716
0.6044
0.5372
0.5372
0.4605
0.4605
0.4605
0.3454
0.2302
0.1151
0.1151
0.9524 = 1-1/21
=1 - failure/number at risk
before failure
0.8995 = 0.9524*[1-1/18]
Kaplan-Meier
Discrete time interval (30 days)
B. Discrete time interval: completed months
Duration
Month
0
1
2
3
Duration
Days
Events
Censoring
4
3
1
1
4
2
3
1
0-30
31-59
60-89
90-119
Event+censoring in same month:
Events+
censoring
0
1
0
2
case 21 (event on May 9 and interview on May 11)
Case 5 (event on May 17 and interview on May 23)
Case 14 (event on May 18 and interview on May 28)
A. Censoring at beginning of month
Duration
Risk set
Events
Censored
in months
0
18
4
4
1
11
3
3
2
5
1
3
3
1
1
3
4
NOTE: Case 21 is censored instead of event
NOTE: 11 = 18 - 4- 3
Prob of
event
0.2222
0.2727
0.2000
1.0000
Prob
surviving
0.7778
0.7273
0.8000
0.0000
Survival function
S intermed
1.0000
0.0000
0.7778
0.0123
0.5657
0.0319
0.4525
0.0707
0.0000
0.0000
s.e.
0.0000
0.0864
0.1190
0.1534
0.0000
B. Censoring at end of month
Duration Risk set
Events Censored Prob of
in months
event
0
22
4
4
0.1818
1
14
4
2
0.2857
2
8
1
3
0.1250
3
4
3
1
0.7500
4
NOTE: Case 21 experiences event and is not censored
Prob
surviving
0.8182
0.7143
0.8750
0.2500
Survival function
S intermed
1.0000
0.0000
0.8182
0.0083
0.5844
0.0249
0.5114
0.0267
0.1278
0.0774
s.e.
0.0000
0.4236
0.2979
0.2478
0.0583
C. Censoring in middle of month (half at beginning and half at end of month)
Duration Risk set
Events Censored Prob of
in months
event
0
20
4
4
0.2000
1
13
4
2
0.3077
2
6.5
1
3
0.1538
3
3.5
3
1
0.8571
4
NOTE: Case 21 experiences event and is not censored
Prob
surviving
0.8000
0.6923
0.8462
0.1429
Survival function
S intermed
1.0000
0.0000
0.8000
0.0100
0.5538
0.0296
0.4686
0.0427
0.0669
0.0484
s.e.
0.0000
0.2892
0.1924
0.1415
0.0147
Survival probability
Survival function
1.00
0.80
Censoring begin
0.60
censoring end
0.40
Censoring middle
0.20
0.00
0
1
2
Duration (months)
3
4
Survival probability
Survival function and 95% interval
1.00
0.80
Censoring middle
0.60
S-1.96*s.e.
0.40
S+1.96*s.e.
0.20
0.00
1
2
3
4
Duration (months)
5
Standard error of survival function: Greenwood formula
 i  t  1 qi 
SE ( S )  S  

p
i

1
i Ri 

t
1/ 2
t
With St = survival function
Rt = risk set
qt = probability of event
pt = survival probability (probability of NO event)
Assume 100 respondents (R = risk set)
Probability of event (q): 0.10 => survival probability (p): 0.90
Var(p) = pq/R = 0.9*0.1/100 = 0.0009 SQRT(0.0009) = 0.03
p2q/(pR) = 0.81*0.1/(0.9*100) = 0.0009
SQRT(0.0009) = 0.03
15 survivors experience event. Prob of event is 15/90 = 0.1667
Risk set: 90 (= 100*0.9)
Prob of surviving second interval: S2 = 0.9*0.833=0.75
Var(S2) = 0.752 * [0.1/(0.9*100) + 0.1667/(0.8333*90)]
= 0.00188
Standard error of (hazard) rate:
1/ 2
SE (r t ) 
2

r 1 r t  
q R   2  
t
t
t
With rt = hazard rate
Life table with grouped data
Remarriage of divorced women, aged 25 to 34, USA, 1975
Source: 1975 US Current Population Survey
Namboodiri and Suchindran, 1978, pp. 63 ff
Remarriage life table, Unites States 1975
Number Number of
Divorced
Interval
divorced remarriages at interview
at beginning
(censored)
(n')
(d)
(c)
(1)
(2)
(3)
(4)
[0,1)
1298
238
101
[1,2)
959
177
65
[2,3)
717
107
51
[3,4)
559
64
42
[4.5)
453
50
34
[5,6)
369
34
18
[6,7)
317
27
21
[7,8)
269
21
11
[8,9)
237
14
4
[9,10)
219
15
11
[10,11)
193
16
14
[11,12)
163
8
8
[12,13)
147
6
8
[13,14)
133
6
7
[14,15)
120
5
5
[15,16)
110
2
10
[16,17)
98
5
5
[17,18)
88
5
2
[18,19)
81
2
2
[19,20)
77
2
9
[20,+)
66
12
54
Total
816
482
Probability
Survival
Risk set
of event probability
[3/5]
[1-3/5]
(n)
(q)
(p)
(5)
(6)
(7)
1247.5
0.1908
0.8092
926.5
0.1910
0.8090
691.5
0.1547
0.8453
538.0
0.1190
0.8810
436.0
0.1147
0.8853
360.0
0.0944
0.9056
306.5
0.0881
0.9119
263.5
0.0797
0.9203
235.0
0.0596
0.9404
213.5
0.0703
0.9297
186.0
0.0860
0.9140
159.0
0.0503
0.9497
143.0
0.0420
0.9580
129.5
0.0463
0.9537
117.5
0.0426
0.9574
105.0
0.0190
0.9810
95.5
0.0524
0.9476
87.0
0.0575
0.9425
80.0
0.0250
0.9750
72.5
0.0276
0.9724
39.0
0.3077
0.6923
Survival
function
(S)
(8)
1.0000
0.8092
0.6546
0.5533
0.4875
0.4316
0.3908
0.3564
0.3280
0.3085
0.2868
0.2621
0.2489
0.2385
0.2274
0.2178
0.2136
0.2024
0.1908
0.1860
0.1809
[q/pR]
SE(S)
[6/(7*5)] Greenwood
(9)
0.0000
0.0002
0.0003
0.0003
0.0003
0.0003
0.0003
0.0003
0.0003
0.0003
0.0004
0.0005
0.0003
0.0003
0.0004
0.0004
0.0002
0.0006
0.0007
0.0003
0.0004
Risk set: assuming censoring in MIDDLE of interval
[ T,T+1) Greater or equal to T, but less than T+1
Greenwood formula: see also Blossfeld and Rohwer, 1995, p. 53
SE(rate): see Blossfeld and Rohwer, 1995, p. 54
Conditional density varies with duration
(10)
0.0000
0.0111
0.0138
0.0147
0.0151
0.0153
0.0154
0.0154
0.0154
0.0153
0.0152
0.0151
0.0150
0.0150
0.0150
0.0149
0.0149
0.0150
0.0150
0.0150
0.0150
h(t)
h(t)
Linear Exponential
(11)
0.2109
0.2112
0.1677
0.1265
0.1217
0.0991
0.0922
0.0830
0.0614
0.0728
0.0899
0.0516
0.0429
0.0474
0.0435
0.0192
0.0538
0.0592
0.0253
0.0280
-
(12)
0.2117
0.2120
0.1681
0.1267
0.1218
0.0992
0.0922
0.0831
0.0614
0.0728
0.0899
0.0516
0.0429
0.0474
0.0435
0.0192
0.0538
0.0592
0.0253
0.0280
-
SE[h(t)]
(13)
0.0136
0.0158
0.0162
0.0158
0.0172
0.0170
0.0177
0.0181
0.0164
0.0188
0.0224
0.0182
0.0175
0.0194
0.0194
0.0136
0.0240
0.0265
0.0179
0.0198
-
0.2109 = 238/(0.5*(1298+959))
0.2117 = -ln(0.8092)
1.0000
0.2500
0.8000
0.2000
0.6000
0.1500
0.4000
0.1000
0.2000
0.0500
0.0000
0.0000
0
2
4
6
8
10 12 14 16
Duration since divorce
18 20
Rate
Probability
Estimated survival function and hazard function
S(t)
h(t)
Leaving parental home,
The Netherlands, Birth cohort 1961
Survey Sept. 87 - Febr. 88
Source: W. Dijkstra ed. (1989) Het proces van sociale integratie van jong-volwassenen:
De gegevensverzameling voor de eerste hoofdmeting (The process of social integration of
young adults. Data collection for first measurement). VU Uitgeverij, Amsterdam.
Jong Gierveld, J. de, Liefbroer, A.C. & Beekink, E. (1991a). The effect
of parental resources on patterns of leaving home among young adults
in the Netherlands. European Sociological Review, 7, 55-71.
Thanks to Jenny Gierveld and Aat liefbroer, NIDI, The Hague, for providing the data.
Data
Leaving parental home, 1961 cohort, micro-data, 583 respondents (first 30 respondents)
# Sex Father Month Reason
# Sex Father Month Reason
1
2
2
268
2
16
1
2
251
3
2
1
3
268
2
17
2
2
212
1
3
1
2
202
1
18
2
2
320
2
4
2
2
320
4
19
1
2
221
3
5
1
1
237
1
20
2
2
322
4
6
1
1
295
2
21
2
1
221
2
7
1
1
272
2
22
2
3
308
2
8
2
1
231
1
23
1
2
233
1
9
2
1
312
3
24
1
1
273
2
10
1
2
289
2
25
1
1
208
1
11
1
1
316
2
26
1
2
219
1
12
2
1
321
4
27
1
1
261
2
13
2
1
260
1
28
2
2
270
3
14
2
2
281
2
29
2
2
277
2
15
2
1
273
2
30
1
1
290
3
Sex
1 Female
2 Male
Father status
1 Low
2 Middle
3 High
Reason
1
2
3
4
Educ/work
Marriage/cohabit
Freedom
Censored
at interview
Data
Females, including early censoring
Females, excluding early censoring
Living
Attrition
Exposure
Living
Attrition
at home
Total Censored
Leave (months) at home
Total Censored
Leave
15
291
4
0
4
3480
288
4
0
4
16
287
7
0
7
3412
284
7
0
7
17
280
23
0
23
3279
277
23
0
23
18
257
53
0
53
2761
254
53
0
53
19
204
48
0
48
2181
201
48
0
48
20
156
41
1
40
1652
153
40
0
40
21
115
31
0
31
1223
113
31
0
31
22
84
26
0
26
855
82
26
0
26
23
58
23
0
23
596
56
23
0
23
24
35
16
1
15
325
33
15
0
15
25
19
8
2
6
183
18
7
1
6
26
11
11
9
2
42
11
11
9
2
27
0
0
0
0
0
0
291
13
278
19989
288
10
278
Early censoring: Person born at the end of December 1961 and interviewed in September 1987 reaches 25
in December 1986 and is 25 + 9 months = 309 months early September 1987
There are females censored in months 244, 297 and 301. IMPOSSIBLE (see HOME.XLS)
Age
Age
15
16
17
18
19
20
21
22
23
24
25
26
27
Males, including early censoring
Living
Attrition
Exposure
at home
Total Censored
Leave (months)
292
2
0
2
3495
290
5
0
5
3457
285
15
0
15
3347
270
24
1
23
3112
246
30
1
29
2791
216
35
2
33
2408
181
28
0
28
2043
153
33
0
33
1643
120
30
0
30
1283
90
30
0
30
915
60
25
5
20
626
35
35
31
4
196
0
0
0
0
0
292
40
252
25316
The first male censored is in month 310. Hence no early censoring.
d:\s\teach\98\lessn\lt_obs\jenny
Age
At home
Column 1
15
16
17
18
19
20
21
22
23
24
25
26
27
2
291
287
280
257
204
156
115
84
58
35
19
11
0
Females
Leave Censored
3
4
7
23
53
48
40
31
26
23
15
6
2
0
278
4
0
0
0
0
0
1
0
0
0
1
2
9
0
13
Risk set Exposure
(months)
5
291.0
287.0
280.0
257.0
204.0
155.5
115.0
84.0
58.0
34.5
18.0
6.5
0.0
6
3480
3412
3279
2761
2181
1652
1223
855
596
325
183
42
0
19989
Probab
leaving
(3/5)
7
0.0137
0.0244
0.0821
0.2062
0.2353
0.2572
0.2696
0.3095
0.3966
0.4348
0.3333
0.3077
Probab
surviving
1-(7)
8
0.9863
0.9756
0.9179
0.7938
0.7647
0.7428
0.7304
0.6905
0.6034
0.5652
0.6667
0.6923
Survival
q/(p*R)
function (9/(10*5)
9
1.0000
0.9863
0.9622
0.8832
0.7010
0.5361
0.3982
0.2908
0.2008
0.1212
0.0685
0.0457
0.0316
0.0000
0.0001
0.0003
0.0010
0.0015
0.0022
0.0032
0.0053
0.0113
0.0223
0.0278
0.0684
SE(S(t))
10
0.0000
0.0069
0.0115
0.0205
0.0338
0.0382
0.0387
0.0365
0.0341
0.0318
0.0264
0.0188
0.0173
Risk set = number initially at home - 0.5*censoring during year. Assume UNIFORM distribution
Exposure in YEARS = risk set - 0.5*number leaving home during year. Assume: UNIFORM distribution
d:\s\teach\98\lessn\lt_obs\jenny
Probability of living at home, Females, (+95% interval)
Probability
1.0000
0.8000
0.6000
0.4000
0.2000
0.0000
15
16
17
18
19
20
21
22
23
24
25
26
27
Age
d:\s\teach\98\lessn\lt_obs\jenny
Age
At home
Leave
Column 1
15
16
17
18
19
20
21
22
23
24
25
26
27
2
291
287
280
257
204
156
115
84
58
35
19
11
0
3
4
7
23
53
48
40
31
26
23
15
6
2
0
278
Risk set Exposure
in years
5
291.0
287.0
280.0
257.0
204.0
155.5
115.0
84.0
58.0
34.5
18.0
6.5
0.0
289.0
283.5
268.5
230.5
180.0
135.5
99.5
71.0
46.5
27.0
15.0
5.5
0.0
1651.5
Event
Rate
SE(rate)
11
0.0138
0.0247
0.0857
0.2299
0.2667
0.2952
0.3116
0.3662
0.4946
0.5556
0.4000
0.3636
12
0.0069
0.0093
0.0178
0.0314
0.0381
0.0462
0.0553
0.0706
0.0999
0.1378
0.1600
0.2528
Waiting Expected
time
waiting
(T) time (yrs)
13
14
1651.5
5.68
1362.5
4.75
1079.0
3.85
810.5
3.15
580.0
2.84
400.0
2.57
264.5
2.30
165.0
1.96
94.0
1.62
47.5
1.38
20.5
1.14
5.5
0.85
d:\s\teach\98\lessn\lt_obs\jenny
Leaving home: occurrences and exposures
A. Events and exposure including censored observations
Sex
Event count
Females
Males
Timing
Early (LT 20)
135
74
Late (GE 20)
143
178
Censored at interview
13
40
Total
291
292
Timing
16202
9114
25316
31315
13990
45305
B. Events and exposure excluding censored observations
Sex
Event count
Females
Males
Timing
Early (LT 20)
135
74
Late (GE 20)
143
178
Total
278
252
Total
209
321
530
Timing
Early (LT 20)
Late (GE 20)
Total
Exposure
15113
4876
19989
Total
209
321
53
583
Early (LT 20)
Late (GE 20)
Total
Exposure
14333
3998
18331
13826
6339
20165
28159
10337
38496
d:\s\teach\98\lessn\lt_obs\jenny
d:\s\teach\99pisa\leaveh\expos1.xls
SPSS output
Intrvl
Start
Time
-----15
16
17
18
19
20
21
22
23
24
25
26
SPSS SURVIVAL
Females
Number Number Number
Entrng Wdrawn
Exposd
this
During
to
Intrvl
Intrvl
Risk
---------------291
0
291.0
287
0
287.0
280
0
280.0
257
0
257.0
204
0
204.0
156
1
155.5
115
0
115.0
84
0
84.0
58
0
58.0
35
1
34.5
19
2
18.0
11
9
6.5
583 observations
Number
of
Termnl
Events
-----4
7
23
53
48
40
31
26
23
15
6
2
Propn
Terminating
-----0.0137
0.0244
0.0821
0.2062
0.2353
0.2572
0.2696
0.3095
0.3966
0.4348
0.3333
0.3077
Propn
Surviving
-----0.9863
0.9756
0.9179
0.7938
0.7647
0.7428
0.7304
0.6905
0.6034
0.5652
0.6667
0.6923
Cumul
Propn
Surv
at End
-----0.9863
0.9622
0.8832
0.7010
0.5361
0.3982
0.2908
0.2008
0.1212
0.0685
0.0457
0.0316
Probability
Densty
-----0.0137
0.0241
0.0790
0.1821
0.1649
0.1379
0.1073
0.0900
0.0796
0.0527
0.0228
0.0141
Hazard
Rate
-----0.0138
0.0247
0.0857
0.2299
0.2667
0.2952
0.3116
0.3662
0.4946
0.5556
0.4000
0.3636
The median survival time for these data is 20.26
d:\s\teach\98\lt_obs\jenny\spss.xls
d:\s\teach\98\lt_obs\jenny\lt.doc
SPSS output
Intrvl
Start
Time
-----15
16
17
18
19
20
21
22
23
24
25
26
SPSS SURVIVAL
Males
Number Number Number
Entrng Wdrawn
Exposd
this
During
to
Intrvl
Intrvl
Risk
---------------292
0
292.0
290
0.0
290.0
285
0.0
285.0
270
1.0
269.5
246
1.0
245.5
216
2.0
215.0
181
0.0
181.0
153
0.0
153.0
120
0.0
120.0
90
0.0
90.0
60
5.0
57.5
35
31.0
19.5
583 observations
Number
of
Termnl
Events
-----2
5
15
23
29
33
28
33
30
30
20
4
Propn
Terminating
-----0.0068
0.0172
0.0526
0.0853
0.1181
0.1535
0.1547
0.2157
0.2500
0.3333
0.3478
0.2051
Propn
Surviving
-----0.9932
0.9828
0.9474
0.9147
0.8819
0.8465
0.8453
0.7843
0.7500
0.6667
0.6522
0.7949
The median survival time for these data is 22.29
Cumul
Propn
Surv
at End
-----0.9932
0.9760
0.9247
0.8457
0.7458
0.6314
0.5337
0.4186
0.3139
0.2093
0.1365
0.1085
Probability
Densty
-----0.0068
0.0171
0.0514
0.0789
0.0999
0.1145
0.0977
0.1151
0.1046
0.1046
0.0728
0.0280
Hazard
Rate
-----0.0069
0.0174
0.0541
0.0891
0.1255
0.1662
0.1677
0.2418
0.2857
0.4000
0.4211
0.2286
The Kaplan-Meier estimator
• A method for the nonparametric estimation
of the survival function (1958)
• Also called product-limit estimator
• The risk set is calculated at every point in
time where at least one event occurred.
• Hence all episodes must be sorted according to their ending
times.
• It is a staircase function with
a. Location of drop is random (time at event)
b. Size of drop depends on censoring
Kaplan-Meier estimator
Time from diagnosis to death from melanoma
or loss to follow-up, 50 subjects (Clayton and Hills, 1993, p. 36)
Conditional probability
Survival Aalen-Nelson
Month
N
Death Censored of death of survival function
(Cum. Rate)
1.0000
0
50
2
0.0400
0.9600
0.9600
0.0400
1
48
1
0.0208
0.9792
0.9400
0.0608
2
47
2
0.0426
0.9574
0.9000
0.1034
3
45
1
1
0.0222
0.9778
0.8800
0.1256
8
43
1
0.0233
0.9767
0.8595
0.1489
10
42
1
0.0238
0.9762
0.8391
0.1727
12
41
1
1
0.0244
0.9756
0.8186
0.1971
13
39
1
0.0256
0.9744
0.7976
0.2227
15
38
1
0.0263
0.9737
0.7766
0.2490
18
37
1
0.0000
1.0000
0.7766
0.2490
19
36
1
0.0278
0.9722
0.7551
0.2768
21
35
1
0.0000
1.0000
0.7551
0.2768
27
34
2
0.0000
1.0000
0.7551
0.2768
30
32
1
0.0000
1.0000
0.7551
0.2768
33
31
1
1
0.0323
0.9677
0.7307
0.3091
34
29
1
0.0345
0.9655
0.7055
0.3435
38
28
1
0.0000
1.0000
0.7055
0.3435
40
27
1
0.0000
1.0000
0.7055
0.3435
41
26
1
0.0385
0.9615
0.6784
0.3820
43
25
1
0.0000
1.0000
0.6784
0.3820
44
24
1
0.0000
1.0000
0.6784
0.3820
46
23
1
0.0000
1.0000
0.6784
0.3820
54
22
1
0.0000
1.0000
0.6784
0.3820
55
21
1
0.0476
0.9524
0.6461
0.4296
56
20
1
0.0500
0.9500
0.6138
0.4796
57
19
2
0.0000
1.0000
0.6138
0.4796
60
17
1
0.0000
1.0000
0.6138
0.4796
-->
Kaplan-Meier estimator
Exact time of failure and censoring are known
Time in complete months
Subjects dying during first month, are recorded
as surviving one month
For 2 subjects, diagnosis took place at death,
hence time recorded as zero
In 11th month (10 months completed), 1 death occurs.
Number at risk is 42.
Hence probability of falure is 1/42 = 0.02381
and the probability of surviving is 0.8595*(1-0.02381) = 0.8391
S(t)   1 - Di / Y (Xi)
X t
^
i
Where Y(xi) is the risk set (individuals at risk just before
time t (time at event))and Di is the failure indicator (1 in
case of failure)
Kaplan-Meier estimator
Time from diagnosis to death
Survival function by Kaplan-Meier method
Survival probability
1.00
0.95
0.90
0.85
0.80
0.75
0.70
0.65
0.60
0
5
10
15
20
25
30
35
40
45
50
55
60
Time
Clayton and Hills, 1993, p. 37
References: Kaplan-Meier
• Good introduction: Clayton and Hills, 1993,
pp. 35ff
• Technical: Andersen and Keiding, 1996, pp.
180ff (includes several references)
The Nelson-Aalen estimator
• A method for the nonparametric estimation
of the cumulative hazard function
• The risk set is calculated at every point in
time where at least one event occurred.
• Hence all episodes must be sorted according to their ending
times.
• It is a staircase function with
a. Location of drop is random (time at event)
b. Size of drop is 1/risk set (number at risk: count of persons alive
before the death
• Easier to generalise to multistate situations
The Nelson-Aalen estimator
• Nelson (1969) and Aalen (1978)
• Clayton and Hills, 1993, p. 48
• Andersen and Keiding, 1996, p. 181
^
A(t)   Di / Y (Xi)
X t
i
A(t) = -lnS(t)
Nelson-Aalen estimator (rates)
Time from diagnosis to death
Cumulative rates using Aalen-Nelson method
Cumulative rate
0.60
0.50
0.40
0.30
0.20
0.10
0.00
0
5
10
15
20
25
30
35
40
45
50
55
60
Time (months)
Clayton and Hills, 1993, p. 50
Duration of job episodes (Blossfeld and Rohwer, 1995) (sorted)
ID
33
110
202
23
81
145
175
185
193
203
44
87
100
100
110
110
127
170
194
27
106
161
177
7
8
40
44
44
44
76
84
153
203
Duration of job episodes
Job Starting
Ending
number
time
time
3
981
982
3
672
673
2
845
846
1
855
857
3
604
606
2
980
982
3
981
983
2
817
819
5
969
971
1
935
937
1
802
805
3
915
918
2
579
582
3
583
586
1
664
667
2
668
671
2
869
872
1
552
555
6
939
942
6
978
982
1
832
836
2
732
736
1
700
704
3
730
735
1
838
843
3
772
777
3
826
831
4
832
837
5
856
861
1
838
843
1
634
639
1
849
854
2
938
943
Duration of job episodes
Duration
ID
Job Starting
Ending
Duration
(p. 46)
number
time
time
2
27
5
971
977
7
2
100
4
588
594
7
2
100
5
595
601
7
3
106
2
837
843
7
3
107
1
934
940
7
3
127
3
873
879
7
3
135
2
834
840
7
3
172
3
940
946
7
3
193
6
976
982
7
3
194
4
897
903
7
4
21
1
689
696
8
4
21
2
697
704
8
4
28
3
702
709
8
4
31
4
975
982
8
4
40
2
764
771
8
4
49
1
975
982
8
4
73
3
937
944
8
4
95
4
975
982
8
4
135
1
826
833
8
5
5
5
5
6
201 respondents
6
6
600 job episodes
6
6
6
6
6 Duration: 'To avoid zero durations, we have added
6 one months to the job duration, or the observed
6 duration if the episode is right censored.' (p. 46)
ID
9
27
44
110
161
176
194
194
27
58
59
70
79
96
123
167
171
173
188
194
9
24
127
167
3
3
3
15
15
24
40
Duration of job episodes
Job Starting
Ending
number
time
time
3
634
642
4
773
781
7
974
982
4
673
681
1
625
633
2
565
573
1
844
852
2
853
861
2
703
712
5
942
951
3
973
982
2
973
982
2
973
982
2
772
781
2
877
886
1
532
541
6
831
840
1
654
663
2
834
843
8
973
982
1
591
601
4
938
948
1
839
849
2
542
552
1
688
699
3
730
741
5
817
828
1
820
831
2
832
843
1
700
711
1
752
763
Duration
9
9
9
9
9
9
9
9
10
10
10
10
10
10
10
10
10
10
10
10
11
11
11
11
12
12
12
12
12
12
12
Product-limit estimate of survival function (Kaplan-Meier)
Index
Time
0
1
2
3
4
5
6
7
8
9
10
0
2
3
4
5
6
7
8
9
10
11
Number Number
of events censored
E
0
0
2
0
5
1
9
2
3
0
10
1
9
0
6
1
7
3
8
1
4
4
Exposed
to risk
(episodes)
R
600
600
597
590
581
577
567
557
548
540
528
Survivor
Sum
function E/(R(R-E))
1.00000
0.99667
0.98832
0.97324
0.96822
0.95144
0.93634
0.92625
0.91442
0.90087
0.89405
5.574E-06
1.415E-05
2.626E-05
8.933E-06
3.057E-05
2.845E-05
1.955E-05
2.361E-05
2.785E-05
1.446E-05
Std
error
0.00000
0.00235
0.00439
0.00660
0.00717
0.00880
0.00999
0.01070
0.01146
0.01225
0.01262
Median duration: 43.03 months
Number censored = number of censored episodes
with ENDING TIMES less than time shown in TIME column.
0.98832 = 0.99667 * (1-5/597)
S(t-1)*[1-E/R]
0.00001415 = 5/(597*(597-5)) = E/(R*(R-E)) .
Std error: S(t) * SQRT{sum [ E/R(R-E))]} GREENWOOD FORMULA (Blossfeld and Rohwer, 95, p. 67)
Table: see Blossfeld and Rohwer, p. 69.
Pisa99/blossfeld/rrdat_sort.xls
Product-limit estimate of survival function (Kaplan-Meier)
and 95% interval
Survival function (men+women)
1.00
0.95
0.90
0.85
0.80
0.75
0
2
3
4
5
S
6
7
Lower
8
Upper
9
10
11
12
Plot of survival function, generated by TDA
Duration up to 428 months (shown up to 300 months)
Blossfeld and Rohwer, 1995, p. 70 (ehc6_1.cf => ehc6_1.ps)
Blossfeld and Rohwer, 1995, p. 73 (ehc8.ps)
Product-limit estimate of survival function (Kaplan-Meier)
#
ehc5.cf
nvar(
dfile
ID
SN
TS
TF
SEX
TI
TB
TE
TMAR
PRES
PRESN
EDU
Kap
=
rrda
[3.0]
[2.0]
[3.0]
[3.0]
[2.0]
[3.0]
[3.0]
[3.0]
[3.0]
[3.0]
[3.0]
[2.0]
#
define
add
DES
[1.0]
=
TFP
[3.0]
=
);
edef(
ts
=
0,
tf
=
TFP,
org
=
0,
des
=
DES,
);
ple
=
ehc5.ple;
Product-limit estimate of survival function (Kaplan-Meier): output
D:\S\TEACH\99PISA\BLOSSF\OEF>d:\s\software\tda\62b\tda\tda_nt
cf=ehc5.cf
Creating new single episode data. Max number of transitions: 100.
Definition: org=0, des=DES, ts=0, tf=TFP
Mean
SN Org Des Episodes Weighted Duration TS Min TF Max
Excl
---------------------------------------------------------------------------1 0 0
142
142.00
128.18
0.00
428.00
1 0 1
458
458.00
49.30
0.00
350.00
Sum
600
600.00
Number of episodes: 600
ple=ehc5.ple
Product-limit estimation. Current memory: 367814 bytes.
Sorting episodes according to ending times.
Product-limit estimation.
1 table(s) written to: ehc5.ple
---------------------------------------------------------------------------Current memory: 311232 bytes. Max memory used: 387781 bytes.
End of program. Mon Mar 27 23:49:40 2000
Product-limit estimate of survival function (Kaplan-Meier): output
# SN 1. Transition: 0,1 - Product-Limit Estimation ehc5.ple
#
# ID Index
0
0
0
1
0
2
0
3
0
4
0
5
0
6
0
7
0
8
0
9
0
10
0
11
0
12
0
13
0
14
0
15
0
16
0
17
0
18
0
19
Time
0.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
10.00
11.00
12.00
13.00
14.00
15.00
16.00
17.00
18.00
19.00
Number
Events
0
2
5
9
3
10
9
6
7
8
4
24
8
10
6
4
9
6
8
20.00
9
Number
Censored
0
0
1
2
0
1
0
1
3
1
4
0
1
3
1
0
0
0
0
1
Exposed
to Risk
600
600
597
590
581
577
567
557
548
540
528
524
499
488
477
471
467
458
452
443
Survivor
Std.
Function Error
1.00000 0.00000
0.99667 0.00235
0.98832 0.00439
0.97324 0.00660
0.96822 0.00717
0.95144 0.00880
0.93634 0.00999
0.92625 0.01070
0.91442 0.01146
0.90087 0.01225
0.89405 0.01262
0.85310 0.01455
0.83942 0.01510
0.82222 0.01574
0.81188 0.01610
0.80498 0.01633
0.78947 0.01681
0.77913 0.01711
0.76534 0.01749
Cum.
Rate
0.00000
0.00334
0.01175
0.02712
0.03230
0.04978
0.06578
0.07661
0.08947
0.10439
0.11200
0.15888
0.17504
0.19575
0.20841
0.21694
0.23640
0.24958
0.26744
0.74979 0.01789 0.28797
Blossfeld and Rohwer, 1995, p. 69
Deaths in first years of life, Kerala. Source: NFHS, 1992-93
Table 1. Life table results for deaths during first four years of life, 1988-92 birth cohort, Kerala (S. Padmadas, dissertation)
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------Age
Number Number Number Number Prob of Prob of
Survival
Prob den Hazard Standard Standard Standard
interval
entering censored exposed
of
dying
surviving function
function
rate
error
error
error
in months
to risk
deaths
(x)
(lx)
(cx)
(lx’)
(dx)
(qx)
(px)
(sx)
(fx)
( x)
(sx)
(fx)
( x)
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------0-1
2026.0
8
2022.0
32
0.0158
0.9842
1.0000
0.0158
0.0160
0.0028
0.0028
0.0028
1-2
1986.0
33
1969.5
4
0.0020
0.9980
0.9842
0.0020
0.0020
0.0030
0.0010
0.0010
2-3
1949.0
46
1926.0
2
0.0010
0.9990
0.9822
0.0010
0.0010
0.0030
0.0007
0.0007
3-4
1901.0
33
1884.5
5
0.0027
0.9973
0.9812
0.0026
0.0027
0.0032
0.0012
0.0012
4-5
1863.0
37
1844.5
1
0.0005
0.9995
0.9786
0.0005
0.0005
0.0033
0.0005
0.0005
5-6
1825.0
32
1809.0
0
0.0000
1.0000
0.9780
0.0000
0.0000
0.0033
0.0000
0.0000
6-7
1793.0
28
1779.0
0
0.0000
1.0000
0.9780
0.0000
0.0000
0.0033
0.0000
0.0000
7-8
1765.0
32
1749.0
1
0.0006
0.9994
0.9780
0.0006
0.0006
0.0033
0.0006
0.0006
8-9
1732.0
42
1711.0
1
0.0006
0.9994
0.9775
0.0006
0.0006
0.0034
0.0006
0.0006
9-10
1689.0
35
1671.5
0
0.0000
1.0000
0.9769
0.0000
0.0000
0.0034
0.0000
0.0000
10-11
1654.0
32
1638.0
2
0.0012
0.9988
0.9769
0.0012
0.0012
0.0035
0.0008
0.0009
11-12
1620.0
28
1606.0
0
0.0000
1.0000
0.9757
0.0000
0.0000
0.0035
0.0000
0.0000
12-13
1592.0
30
1577.0
1
0.0006
0.9994
0.9757
0.0006
0.0006
0.0035
0.0006
0.0006
13-14
1561.0
32
1545.0
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
14-15
1529.0
25
1516.5
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
15-16
1504.0
32
1488.0
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
16-17
1472.0
39
1452.5
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
17-18
1433.0
35
1415.5
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
18-19
1398.0
34
1381.0
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
19-20
1364.0
35
1346.5
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
20-21
1329.0
31
1313.5
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
21-22
1298.0
34
1281.0
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
22-23
1264.0
36
1246.0
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
23-24
1228.0
32
1212.0
0
0.0000
1.0000
0.9751
0.0000
0.0000
0.0035
0.0000
0.0000
continued
Deaths in first years of life, Kerala. Source: NFHS, 1992-93
Age
Number Number Number Number Prob of Prob of
Survival
Prob den Hazard Standard Standard Standard
interval
entering censored exposed
of
dying
surviving function
function
rate
error
error
error
in months
to risk
deaths
(x)
(lx)
(cx)
(lx’)
(dx)
(qx)
(px)
(sx)
(fx)
( x)
(sx)
(fx)
( x)
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------24-25
1196.0
37
1177.5
4
0.0034
0.9966
0.9751
0.0033
0.0034
0.0039
0.0017
0.0017
25-26
1155.0
27
1141.5
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
26-27
1128.0
26
1115.0
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
27-28
1102.0
30
1087.0
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
28-29
1072.0
34
1055.0
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
29-30
1038.0
30
1023.0
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
30-31
1008.0
22
997.0
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
31-32
986.0
43
964.5
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
32-33
943.0
29
928.5
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
33-34
914.0
39
894.5
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
34-35
875.0
38
856.0
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
35-36
837.0
37
818.5
0
0.0000
1.0000
0.9718
0.0000
0.0000
0.0039
0.0000
0.0000
36-37
800.0
32
784.0
1
0.0013
0.9987
0.9718
0.0012
0.0013
0.0041
0.0012
0.0012
37-38
767.0
26
754.0
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
38-39
741.0
44
719.0
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
39-40
697.0
38
678.0
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
40-41
659.0
30
644.0
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
41-42
629.0
25
616.5
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
42-43
604.0
23
592.5
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
43-44
581.0
31
565.5
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
44-45
550.0
26
537.0
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
45-46
524.0
45
501.5
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
46-47
479.0
29
464.5
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
47-48
450.0
39
430.5
0
0.0000
1.0000
0.9705
0.0000
0.0000
0.0041
0.0000
0.0000
48+
411.0
411
205.5
0
0.0000
1.0000
-------------------
Table 7. Estimated life table functions for the first year (in months) of life by place of residence
for 1988-92 birth cohort, Kerala.
Interval in
Months
0
1
2
3
4
5
6
7
8
9
10
11
12
Survival function S(x)
All
Urban Rural
1.0000 1.0000 1.0000
0.9842 0.9942 0.9807
0.9822 0.9942 0.9780
0.9812 0.9942 0.9767
0.9786 0.9922 0.9739
0.9780 0.9922 0.9732
0.9780 0.9922 0.9732
0.9780 0.9922 0.9732
0.9775 0.9922 0.9724
0.9769 0.9922 0.9716
0.9769 0.9922 0.9716
0.9757 0.9898 0.9708
0.9757 0.9898 0.9708
Prob density function f(x)
All
Urban Rural
0.0159 0.0058 0.0193
0.0020 0.0000 0.0027
0.0010 0.0000 0.0014
0.0026 0.0020 0.0028
0.0005 0.0000 0.0007
0.0000 0.0000 0.0000
0.0000 0.0000 0.0000
0.0006 0.0000 0.0007
0.0006 0.0000 0.0008
0.0000 0.0000 0.0000
0.0012 0.0024 0.0008
0.0000 0.0000 0.0000
0.0006 0.0000 0.0008
Hazard rate
All
Urban
0.0160 0.0059
0.0020 0.0000
0.0010 0.0000
0.0027 0.0021
0.0005 0.0000
0.0000 0.0000
0.0000 0.0000
0.0006 0.0000
0.0006 0.0000
0.0000 0.0000
0.0012 0.0024
0.0000 0.0000
0.0006 0.0000
Note: Number of children entering the interval 0-1; All: 2026; Urban: 519; Rural: 1507
(x)
Rural
0.0195
0.0027
0.0014
0.0029
0.0007
0.0000
0.0000
0.0008
0.0008
0.0000
0.0008
0.0000
0.0008
Table 8. Estimated life table results for the interval between first and second birth (in months),
Kerala, January 1988-February 1993
Interval
in
months
Women
Women Number Prob.of Propn.
entering Women exposed of
second surviv
the intv censored to risk births births
ing
9-10
10-11
11-12
12-13
13-14
14-15
15-16
16-17
17-18
18-19
19-20
20-21
21-22
22-23
23-24
24-25
25-26
26-27
27-28
28-29
29-30
30-31
31-32
32-33
33-34
34-35
35-36
36-37
37-38
38-39
39-40
40-41
41-42
42-43
43-44
44-45
45-46
46-47
47-48
48+
1408
1389
1368
1350
1327
1296
1274
1249
1210
1175
1152
1113
1081
1046
1005
954
906
869
828
799
765
738
704
674
646
614
585
562
542
516
488
460
444
419
394
375
349
335
324
303
16
15
11
13
11
6
11
18
11
8
15
9
8
10
14
16
9
13
6
9
3
5
12
8
9
12
9
4
7
8
10
5
3
6
7
7
8
5
8
54
1400.0
1381.5
1362.5
1343.5
1321.5
1293.0
1268.5
1240.0
1204.5
1171.0
1144.5
1108.5
1077.0
1041.0
998.0
946.0
901.5
862.5
825.0
794.5
763.5
735.5
698.0
670.0
641.5
608.0
580.5
560.0
538.5
512.0
483.0
457.5
442.5
416.0
390.5
371.5
345.0
332.5
320.0
276.0
3
6
7
10
20
16
14
21
24
15
24
23
27
31
37
32
28
28
23
25
24
29
18
20
23
17
14
16
19
20
18
11
22
19
12
19
6
6
13
249
0.0021
0.0043
0.0051
0.0074
0.0151
0.0124
0.0110
0.0169
0.0199
0.0128
0.0210
0.0207
0.0251
0.0298
0.0371
0.0338
0.0311
0.0325
0.0279
0.0315
0.0314
0.0394
0.0258
0.0299
0.0359
0.0280
0.0241
0.0286
0.0353
0.0391
0.0373
0.0240
0.0497
0.0457
0.0307
0.0511
0.0174
0.0180
0.0406
---
0.9979
0.9957
0.9949
0.9926
0.9849
0.9876
0.9890
0.9831
0.9801
0.9872
0.9790
0.9793
0.9749
0.9702
0.9629
0.9662
0.9689
0.9675
0.9721
0.9685
0.9686
0.9606
0.9742
0.9701
0.9641
0.9720
0.9759
0.9714
0.9647
0.9609
0.9627
0.9760
0.9503
0.9543
0.9693
0.9489
0.9826
0.9820
0.9594
---
Cum.
propn
surv
1.0000
0.9979
0.9935
0.9884
0.9811
0.9662
0.9543
0.9437
0.9277
0.9093
0.8976
0.8788
0.8606
0.8390
0.8140
0.7838
0.7573
0.7338
0.7100
0.6902
0.6685
0.6474
0.6219
0.6059
0.5878
0.5667
0.5509
0.5376
0.5222
0.5038
0.4841
0.4661
0.4549
0.4323
0.4125
0.3998
0.3794
0.3728
0.3661
---
Prob.
density
function
Hazard
rate
0.0021
0.0043
0.0051
0.0074
0.0148
0.0120
0.0105
0.0160
0.0185
0.0116
0.0188
0.0182
0.0216
0.0250
0.0302
0.0265
0.0235
0.0238
0.0198
0.0217
0.0210
0.0255
0.0160
0.0181
0.0211
0.0158
0.0133
0.0154
0.0184
0.0197
0.0180
0.0112
0.0226
0.0197
0.0127
0.0204
0.0066
0.0067
0.0149
---
0.0021
0.0044
0.0052
0.0075
0.0152
0.0125
0.0111
0.0171
0.0201
0.0129
0.0212
0.0210
0.0254
0.0302
0.0378
0.0344
0.0315
0.0330
0.0283
0.0320
0.0319
0.0402
0.0261
0.0303
0.0365
0.0284
0.0244
0.0290
0.0359
0.0398
0.0380
0.0243
0.0510
0.0467
0.0312
0.0525
0.0175
0.0182
0.0415
---
The fetal life table
Goldhaber and Fireman (1991)
• 9564 pregnancies identified retrospectively from urine tests as well as
first prenatal care visits at three Kaiser Permanente clinics in San
Francisco Bay area during 10-month period in 1981-1982. Twin and
triplet pregnancies, pregnancies with less than 2 days follow-up, and
few other pregnancies were omitted => 9055 pregnancies. Of these,
103 withdrew during follow-up (pregnancy outcome not known), 6629
resulted in live births, 549 in spontaneous fetal loss (including 27
ectopic pregnancies), and 1774 induced abortion. 2-day lag was used
to avoid bias arising when women selectively report for medical care
because of threatened abortion. Many of these women miscarry within
2 days (selection!). Inclusion would overestimate the risk of abortion!
• Measurement issues: onset of pregnancy (date of last menstrual period)
and pregnancy outcome. 459 women entered observation in week 5 of
gestation (days 0-6 after last menstrual period = week 0; days 35-41 =
week 5; days 308-314 = week 44)
Fetal life table by gestational weeks (LMP), 1981-1982, California, USA
Gestational Foetuses entering
week s (LMP)
during week
5
459
6
1313
7
1249
8
922
9
792
10
725
11
638
12
517
13
458
14
319
15
246
16
187
17
153
18
102
19
116
20
102
21
78
22
87
23
58
24
53
25
44
26
58
27
38
28
39
29
34
30
38
31
32
32
39
33
34
34
23
35
29
36
27
37
18
38
11
39
10
40
4
41
2
42
1
43
0
44
0
Total
9055
Spontaneous
foetal loss
1
6
16
34
44
62
60
68
42
27
26
24
12
11
16
10
8
7
1
2
8
3
2
4
0
4
2
3
3
1
2
0
2
1
3
5
2
0
0
0
522
Ectopic
Induced
pregnancy abortion
1
3
1
24
4
152
4
359
5
367
3
284
4
244
1
136
0
71
1
43
0
29
0
24
1
8
2
11
0
9
0
6
0
1
0
2
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
27
1774
Live
birth
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
0
1
3
2
3
3
2
5
7
7
14
19
34
43
75
162
304
727
1452
2101
1026
477
134
26
6629
Withdrawal Number of pregnancies still
in progress at end
0
454
0
1736
0
2813
0
3338
2
3712
0
4088
4
4414
1
4725
5
5065
5
5308
3
5496
6
5629
7
5754
5
5827
5
5912
2
5995
6
6058
3
6132
6
6180
3
6225
3
6255
2
6305
6
6333
3
6360
3
6384
3
6408
2
6422
4
6435
3
6429
3
6405
0
6357
2
6220
2
5930
2
5211
0
3766
2
1662
0
636
0
160
0
26
0
0
103
Risk set
459.0
1767.0
2985.0
3735.0
4129.0
4437.0
4724.0
4930.5
5180.5
5381.5
5552.5
5680.0
5778.5
5853.5
5940.5
6013.0
6070.0
6143.5
6187.0
6231.5
6267.5
6312.0
6340.0
6370.5
6392.5
6420.5
6439.0
6459.0
6467.5
6450.5
6434.0
6383.0
6237.0
5940.0
5221.0
3769.0
1664.0
637.0
160.0
26.0
Risk set = foetuses at risk at the beginning of the week minus half of withdrawals during week
Goldhaber, M.K. and B.H. Fireman (1991) The fetal life table revisited: spontaneous abortion rates in three Kaiser Permanente cohorts.
Epidemiology, 2:33-39
Fetal life table by gestational weeks (LMP), 1981-1982, California, USA
Risk set
459.0
1767.0
2985.0
3735.0
4129.0
4437.0
4724.0
4930.5
5180.5
5381.5
5552.5
5680.0
5778.5
5853.5
5940.5
6013.0
6070.0
6143.5
6187.0
6231.5
6267.5
6312.0
6340.0
6370.5
6392.5
6420.5
6439.0
6459.0
6467.5
6450.5
6434.0
6383.0
6237.0
5940.0
5221.0
3769.0
1664.0
637.0
160.0
26.0
prob of
spont. death
0.004
0.004
0.007
0.010
0.012
0.015
0.014
0.014
0.008
0.005
0.005
0.004
0.002
0.002
0.003
0.002
0.001
0.001
0.000
0.000
0.001
0.000
0.000
0.001
0.000
0.001
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.001
0.001
0.001
0.000
0.000
0.000
prob of
induced abortion
0.007
0.014
0.051
0.096
0.089
0.064
0.052
0.028
0.014
0.008
0.005
0.004
0.001
0.002
0.002
0.001
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
prob of
live birth
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.000
0.001
0.001
0.001
0.002
0.003
0.005
0.007
0.012
0.025
0.049
0.122
0.278
0.557
0.617
0.749
0.838
1.000
prob of
survival
0.989
0.982
0.942
0.894
0.899
0.921
0.934
0.958
0.978
0.986
0.990
0.991
0.996
0.995
0.995
0.997
0.998
0.998
0.999
0.999
0.998
0.999
0.999
0.998
0.999
0.998
0.997
0.996
0.994
0.993
0.988
0.974
0.951
0.877
0.721
0.441
0.382
0.251
0.163
0.000
sum of
probs
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
0.999
0.999
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000
Application and confusion (discussion)
Miller and Homan (1994) “Determining transition probabilities: confusion
and suggestions”, Medical Decision Making, 14(1):??? (based on Kleinbaum
et al.). Terminology used in this paper is confusing (and wrong!)
a. Distinguish between rates and risk
Rate (incidence rate): occurrences (incidences; new cases) over
exposure. Exposure is measured by ‘summing each subject’s time exposed to the
possibility of transiting’ (includes censored cases).
. Instantaneous incidence rate (‘also known as the hazard function’)
. Average incidence rate (also known as the ‘incidence density’ [ID])
Density  rate
Application and confusion (discussion)
Risk: Risk used to denote probability.
Three methods for estimating risk:
1. Simple cumulative method: new cases / number of disease-free
individuals at beginning of interval (no censoring or withdrawal): I/N0
where I is number of new cases and N0 is the number of disease-free individuals
at t=0.
2. Actuarial (life-table) method: new cases / number of disease-free
individuals at beginning of interval minus half of the number of
withdrawals: I/[N0-W/2] where W is number of withdrawals
= risk set
3. Density method: uses age-specific incidence densities (e.g. rates) to
estimate the risk for given age or time interval:
P(0,t) = 1 - exp[-ID*t]
where ID is the average rate and t is the elapsed time. Rates are translated into
probabilities.
Download