Title: Survival Analysis: From Square One to Square Two
1Survival Analysis From Square One to Square Two
- Yin Bun Cheung, Ph.D.
- Paul Yip, Ph.D.
Readings
2Lecture structure
- Basic concepts
- Kaplan-Meier analysis
- Cox regression
- Computer practice
3Whats in a name?
- time-to-event data
- failure-time data
- censored data
- (unobserved outcome)
4Types of censoring
- loss to follow-up during the study period
- study closure
5Examples of survival analysis
- 1. Marital status mortality
- 2. Medical treatments tumor recurrence
mortality in cancer patients - 3. Size at birth developmental milestones in
infants
6Why survival analysis ?
- Censoring (time of event not observed)
- Unequal follow-up time
7What is time?What is the origin of time?
- In epidemiology
- Age (birth as time 0) ?
- Calendar time since a baseline survey ?
8What is the origin of time?
- In clinical trials
- Since randomisation ?
- Since treatment begins ?
- Since onset of exposure ?
9The choice of origin of time
- Onset of continuous exposure
- Randomisation to treatment
- Strongest effect on the hazard
10Types of survival analysis
1. Non-parametric method Kaplan-Meier
analysis 2. Semi-parametric method Cox
regression 3. Parametric method
11Square 1 to square 2
- This lecture focuses on two commonly used methods
- Kaplan-Meier method
- Cox regression model
12KM survival curve
ddeath, ccensored, survsurvival
13KM survival curve
14No. of expected deaths
- Expected death in group A at time i, assuming
equality in survival - EAi no. at risk in group A i ? death i
- total no. at risk i
- Total expected death in group A EA ?
EAi
15Log rank test
- A comparison of the number of expected and
observed deaths. - The larger the discrepancy, the less plausible
the null hypothesis of equality.
16An approximation
- The log rank test statistic is often approximated
by - X2 (OA-EA)2/EA (OB-EB)2/EB,
- where OA EA are the observed expected number
of deaths in group A, etc.
17Proportional hazard assumption
Log rank test preferred (PH true )
Breslow test preferred (non-PH)
18Risk, conditional risk, hazard
19Another look of PH
Hazard
Hazard
0
5
10
15
20
0
5
10
15
20
Time
Time
Log rank test preferred (PH true )
Breslow test preferred (non-PH)
20Cox regression model
- Handles ?1 exposure variables.
- Covariate effects given as Hazard Ratios.
- Semi-parametric only assumes proportional hazard.
21Cox model in the case of a single variable
- . hi(t) hB(t) ? exp(BXi)
- . hj(t) hB(t) ? exp(BXj)
- . hi(t)/hj(t) expB(Xi-Xj)
- exp(B) is a Hazard Ratio
22Test of proportional hazard assumption
- Scaled Schoenfeld residuals
- Grambsch-Therneau test
- Test for treatment?period interaction
- Example mortality of widows
23Computer practice
- A clinical trial of
- stage I bladder tumor
- Thiotepa vs Control
- Data from StatLib
24Data structure
- Two most important variables
- Time to recurrence (gt0)
- Indicator of failure/censoring
- (0censored 1recurrence)
- (coding depends on software)
25KM estimates
Thiotepa
Control
26Log rank test
chi2(1) 1.52 Prgtchi2 0.22
27Cox regression models
28Test of PH assumption
- Grambsch-Therneau test
- for PH in model II
- Thiotepa P0.55
- Number of tumor P0.60
29Major References (Examples)
- Ex 1. Cheung. Int J Epidemiol 20002993-99.
- Ex 2. Sauerbrei et al. J Clin Oncol
20001894-101. - Ex 3. Cheung et al. Int J Epidemiol 20013066-74.
30Major References (General)
- Allison. Survival Analysis using the SAS System.
- Collett. Modelling Survival Data in Medical
Research. - Fisher, van Belle. Biostatistics A Methodology
for the Health Sciences.