Fitting Marginal Structural Models - PowerPoint PPT Presentation

About This Presentation

Title:

Fitting Marginal Structural Models

Description:

Robins JM, Hernan MA, Brumback B. Marginal structural models and causal ... Bang H, Robins JM (2005). Doubly Robust Estimation in Missing Data and Causal ... – PowerPoint PPT presentation

Number of Views:236

Avg rating:3.0/5.0

Slides: 44

Provided by: eleanorpul

Category:

more less

Transcript and Presenter's Notes

Title: Fitting Marginal Structural Models

1
Fitting Marginal Structural Models

Eleanor M Pullenayegum
Asst Professor
Dept of Clin. Epi Biostatistics
pullena_at_mcmaster.ca

2
Outline

Causality and observational data
Inverse-Probability weighting and MSMs
Fitting an MSM
Goodness-of-fit
Assumptions/ Interpretation

3
Causality in Medical Research

Often want to establish a causal association
between a treatment/exposure and an event
Difficult to do with observational data due to
confounding
Gold-standard for causal inferences is the
randomized trial
Randomize half the patients to receive the
treatment/exposure, and half to receive usual
care
Deals with measured and unmeasured confounders

4
Randomized trials are not always possible

Sometimes, they are unethical
cannot do a randomized trial on the effects of
second-hand smoke on lung cancer
or a randomized trial of the effects of living
near power stations
Sometimes, they are not feasible
Study of a rare disease (funding is an issue)

5
Observational Studies

Observe rather than experiment (or interfere!)
Recruit some people who are exposed to
second-hand smoke and some who are not
Study communities living close to power lines vs.
those who dont
Confounding is a major concern
For 1st example, are workplace environment, home
environment, age, gender, income similar between
exposed and unexposed?
For 2nd example, are education, family income,
air pollution similar between cases and controls?

6
Handling Confounding

Match exposed and unexposed on key confounders
e.g. for every family living close to a power
station, attempt to find a control family living
in a similar neighbourhood with a similar income
Adjust for confounders
for the smoking example, adjust for age, gender,
level of education, income, type of work, family
history of cancer etc.
Cannot deal with unmeasured confounders

7
Causal Pathways

There are some things we cannot adjust for
When studying the effect of a lipid-lowering drug
on heart disease, we cant adjust for
LDL-cholesterol level

8
Causal Pathways
Drug
LDL-cholesterol
Heart Disease
LDL-cholesterol mediates the effect of the
drug Cannot adjust for variables that are on the
causal pathway between exposure and outcome.
9
Motivating Example

Juvenile Dermatomyositis (JDM) is a rare but
serious skin/muscle disease in children
Standard treatment is with steroids (Prednisone),
however these have unpleasant side-effects
Intravenous immunoglobulin (IVIg) is a possible
alternative treatment
DAS measures disease activity

10
JDM Dataset

81 kids, 7 on IVIg at baseline, 23 on IVIG later
Outcome is time to quiescence
Quiescence happens when DAS0
IVIg tends to be given when the child is doing
particularly badly (high DAS)
DAS is a counfounder

11
Causal Pathway for JDM study

DASt
DASt1

IVIgt
IVIgt1
Time-to-Quiescence

DAS confounds IVIg and outcome
DAS is on the causal pathway

12
A Thought Experiment

Suppose that at each time t, we could create an
identical copy of each child i.
Then if the real child received IVIG, we would
give the copy control and vice versa
We could then compare the child to its copy
Solves confounding by matching the child is
matched with the copy
If treatment varies on a monthly basis and we
follow for 5 years, we would have 260-1 copies

13
Counterfactuals

Clearly, this is impossible.
But we can use the idea
Define the counterfactuals for child i to be the
outcomes for each of the 260-1 imaginary copies
Idea treat the counterfactuals as missing data

14
Inverse-Probability Weighting

Inverse-Probability Weighting (IPW) is a way of
re-weighting the dataset to account for selective
observation
E.g. if we have missing data, then we weight the
observed data by the inverse of the probability
of being observed
Why does this work?
Suppose we have a response Yij, treatment
indicator xij and Rij1 if Yij observed, 0 o/w

15
Inverse-Probability Weighting

Suppose we want to fit the marginal model
Usually, we solve the GEE equation
If we use just the observed data, we solve
LHS does not have mean 0

16
Inverse-Probability Weighting

If we replace D by with
pij, the conditional probability of observing
Yij, then
What to condition on?
Must condition on Yij
If MAR, then conditionally independent given
previous Y

17
Marginal Structural Models

MSMs use inverse-probability weighting to deal
with the unobserved (missing) counterfactuals
We cannot adjust for confounders
but using IPW, can re-weight the dataset so that
treatment and covariates are unconfounded
i.e. mean covariate levels are the sample between
treated and untreated patients
So can do a simple marginal analysis

18
Probability-of-Treatment Model

Weighting is based on the Probability-of-Treatment
model
Treatment is longitudinal
For each child at each time, need probability of
receiving the observed treatment trajectory
Probability is conditional on past responses and
confounders
Assume independent of current response

19
JDM Example

Probability of being on IVIg at baseline
(logistic regression)
Probability of transitioning onto IVIg (Cox PH)
Probability of transitioning off IVIg (Cox PH)
Suppose a child initiates IVIG at 8 months and is
still on IVIG at 12 months.
What is the probability of the observed treatment
pattern?

20
Trratment probability
P(transition at month 8)
P(no transition before month 8)
P(no transition off before month 12)
P(not on IVIg at baseline)
0
8
12
No IVIg
Initiate IVIg
Still on IVIG
21
Model Fitting

First identified covariates univariately
Then entered those that were sig. into model and
refined (by removing those that were no longer
sig.)
IVIg at baseline Functional status (any vs.
none) OR 11.6, 95 CI 1.94 to 69.7 abnormal
swallow/voice OR 6.28, 95 CI 0.983 to 4.02.
IVIg termination no covariates

22
Assessing goodness-of-fit

If the IPT weights are correct, in the
re-weighted population, treatment and covariates
are unconfounded
This property is
crucial
testable
so we should test it!

23
Goodness-of-fit in the JDM study

Biggest concern is that kids are doing badly when
they start IVIg
If inverse-probability weights are correct, then
at each time t, amongst patients previously
IVIg-naïve, IVIg is not associated with
covariates.
Will look at differences in mean covariate values
by current IVIg status amongst patients
previously IVIg-naïve
Data are longitudinal, so use a GEE analysis,
adjusting for time

24
Model 1 HRs for Treatment Initiation
Hazard Ratios and 95 confidence intervals for
initiating treatment
25
(No Transcript)
26
Model 2 -Revised Treatment initiation
Hazard Ratios and 95 confidence intervals for
initiating treatment
27
New goodness-of-fit
28
Back to basics

Some patients start IVIg because they are
steroid-resistant (early-starters)
Others start because they are steroid-dependent
(late-starters)
Repeat model-fitting process separately for early
and late starters

29
(No Transcript)
30
Refined two-stage model
31
(No Transcript)
32
Efficacy Results
33
Other concerns with MSMs

Format of treatment effect (e.g. constant over
time, PH etc.)
Unmeasured counfounders
Lack of efficiency
Experimental Treatment Assignment

34
Efficiency

IPW reduces bias but also reduces efficiency
The further the weights are from 1, the worse the
efficiency
Can stabilise the weights
Estimating equations will still be zero-mean if
we multiply Dijj by a factor depending on j and
treatment
In JDM study, we used
DijjRijP(Rx history)/P(Rx historyconfounders)

35
Efficiency other techniques

Doubly robust methods (Bang Robins)
Could have used a more information-rich outcome
Did a secondary analysis using DAS as the outcome
got far more precise (and more positive) results

36
Experimental Treatment Assignment

In order for MSMs to work, there must be some
experimentality in the way treatment is assigned
Intuitively, if we can predict perfectly who will
get what treatment, then we have complete
confounding
Mathematically, if pij is 0 then were in
trouble!
Actually, we get into trouble if pij 0 or 1

37
Testing the ETA simple checks

At each time j, review the distribution of
covariates amongst those who are on treatment vs.
those who are not.
Review the distribution of the weights
check p bounded away from 0/1
In the JDM example, also check distn of
transition probabilities

38
Testing the ETA more advanced methods

Bootstrapping
Wang Y, Petersen ML, Bangsberg D, van der Laan
MJ. Diagnosing bias in the inverse probability of
treatment weighted estimator resulting from
violation of experimental treatment assignment.
UC Berkeley Division of Biostatistics working
paper series, 2006.

39
Implementing MSMs

For time-to-event outcome, can do weighted PH
regression in R
Used the svycoxph function from the survey
package
For continuous (or binary) outcome, use weighted
GEE
Used proc genmod in SAS with scgwt
Weighted GEEs are not straightforward in R
STATA could probably handle either type of outcome

40
MSMs - potentials

Often good observational databases exist
Should do what we can with them before using
large amounts of money to do trials
Can deal with a time-varying treatment
Conceptually fairly straightforward
Do not have to model correlation structure in
responses

41
MSMs - limitations

There may always be unmeasured confounders
Relies heavily on probability-of-treatment model
being correct
Experimental ETA violations can often occur
(particularly with small sample sizes)
Somewhat inefficient
Doubly robust methods may help
Not a replacement for an RCT

42
Key points

MSMs can help to establish causal associations
from observational data
Make some strong assumptions
Need goodness-of-fit for measured confounders
Will never find the right model
Aim to find good models

43
References

Robins JM, Hernan MA, Brumback B. Marginal
structural models and causal inference in
epidemiology. Epidemiology 2000 11 550-560.
Bang H, Robins JM (2005). Doubly Robust
Estimation in Missing Data and Causal Inference
Models. Biometrics 61 (4), 962973.
Pullenayegum EM, Lam C, Manlhiot C, Feldman BM.
Fitting Marginal Structural Models Estimating
covariate-treatment associations in the
re-weighted dataset can guide model fitting.
Journal of Clinical Epidemiology.
Wang Y, Petersen ML, Bangsberg D, van der Laan
MJ. Diagnosing bias in the inverse probability of
treatment weighted estimator resulting from
violation of experimental treatment assignment.
UC Berkeley Division of Biostatistics working
paper series, 2006.