Title: WHY%20BAYES?%20INNOVATIONS%20IN%20CLINICAL%20TRIAL%20DESIGN%20
1WHY BAYES?INNOVATIONS IN CLINICAL TRIAL DESIGN
ANALYSIS
- Donald A. Berry
- dberry_at_mdanderson.org
2Conclusion These data add to the growing
evidence that supports the regular use of aspirin
and other NSAIDs as effective chemopreventive
agents for breast cancer.
3Results Ever use of aspirin or other NSAIDs
was reported in 301 cases (20.9) and 345
controls (24.3) (odds ratio 0.80, 95 CI
0.66-0.97).
4Bayesian analysis?
- Naïve Bayesian analysis of Results is wrong
- Gives Bayesians a bad name
- Any naïve frequentist analysis is also wrong
5What is Bayesian analysis?
- Bayes' theorem
- ?'( q? X ) ? ?(q) f( X q )
- Assess prior ? (subjective, include available
evidence) - Construct model f for data
6Implication The Likelihood Principle
- Where X is observed data, the likelihood
function - LX(?) f( X ? )
- contains all the information in an experiment
relevant for inferences about ?
7- Short version of LP Take data at face value
- Data
- Among cases 301/1442
- Among controls 345/1420
- But Data is deceptive
- These are not the full data
8The data
- Methods
- Population-based case-control study of breast
cancer - Study design published previously
- Aspirin/NSAIDs? (2.25-hr ?naire)
- Includes superficial data
- Among cases 301/1442
- Among controls 345/1420
- Other studies ( fact published!!)
9Silent multiplicities
- Are the most difficult problems in statistical
inference - Can render what we do irrelevant
- and wrong!
?
10Which city is furthest north?
- Portland, OR
- Portland, ME
- Milan, Italy
- Vladivostok, Russia
11Beating a dead horse . . .
- Piattelli-Palmarini (inevitable illusions) asks
I have just tossed a coin 7 times. Which did I
get? - 1 THHTHTT
- 2 TTTTTTT
- Most people say 1. But the probabilities are
totally even - Most people are right hes totally wrong!
- Data He presented us with 1 2!
- Piattelli-Palmarini (inevitable illusions) asks
I have just tossed a coin 7 times. Which did I
get? - 1 THHTHTT
- 2 TTTTTTT
- Most people say 1. But the probabilities are
totally even - Most people are right hes totally wrong!
- Data He presented us with 1 2!
12THHTHTT or TTTTTTT?
- LR Bayes factor of 1 over 2
- P(Wrote 12 Got 1)
- P(Wrote 12 Got 2)
- LR gt 1 ? P(Got 1 Wrote 12) gt 1/2
- Eg LR (1/2)/(1/42) 21 ?
- P(Got 1 Wrote 12) 21/22 95
- Probs totally even if a coin was used to
generate the alternative sequence
13Marker/dose interaction Marker negative
Marker positive
14Proportional hazards model
- Variable Comp RelRisk P
- PosNodes 10/1 2.7 lt0.001
- MenoStatus pre/post 1.5 0.05
- TumorSize T2/T1 2.6 lt0.001
- Dose NS
- Marker 50/0 4.0 lt0.001
- MarkerxDose lt0.001
This analysis is wrong!
15Data at face value?
- How identified?
- Why am I showing you these results?
- What am I not showing you?
- What related studies show?
16Solutions?
- Short answer I dont know!
- A solution
- Supervise experiment yourself
- Become an expert on substance
- Partial solution
- Supervise supervisors
- Learn as much substance as you can
- Danger You risk projecting yourself as uniquely
scientific
17A consequence
- Statisticians come to believe
- NOTHING!!
18OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
19Bayes in pharma and FDA
20http//www.cfsan.fda.gov/frf/bayesdl.html
http//www.prous.com/bayesian2004/
21(No Transcript)
22(No Transcript)
23BAYES AND PREDICTIVE PROBABILITY
- Critical component of experimental design
- In monitoring trials
24Example calculation
- Data 13 A's and 4 B's
- Likelihood ? p13 (1p)4
25Posterior density of p for uniform prior
Beta(14,5)
26Laplaces rule of succession
P(A wins next pair data) EP(A wins next pair
data, p) E(p data) mean of Beta(14, 5)
14/19
27Updating w/next observation
28Suppose 17 more observations
- P(A wins x of 17 data)
- EP(A wins x data, p)
- E px(1p)17x data, p
( )
17 x
?
29Best fitting binomial vs. predictive probabilities
Binomial, p14/19
Predictive, p beta(14,5)
30Comparison of predictive with posterior
31Example Baxters DCLHb predictive probabilities
- Diaspirin Cross-Linked Hemoglobin
- Blood substitute emergency trauma
- Randomized controlled trial (1996)
- Treatment DCLHb
- Control saline
- N 850 ( 2x425)
- Endpoint death
32- Waiver of informed consent
- Data Monitoring Committee
- First DMC meeting
- DCLHb Saline
- Dead 21 (43) 8 (20)
- Alive 28 33
- Total 49 41
- P-value? No formal interim analysis
33Predictive probability of future results (after n
850)
- Probability of significant survival benefit for
DCLHb after 850 patients 0.00045 - DMC paused trial Covariates?
- No imbalance
- DMC stopped trial
34OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
35BAYES AS A FREQUENTIST TOOL
- Design a Bayesian trial
- Check operating characteristics
- Adjust design to get ? 0.05
- ? frequentist design
- Thats fine!
- We have 50 such trials at MDACC
36OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
37ADAPTIVE DESIGN
- Look at accumulating data without blushing
- Update probabilities
- Find predictive probabilities
- Modify future course of trial
- Give details in protocol
- Simulate to find operating characteristics
38OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
39Giles, et al JCO (2003)
- Troxacitabine (T) in acute myeloid leukemia (AML)
when combined with cytarabine (A) or idarubicin
(I) - Adaptive randomization to IA vs TA vs TI
- Max n 75
- End point CR (time to CR lt 50 days)
40Randomization
- Adaptive
- Assign 1/3 to IA (standard) throughout (unless
only 2 arms) - Adaptive to TA and TI based on current results
- Final results ?
41(No Transcript)
42Drop TI
Compare n 75
43Summary of results
- CR rates
- IA 10/18 56
- TA 3/11 27
- TI 0/5 0
- Criticisms . . .
44OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
45Example Adaptive allocation of therapies
- Design for phase II Many drugs
- Advanced breast cancer (MDA) endpoint is tumor
response - Goals
- Treat effectively
- Learn quickly
46Comparison Standard designs
- One drug (or dose) at a time no drug/dose
comparisons - Typical comparison by null hypothesis response
rate 20 - Progress is slow!
47Standard designs
- One stage, 14 patients
- If 0 responses then stop
- If 1 response then phase III
- Two stages, first stage 20 patients
- If 4 or 9 responses then stop
- Else second set of 20 patients
48An adaptive allocation
- When assigning next patient, find r P(rate
20data) for each drug - Or, r P(drug is bestdata)
- Assign drugs in proportion to r
- Add drugs as become available
- Drop drugs that have small r
- Drugs with large r ? phase III
49Suppose 10 drugs, 200 patients
- 9 drugs have mix of response rates 20 40, 1
(nugget) has 60 - Standard 2-stage design finds nugget with
probability lt 70 (After 110 patients on average) - Adaptive design finds nugget with probability gt
99 (After about 50 patients on average) - Adaptive also better at finding 40
50Suppose 100 drugs, 2000 patients
- 99 drugs have mix of response rates 20 40, 1
(nugget) has 60 - Standard 2-stage design finds nugget with
probability lt 70 (After 1100 patients on
average) - Adaptive design finds nugget with probability gt
99 (After about 500 patients on average) - Adaptive also better at finding 40
51Consequences
- Recall goals
- (1) Treat effectively
- (2) Learn quickly
- Attractive to patients, in and out of the trial
- Better drugs identified faster move through
faster
52OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
53Example Seamless phase II/III
- Drug vs placebo, randomized
- Local control (or biomarker, etc) early endpoint
related to survival? - May depend on treatment
Inoue et al (2002 Biometrics)
54Conventional drug development
Survivaladvantage
Market
Local control
No survivaladvantage
Not
No local control
Stop
Phase III
Phase II
gt 2 yrs
6 mos
9-12 mos
Seamless phase II/III
lt 2 yrs (usually)
55Seamless phases
- Phase II Two centers 10 pts/mo. drug vs
placebo. If predictive probabilities look good,
expand to - Phase III Many centers 40 pts/mo.(Initial
centers accrue during set-up) - Max sample size 900
- Single trial survival data from both phases
combined in final analysis
56Early stopping
- Use predictive probs of stat. signif.
- Frequent analyses (total of 18) using predictive
probabilities - To switch to Phase III
- To stop accrual
- For futility
- For efficacy
- To submit NDA
57Comparisons
- Conventional Phase III designs Conv4 Conv18,
max N 900 - (same power as adaptive design)
58Expected N under H0
59Expected N under H1
60Benefits
- Duration of drug development is greatly shortened
under adaptive design - Fewer patients in trial
- No hiatus for setting up phase III
- Use all patients to assess phase III endpoint and
relationship between local control and survival
61Possibility of large N
- N seldom near 900
- When it is, its necessary!
- This possibility gives Bayesian design its edge
- Other reason for edge is modeling local
control/survival
62OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
63Berry, et al. Case Studies in Bayesian
Statistics 2001
64Example Stroke and adaptive dose-response
- Adaptive doses in Phase II setting learn
efficiently and rapidly about dose-response
relationship - Pfizer trial of a neutrofil inhibitory factor
results recently announced - Endpoint stroke scale at week 13
- Early endpoints weekly stroke scale
65Standard Parallel Group Design
Equal sample sizes at each of k doses.
Doses
66True dose-response curve (unknown)
Response
Doses
67Observe responses (with error) at chosen doses
Response
Doses
68Dose at which 95 max effect
Response
True ED95
Doses
69Uncertainty about ED95
Response
True ED95
?
Dose
70Uncertainty about ED95
Response
?
Dose
71Solution Increase number of doses
ED95
Response
Doses
72But, enormous sample size, and . . . wasted dose
assignmentsalways!
ED95
Response
Doses
73Our adaptive approach
- Observe data continuously
- Select next dose to maximize information about
ED95, given available evidence - Stop dose-ranging trial when know ED95 response
at ED95 sufficiently well
74Our approach (contd)
- Info accrues gradually about each patient
prediction using longitudinal model
Longitudinal Model Copenhagen Stroke Database
Difference from baseline in SSS week 12
-40
-30
-20
-10
0
10
20
30
40
50
Difference from baseline in SSS week 3
75Our approach (contd)
- Model dose-response (borrow strength from
neighboring doses) - Many doses (logistical issues)
76Possible decisions each day
- Stop trial and drugs development
- Stop and set up confirmatory trial
- Continue dose-finding (what dose?)
- Size of confirmatory trial based on info from
dose-ranging phase - Choices by decision analysis (Human safeguard
DSMB)
77Dose-response trial
- Learn efficiently and rapidly about
dose-response if go to Phase III - Assign dose to maximize info about dose-response
parameters given current info - Use predictive probabilities, based on early
endpoints - Doses in continuum, or preset grid
78Dose-response trial (contd)
- Learn about SD on-line
- Halt dose-ranging when know dose sufficiently
well - Seamless switch from dose-ranging to confirmatory
trial2 trials in 1!
79Advantages over standard design
- Fewer patients (generally) faster more
effective learning - Better at finding ED95
- Tends to treat patients in trial more effectively
- Drops duds early
actual trial!
80Dose-assignment simulation
- Assumes particular dose-response curve
- Assumes SD 12
- Shows weekly results, several patients at a time
(green circles)
81Prior
82(No Transcript)
83(No Transcript)
84(No Transcript)
85(No Transcript)
86(No Transcript)
87(No Transcript)
88(No Transcript)
89(No Transcript)
90(No Transcript)
91(No Transcript)
92(No Transcript)
93(No Transcript)
94(No Transcript)
95(No Transcript)
96(No Transcript)
97(No Transcript)
98(No Transcript)
99(No Transcript)
100(No Transcript)
101(No Transcript)
102(No Transcript)
103(No Transcript)
104(No Transcript)
105(No Transcript)
106(No Transcript)
107(No Transcript)
108(No Transcript)
109(No Transcript)
110(No Transcript)
111(No Transcript)
112(No Transcript)
113(No Transcript)
114(No Transcript)
115(No Transcript)
116(No Transcript)
117(No Transcript)
118(No Transcript)
119(No Transcript)
120(No Transcript)
121(No Transcript)
122(No Transcript)
123(No Transcript)
124(No Transcript)
125(No Transcript)
126(No Transcript)
127(No Transcript)
128(No Transcript)
129(No Transcript)
130(No Transcript)
131(No Transcript)
132(No Transcript)
133Estimated ED95
Confirmatory
1340.0 0.5
1.0 1.5
DOSE
DOSE
135(No Transcript)
136(No Transcript)
137(no dose effect)
138(No Transcript)
139Consequences of Using Bayesian Adaptive Approach
- Fundamental change in the way we do medical
research - More rapid progress
- Well get the dose right!
- Better treatment of patients
- . . . at less cost
140Reactions
- FDA Positive. Makes coming to work worthwhile.
In five years all trials may be seamless. - Pfizer management Enthusiastic
- Other companies Cautious
141OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
142Example Extraim analysis
- Endpoint CR (detect 0.42 vs 0.32)
- 80 power N 800
- Two extraim analyses, one at 800
- Another after up to 300 added pts
- Maximum n 1400 (only rarely)
- Accrual 70/month
- Delay in assessing response
143- After 800 patients, have response info on 450
- Find predictive probability of stat significance
when full info on 800 - Also when full info on 1400
- Continue if . . .
- Stop if . . .
- If continue, n via predictive power
- Repeat at second extraim analysis
144vs 0.80
145OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis
146Decision-analytic approach
- For each trial design
- List possible results
- Calculate their predictive probabilities
- Evaluate their utilities
- Average utilities by probabilities to give
utility of trial with that design - Compare utilities of various designs
- Choose design with high utility
147Choosing sample size
- Special case of above
- One utility Effective overall treatment of
patients, both those - after the trial
- in the trial
- Example, dichotomous endpointMaximize expected
number of successes over all patients
Cheng et al (2003 Biometrika)
148Compare Joffe/Weeks JNCI Dec 18, 2002
- Many respondents viewed the main societal
purpose of clinical trials as benefiting the
participants rather than as creating
generalizable knowledge to advance future
therapy. This view, which was more prevalent
among specialists such as pediatric oncologists
that enrolled greater proportions of patients in
trials, conflicts with established principles of
research ethics.
149Maximize effective treatment overall
- What is overall?
- All patients who will be treated with therapies
assessed in trial - Call it N, patient horizon
- Enough to know mean of N
- Enough to know magnitude of N100? 1000?
1,000,000?
150- Goal maximize expected number of successes in N
- Either one- or two-armed trial
- Suppose n 1000 is right for N 1,000,000
- Then for other Ns use n
151Optimal allocations in a two-armed trial
152Knowledge about success rate r
153OUTLINE
- Silent multiplicities
- Bayes and predictive probabilities
- Bayes as a frequentist tool
- Adaptive designs
- Adaptive randomization
- Investigating many phase II drugs
- Seamless Phase II/III trial
- Adaptive dose-response
- Extraim analysis
- Trial design as decision analysis