Title: Algebraic%20Model
1I N N O V A T I O N L E C T U R E S
(I N N O l E C)
Binding and Kinetics for Experimental Biologists
Lecture 1 Numerical Models for Biomolecular
Interactions
Petr Kuzmic, Ph.D.BioKin, Ltd. WATERTOWN,
MASSACHUSETTS, U.S.A.
2Lecture outline
- The problemTraditional equations for fitting
biomolecular binding datarestrict the
experimental design. Typically, at least one
componentmust be present in very large excess. - The solutionAbandon algebraic equations
entirely. Use iterative numerical models,which
can be derived automatically by the computer. - An implementationSoftware DynaFit.
- An exampleKinetics of forked DNA binding to
the protein-protein complex formedby
DNA-polymerase sliding clamp (gp45) and clamp
loader (gp44/62).
3Traditional approach is based on algebraic models
A typical cookbook for experimental biologists
Cold Spring Harbor Laboratory Press Cold Spring
Harbor, NY, 2007 ISBN-10 0879697369
4Algebraic models restrict experiment design
Goodrich Kugel (2006) Binding and Kinetics for
Molecular Biologists
EXAMPLE Determine the association rate constant
for A B ? AB
5Experimental handbooks are full of restrictions
Goodrich Kugel (2007) Binding and Kinetics for
Molecular Biologists
p. 34
p. 79
p. 120
p. 126
etc.
6Numerical vs. algebraic mathematical models
FROM A VARIETY OF ALGEBRAIC EQUATIONS TO A
UNIFORM SYSTEM OF DIFFERENTIAL EQUATIONS
k1
EXAMPLE Determine the rate constant k1 and k-1
for A B AB
k-1
ALGEBRAIC EQUATIONS
DIFFERENTIAL EQUATIONS
dA/dt -k1AB k-1AB
dB/dt -k1AB k-1AB
dAB/dt k1AB k-1AB
7Advantages and disadvantages of numerical models
THERE IS NO SUCH THING AS A FREE LUNCH
ALGEBRAICMODEL
ADVANTAGE
- - -
can be derived for any molecular mechanism can
be derived automatically by computer can be
applied under any experimental conditions can be
evaluated without specialized software requires
very little computation timedoes not require an
initial estimateis resistant to truncation and
round-off errors has a long tradition many
papers published
8Specialized numerical software DynaFit
MORE THAN 600 PAPERS PUBLISHED WITH IT (1996
2009)
2009
FREE for academic users
Kuzmic (2009) Meth. Enzymol., 467, 247-280
9DynaFit software Citation analysis
APPROXIMATELY 850 JOURNAL ARTICLES PUBLISHED
SINCE 1998
Kuzmic, P. (1996) "Program DYNAFIT for the
analysis of enzyme kinetic data Application to
HIV proteinase" Anal. Biochem. 237,
260-273. Kuzmic, P. (2009) "DynaFit - A software
package for enzymology Meth. Enzymol. 467,
247-280.
10Theoretical foundations Mass Action Law
RATE IS PROPORTIONAL TO CONCENTRATION(S)
rate derivative
11Theoretical foundations Mass Conservation Law
PRODUCTS ARE FORMED WITH THE SAME RATE AS
REACTANTS DISAPPEAR
EXAMPLE
12Composition Rule Example
k5 EAB
k2 EA?B
- k-2 EAB
k4 EB?A
- k-4 EAB
- k5 EAB
Similarly for other species...
13A "Kinetic Compiler"
HOW DYNAFIT PROCESSES YOUR BIOCHEMICAL EQUATIONS
- k1 ? E ? S
dE / dt
k2 ? ES
k3 ? ES
dES / dt
Similarly for other species...
14System of Simple, Simultaneous Equations
HOW DYNAFIT PROCESSES YOUR BIOCHEMICAL EQUATIONS
"The LEGO method" of deriving rate equations
15DynaFit can analyze many types of experiments
MASS ACTION LAW AND MASS CONSERVATION LAW IS
APPLIED IN THE SAME WAY
EXPERIMENT
DYNAFIT DERIVES A SYSTEM OF ...
chemistrybiophysics
Ordinary differential equations (ODE)Nonlinear
algebraic equations Nonlinear algebraic
equations
Kinetics (time-course) Equilibrium
binding Initial reaction rates
enzymology
16Example DNA clamp / clamp loader complex
DETERMINE ASSOCIATION AND DISSOCIATION RATE
CONSTANT IN AN A B ? AB SYSTEM
Typical email from a Ph.D. student
Courtesy of Senthil Perumal, Penn State
University (Steven Benkovic lab)
17Example Experimental setup
ALL COMPONENTS PRESENT AT EQUAL CONCENTRATIONS
- pre-mix sliding clamp (C) clamp loader (L) to
form C.L complex - add DNA solution
- observe the formation of C.L.DNA ternary complex
over timefinal concentrations100 nM
clamp100 nM loader100 nM DNA
... gp45 labeled with Cy5 acceptor dye ...
gp44/62 ... primer labeled with Cy3 donor dye
C.L complex has estimated Kd 1 nM, so C.L ? CL
dissociation upon adding DNA should be negligible
Courtesy of Senthil Perumal, Penn State
University (Steven Benkovic lab)
18Example Raw data
JUST BECAUSE THE DATA FIT TO A MODEL DOES NOT
MEAN THAT THE MODEL IS CORRECT!
raw fluorescence F fit to F A0 A1 exp (-k t)
fluorescence
exponential model fits the data well but it is
theoretically invalid!
time
Courtesy of Senthil Perumal, Penn State
University (Steven Benkovic lab)
19Example Anatomy of DynaFit scripts
DYNAFIT SOFTWARE IS DRIVEN BY TEXT SCRIPTS -
MINIATURE COMPUTER PROGRAMS
20Example DynaFit tutorial
YOUR FIRST DYNAFIT DATA-ANALYSIS SESSION
TUTORIAL
- Start DynaFit
- Select menu File ... Open or press CtrlO
- Navigate to file./courses/bkeb/lec-1/ab/fit-001
.txt - Select menu File ... Try or press CtrlTThis
is the initial estimate - Select menu File ... Run or press CtrlUWait
several seconds to finish the analysis - Select menu View ... Results in External
BrowserNavigate in the output files
21Example Detailed explanation
A BIT OF THEORY
- Reaction order
- Units and dimensions (scaling)
- The DynaFit model for biomolecular kinetics
- Initial estimates of model parameters
22Molecularity and reaction order
IN PRACTICE WE ENCOUNTER ONLY ZERO-, FIRST-, AND
SECOND-ORDER REACTIONS
PHYSICALMEANING
DYNAFITNOTATION
ORDER
NOTATION
zero-
constant-rateinflux or efflux
X --gt v
k1
A B
first-
isomerization ordissociation ofone molecule
A --gt B k1C --gt A B k1
k1
A B C
second-
binding(association) of two molecules
A B --gt C k2
k2
A B C
23Reversible reactions and reaction mechanisms
DYNAFIT CAN HANDLE AN ARBITRARY NUMBER OF
ELEMENTARY REACTIONS IN A MECHANISM
DYNAFITNOTATION
REVERSIBLE REACTION
k1
A B ltgt C k1 k2
A B C
k2
A B ltgt AB k1 k2A C ltgt AC k3
k4 AC ---gt X k5
24Dimensions of rate constants
CAREFUL ABOUT DIMENSIONS OF RATE CONSTANTS!
DIMENSIONAL ANALYSIS
quantity v X k1, k2
dimension concentration / time concentration ?
forward and reverse reaction rates
k1
A B AB
v? k1 A B
k2
v? k2 AB
25Dimensions of rate and equilibrium constants
SUMMARY
k1
1/sec 1/(M sec) M 1/M
k2
dissociation rate constant association rate
constant dissociation equilibrium
constant association equilibrium constant
A B AB
k2
k1
Kd k2 /k1
Ka k1 /k2
k3
k3
? rate constant ? rate constant ? equilibrium
constant ? equilibrium constant
1/sec 1/sec -- --
A A
k4
k4
K? k3 /k4
K? k4 /k3
26Example Units (scaling) of rate constants
ALL UNITS ARE ARBITRARY BUT MUST BE IDENTICAL
THROUGHOUT THE ENTIRE SCRIPT!
mechanism DNA Clamp.Loader ltgt Complex
kon koff constants kon 1 ?
koff 1 ? concentrations DNA
0.1 Clamp.Loader 0.1 responses Complex
1 ? data file ./.../d1-edit.txt
offset auto ?
27Example The response coefficient
MOLAR RESPONSE PROPORTIONALITY FACTOR LINKING
CONCENTRATIONS TO SIGNAL
mechanism DNA Clamp.Loader ltgt Complex
kon koff constants kon 1 ?
koff 1 ? concentrations DNA
0.1 Clamp.Loader 0.1 responses Complex
1.00 ? data file ./.../d1-edit.txt
offset auto ?
28Example Initial estimates
NONLINEAR REGRESSION ANALYSIS ALWAYS REQUIRES
INITIAL ESTIMATES OF THE SOLUTION
the initial estimate of rate constants
mechanism DNA Clamp.Loader ltgt Complex
kon koff constants kon 1 ?
koff 1 ? concentrations DNA
0.1 Clamp.Loader 0.1 responses Complex
1 ? data file ./.../d1-edit.txt
offset auto ?
optimized model parameters
A VERY DIFFICULT PROBLEM HOW TO GUESS GOOD
ENOUGH INITIAL ESTIMATES OF RATE CONSTANTS?
29Example Good initial estimate
data
model
30Example Good initial estimate results
31Example Bad initial estimate
model
a hundred times smaller / larger
data
32Example Bad initial estimate results
33Example Good vs. Bad results - comparison
sum of squares
relativesum of sq.
best-fit constants
initial estimate
Kd, nM k2/k1
k1 2.2 0.5 k2 0.030 0.015
k1 1 k2 1
0.002308
1.00
13 nM
good
k1 0.2 3.4 k2 0.2 0.6
1000 nM
k1 100 k2 0.01
0.002354
1.02
bad
34Example Good vs. Bad results - comparison
From good initial estimate
From bad initial estimate
not very encouraging!
35Example Good vs. Bad results - summary
- Initial estimates off by a factor of 100 can
produce misleading results. - The data/model overlay may look good, but the
results may be invalid. - The same applies to the residual sum of squares
(only 2 difference). - The only indication that something went wrong
might bea. huge standard errors of model
parameters andb. various warnings from the
least-squares fitter - The simplest possible safeguard Use several
different initial estimates?Disadvantage how
do we know which multiple estimates?
36Summary and conclusions
NUMERICAL MODELS IN BIOCHEMISTRY AND BIOPHYSICS
BETTER THAN ALGEBRAIC EQUATIONS
- Numerical models are applicable to all
experimental conditions.No more large excess of
this over that. - Numerical models apply uniformly to all types of
experimentsa. reaction progress
(kinetics)b. equilibrium composition
(binding)c. enzyme catalysis. - Numerical models can be automatically derived by
computer.No more looking up algebraic equations
if they exist at all. - Main disadvantage requirement for specialized
software.But DynaFit is free to academic users. - Not a silver bullet! Example the initial
estimate problem.But this is not specific to
numerical models (applies to algebraic models,
too).