Kinetic Modeling with KinTek Global Explorer 3.0 - PowerPoint PPT Presentation

1 / 33

About This Presentation

Title:

Kinetic Modeling with KinTek Global Explorer 3.0

Description:

If the fitter doesn't pay attention to this then numerical imprecision will ... Fully threaded fitter can exploit multi-core machines ... – PowerPoint PPT presentation

Number of Views:169

Avg rating:3.0/5.0

Slides: 34

Provided by: Zack76

Category:

more less

Transcript and Presenter's Notes

Title: Kinetic Modeling with KinTek Global Explorer 3.0

1
Kinetic Modelingwith KinTek Global Explorer 3.0

John H Davis, Zack Booth Simpson, Thomas Blom,
Ken A Johnson

2
Kinetic Simulation 101

Given a model of reactions, build an ODE.
dA/dt -rfAB rbC
dB/dt -rfAB rbC
dC/dt rfAB -rbC
A(t0)3, B(t0)1, C(t0)0
Integration of these equations simulates the
evolution of the system

C
time
3
Kinetic Simulator

KinTek Global Explorer (KGE) 1.0. Fast-response
simulator allows for playing around with the
parameters
Live kin demo

4
Fitting Problem

Given data and a proposed model wed like to find
the parameters, rf and rb
Naïve approach guesstimate the parameters,
simulate, compare results to experiment, tweak
the parameters by hand, find the sweet spot.
Might work with 2 parameters, nearly impossible
with more
Realistically dozens of rate constants plus
initial conditions plus unknown scaling
constants. Might be 30 unknowns even on a
simple model

5
Intuition for fitting Parameter Space

Imagine a space where the axes are the parameter
values (reaction rates)
Let altitude be the error relative to experiment
Easy to visualize for 1 or 2 parameters, very
hard past that.
Live fitspace demo Brute force sampling of
space for AB?C

Real-life example
Cartoon
6
Fitting Skiing in parameter space

Imagine moving in parameter space in a dense fog
while holding an altimeter and our skis are
sticky!
Take a step along rf, note an altimeter change
take a step along rb note altitude, we now have
gradient estimate

By estimating local slope we can head downhill.
Repeat the process until
Finally we find a place where all steps are
uphill. Were done.
Or are we?

rf
rb
7
Global minima vs. local

It is possible that weve found is a local
minima.
Finding the global minima takes an effort that is
much harder we dont even attempt it.
Even finding the local minima is tricky (and is a
pre-requisite for finding the global)

8
Unique fit?

From the bottom, movement along either parameter
is uphill.
Easy to say Tweaks of either parameter are
worse, therefore I have a good fit. Submit
values to publication!
No! No! No!

parameter rb
Parameter rf
9
No unique fit

Theres an infinite number of roughly equivalent
fits the fit is not unique!
These movements along the valley are a mixture of
both parameters i.e. they are not axis
aligned.
Visualizing these non-axis aligned directions is
hard!

parameter rb
Parameter rf
10
Fit not unique, real-life example

Suppose that we believe that our earlier system
AB?C has an intermediary AB?I?C.

Error surface for AB?C
Parameter rb
Color indicates altitude Red high error Blue
low error
Parameter rf
Error surface for AB?I?C with same experimental
design (same observables, only 2 parameters
illustrated)
11
Fit not unique, real-life example

AB?C versus AB?I?C
Which hypothesis is supported?
The more complex model is under-constrained.
In order to support the second hypothesis we need
more independent experiments.

12
Searching Metaphor

A miner searches for a gold in a mountain.
The seam of gold is not axis aligned but there is
a line of it off to infinity
The miner takes axis aligned steps. He could
search forever and only hit gold by luck.
He concludes Gold is rare
even though there is an infinite amount of it.
Say he stumbles upon it he makes a few
axis-aligned movements and loses the gold.
He concludes This small spot is the only gold
around here
even though an infinite line of it is right
there!

13
Moral of the story

Just because
Something is hard to find doesnt mean that it is
in limited supply.
Loss of signal away from a known source doesnt
mean that the source is small or finite in size.
These conclusions seem obvious, but there are
published papers that seem to assume the
opposite!
Why the confusion?

14
Confusing Quality of fitwith Quality of model

Realistically altimeter is noisy can not
determine altitude with perfect precision.
After finding the bottom we take a lot of
measurements with the altimeter, do some fancy
math, and compute the precision
Get chi-square or some other other fancy-sounding
statistic.
But, the altimeter does not tell you where you
are or the shape of the basin!!!
Determining the fitting error with high precision
is not the same thing as determining the bottom
of the basin with high precision! (More on this
later)
Altimeters are not GPSs!

15
Need Better Tools!

In high dimensions, it is so hard to visualize
that anyone can be forgiven for getting confused.
Because it is so easy to confuse a low-error fit
with a high-quality model
We need a way to make it obvious when the model
is under-constrained
KGE to the rescue!

16
Understanding the Fitter

The fitter doesnt ski down the error slope.
It has to take discrete steps downhill.
What direction should the steps be in?
In the direction of the gradient, i.e. downhill
How big should the steps be?
This is a harder question
Time to define some terms

17
Gradient Curvature

Gradient is the change in altitude (fitting
error) per unit change in parameter (1st
derivative)
Intuition a big gradient means it is heading
heading downhill fast
Curvature is the change in gradient per unit
change in parameter(2nd derivative)
Intuition a big curvature means the basin is
narrow
These are measured locally and used to estimate
the contour of the basin. (Never mind the exact
math of finding the gradient and curvature)
I mean this in a laymans sense of the word,
not the mathematical sense

18
Gradient Curvature

In 1 dimension
Gradient is scalar, aka slope
Curvature is scalar, aka acceleration
In more dimensions
Gradient is a vector
Think of 1st element as delta altitude wrt
parameter 1
Think of 2nd element as delta altitude wrt
parameter 2
Curvature is matrix
Think of 1st column as delta gradient with
respect to parameter 1
Think of 2nd column as delta gradient with
respect to parameter 2

19
Using the curvature matrixto leap, not ski

In n-dimensions, the gradient vector and the
curvature matrix define a parabaloid
Pretending that the basin conforms to a
parabaloid, solve for the bottom of said
parabaloid and jump there.
Because the basin is not a parabaloid, we wont
end up at exactly the bottom, but were closer
(we hope).
Repeat until no improvement
This (plus some complications) is the Levenberg
Marquardt Algorithm

Step 1
Step 2
20
The curvature matrix reveals the basin topology

If you could measure the curvature in some
direction youd have a quantifiable description
of how well-conditioned the model is in that
direction.
The directions of maximum and minimum curvature
are particularly interesting.
What we want is a new set of axes that are better
aligned to our basin topology.
How can you find these axes? A wonderful tool
Singular Value Decomposition

21
Singular Value Decomposition

SVD is a well-known factorization which creates a
new set of axes (basis factors) with the
following properties
The direction of strongest action is the first
returned axis all other axes are orthogonal to
this one and to each other.
eigen vectors
A scalar is given as to the strength of the
action in each eigen direction
singular values

22
SVD of the fitting curvature matrix

Large singular values mean the system is well
determined in the respective eigen direction
Small singular values mean the system is poorly
determined, i.e. theres a valley in the
respective eigen direction.

23
Define a valley? How flat is too flat?

When we say that some valley is flat, what do
we mean exactly?
Do we mean flat relative to other valleys?
Or do we mean flat in some absolute sense?

We mean it relative to our ability to measure it.
The Signal to noise ratio

24
Signal and noise

Our altimeter for the fit error surfaceis not
perfect because there are randomerrors in the
observation.
Result of noise in the instruments and other
factors
It is harder to determine the bottom of a shallow
basin if there is a lot of noise.
Conversely for the same noise it is easier to
locate the bottom of a narrow basin than a wide
one.
This can be quantified as the signal to noise
ratio or information content of measurement

25
SNR demo
Fit error altitude
Parameter axis
Noise makes estimate of altitude fuzzy. Above
two basins with the same noise. It is much
easier to locate the bottom of the more curvy
basin on the right. Signalcurviness,
Noisefuzziness, SNR log((SN)/N) the number
of significant digits that establish
curvature. Live errbasin demo
26
How do you estimate the noise?

Chemical kinetic systems are typically much
slower acting than are the instruments used to
measure them.
i.e. you get to take lots of data points on a
curve
Thus, the signal is all in the low frequency
parts while the noise is uniform across the whole
spectrum.
Noise can be estimated by the Fourier Transform
and looking at the only top 1/3

27
What happens when theres a poorly determined
valley?

A poorly constrained system is not just hard on
the experimenter, it is also hard on the fitter
The curvatures of the an ill-conditioned system
can vary by many orders of magnitude.
If the fitter doesnt pay attention to this then
numerical imprecision will result in gigantic
steps in the unconstrained directions
The parabolic approximation is only useful
locally -- if you take giant steps then you enter
lala land and the fitter will fail to converge.
Live fitsurf demo

28
The curvature matrix is independent of the data!

The approximation of the curvature matrix throws
away higher order terms that make reference to
the actual data
This is very counter-intuitive. Remember
Data locates the height of the error surface
Curvature is a derivative (a second derivative),
to a first approximation it doesnt care about
the absolute height of the error surface

Skipping over a bunch of hairy math!
29
Intuition for data independence of curvature
matrix
Given some parameters the simulated observable
falls somewhere relative to some data. (Assume
that the two are reasonably close in parameter
space).
observable
time
The error is the difference between these.
observable
time
30
Intuition for data independence of curvature
matrix
As you adjust a parameter, the simulated trace
moves but of course the data stays put.
observable
time
The rate at which this error area grows has
little to do with the data (especially when the
data is similar to the trace which weve assumed
throughout.) This is the first derivative. The
second derivative cares even less!
observable
time
31
The curvature matrix is independent of the data!

In other words, if you have guesstimates of the
reaction rates, you can approximate the curvature
matrix before you pick up a pipette!
This changes the workflow of experimental
kinetics

32
Work flow,old and new
OLD
NEW

The precision of instruments can be determined
a-priori.
Therefore you can ask Will these experiments be
capable of determining the parameters within said
instrument precision?
The expensive step can be taken out of the loop

33
Live demo of KGE 3.0 prototype