Computational Modeling For Understanding Biological Pathways - PowerPoint PPT Presentation

1 / 71
About This Presentation
Title:

Computational Modeling For Understanding Biological Pathways

Description:

Why we need models for simple scenarios. Much material ... Meg Ryan. Natalie Wood. Jackie Chan. Judy Davis. Bruce Willis. Arnold Schwarzenegger. Lauren Bacall ... – PowerPoint PPT presentation

Number of Views:79
Avg rating:3.0/5.0
Slides: 72
Provided by: dcs4
Category:

less

Transcript and Presenter's Notes

Title: Computational Modeling For Understanding Biological Pathways


1
Computational Modeling For Understanding
Biological Pathways
  • Lisa Tucker-Kellogg
  • CS3108A Computational Thinking
  • 30 Sept 2008

2
-- Pat Philips, Microsoft Research
3
TOPICS and sources
  • Computational Thinking
  • Quotes from Jeanette Wing
  • Modeling and Simulation
  • What is modeling
  • Formalisms must match the data the goal.
  • Much of this material quoted or adapted from
    Wikipedia
  • My Research
  • Modeling Biological Signaling Pathways
  • Fallacies in Probabilisitic Reasoning
  • Why we need models for simple scenarios
  • Much material due to Norman Fenton,Yuval Shahar
  • Additional sources are written on the individual
    slides

4
Computational Thinking
  • doing arithmetic, solving mathematical
    equations by sheer bulldozing power, is not the
    most significant . Computers are thinking aids
    of enormous potentialities. Merely having them
    around is enough to change the way we think, to
    force investigators in all fields to think
    through their problems along new lines. We are at
    the beginning of a trend that is certain to bring
    machines which not only learn, but which will
    accelerate the rate at which we ourselves learn.
    The revolution to come is difficult to appreciate
    fully. We only know that science, government, and
    industry will change swiftly and radically in the
    years ahead. -- The Thinking Machine, 1962.

5
Futurism
  • People tend to over-estimate how much change will
    occur in 1-3 years.
  • People tend to under-estimate how much change
    will occur in 10-30 years.
  • The information you learn now might not be
    useful for long, but the way you learn to think
    will endure.

6
Coursework is an artificial workload
  • Its fantastically beneficial and worth every
    cent youre paying, but its artificial.
  • Real jobs are blah blah blah repetitive,
    boring, gruntwork
  • Real problems often come with longer time frames,
    greater risk, and more broadly delegated roles.
  • Real jobs are sometimes better grounded in the
    philosophical principles of the field.

7
The grounding of different fields
  • Mathematics what can I prove, given my axioms?
  • Engineering what can I build and how does it
    behave?
  • Experimental Natural Science design controlled
    experiments
  • Observational Natural Science choose what to
    observe

8
Computational Thinking
  • Computer science interacts with almost every
    other discipline on campus. Computational
    biology, computational chemistry, computational
    design, computational finance, computational
    linguistics, computational logic, computational
    mechanics, computational neuroscience,
    computational physics, and computational and
    statistical learning . Computer science is not
    just about programming, but about thinking. Our
    long-term vision is to make computational
    thinking commonplace for everyone, not just
    computer scientists.
  • -- Jeannette Wing (CMU)

9
  • Robotics CS Mechanical Engineering
    Electrical Engineering
  • Language Technologies CS Linguistics
  • Human-Computer Interaction CS Design
    Psychology
  • Automated Learning and Discovery CS
    Statistics
  • Software CS Public Policy Management

10
Computational Thinking
  • Computational thinking builds on the power and
    limits of computing processes, whether they are
    executed by a human or by a machine.
    Computational methods and models give us the
    courage to solve problems and design systems that
    no one of us would be capable of tackling alone.

11
Computational Modeling
  • Computer models have some of the characteristics
    of mental modeling as well as some of the
    characteristics of math modeling and the types of
    modeling done in other disciplines. If a problem
    lends itself to computer modeling, then the
    computer may well be able to carry out the steps
    (procedures, symbol manipulations) needed to
    solve the problem.

From www.iae-pedia.org
12
Modeling
  • A model is a physical, mathematical, or
    logical representation of a system of entities,
    phenomena, or processes. Basically a model is a
    simplified abstract view of the complex reality.
    It may focus on particular views, enforcing the
    "divide and conquer" principle for a compound
    problem. 
  • A model is a formalized interpretation which
    deals with empirical entities, phenomena, and
    physical processes in a logical, mathematical, or
    systematic way.
  • A model is also a way in which the human thought
    processes can be amplified.  Models that are
    rendered in software allow scientists to leverage
    computational power to simulate, visualize,
    manipulate and gain intuition about the entity,
    phenomenon or process being represented

13
Modeling
  • Scientific modeling is the process of generating
    abstract, conceptual, graphical, and/or
    mathematical models. Science offers a growing
    collection of methods, techniques and theory
    about all kinds of specialized scientific
    modeling.
  • Modeling is an essential and inseparable part of
    all scientific activity, and many scientific
    disciplines have their own ideas about specific
    types of modeling. 

14
Modeling
  • Modeling is a comparatively new area of activity
    involving the marriage of ideas from various
    disciplines.
  • Modeling techniques include statistical methods,
    computer simulation, system identification, and
    sensitivity analysis. An important issue is to
    understand the underlying dynamics of a complex
    system. (chaos theory)
  • Must assess whether the assumptions of a model
    are correct and complete. Does a model
    reflects reality? Deal with divergences between
    theory and data.

15
Modeling
  • One of the main aims of scientific modelling,
    according to Silvert (2001), is to apply
    quantitative reasoning to observations about the
    world, in the hope of seeing aspects that may
    have escaped the notice of others.
  • Typical Steps
  • characterize the system,
  • make some assumptions about how it works
  • translate these into equations and a simulation.
  • validate the results or check the models
    predictions.

16
Generating a Model
  • Typically a model will refer only to some aspects
    of the phenomenon in question, and two models of
    the same phenomenon may be essentially different.
    This may be due to differing requirements of the
    model's end users or to conceptual or aesthetic
    differences by the modelers and decisions made
    during the modeling process.
  • Users of a model need to understand the model's
    original purpose and the assumptions that it is
    based on.

17
Assumptions and Scope
  • Identify abstract principles that are
    approximately true (or true enough for modeling
    purposes).
  • Remove issues that are higher and bigger than you
    are willing to take on (such as organism-wide
    cancer from medical viewpoint). Remove issues
    that are too low-level (exact details of variants
    forms, or exact details of how the DNA unwinds if
    all you care about is the behavior of a whole
    organ.)
  • Computational thinking what information will
    suffice? what information is missing? What is
    happening that doesnt have a cause?

18
Ways to Validate a Model
  • Ability to explain past observations
  • Ability to predict future observations
  • Ability to control events
  • Cost of use, especially in combination with other
    models
  • Refutability, enabling estimation of the degree
    of confidence in the model
  • Simplicity, or even aesthetic appeal

19
Computational Modeling
  • a computational model is a program that attempts
    to simulate an abstract model of a particular
    system. 
  • Computational models can be mathematical models,
    or can be any otherexecutable form.
  • computational models can also use external
    inputs, with only part of the system being
    modeled, such as flight simulators.

20
Computational Modeling
  • What is the point of modeling and simulation?
  • Dont want to repeat a prediction that is already
    essentially obvious.
  • Dont want to build a model of something thats
    so poorly understood that you dont really know
    anything yet.
  • Need to find a middle ground.
  • In biology thats particularly hard because
    biologists informally have a lot of intuitive
    knowledge that helps them get the individual
    results.
  • Can you formalize the published and unpublished
    knowledge?

21
Choice of Formalism
  • Determines what information you can represent in
    your modeling.
  • Represent encode, capture.
  • Affects input data, output results, and
    everything else between.
  • Key questions for choosing formalisms amount of
    detail, scale(s), dimensions of variation.
  • Scope
  • Should we build complex models than incorporate
    everything we know?
  • Should be build simple models that incorporate
    only what were sure of?

22
Spectrum of Computational Mining / Modeling
Methods
SPECIFIED
ABSTRACTED
differential equations
Markov chains
Bayesian networks
Boolean/fuzzy logic models
mechanisms
statistical mining
(including molecular structure-based computation)
influences and logic
components and relationships
Appropriate approach depends on question and data
23
Adapted from Nature Cell Biology, 8 1195 - 1203
(2006) Aldridge et al.
24
Why we need mathematical models to understand
biological systems
Weng, Bhalla and Iyengar, Science. 284, 92
(1999).
25
(No Transcript)
26
Pathway in diagram form
from Nature 407, 789-795
27
Pathway knowledge
Time-series measurements of concentrations
Reaction equations with unknown parameters
Parameter estimation
Computational model of pathway dynamics
Biological Predictions
28
Semi-precise format
29
To be 100 precise we need each step to have a
reaction equation with defined kinetics
  • Mass action
  • caspase-3 IAP ? caspase-3IAP
  • Irreversible mass action
  • Ligand Receptor ? LigandReceptor
  • Michaelis-Menten
  • procaspase-3 ? caspase-3, catalyzed by
    caspase-8. Two parameters, k and Km.

kforward
kbackward
kLR
30
Reaction equations are interchangeable with
differential equations that calculate
concentrations over time
kLR
  • Receptor Ligand LigandReceptor
  • d LR / dt kLR L R
  • d L / dt - kLR L R
  • d R / dt -kLR L R
  • kLR L R velocity

31
Pathway knowledge
Time-series measurements of concentrations
Reaction equations with unknown parameters
Parameter estimation
Computational model of pathway dynamics
Biological Predictions
32
How much ligand is needed to cause activation of
caspase-8?
33
What part of this research is within a
traditional discipline?
  • Mathematical equations for the pathway
  • Defining the format and parameters of a model
  • Experiments for calibrating the model
  • Parameter Estimation Method
  • Choose the best number for each parameter
  • Simulate pathway behaviors
  • Biological Experiments

Computer Science
Biology
34
Impact of modelingaccording to (Bentele et al.,
2004)
  • Communicate interpretations and causes
  • Focus debate and avoid pointless argument
  • Fewer experiments needed to pinpoint responsible
    regulatory mechanism.
  • Probe different scenarios hypotheses.
  • They feel they couldnt have gotten the result
    without both model and experiment.

35
Benefits of Modeling
  • Larger pathways become very complex
  • Human intuition isnt good at intuiting dynamics
  • Steady states are much easier.
  • Help design more efficient experiments.
  • Such as choosing timepoints.
  • Birds eye view shows possibilities you havent
    considered

36
Benefits of Modeling
  • Once you set up the model, each thing you try
    with it gives instantaneous results, for free
  • And you can try changing ANYTHING.
  • Each wet experiment requires a lot of resources.
  • Explore a huge number of perturbations
  • Each perturbation can be an experimental
    condition.
  • Each condition can be a hypothesis, and each
    simulation shows the implications of that
    hypothesis.

37
Mental benefits of modeling
  • Quantitative awareness
  • For example about amount of uncertainty
  • Visualize all concentration curves
  • Can trace causes, not just correlations
  • Many people swear by it
  • We limit our thinking unconsciously, especially
    about unmeasurable things
  • Complementary to human discussions
  • Competitive advantage?

38
My Research
  • Using differential equations to model the
    dynamics of signaling pathways.

39
Nature Cell Biology, 8 1195 - 1203
(2006) Aldridge et al.
40
My Research
PI3K active
PI3K inact
PI3K membr
Akt cyto
PIP3
PIP2
? ?
Akt membr
PP2A
RedEnz
PDK1 active
PDK1 membr
PDK1 inact
PTEN
PTENox
p-Akt
DDC
DPI
NOX
catalase
SOD
O2-
H2O2
41
O2-
PDK1_inactive
Akt_p
42
Whats my modeling for?
  • A model cannot be proven correct.
  • Modeling cannot disprove anything.
  • A model can suggest a view of reality
  • Suggest experiments
  • Suggest interpretations of datasets
  • Suggest implications
  • Predict the results of experiments never
    performed before

43
Whats my modeling for?
  • Proof of Plausibility
  • Accounting for the effects of a drug
  • Suggesting a missing link
  • Hypothesis Management
  • Suggesting experiments to differentiate
  • Disambiguate Communication
  • Meta-study of Existing Reports
  • Understand complex dynamics

44
Proof of Plausibility
  • Drug D causes some mild, subtle upstream effects
    e, f, g, and a massively important downstream
    effect Z.
  • D causes large changes in the localization of
    protein e, but the data is noisy. Protein f shows
    15 increased expression, not so big. Etc.
  • GOAL QUESTION TO ANSWER Are the effects e,f,g
    sufficient to explain the big downstream effect
    Z?
  • Or should we look for D having other effects?

45
Proof of Plausibility
  • Build a set of differential equations starting
    from reactions D?e, D ? f, D ? g and ending with
    the reactions that create Z.
  • Simulate the system computationally. Are the
    effects e,f,g,h sufficient to explain the
    downstream effect Z?
  • E.g., Modeling can provide mathematical evidence
    that D?e and D ? f are enough to explain 100 of
    Z, without needing g.

46
Hypothesis Management
  • Rather than doing experiments to confirm or
    refute the most likely hypothesis
  • Provide a systematic list of hypotheses
  • According to whatever criteria (e.g. willing to
    measure)
  • Simulate each hypothesis
  • Automatically filter the results, looking for a
    good match to data, a conclusive result that
    depends on the hypothesis.

47
inactiveReducingEnzyme
O2-
Too Presumptious?
ReducingEnzyme(oxidizedactive)
?
Wrong! Whats better?
?
PPaseNHEox
PPaseNHE1(active)
inactiveNHE1
NHE1 (active)
PPaseAkt
PPaseAktox
inactivePDK1
?
Aktcytosol
PIP3
PIP3PDK1(active)
Akt-P
Aktmembrane
48
Hypothetical influences of O2- on Akt lifecycle
expression activation
NHE1
PIP3
via PTEN
Akt_m
PDK1
p-Akt_m
via ERM scaffolding
Akt_cytosol
reduced p-Akt_cytosol
p-Akt_cytosol with disulfide
PP2A
?
S-NO activation
thioredoxin
49
Hypothesis Management
  • Suppose we have a set of several very closely
    related hypotheses about the connectivity in a
    signaling pathway.
  • Experimental measurements seem to give
    approximately the same results for each case.
  • Design a combination of experimental
    perturbations that will maximize the differences
    between the behaviors of the system under the
    different hypotheses.
  • Modeling was successful for suggesting a
    combination of knockout and si-RNA perturbations
    for highlighting the phenomenon we wanted to
    study.

50
Disambiguate Communication
  • Certain types of complex concepts are hard to
    express in text.
  • High-dimensional or dynamic or nested
  • If your data provides evidence for something
    complex happening, how do you communicate this
    interpretation unambiguously to your audience?
  • Even if you can contruct the logic in your head,
    your reviewers and editors and audience need to
    be able to follow your logic, reproducibly.

51
Meta-Study of Existing Studies
  • (Future project, not succeeded yet).
  • Basic idea
  • Everybody accepts that A?B?C?D.
  • 4 papers show that A?X in some cell types.
  • 2 papers show that X?Y in some cell types.
  • 8 papers show that Y?D in some cell types.
  • Use modeling to show what fraction of the A?D
    effect could theoretically be explained by
    A?X?Y?D. (Which might shock people.)

52
Example a study of dynamics
Urokinase
  • AIM 1
  • Characterize dynamic behavior of Urokinase
    mediated plasmin, in silico and in vitro.
  • AIM 2
  • Model the role of plasmin dynamics in liver
    fibrosis
  • AIM 3
  • Experimentally validate model predictions

TSP1
Plasmin
LTGF-ß1
TGF-ß1
53
Detail about first aim
  • CONCEPT The mechanism of activation of plasmin
    from urokinase allows for bistability in plasmin.
  • 1a Model development Define the differential
    equations and the parameters.
  • 1b Model simulation Simulate the model in
    MATLAB, under different initial conditions.
  • Phase plane and Nullcline analysis Steady state
    analysis of the reduced system of ODEs.
  • Bifurcation analysis Analyzing the steady state
    behavior of the system with changes in
    parameters.
  • Experimental validation Experimental validation
    of bistable behavior.

54
Then study larger implications
55
(No Transcript)
56
Common Human Errors in Prob/Stat Reasoning
  • Slide Credits
  • Norman Fentons BBN web site http//www.dcs.qmul.a
    c.uk/norman
  • Yuval Shahars class on medical decision-making
    http//www.ise.bgu.ac.il/courses/mdss/

57
Probabilistic Reasoning
  • Probabilistic reasoning is used extensively in
    computer science, especially in the design of
    algorithms and in artificial intelligence, and
    CompSci is one of the few undergraduate degrees
    outside math that teaches probabilistic
    inference.
  • The following fallacies are to help us appreciate
    the importance of using formal methods and not
    just common sense and gut instincts for solving
    problems.
  • These are the sorts of things where humans tend
    to intuit the wrong conclusions, and where
    automated reasoning about outcomes, or
    simulations of outcomes, can be particularly
    important.

58
Here is a list of names from Hollywood movies
  • Tom Hanks
  • Ruth Gordon
  • George Clooney
  • Meg Ryan
  • Natalie Wood
  • Jackie Chan

Judy Davis Bruce Willis Arnold Schwarzenegger Laur
en Bacall
59
Memory Test
  • How many names were in the list?
  • How many were male
  • How many were female
  • When given an equal number of each gender in the
    list, if the men in the list are more famous than
    the women in the list, then subjects usually
    think the list has more men than women.
  • Increased familiarity makes some names more
    retrievable.

60
Search Effects
  • Think of English words that have at least three
    letters.
  • For a word picked at random that contains the
    letter r,
  • Is it more likely that the word would start with
    letter "r, or more likely that r would be the
    third letter?
  • "r" is more frequently found in the third
    position of English words than in the first
    position. Subjects usually choose the reverse.
  • It is easier to search for words by their first
    letter than by their third, so the first case is
    more available, despite being less numerous.

61
  • You are given the following information
  • Ken is an extremely athletic-looking young man
    who drives a fast car and has an attractive
    girlfriend.
  • Is Ken more likely to be a professional football
    player or a nurse?

62
Representativeness
  • We often judge whether object X belongs to class
    Y by how representative X is of class Y
  • Despite differences in the prior probability of
    each profession, subjects usually order the
    probability of potential occupations by
    similarity to representatives.
  • Prob(famouspopular) ? Prob(popularfamous)
  • Prior probabilities of diseases are often ignored
    when the patient seems to fit the description of
    a rare disease.
  • Eastern Equine Encephalitis

63
  •  Steve is very shy and withdrawn, invariably
    helpful, but with little interest in people, or
    in the world of reality.  A meek and tidy soul,
    he has a need for order and structure, and a
    passion for detail.
  • Is Steve a farmer, a librarian, a physician, an
    airline pilot, or a salesman?
  • Farmers are half the worlds population. How
    many librarians are there?

64
  • Write down the last two digits of your student ID
    number on a piece of paper.
  • Or passport number if you dont have a student ID
    number.
  • Digits only. Please ignore characters.

65
Anchoring
  • Please write down your estimate
  • What percentage of Singapore households own at
    least one iPod?
  • Anchoring describes the phenomenon where
    previous estimates (including random or
    uncorrelated numbers) bias later estimates and
    revisions.
  • Are your ID numbers correlated with your
    predictions of iPod ownership?

66
Insufficient Adjustment  Anchoring
  • Anchoring occurs even when initial estimates
    (e.g., percentage of African nations in the UN)
    were explicitly made at random by spinning a
    wheel!
  • Anchoring may occur due to incomplete
    calculation, such as estimating by two
    high-school student groups
  • the expression  8x7x6x5x4x3x2x1 (median answer
    512)
  • with the expression 1x2x3x4x5x6x7x8 (median
    answer 2250)
  • Anchoring occurs even with outrageously extreme
    anchors (Quattrone et al., 1984)
  • Anchoring occurs even when experts (real-estate
    agents) estimate real-estate prices (Northcraft
    and Neale, 1987)

67
Illusory Correlations
  • The probability that two events will co-occur
  • Is often judged not on the number of
    co-occurrences (which would be correct)
  • But mistakenly its judged on the strength/degree
    of their association, from the instances when
    they do co-occur. Even if the number of
    co-occurrences is small.

68
  • People expect random sequences to be
    representatively random even locally
  • E.g., they consider a coin-toss run of HTHTTH to
    be more likely than HHHTTT or HHHHTH
  • The Gamblers Fallacy
  • the mistaken belief that past events will affect
    future events when dealing with random activities
  • After there has been a run of reds in roulette,
    people think black is more (or less) likely to be
    next.

69
The Illusion of Validity 
  • A good match between input information and
    outcome causes people to over-estimate their
    confidence in a prediction.
  • Internal consistency of input pattern increases
    confidence
  • Redundant, correlated data increases peoples
    confidence in a prediction.
  • A series of Bs seems more predictive of a final
    grade-point average than a set of As and Cs

70
Misconceptions of Regression 
  • People tend to ignore the phenomenon of
    regression towards the mean
  • Correlation between parents and childrens
    heights or IQ.
  • People expect predicted outcomes to be as
    representative of the input as possible.

71
Prep for Exam
  • Suppose performing well on an exam requires that
    I set my alarm properly (95 probable), that I
    wake up when my alarm sounds (90 probable), that
    I dont have a headache (95 probable), that my
    bus isnt late (80 probable), and that Im not
    sitting next to somebody who taps their pencil
    (80 probable).
  • Probability of performing well is 52. People
    tend to overestimate the probability of
    conjunctive events
Write a Comment
User Comments (0)
About PowerShow.com