Title: Bayesian network models of Biological signaling pathways
1 Bayesian network models of Biological signaling
pathways
- karensachs_at_stanford.edu
2From Phospho-molecular profiling to Signaling
pathways
Cell1
Cell2
Cell3
Flow Measurments
Cell4
...
Cell600
Picture John Albeck
Signaling Pathways
High throughput data
3Outline
- What are signaling pathways?
- What kind of data is available study them?
- How do we use Bayesian networks to learn their
structure?
- Two extensions
- Markov neighborhood algorithm
- Bayesian network based cyclic networks (BBCs)
4Outline
- What are signaling pathways?
- What kind of data is available study them?
- How do we use Bayesian networks to learn their
structure?
- Two extensions
- Markov neighborhood algorithm
- Bayesian network based cyclic networks (BBCs)
5Cells respond to their environment
6Central Dogma
DNA
Modified Protein
Delivers instructions for specific gene
Ribosome Protein-production factory
Blueprint- instructions for production of all
proteins
7Signaling Genetic pathways
A
B
C
Cell response
DNA
8Outline
- What are signaling pathways?
- What kind of data is available study them?
- How do we use Bayesian networks to learn their
structure?
- Two extensions
- Markov neighborhood algorithm
- Bayesian network based cyclic networks (BBCs)
9Spectrum of Modeling Tools in Systems Biology
10Bayesian Networks
Protein A
Protein B
Protein E
P(BAOn)
Protein C
Protein D
- Graph
- Node Measured level/activity of protein
- Edge Influence (dependency) between proteins
- Conditional probability distributions
- Each node has a conditional probability given its
parents
10
11How do we use Bayesian Networks to infer pathways?
The Technical Details
? Score candidate models
? Use a heuristic search to find high scoring
models
12Protein data
13Protein data
14Protein data
All of these lysate approaches give 1 measurement
per protein for 103-107 cells
15Flow Cytometry Single Cell Analysis
Thousands of datapoints
16Stimulations and perturbations
LFA-1
CD3
CD28
L A T
RAS
Cytohesin
PI3K
JAB-1
Zap70
10
Lck
PKC
PLCg
Akt
PIP3
Activators 1. a-CD3 2. a-CD28 3. ICAM-2
4. PMA 5. b2cAMP Inhibitors 6. G06976
7. AKT inh 8. Psitect 9. U0126 10.
LY294002
PIP2
PKA
Raf
MAPKKK
Mek1/2
MAPKKK
MEK3/6
Erk1/2
MEK4/7
p38
JNK
17T-Lymphocyte Data
- Datasets
- of cells
- condition a
- condition b
- conditionn
- Primary human T-Cells
- 9 conditions
- (6 Specific interventions)
- 9 phosphoproteins, 2 phospolipids
- 600 cells per condition
- 5400 data-points
Omar Perez
18Statistical Dependencies
Phospho A
Phospho B
19Statistical Dependencies
Phospho A
Phospho B
Edges can be directed (primarily) due to the use
of interventions
20Overview
Influence diagram of measured variables
Bayesian Network Analysis
21Inferred Network
Phospho-Proteins
Phospho-Lipids
PKC
Perturbed in data
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
P44/42
Akt
PIP2
22How well did we do?
Phospho-Proteins
Phospho-Lipids
PKC
Perturbed in data
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
P44/42
Akt
PIP2
Direct phosphorylation
23Features of Approach
Mek
Erk
Difficult to detect using other forms of
high-throughput data -Protein-protein
interaction data -Microarrays
24How well did we do?
Phospho-Proteins
Phospho-Lipids
PKC
Perturbed in data
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
P44/42
Akt
PIP2
25How well did we do?
Phospho-Proteins
Phospho-Lipids
PKC
Perturbed in data
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
P44/42
Akt
PIP2
Indirect Signaling
26Indirect signaling
Indirect connections can be found even when the
intermediate molecule(s) are not measured
27Indirect signaling - Complex example
- Is this a mistake?
- The real picture
- Phoso-protein specific
- More than one pathway of influence
28How well did we do?
Phospho-Proteins
Phospho-Lipids
PKC
Perturbed in data
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
P44/42
Akt
PIP2
29Signaling pathway reconstruction
Phospho-Proteins
Phospho-Lipids
PKC
Perturbed in data
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
Erk
- 15/17 Classic
- 17/17 Reported
- 3 Missed
Akt
PIP2
Sachs et al 2005
30Caveats
- Inhibitor specificity
- Binding site similar across proteins
- Reagent availability and specificity
- Data quality
- These are issues in many biological apps!
I think Ill bind here
31Outline
- What are signaling pathways?
- What kind of data is available study them?
- How do we use Bayesian networks to learn their
structure?
- Two extensions
- Markov neighborhood algorithm
- Bayesian network based cyclic networks (BBCs)
32Markov Neighborhood Algorithm
33Building larger networks
- 12 color capability ? Model 50-100 variables
- 4 color capability ? Model 12 variables
80 proteins involved in MAPK signaling (11- at
the cutting edge- is NOT enough!)
33
34Measured subsets Incomplete dataset (Missing
data)
- Insufficient information for standard approaches
(will perform poorly) - Use a set of biologically motivated assumptions
to constrain search.. - And to reduce the number of experiments
-
34
35Constraining the search
- Using Markov neighborhoods
- (for each variable)
- Plus potential perturbation parents
Identify candidate parents
35
36Approach overview
Bayesian Network Analysis (Constrained search)
37Neighborhood reduction
4?11
C
4 color capability
Conditional independencies in the substructure?
A?B?C
37
38Accurate Reproduction of Model 15 experiments,
4-colors
PKC
PKA
Raf
Plc?
Jnk
P38
Mek
PIP3
Erk
Akt
PIP2
39Active learning approach
39
40Outline
- What are signaling pathways?
- What kind of data is available study them?
- How do we use Bayesian networks to learn their
structure?
- Two extensions
- Markov neighborhood algorithm
- Bayesian network based cyclic networks (BBCs)
41Learning cyclic structures with Bayesian networks
- Biological networks contain many loops
- Bayesian networks are constrained to be acyclic
- So
42Overcoming acyclicity
- Signaling pathways contain many cycles
- Bayesian networks are constrained to be acyclic
- How can we accurately model pathways with cycles?
GRB2/SOS
Ras
Raf
MEK
Develop a new, Bayesian network derived algorithm
that models cycles ?
Erk
43Bayesian Network Based Cyclic Networks (BBNs)
- I. Break loops with molecule inhibitors
- II. Use BN to learn the structure (now not
cyclic!) - III. Close loops
GRB2/SOS
Ras
Raf
Mek inhibitor
Solomon Itani
MEK
Erk
44Bayesian Network Based Cyclic Networks (BBNs)
- I. Break loops with molecule inhibitors
- Detect loops P(A)A P(A)
- II. Use BN to learn the structure (now not
cyclic!) - III. Close loops
- P(BPa(B)) A P(BPa(B))
- A?B
GRB2/SOS
Ras
Raf
MEK
Erk
45Future work
- Larger network from overlapping sets (Markov
neighborhood) - Dynamic models over time
- Differences in signaling (sub-populations,
treatment conditions, cell types, disease states)
46Acknowledgements
Garry Nolan
Dana Peer
Doug Lauffenburger
Omar Perez
Dennis Mitchell
Funding LLS post doctoral fellowship
Shigeru Okumura
Mesrob Ohannessian
Solomon Itani
46
47Extra slides
48Mathematical Intuition
B
C
A
C is independent of A given B.
A
B
C independent of A given B and D
C
D
- No need to introduce time!!!
- When loops are broken, the result is a BN!!!
49Prediction Erk?Akt
- Erk1/2 unperturbed
- Erk ? Akt not well established in literature
- Predictions
- Erk1/2 influences Akt
- While correlated, Erk1/2 does not influence PKA
PKC
PKA
Raf
Mek
Erk1/2
Akt
50Validation
- SiRNA on Erk1/Erk2
- Select transfected cells
- Measure Akt and PKA
P9.4e-5
P0.28