Title: Protein Evolution, Coevolution and Interaction Networks Day 3
1Protein Evolution, Co-evolution and Interaction
Networks(Day 3)
- Matteo Pellegrini
- Rosetta Inpharmatics
2Protein Networks
Matteo Pellegrini Protein Pathways Inc.
3Outline
- Introduction
- Methods to infer protein couplings
- Direct detection of protein-protein interactions
- Analysis of expression data
- Properties of protein networks
- Applications of protein networks
- Identifying genes involved in osteoclast
differentiation
4Cells Contain High Concentrations of Proteins
that Participate in a Multitude of Interactions
Escherichia coli drawn to molecular scale
by David Goodsell
5Types of Biological Molecular Network Models
- Models of molecular interaction networks may be
constructed at varying levels of resolution - Quantitative models use sets of differential
equations to model molecular concentrations - Qualitative models use graphs to represent
functional relationships between molecules
6Quantitative Model EGFR Pathway
Ordinary differential equations include 94
state variables 95 parameters From Schoeberl B,
Eichler-Jonsson C, Gilles ED, Muller G.
Computational modeling of the dynamics of the MAP
kinase cascade activated by surface and
internalized EGF receptors. Nat Biotechnol. 2002
Apr20(4)370-5.
7Qualitative Model 2-hybrid yeast protein map
1548 proteins 2358 interactions From Schwikowski
B, Uetz P, Fields S. A network of
protein-protein interactions in yeast. Nat
Biotechnol. 2000 Dec18(12)1257-61.
8Advantages and Disadvantages of Different Network
Models
Quantitative Advantage allows for
quantitative predictions of perturbations on the
system Disadvantage requires an understanding
of the network connectivity as well as reaction
rates many parameters must be modeled per
interaction. These models are typically
constructed for small networks (lt 10 proteins)
Qualitative Advantage requires only the
understanding of network connectivity (1 bit per
interaction) Disadvantage leads to qualitative
not quantitative hypotheses
9Qualitative Protein Networks May be Reconstructed
From Varied Data
Direct measurements of protein interactions
Analysis of co-evolving genes
Automated analysis of literature
Analysis of expression microarrays
10Experimental Methods for Detection of
Protein-Protein Physical Interactions
- Physical Interactions
- Two hybrid
- Co-purification
- Protein Fragment Complementarity Assays
- Protein Chips
11Database of Interacting Proteins (DIP)
12Properties of Protein Networks
13One Subgraph Typically Includes Most Nodes
Number of clusters
Number of proteins per network
14Most Nodes are Connected by Short Path Lengths
15Networks are Scale-free
16Network Hubs Tend to be Proteins Essential for
Survival
17Protein Networks Tend to have High Local
Connectivity
High Connectivity
Low Connectivity
18Qualitative Protein Networks May be Reconstructed
From Varied Data
Direct measurements of protein interactions
Analysis of co-evolving genes
Automated analysis of literature
Analysis of expression microarrays
19Applications of Expression Data Analysis
- Classification of experiments
- cancer diagnoses
- Classification of gene relationships
- find co-expressed genes
20Degree of Correlation
The degree of correlation between two genes is
computed by the Pearson correlation coefficient
21Conventional Analysis of Expression Data using
Hierarchical clustering
22Challenges for Inferring Gene Relationships from
Expression Data
Extract correlations between genes from
correlated data sets
Experiment 2
Experiment 1
23Correlations Between Experiments may be Removed
by Transforming Coordinates
Experiment 2
Experiment 2
Experiment 1
Experiment 1
24Ergosterol Biosyntehsis Pathway in Yeast
According to KEGG
25Recovery of Ergosterol Pathway using Yeast
Expression Data
26Co-expression Among Ergosterol Genes in Untreated
Expression Data
27Co-expression Among Ergosterol Genes in
Decorrelated Expression Data
28Benchmarking Co-expression Data
29Treating Rheumatoid Arthritis by Discovering
Genes Involved in T cell Activation
30- T Cell Receptor (TCR) Signaling
- The immune response involves the recognition by T
lymphocytes of peptide fragments (antigens)
derived from foreign pathogens - This recognition event is mediated by the T cell
receptor (TCR) - This signaling cascade leads to the induction of
transcription of the IL-2 gene - T cell activation is implicated in autoimmune
diseases -
31T Cell Receptor (TCR) Signaling
32Human Network
- Links are generated from the analysis of
literature, co-evolution and co-expression - 200,000 links between 20,000 human proteins
- Links are 70 accurate in recapitulating known
pathway associations - 30,000 links between 7000 proteins are supported
by multiple methods
33Uncharacterized Receptor linked to T Cell Receptor
34Experimental Validation of Computational
Predictions
T cell
Ionomycin (P/I) CD3/28
RNAi
IL2
35Reduction of Receptor X mRNA leads to
upregulated IL-2 Production
siRNA
36Receptor X Gene is Induced Following T Cell
Activation
Induced Gene of Interest Following T Cell
Activation
Jurkat
Peripheral CD4 Cells
CD3/28 P/I CD3/28 P/I CD3/28 P/I
CD3/28 P/I CD3/28 P/I CD3/28 P/I
CD3/28 P/I
______ 30m
______ 30m
______ 2h
______ 4h
______ 8h
______ 4h
______ 8h
37Receptor X is a Putative Drug Target
- Orphan GPCR in rhodopsin family
- Coexpressed with TCR pathway proteins
- Target is upregulated upon TCR activation
- Reduction of target mRNA leads to upregulated
IL-2 Production - Novel drug would be an agonist to target causing
downregulation of TCR activation
38Acknowledgements
- Michel Thompson
- Peter Bowers
- Kelly Oliner
- Seenu Kothakota
- Bill Boyle
- Steve Wickert
- Joe Fierro
- Darin Taverna
- Taruna Arora
- Marco Vasquez