Identifying Changes in Signaling from HighThroughput Data - PowerPoint PPT Presentation

1 / 38
About This Presentation
Title:

Identifying Changes in Signaling from HighThroughput Data

Description:

ROSETTA DATA. From 5 to 20 patterns were posited in the analysis. ... ROC analysis was performed. Bioinformatics. Fox Chase Cancer Center. ROSETTA DATA ... – PowerPoint PPT presentation

Number of Views:73
Avg rating:3.0/5.0
Slides: 39
Provided by: tri5421
Category:

less

Transcript and Presenter's Notes

Title: Identifying Changes in Signaling from HighThroughput Data


1
Identifying Changes in Signaling from
High-Throughput Data
Michael Ochs Fox Chase Cancer Center
2
The New Paradigm
Group 1
Group 2
Targeted Therapies
Personalized Medicine
Your Chromosomes Here
3
Outline
  • Signaling and Gene Expression
  • Bayesian Decomposition
  • Examples of Analyses

4
Cellular Signaling
Extracellular Signal
Signal Transduction
Metabolic Changes
Transcription
Downward, Nature, 411, 759, 2001
5
Gene Expression
6
Identifying Pathways
A B C D E
7
Goal of Analysis
8
Biological Model
But the Gene Lists are Incomplete as are
the Network Diagrams!
9
Issues to Solve
  • Overlapping Signals
  • Genes are involved in multiple processes
  • Various processes are active simultaneously in
    any observed data
  • Identification of Process Behind Signal
  • If find a signal, what is the cause
  • Do identification without a complete model

10
Outline
  • Signaling and Gene Expression
  • Bayesian Decomposition
  • Examples of Analyses

11
Data

(Spellman et al, Mol Biol Cell, 9, 3273,
1999) (Cho et al, Mol Cell, 2, 65, 1998)
12
BD Identification of Signals
condition 1
condition M
gene 1





pattern k
pattern 1
gene 1


X

gene N
Data
gene N
13
Markov Chain Monte Carlo
We cannot always solve the problem directly, we
can only estimate relative probabilities of
possible solutions
Markov Chain Monte Carlo is used to explore the
possible solutions
14
Bayesian Statistics
p(data model) p(model)
p(model data)
p(data)
condition 1
condition M
pattern 1
pattern k
gene 1





gene 1


pattern 1



X
pattern k
condition M
condition 1
gene N
gene N
15
Outline
  • Signaling and Gene Expression
  • Bayesian Decomposition
  • Examples of Analyses

16
Acknowledgements
  • Tom Moloshok (Cell Cycle, Mouse)
  • Ghislain Bidaut (Yeast Deletion Mutants)
  • Andrew Kossenkov (TFs, YDMs)
  • Bill Speier, DJ Datta, Daniel Chung, Ryan
    Goldstein, Matt Lewandowski

17
Cell Cycle
Tobin and Morel, Asking About Cells, Harcourt
Brace, 1997
18
Data
  • Data Expression data of 788 yeast cell-cycle
    regulated genes Cho, 1998 across 17 different
    time points was taken for analysis.
  • Coregulation 11 groups (from 5 to 17 genes in
    each group 67 genes in total, 18 from 67 genes
    belong to more than one group) were composed,
    based on literature review (not cell cycle
    literature).
  • Analysis with and without coregulation
    information

19
Validation
Cherepinsky et al, PNAS, 100, 9668, 2003
20
ROC Analysis
ROC Receiver Operator Characteristic
Fraction of called positives that are correct
Sensitivity
Fraction of called negatives that are correct
TP true positive TN true negative FP false
positive FN false negative
1 - Specificity
Area under the curve is the measurement of
algorithm efficacy
21
Hierarchical Clustering
ROC Curve
Cherepinsky et al, PNAS, 100, 9668, 2003
22
Bayesian Decomposition
Sensitivity
1 - Specificity
23
Deletion Mutant Data Set
(Hughes et al, Cell, 102, 109, 2000)
  • 300 Deletion Mutants in S. cerevisiae
  • Biological/Technical Replicates with Gene
    Specific Error Model
  • Filter Genes
  • gt25 Data Missing in Ratios or Uncertainties
  • lt 2 Experiments with 3 Fold Change
  • Filter Experiments
  • lt 2 Genes Changing by 3 Fold
  • 228 Experiments/764 Genes

24
BD Matrix Decomposition
Distribution of Patterns (what genes are in
patterns)
Mutant 1
Mutant M
gene 1





pattern k
pattern 1
gene 1


X

Patterns of Behavior (does mutant
contain pattern)
gene N
Data
gene N
25
Analysis
  • Bayesian Decomposition
  • Identify patterns and linked genes
  • Use genes to determine function
  • Interpretation of Functions
  • Gene Ontology
  • Transcription factor data
  • Validation

26
Use of Ontology Pattern 13
13
15
27
The Other Pattern 15
13
15
28
Transcription Factors
Signaling Pathways
29
Genes from Pattern 13
Fig1 Prm6 Fus1 Ste2 Aga1 Fus3 Pes4 Prm1 ORF
Bar1
known to be involved in mating response
known to be regulated by Ste12p
30
Validation
(Posas, et al, Curr Opin Microbiology, 1, 175,
1998)
31
Pattern 13 Mutants
32
Pattern 15 Mutants
33
Conclusions
  • Transcriptional Response Provides Signatures of
    Pathway Activity
  • Ontologies Can Guide Interpretation
  • Bayesian Decomposition Can Dissect Strongly
    Overlapping Signatures

34
Acknowledgements
Fox Chase
  • Tom Moloshok
  • Jeffrey Grant
  • Yue Zhang
  • Elizabeth Goralczyk
  • Liat Shimoni
  • Luke Somers (UPenn)
  • Olga Tchuvatkina
  • Michael Slifker
  • Sinoula Apostolou
  • Brendan Reilly

Ghislain Bidaut (UPenn CBIL) Andrew
Kossenkov Vladimir Minayev (MPEI) Garo Toby (Dana
Farber) Yan Zhou Aidan Petersen Bill Speier
(Johns Hopkins) Daniel Chung (Columbia) DJ Datta
(UCSF) Elizabeth Faulkner (UPenn) Frank
Manion Bob Beck
  • Collaborators
  • A. Godwin (FCCC)
  • A. Favorov (GosNIIGenetika)
  • J.-M. Claverie (CNRS)
  • G. Parmigiani (JHU)
  • O. Favorova (RMSU)

35
Patterns as Basis Vectors
BD
36
MakingProteins(Phenotype)
37
ROSETTA DATA
  • From 5 to 20 patterns were posited in the
    analysis.
  • Results were checked on information about
    Metabolic Pathways taken from Saccharomyces
    Genome Database - 11 groups of 4-6 genes, known
    to be involved in the same metabolic pathways.
  • ROC analysis was performed

38
ROSETTA DATA
Write a Comment
User Comments (0)
About PowerShow.com