Title: Epistasis Analysis Using Microarrays
1Epistasis Analysis Using Microarrays
2Experiments with Microarrays
- Cool technology, but how do we use it? How is it
useful? - Identify marker genes in disease tissues
- Classification, diagnostics
- Toxicology, stress response
- Drug candidate screens, basic science
- Genetic factors
- Measuring interactions (chIP-on-chip)
3Overview
- Expression profiling in single-deletions
- Epistasis analysis using single- and
double-deletions - Epistasis analysis, genetic and environmental
factors - Reconstructing pathways that explain the genetic
relationships between genes
4Expression Profiling in 276 Yeast Single-Gene
Deletion Strains(The Rosetta Compendium)
- Only 19 of yeast genes are essential in rich
media - Giaever, et. al. Nature (2002)
5Clustered Rosetta Compendium Data
6Gene Deletion Profiles Identify Gene Function and
Pathways
7Principle of Epistasis Analysis
8(No Transcript)
9Experimental Design
- Compare single-gene deletions to wild type
- Compare to the double knockout to wild type
10Experimental DesignSingle vs Double-Gene
Deletions
11Classical Epistasis Analysis Using Microarrays to
Determine the Molecular Phenotypes
Time series expression (0-24hrs) every 2hrs
12Mixing Genetic and Environmental Factors
13Expression in Single-Gene Deletions(yeast mec1
and dun1 deletion strains)
14(No Transcript)
15Chen-Hsiang Yeang, PhD Craig Mak MIT
UCSD UC
Santa Cruz Yeang, Jaakkola, Ideker. J Comp Bio
(2004) Yeang, Mak, et. al. Genome Res (2005)
16Measurements
Networks
17Measurements
Networks
18Displaying deletion effects
Published work Epistasis analysis using
expression profiling (2005)
19Relevant Interactions
- Subset of Rosetta compendium used
- 28 deletions were TF (red circles)
- 355 diff. exp. genes (white boxes)
- P lt 0.005
- 755 TF-deletion effects (grey squiggles)
20Network Measurements
- Yeast under normal growth conditions
- Promoter binding
- ChIP-chip / location analysis
- Lee, et. al. Science(2002)
- Protein-protein interaction
- Yeast 2-hybrid
- Database of Interaction Proteins (DIP)
- Deane, et. al. Mol Cell Proteomics (2002)
21ChIP Measurement of Protein-DNA Interactions
(Chromatin Immunoprecipitation)
22Step 1 Network connectivity(chIP-chip analysis)
5k genes (white boxes) 20k interactions
(green lines)
23Step 2 Network annotation(gene expression
analysis)
Measure variables that are a function of the
network (gene expression). Monitor these effects
after perturbing the network (TF knockouts).
What parts are wired together
How and why the parts are wired together the way
they are
24Inferring regulatory paths
Direct
Indirect
25Annotate inducer or repressor
26Annotate inducer or repressor
27Computational methods
- Problem Statement
- Find regulatory paths consisting of physical
interactions that explain functional
relationship - Method
- A probabilistic inference approach
- Yeang, Ideker et. al. J Comp Bio (2004)
- To assign annotations
- Formalize problem using a factor graph
- Solve using max product algorithm
- Kschischang. IEEE Trans. Information Theory
(2001) - Mathematically similar to Bayesian inference,
Markov random fields, belief propagation
28Inferred Network Annotations
A network with ambiguous annotation
29Test Refine
30Which deletion experiments should we do first?
- A mutual information based score
- For each candidate experiment (gene ?)
- Variability of predicted expression profiles
- Predict profile for each possible set of
annotations - More variable more information from experiment
- Reuse network inference algorithm to compute
effect of deletion!
31Ranking candidate experiments
32We target experiments to one region of network
Expression for SOK2?, HAP4 ?, MSN4 ?, YAP6 ?
33Expression of Msn4 targets
Average signed z-score
34Expression of Hap4 targets
35Yap6 targets are unaffected
36Refined Network Model
- Caveats
- Assumes target genes are correct
- Only models linear paths
- Combinatorial effects missed
- Measurements are for rich media growth
37Using this method of choosingthe next experiment
- Is it better than other methods?
- How many experiments?
- Run simulations vs
- Random
- Hubs
38Simulation results
simulated deletions profiles used to learn a
true network
39Current Work
Measurements
Systems level understanding
Treat disease
Networks
Test Refine
Transcriptional response to DNA damage
40Acknowledgments
Trey Ideker
Craig Mak
Chen-Hsiang Yeang Tommi Jaakkola
Scott McCuine Maya Agarwal Mike Daly Ideker lab
members
Tom Begley Leona Samson
Funding grants from NIGMS, NSF, and NIH