Replicated Microarray Data Proportion of Differentially Expressed Genes - PowerPoint PPT Presentation

1 / 18
About This Presentation
Title:

Replicated Microarray Data Proportion of Differentially Expressed Genes

Description:

Replicated Microarray Data. Proportion of Differentially Expressed Genes. Ingrid L nnstedt. Department of Mathematics, Uppsala University ... – PowerPoint PPT presentation

Number of Views:25
Avg rating:3.0/5.0
Slides: 19
Provided by: cen7196
Category:

less

Transcript and Presenter's Notes

Title: Replicated Microarray Data Proportion of Differentially Expressed Genes


1
Replicated Microarray DataProportion of
Differentially Expressed Genes
  • Ingrid Lönnstedt
  • Department of Mathematics, Uppsala University
  • FMS Bayesian Inference in Biostatistics, April
    4 2003,
  • Tom Britton, SU (Terry Speed, UCB)

2
Microarrays-genetic fingerprints comparing two
samples of cells
Green intensity Gene actitity before
treatment Red intensity Gene activity after
treatment
One spot one gene
one logratio Mlog R/G
3
Replicated microarray data
Mijlog Rij/Gij i1...N e.g. N10000
genes j1...n e. g. n4 replicates
Thousands of genes Few replicates Large
variance Which genes are differentially
expressed between the cell samples? i.e. Which
genes have EMij different from 0?
4
Common statistics
5
Bayesian framework
  • Empirical Bayes Model

Each gene i has a (stochastic) mean mi and a
(stochastic) variance si2 Mij mi, si2
N( mi , si2 )
Most genes have mi 0. Small proportion p of
genes have mi si2 N(0, csi2 ) All genes have
si2 G-1(a,b)
6
Bayesian framework
Empirical Bayes Model
Mij mi, si2 N( mi , si2 ) si2
G-1(a,b) mi 0 prob. 1-p mi si2 N(0,
csi2 ) prob. p
7
Lodsratio for differential expression
p fixed
8
Estimate proportion p of differentially
expressed genes
Method of moments (works fine for simulated data)
SRBI data p 0.860
The model does not fit our real data!
9
What is wrong with the model?
  • The connection c between variances

Mij mi, si2 N( mi , si2 ) si2
G-1(a,b) mi 0 prob. 1-p mi si2 N(0,
csi2 ) prob. p
10
Hierarchical Bayes model
Mij mi, si2 N( mi , si2 ) si2
G-1(a,b) mi 0 prob. 1-p mi N(0, v )
prob. p
  • No explicit posterior distributions
  • Use MCMC and Gibbs sampler
  • Sample each parameter realization from
  • P (qk q-k , Data )

11
Marginal posterior for mean mP ( mi p , n ,
Di )
  • Two cases
  • mi0 prob. 1-pi
  • mi N(, ) prob. pi
  • Posterior probability pi P(mi 0 p , n ,
    Di )Probabilities only known up to
    proportionality

12
Estimate proportion p of differentially
expressed genes
Simulated data 8 replicates
SRBI data prior Ep0.5 p 0.9950 prior
Ep0.01 p0.8786
13
What is wrong with the model?
  • The connection c between variances
  • Thick tails in data

14
Hierarchical Bayes with t-distr.
  • No explicit posterior distributions
  • No explicit marginal posteriors
  • P (qk q-k , Data )
  • Use MCMC and Metropolis-Hastings single component
    algorithm

Sample each parameter realization from proposal
distribution qk q (qk q-k , Data
) Accept new realization with probability
15
Estimate proportion p of differentially
expressed genes
2 years
Sample dataset from t-model (true
p0.01) Estimate from N-model
p ?
16
What is wrong with the model?
  • The connection c between variances
  • Thick tails in data
  • If mi N(0, v ) most expressed genes have mean
    around 0

17
Hierarchical Bayes with G-distr.
P ( mi l,g)
mi
0
Sample dataset from this model (true
p0.01) Estimate from N-model
p 0.01227
18
Summary
  • Microarray data many genes, few replicates.
  • Which genes are differentially expressed? (mi 0
    )
  • Empirical Bayes method, p0.86
  • Normal Hierarchical Bayes model
  • Student t HB model ?
  • G HB model ?
Write a Comment
User Comments (0)
About PowerShow.com