Introduction%20into%20Micro%20Array%20Analysis - PowerPoint PPT Presentation

About This Presentation

Title:

Introduction%20into%20Micro%20Array%20Analysis

Description:

Introduction into Micro Array Analysis – PowerPoint PPT presentation

Number of Views:78

Avg rating:3.0/5.0

Slides: 36

Provided by: UCHC7

Learn more at: http://merlot.stat.uconn.edu

Category:

more less

Transcript and Presenter's Notes

Title: Introduction%20into%20Micro%20Array%20Analysis

1
Introduction into Micro Array Analysis

Winfried Krueger, Ph. D.
UCHC M.A.C.

2
Analysis of Single Nucleotide Polymorphisms
3
Genotyping through Oligonucleotide Micro Arrays
Photolitho- graphic mask
0.8 cm
Array Production
gene specific arrayed oligonucleotides
Target synthesis

4
cDNA vs Oligonucleotide Arrays
cDNA arrays Oligonucleotide arrays
Probe Size 400 - gt 2000bp 25 -70 nts
GC Contents among and within probes variable Predetermined
Probe dependence on labeling protocol no yes
Access to tissue specific arrays fast slow
Spotting amounts with simple chemistry 100-200 ng/ml 1 mg/ml
Binding chemistry (low probe amounts) simple complex
Expression profiles for sequenced genes yes yes
Expression profiles for part. seq. genes yes no
Direct detection of SNPs no yes
Enzyme mediated detection of SNPs yes yes
5
eSensor Nanotechnology based Switch
6

Basic Concepts in Molecular Biology
Introduction into Micro Array Methodology
Image and Data Analysis

7
Micro Array Imaging

Micro array imaging methods
Phosphoimaging (33P)
Fluorescence Imaging
Confocal
epifluorescence microscopy
CCD technology)

8
Array Analysis of P2 brain mRNA
Cy3 channel P2 RNA
Cy5 channel P2 RNA
9
Micro Array Analysis

Micro array data analysis consists of two main
procedures
Image analysis
Pattern recognition
- Data normalization
- Data curation and
- Cluster Analysis

10
Generation of a simplified Expression Profile
Profile
Series of transcriptogram
11
Comparative Analysis of cDNA Microarrays

Microarrays contain large amounts of system
specific expression data
Linkage of related experimental systems increases
the information value of each individual
experiment and reduces redundancy
Replacement of one experimental condition with a
unified reference (RNA sample or general
oligonucleotide) provides the standardization
required for comparative analyses through use of
Microarray image databases (AMAD/NOMAD etc.)

12
Image Analysis
13
(No Transcript)
14
(No Transcript)
15
Selected Methods for Background Computation
Standard background determination Radial
Background determination Autocorrecting
Background determination
16
Image analysis - General Flowdiagram
Compute Pixel intensities across ROI (defined
manually or by grid file) gt compute signal area
Compute Pixel intensities across preset square
background area gt compute corrected signal
intensities per pixel
Compute Pixel intensities across preset radial
area 1 gt compute corrected signal intensities
per pixel
Compute Background intensitie from area2 area1
intensites gt compute corrected signal
intensities per pixel
Compute Pixel intensities across preset radial
area 2 gt compute corrected signal intensities
per pixel
17
Autocorrecting Image analysis - Flowdiagram
Compare Pixel intensities across ROI gt
compute signal area Determine spot morphology
parameters of signal area gt center, radius,
circumference Determine intensity distributions
across background and signal areas gt correlate
with optimized distribution Calculate
aberration of experimental vs optimized signal
distribution gt substitute outlying pixels with
anticipated pixel intensities Calculate
corrected background and signal intensities gt
compute corrected signal intensities per pixel
18
(No Transcript)
19
Comparative Analysis of an Array Image
P2VsP2 ImaGene
P2VsP2 Gleams
20
Data Variability
21
Data Curation

Array inherent error sources interfere with
accurate signal
quantitation even if sophisticated image analysis
algorithms are
employed
Target synthesis effects (cDNA length, integrity
and ribosomal cDNA contamination)
Labeling effects (labeling efficiencies)
Dye effects (higher incorporation rates for one
dye)
Substrate effects (DNA binding capacity, coating
homogeneity etc.)
Dye gene effects (preferential labeling of one
gene with one dye)
Data curation requires
Experimental approaches
Statistical Analysis

22
Considerations for Micro Array Experiments

High throughput format/Transcript coverage in
micro array experiments is inversely correlated
to experimental parameters that depend on the
amount of substrate immobilized probe
Sensitivity (function of scanner, probe and
target amounts)
Reproducibility (function of substrate, target
and arrayer)
Signal/noise ratios (function of substrate,
target, hybridization specificity and sequence)
Breadth (function of RNA abundancy)
spot density reduction increases reliability and
forces the development of customized clonesets

23
Experimental Approaches to Data Curation

Replicate and flip dye experiments
Competitive hybridization or simultaneous
hybridization both experimental and control
samples
A second experimental condition
Universal control RNA/Oligonucleotide from a
large mixture of different cell types
Simultaneous hybridization to chip immobilized
sense probes and scrambled sense probes
Internal controls
Arraying of replicate probes within each array
Incorporation of genes with constant expression
levels into the micro array
Arraying of heterologous cDNAs and spiking of
target RNA with respective in vitro transcripts
for positive/negative controls and for
normalization of the fluorescence intensities
Arraying of sense oligonucleotide and
randomized sense oligonucleotides

24
Statistical Methods for Data Curation

Currently implemented statistical methods for
data significance calculation
Holms p-value
T-test Bayesian T-test (2 categories)
Mann-Whitney test (2 categories)
ANOVA (gt2 categories KerrChurchill)
Kruskal-Wallis test (2 categories)

25
Pattern Recognition

Micro Array image data specify genes for further
functional
investigation based on their regulation.
Filtering of gene groups through pattern
recognition -
Expression Profiling
Expression profiles represent Fingerprints
characteristic for the transcriptional state of
cells
The significance of expression profiles is
measured as the
probability for their non random occurrence
specified through a
P (probability) - value.

This value is generally computed through massive
permutation of data points within and across
replicate experiments.
26
Pattern recognition - Cluster Analysis

Cluster analysis is a method of organizing genes
(objects) into groups whose members demonstrate
similar expression profiles (relatedness)
Genes within a group demonstrate expression
profiles more similar among the members of that
group than among members of different groups
clustering procedures are predominantly based on
hierachial clustering or partitioning

27
Interpretation of Primary Data by Cluster Analysis
Primary microarray data

yield near quantitative transcriptional and
translational (indirectly) activation data
specify genes for further functional
characterization on the basis of compelling
expression profiles

but most genes with are functionally
uncharacterized and a targeted research approach
requires a further data analysis

Cluster analysis
genes with similar expression profiles often
participate in related biochemical pathways
(guilt by association principle)

28
Cluster Analysis - Definition

Cluster analysis is a method of organizing genes
(objects) into groups whose members demonstrate
similar expression profiles (relatedness)
Genes within a group demonstrate expression
profiles more similar among the members of that
group than among members of different groups
clustering procedures are predominantly based on
hierachial clustering or partitioning

29
Types of Cluster Algorithms
A) Hierachial clustering

B) Partitional clustering (Traditional Methods)
Nearest Neighbor Clustering
K-means
Self organizing maps(SOM)/unsupervised neural
network

C) Partitional clustering (Hypergraph based
Methods)
Support Vector Machines
C45 Tree Decision Algorithms

30
Hierachial and K- means/SOM Cluster Analysis
Hierachial
K-means/SOM
1
5
16
types

K-means and SOMs calculate a mean of group
members (prototype) for each cluster and and
define a pattern according to the similarity
between prototypes
K-means uses a predefined number of clusters for
pattern definition
SOMs maximize the similarity of group members
within each cluster and then minimize the number
of clusters that create the single partition
Hierachial clustering finds a sequence of groups
by merging two groups at every step according to
some criterion and each node represents a cluster

31
Hypergraph based Clustering Methods
Similarity is determined by association rules in
a n-dimensional hyperspace, not by euclidian
distance between datapoints

implementation complex
number of clusters independent of data set
(multidimensional scaling)
avoid data that are far from mean (K-means) or
binary groups with highly dissimilar data
(hierachial)
maximize information

32
Cluster Analysis generates Molecular Signatures
time, cell types etc.
genes
subgroups
33
Requirement for Improved Data Analysis
What is the biological Significance of Gene
Expression Profiling?

Conventional cluster analysis identifies groups
of genes with similar transcriptional activation
profiles
Significance of cluster association for
individual members only by the guilt by
association principle
Association does not reflect true linkage to
biochemical pathways
Development of a resource that links expression
profiling with Genetic Networks and biochemical
Pathways

34
Databases with Information pertinent to
Microarray Experiments
Clone Repositories
PubMed
Sequence DBs

I.M.A.G.E./dbest

N.H.G.R.I./HGR
Entrez
M.G.S.C./MGR
Gene DBs
Genome Mapping DBs
UniGene
RefSeq
Locuslink
InterPro
HomoloGene
TransCompel/Transfac(BioBase)
TCluster_at_States-UMich
ProTSite_at_CBIL-UPenn

Ms Genome DB (MGD) _at_ Jax.
Ensembl DBs _at_ EBI
Genome DBs _at_ T.I.G.R.
Genome DBs _at_ UCSC
HuGenome DB _at_ UToronto(hGDB)
Pathway DBs
TransPath(BioBase)
K.E.G.G.
BioCarta
35
Association of Array and Annotation Data
upload array data to AIDB
Query Editor

Write a Comment

User Comments (0)