Title: Microarray Data Analysis Using BASE
1Microarray Data Analysis Using BASE
- Danny Park
- MGH Microarray Core
- March 15, 2004
2Youve got data!
- What was I asking? remember your experimental
design - How do I analyze the data?
- How do I find interesting stuff? learn some
analysis tools - How do I trust the results? statistics is key
3What was I asking?
- Typically which genes changed expression levels
when I did ____ - Common ____
- Binary conditions knock out, treatment, etc
- Continuous scales time courses, levels of
treatment, etc - Unordered discrete scales multiple types of
treatment or mutations - This tutorials focus binary experiments
4How do I analyze the data?
- BASE BioArray Software Environment
- Data storage and distribution
- Simple filtering, normalization, averaging, and
statistics - Export/Download results to other tools
- MS Excel
- TIGR Multi Experiment Viewer (TMEV)
- This tutorials focus using BASE
5Todays Presentation
- Demonstrate the most basic analysis techniques
- Using our most frequently used software (BASE)
- For the most common kind of experiments
6Work Flow
analysis
7The Most Common experiment
- Two-sample comparison w/N replicates
- KO vs. WT
- Treated vs. untreated
- Diseased vs. normal
- Etc
- Question of interest which genes are (most)
differentially expressed?
8Experimental Design naïve
From Gary Churchill, Jackson Labs
9Experimental Design tech repl
From Gary Churchill, Jackson Labs
10Experimental Design bio repl
- Treatment
- Biological Replicate
- Technical Replicate
- Dye
- Array
From Gary Churchill, Jackson Labs
11The Most Common Analysis
- Filter out bad spots
- Adjust low intensities
- Normalize correct for non-linearities and dye
inconsistencies - Filter out dim spots
- Calculate average fold ratios and p-values per
gene - Rank, sort, filter, squint, sift data
- Export to other software
12BASE _at_ MGH
- BASE is a microarray data storage and analysis
package - BASE resides on our web server
- Data is stored at our facility
- Computation is performed on our machines
- All you need is a web browser
- https//base.mgh.harvard.edu/
- A Microarray Core technician will provide you
with a username, password, and experiment name
13BASE Login page
14BASE Login page
15BASE Login page
16BASE Login page
17BASE Logged in
18BASE Logged in
19BASE Sidebar
20BASE Sidebar
21BASE Sidebar
22BASE Sidebar
23BASE Sidebar
24BASE Sidebar
25BASE Sidebar
26BASE Sidebar
27BASE Sidebar
28BASE Sidebar
29BASE Sidebar
30BASE Sidebar
31BASE My Account
Change your password and access defaults
32BASE My Account
Change your password and access defaults
33BASE My Account
Change your password and access defaults
34BASE My Account
Change your password and access defaults
35Find your experiment
36Find your experiment
37Find your experiment
38Find your experiment
39Experiment view Four Tabs
40Experiment view Four Tabs
41Experiment view Four Tabs
42Experiment view Four Tabs
43Experiment view Four Tabs
44Experiment view Four Tabs
45Experiment view Four Tabs
46Experiment view Four Tabs
47Group slide data together
48Group slide data together
Select the slides that measure the same thing.
Later in analysis, they will be averaged
together. In this experiment, all ten slides are
replicates, so there is only one grouping.
49Group slide data together
Select the slides that measure the same thing.
Later in analysis, they will be averaged
together. In this experiment, all ten slides are
replicates, so there is only one grouping.
50Group slide data together
Select the slides that measure the same thing.
Later in analysis, they will be averaged
together. In this experiment, all ten slides are
replicates, so there is only one grouping.
51Group slide data together
52Group slide data together
Give your data set a descriptive name to
distinguish it from other slide groupings. In
this Myd88 knockout experiment, there is only one
grouping, so a generic name is fine.
53Group slide data together
Give your data set a descriptive name to
distinguish it from other slide groupings. In
this Myd88 knockout experiment, there is only one
grouping, so a generic name is fine.
54Group slide data together
Give your data set a descriptive name to
distinguish it from other slide groupings. In
this Myd88 knockout experiment, there is only one
grouping, so a generic name is fine.
55Analysis Begin
56Analysis Begin
57Analysis Begin
58Analysis Begin
59Analysis Filter Setup
Bad spots are marked with a negative Flag value.
60Analysis Filter Setup
Bad spots are marked with a negative Flag value.
61Analysis Filter Setup
Bad spots are marked with a negative Flag value.
62Analysis Filter Setup
Bad spots are marked with a negative Flag value.
63Analysis Filter Setup
Bad spots are marked with a negative Flag value.
64Analysis Filter Setup
Bad spots are marked with a negative Flag value.
65Analysis Filter Setup
Bad spots are marked with a negative Flag value.
66Analysis Filter Setup
Bad spots are marked with a negative Flag value.
67Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
68Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
69Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
70Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
71Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
72Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
73Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
74Analysis Filter Setup
Bad spots are marked with a negative Flag value.
Oligos are annotated with species codes, but
control spots are not. Set species to your
two-letter code of choice (Mm, Hs, Dr, Pa, etc)
75Analysis Filter Setup
Naming the filter and the child data set are
essential to reducing confusion later.
76Analysis Filter Setup
Naming the filter and the child data set are
essential to reducing confusion later.
77Analysis Filter Setup
Naming the filter and the child data set are
essential to reducing confusion later.
78Analysis Filter Run
79Analysis Quality Data
80Analysis Quality Data
81Analysis Unfiltered Data
82Analysis Filter Parameters
83Analysis Limit-Int Setup
84Analysis Limit-Int Setup
85Analysis Limit-Int Setup
86Analysis Limit-Int Setup
87Analysis Limit-Int Setup
88Analysis Limit-Int Setup
89Analysis Check job status
90Analysis Check job status
91Analysis Check job status
92Analysis Check job status
93Analysis Check job status
All done indicates the job is complete.
94Analysis Check job status
All done indicates the job is complete.
95Analysis Limit-Int Output
96Analysis Limit-Int Output
97Analysis Limit-Int Output
98Analysis Limit-Int Output
99Analysis Limit-Int Output
100Analysis Limit-Int Output
101Analysis Change data set name
102Analysis Change data set name
103Analysis Change data set name
Change the name of this set to Intensity limited
Data
104Analysis Change data set name
105Analysis Change data set name
106Analysis Change data set name
107Analysis Change data set name
108Analysis LOWESS Setup
109Analysis LOWESS Setup
110Analysis LOWESS Setup
111Analysis LOWESS Setup
112Analysis LOWESS Setup
113Analysis LOWESS Setup
114Analysis Check job status
115Analysis Check job status
116Analysis LOWESS Output
117Analysis LOWESS Output
118Analysis LOWESS Output
Change the name of this set to Normalized Data
using the same steps as before.
119Analysis Change data set name
Change the name of this set to Normalized Data
using the same steps as before.
120Analysis Change data set name
Change the name of this set to Normalized Data
using the same steps as before.
121Analysis Filter Setup
Set up the filter as indicated, hit Add/Update on
the Gene filter, then hit Accept and select the
resulting data set.
122Analysis Useful Data
123Analysis Useful Data
124MA Plots Raw Myd88 Data
125MA Plots Raw Myd88 Data
126MA Plots Raw Myd88 Data
127MA Plots Raw Myd88 Data
128MA Plots Quality Data
129MA Plots Quality Data
130MA Plots Quality Data
131MA Plots Quality Data
132MA Plots Quality Data
133MA Plots Quality Data
134MA Plots Int-limited Data
135MA Plots Int-limited Data
136MA Plots Int-limited Data
137MA Plots Int-limited Data
138MA Plots Int-limited Data
139MA Plots Int-limited Data
140MA Plots Normalized Data
141MA Plots Normalized Data
142MA Plots Normalized Data
143MA Plots Normalized Data
144MA Plots Normalized Data
145MA Plots Normalized Data
146MA Plots Norm. Corr. Factor
147MA Plots Norm. Corr. Factor
148MA Plots Useful Data
149MA Plots Useful Data
150MA Plots Useful Data
151MA Plots Useful Data
152MA Plots Useful Data
153MA Plots Useful Data
154Analysis Useful Data
155Analysis Useful Data
156Analysis Fold Ratio Setup
157Analysis Fold Ratio Setup
158Analysis Fold Ratio Setup
159Analysis Fold Ratio Setup
160Analysis Fold Ratio Output
161Analysis Fold Ratio Output
162Analysis Fold Ratio Output
163Analysis Fold Ratio Output
164Analysis Fold Ratio Output
165Analysis Fold Ratio Output
166Analysis Fold Ratio Output
167Analysis Fold Ratio Output
168Analysis Change list name
169Analysis Change list name
170Analysis Change list name
Change the name of this list as indicated here.
171Analysis Change list name
Change the name of this list as indicated here.
172Analysis Change list name
173Analysis Change list name
174Analysis Fold Ratio Graphs
175Analysis Fold Ratio Graphs
176Analysis Fold Ratio Graphs
177Analysis Fold Ratio Graphs
178Analysis Fold Ratio Graphs
179Analysis Fold Ratio Graphs
180Analysis t-test Setup
181Analysis t-test Setup
182Analysis t-test Setup
183Analysis t-test Setup
184Analysis t-test Output
185Analysis t-test Output
186Analysis t-test Output
187Analysis t-test Output
188Analysis t-test Output
189Analysis t-test Output
190Analysis Change list name
Change the name of this set to myd88 p-value
using the same steps as before.
191Analysis Change list name
Change the name of this set to myd88 p-value
using the same steps as before.
192Analysis Change list name
Change the name of this set to myd88 p-value
using the same steps as before.
193Analysis t-test Graphs
194Analysis t-test Graphs
195Analysis t-test Graphs
196Analysis t-test Graphs
197Analysis t-test Graphs
198Analysis t-test Graphs
199Analysis Experiment Explorer
200Analysis Experiment Explorer
201EExplore Single Gene View
202EExplore Single Gene View
203EExplore Single Gene View
204EExplore Single Gene View
205EExplore Single Gene View
206EExplore Single Gene View
207EExplore Gene List View
208EExplore Gene List View
209EExplore Gene List View
210EExplore Gene List View
Fill out the table as indicated, then hit
Add/Update.
211EExplore Gene List View
212EExplore Gene List View
213EExplore Gene List View
214EExplore Gene List View
215EExplore Gene List View
216EExplore Gene List View
217EExplore Gene List View
218EExplore Gene List View
219EExplore NCBI Links
220EExplore Gene List View
This additional row will restrict hits to P
values of 5 or less.
221EExplore Gene List View
This additional row will restrict hits to P
values of 5 or less.
222EExplore Single Gene View
223EExplore Single Gene View
224EExplore Single Gene View
225EExplore Single Gene View
226EExplore Single Gene View
227EExplore Single Gene View
228EExplore Gene List View
229EExplore Gene List View
Open MS Excel and tell it to open the file you
downloaded (typically called base.tsv).
230EExplore Gene List View
Open MS Excel and tell it to open the file you
downloaded (typically called base.tsv).
231Have Fun!
- The rest of the analysis is largely driven by
your biological understanding of the genes
indicated in these lists. We cannot help much in
the interpretation of this data. - Dont forget to go back to the raw data sets and
repeat this entire analysis for any other slide
groupings.
232Acknowledgements
MGH Microarray Core Glenn Short Jocelyn
Burke Najib El Messadi Jason Frietas Zhiyong Ren
MGH Lipid Metabolism Unit Mason Freeman Harry
Bjorkbacka
LUND (Sweden) Dept. Theoretical Physics Dept.
Oncology Carl Troein Lao H. Saal Johan
Vallon-Christersson Sofia Gruvberger Åke
Borg Carsten Peterson
MGH Molecular Biology Bioinformatics Group Chuck
Cooper Xiaowei Wang Harvard School of Public
Health Biostatistics Xiaoman Li