Title: Biostatistics Bioinformatics Core
1Biostatistics Bioinformatics Core
- Personnel
- Elizabeth Garrett, PhD Biostatistician
- Giovanni Parmigiani, PhD Biostatistician
- Data analysis and System support staff
- Hardware
- DELL server linux OS
- Linux and Windows workstations
- Software
- GeneX Database R-based analysis tools
- Labs Affy Suite, others TBA
-
2Contact Information
- Elizabeth S. Garrett
- esg_at_jhu.edu
- Suite 1103, 550 Building
- 410-614-2588
- Giovanni Parmigiani
- gp_at_jhu.edu
- Suite 1103 550 Building
- 410-614-3426
3Aims of the Biostatistics Core
- Specific Aim 1
- To provide biostatistical consultation and
support to projects in the program. - Special emphasis will be to assist in
visualization, analysis, quantitative modeling
and interpretation of results.
4Aims of the Biostatistics Core
- Specific Aim 2
- To help in identifying the appropriate data
structures ensuring data quality and data
confidentiality and developing efficient data
transferring and interfacing for data analysis
and data visualization under different platforms.
5Two important stages where we get involved
- Planning Stage
- Experimental Design
- How many samples?
- How many replicates?
- Housekeeping genes?
- Dye swapping?
- Whats the big deal? You could spend a lot of
time and money and not able to answer your
questions due to experimental errors, etc.
Before the study How can I best address my
hypothesis using minimal resources to get
maximal information?
After the study Now that I have this enormous
amount of data, how do I summarize it and answer
my questions?
- Analysis Stage
- Visualization
- Data Exploration
- Analytic Tools and Models
6What we do
- One-on-one consultations with investigators for
planning experiments - One-on-one consultations with investigators for
visualization, data exploration, and analysis. - Tutorials for helping investigators use some of
the software for exploration and visualization
independently. - Tutorials on basic statistical concepts,
including experimental design in gene expression
studies and basic analytic tools.
7GeneX
- Web based database, data mining,
- and data analysis tool
- Supports
- multiple users
- multiple species
- multiple microarray platforms
Common Denominator for data analysis
8GeneX Components
- Curation Tool (imports data)
- Database (OpenSource SQL)
- XML Data Exchange Protocol
- Query and analytic routines
- -- mining
- -- biostatistics in R
9Analytical Tools and Applications Included or
Co-developed with GeneX
- Clustering
- Visualization
- Principle Component Analysisand
Multi-Dimensional Scaling - Significance testing with R
- Integration with other databases
10Regulation of extracellular matrix changes and
fibrosis in inflammatory bowel disease.
- Shukti Chakravarti
- Feng Wu
- Department of Medicine
- Johns Hopkins University
11TNBS-colon
12(No Transcript)
13 ECM/fibrosis
activity
inflammation
time
14Analysis Plan
- Expression estimates using dChip
- Additional normalization for scanner effect
- Two-level regression model
- Identification of reliably estimable time trends
in gene expression - Grouping genes by patterns
15Normalization
16Empirical Bayes Ranking versus Statistical
Significance
P-value lt .05
17Patterns of gene expression over time
Red positive slope, low fdr
Orange and Brown low p-value
Green negative slope, low fdr