Title: Bioinformatics for Proteomics
1Bioinformatics for Proteomics Shu-Hui Chen
(???) Department of Chemistry National Cheng Kung
University
2What is Proteomics ?
Systematic analysis of All
protein sequences All protein
expression pattern All protein
interactions This involves
Protein identification Protein
quantification Functional
characterization of all proteins
3MS-based Bioinformatics
- MS instrument is so far not sensitive enough to
resolve proteins in a biological system solely
based on signals measured. - MS, however, is able to acquire sufficient data
for mapping a protein from the database using new
computer algorithms to analyze the data. - MS data could be used for Protein Quantification
-
4The tools of Proteomics
Traditional protein chemistry assay methods
struggle to establish Identity and
quantification Identity requires
Specificity of measurement (Precision)
Mass Spectrometry
MS-based data acquisition algorithm
Database dependent identification-A reference for
comparison Protein sequence
databases Search algorithms
Database-independent
identification-No reference
De-novo sequencing
5MS-based Protein Identification and Quantification
? Mass Mapping Peptide Sequencing
Protein Quantification
6Mass Spectrometry Protein identified by database
mapping
7(No Transcript)
8(No Transcript)
9http//www.expasy.org/tools/
10www.uniprot.org
11(No Transcript)
12(No Transcript)
13(No Transcript)
14(No Transcript)
15(No Transcript)
16Automated Database Search
Number 1 match tumor necrosis factor type 1
receptor associated protein TRAP-1 (Mr) 76030.27
Total coverage 33.4
17History for MS Searching
MOWSE
1993
By Pappin and Bleasby
SEQUEST
1994
By Yates and Eng
MOWSE?
1996
Molecular Weight Search
1997
MOWSE?
1998
MASCOT
By Matrix science
18(No Transcript)
19(No Transcript)
20(No Transcript)
21(No Transcript)
22(No Transcript)
23(No Transcript)
24(No Transcript)
25(No Transcript)
26(No Transcript)
27Scoring algorithm
Final score -10LOG(P), where P is absolute
probability that the observed match is a random
event E value (expected value) describes the
number of hits one can expect to see by chance
when searching a database of a particular size.
A value of zero indicates that no matches would
be expected by chance.
28(No Transcript)
29(No Transcript)
30(No Transcript)
31MS-based Protein Identification and Quantification
- Mass Mapping
- Peptide Sequencing
-
- Protein Quantification
32Tandem Mass Spectrometry- MS/MS
MS/MS acquisition is controlled by software
setting
33Nomenclature used for CID peptide
fragmentation- Low Energy (eV)- Q, TOF, FT
Bioanalytical Chemistry Mikkelsen, S.R.,
published by John Wiley Sons, Inc.
34(No Transcript)
35(No Transcript)
36Protein Identification by Database Search
37(No Transcript)
38(No Transcript)
39(No Transcript)
40(No Transcript)
41(No Transcript)
42(No Transcript)
43(No Transcript)
44(No Transcript)
45(No Transcript)
46PRIDE website
47http//www.ebi.ac.uk/pride
48(No Transcript)
49(No Transcript)
50(No Transcript)
51(No Transcript)
52(No Transcript)
53Database Independent Sequencing- De Novo
Sequencing
http//www.bioinfor.com/peaks/tutorials/denovo.htm
l
54Sequence Tag Approach for Peptide Sequencing
Bioanalytical Chemistry Mikkelsen, S.R.,
published by John Wiley Sons, Inc.
55MS-based Protein Identification and Quantification
- Mass Mapping
- Peptide Sequencing
- Protein Quantification
56MS-based Quantification Methods Using Stable
Isotope Labeling
X.J. Li Institute for systems biology, Seattle
57(No Transcript)
58Exercises
1. Choose one protein you are interested in. 2.
Go to the Bioinformatics portal website
http//www.expasy.org/tools/, link to
Uniprot and search for your protein. 3. List
accession number, molecular weight, and aminoacid
sequence of your protein. Briefly describe
the protein function and its subcellular
localization. 4. Link to PaxDb proteomic
database, show the abundance of the protein. 5.
Link to PRIDE or MaxQB proteomic database, list
the peptides (sequences) that have been
identified by MS for this protein. Include
Mascot peptide score for each identified
sequence. If same peptides were identified
by multiple times, list the highest score.