Dressing up Protein Sequences with Bioinformatics Data - PowerPoint PPT Presentation

About This Presentation
Title:

Dressing up Protein Sequences with Bioinformatics Data

Description:

Dressing up Protein Sequences with Bioinformatics Data – PowerPoint PPT presentation

Number of Views:144
Avg rating:3.0/5.0
Slides: 115
Provided by: tcof
Learn more at: https://tcoffee.org
Category:

less

Transcript and Presenter's Notes

Title: Dressing up Protein Sequences with Bioinformatics Data


1
Dressing up Protein Sequences with
Bioinformatics Data
Cédric Notredame
2
(No Transcript)
3
Before We Start
4
(No Transcript)
5
Important Things We will not Talk about here !!!
6
Finding Out Simple Things About Your Protein
7
Doing Biochemistry in your Computer
8
Finding Out Simple things On Expasy
9
ExpasyWhere it all starts
10
Expasy is a MAJOR service
11
Using ProtParam
12
Using ProtParam
13
Using ProtParam
14
Using ProtParam
15
Using ProtParam What it does not do for you!
16
Comparing Your Sequence With Itself
17
Comparing Your Protein With Itself
-Does my protein contain a repeated domain
? -Does my protein contain low complexity
segments?
18
The EMBnet Server
19
Dotlet Before Starting
20
Dotlet Sequence Input
21
Dotlet Getting it to work!
22
Dotlet Getting it to work!
Disapointing !!!
23
Dotlet Getting it to work!
24
Dotlet Getting it to work!
Tunning Dotlet The Right Knob at The Right
Moment
25
Dotlet Fixing the Zoom Factor!
1
1.Set the ZOOMSo that the ENTIRE Protein
Appears
26
Dotlet Choosing the right Window
2
2Set the Window SizeTo The domain size
-Repeat size 50 Window Size 51
27
Dotlet Using The Threshold
Numberof Dots
Log Curve
Score
Threshold 1
Threshold 2
28
Dotlet Using The Threshold
T2
Numberof Dots
BLACK
GREY
WHITE
Score
T1
29
Dotlet Choosing the right Window
3
3Set the Threshold
30
Dotlet Analyzing the Dotlet
Low ComplexityRegion
31
Dotlet Analyzing your Dotlet
32
Introducing our naked sequence
MALRAGLVLG FHTLMTLLSP QEAGATKADH MGSYGPAFYQ
SYGASGQFTH EFDEEQLFSV DLKKSEAVWR LPEFGDFARF
DPQGGLAGIA AIKAHLDILV ERSNRSRAIN VPPRVTVLPK
SRVELGQPNI LICIVDNIFP PVINITWLRN GQTVTEGVAQ
TSFYSQPDHL FRKFHYLPFV PSAEDVYDCQ VEHWGLDAPL
LRHWELQVPI PPPDAMETLV CALGLAIGLV GFLVGTVLII
MGTYVSSVPR
33
Introducing our naked sequence
34
1-Simple predictions
2-Repeated regions
35
Finding Out about the Secondary Structure of
Your Sequence Trans-Membranedomains
36
Predicting Secondary Structures
-Does My Protein Contain A trans-membrane domain
?
37
protscale sliding Window Methods
Average
Sliding Window
38
protscale sliding Window Methods
39
protscale sliding Window Methods
FREL_CANAL
40
protscale sliding Window Methods
Window Size Tm Domain size
41
protscale sliding Window Methods
42
Protscale Making the right Interpretation
43
Protscale Making the right Interpretation
44
Protscale Making the right Interpretation
45
Protscale Making the right Interpretation
46
tmHMMpred The state of the art
47
tmHMMpred The state of the art
48
tmHMMpred The state of the art
49
1-Simple predictions
2-Repeated regions
3-Secondary Struc
50
Predicting Secondary Structures
-Does My Protein Contain A Coiled-coil domain ?
51
Using the EMBnet COILS server
52
Using the EMBnet COILS server
53
Finding Out about the Secondary Structure of
Your Sequence Helices and Beta-sheet
54
Predicting Secondary Structures
-What is the secondary structure of my protein ?
55
Running PsiPred
Your email
56
Running PsiPred
57
How Trustable Is PsiPred ?
58
How Trustable Is PsiPred on my protein ?
Do you need to see the homologs?
59
Finding Out if your Sequence Is Modified
60
Post Translation Modifications
-Is my protein modified after its
translation -Phosphorilated -Glycolated -
61
What is a Prosite pattern ?
-A PROSITE Pattern is a Protein WORD conserved in
many sequences
PVAILL
-PROSITE Patterns lets you identify protein
families or important features of your sequence
62
What is a Prosite pattern ?
-A PROSITE Pattern Lets You identify a protein
family JUST LIKE the silver lady lets you
identify a certain brand of cars
63
Prosite Patterns can describe complex signatures
RK-x-ST
-Reads as Followsan Arginine or a Lysine,
followed by one random residue, followed by a
Serine or a Threonine
64
Prosite Patterns can describe complex signatures
C-DES-x-C-x(3)-I-x(3)-R-x(4)-P-x(4)-C-x(2)-C
Is a signature for Zn finger proteins that Bind
DNA
65
Using PrositeScan
MALRAGLVLG FHTLMTLLSP QEAGATKADH MGSYGPAFYQ
SYGASGQFTH EFDEEQLFSV DLKKSEAVWR LPEFGDFARF
DPQGGLAGIA AIKAHLDILV ERSNRSRAIN VPPRVTVLPK
SRVELGQPNI LICIVDNIFP PVINITWLRN GQTVTEGVAQ
TSFYSQPDHL FRKFHYLPFV
66
Using PrositeScan Short Patterns
67
Using PrositeScan Short Patterns
68
Using PrositeScan Short Patterns
We cannot do much with this weak but
exciting We can only REMEMBER and WAIT
69
Using PROSITE-Scan Structure
70
Other Means of Prediction Post Translational
Modifications
71
1-Simple predictions
2-Repeated regions
3-Secondary Struc
4-PROSITE motifs
72
Identifying Domain
73
The Idea of Domains
74
The Three Major Resources For Searching Domain
Collections
75
Interpro A Federation of Databases
76
The Three Major Resources For Searching Domain
Collections
Domain servers ( and domain collections) are like
good Italian restaurant they all look similar,
but each of them is a bit special and has its own
recipes
77
Finding Domains
-Is my Protein made of known domains ?
78
Interpro The Idea of Domains
79
Using InterPro Asking a question
80
Using InterPro Asking a question
81
Using CDsearch Asking a question
82
Using CDsearch Asking a question
NCBI Domain
E-Values
83
Finding Domains
-How can I be sure that the domain Prediction of
my Protein is real ?
84
Using EMBNet PFscan
85
Using EMBNet PFscan
Important Position that is Well conserved in our
sequence
86
1-Simple predictions
2-Repeated regions
4-PROSITE motifs
3-Secondary Struc
5-Conserved Domains
87
Finding Homologues
88
(No Transcript)
89
(No Transcript)
90
BLAST A summary
-BLAST compares YOUR SEQUENCE with ALL THE
SEQUENCES in a database -BLAST reports the
sequences that are the more similar to your
sequence -If your sequence is more than 30
identical to another sequence (over more than 100
residues) these two sequences probably have the
same origin, the same structure and related
biochemical functions.
91
Making Sense of it all with the 3D structure
92
Using Structural Information
-What is the 3D structure of my protein ?
93
Using Structural InformationBlasting against
PDB
94
Finding Structural Homologues with BLAST
95
Displaying Your Structure
96
Displaying Your Structure
97
Displaying Your Structure
98
Displaying Your Structure
99
Highlighting Interesting Bits
Concistency!!!
100
1-Simple predictions
2-Repeated regions
4-PROSITE motifs
5-Conserved Domains
3-Secondary Struc
6-3D structures
101
Finding out about the Genetic Environment of your
Protein
102
Using Genomic Information
-Where is my protein in the human genome.-Where
are its homologues ?
103
Using Genomic Information
104
Blasting the Human Genome
105
Blasting the Human Genome
106
Blasting the Human Genome Finding Out about
the Neighborhood Of your Gene
107
Blasting the Human Genome Finding Out about
the Neighborhood Of your Gene
108
Blasting the Human Genome Finding Out about
the Neighborhood Of your Gene
109
Blasting the Human Genome Finding Out about
the Neighborhood Of your Gene
110
Blasting the Human Genome Finding Out about
the Neighborhood Of your Gene
111
Wrapping it up!
112
1-Simple predictions
2-Repeated regions
4-PROSITE motifs
5-Conserved Domains
3-Secondary Struc
6-3D structures
7-Genomic Environment
113
5-Conserved Domains
Two Domains
6-3D structures
Tm Domain
2-Repeated regions
3-Secondary Struc
ImmunologyRelated
5-Conserved Domains
7-Genomic Environment
114
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com