Proteins are long chains of Amino Acid AA - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

Proteins are long chains of Amino Acid AA

Description:

... g1, g2. Output: A threading T. In short: Align A to model T. Given: ... Output: T: t1, t2, t3, ..., tm; start locations for core segments; Threading constraints ... – PowerPoint PPT presentation

Number of Views:18
Avg rating:3.0/5.0
Slides: 29
Provided by: csUal
Category:
Tags: acid | amino | chains | long | proteins

less

Transcript and Presenter's Notes

Title: Proteins are long chains of Amino Acid AA


1
Introduction
  • Proteins are long chains of Amino Acid (AA)
  • There are 20 different AAs

2
(No Transcript)
3
(No Transcript)
4
StructureFunction
5
Experimental methods
  • Crystallography- Performed by X-ray diffraction
    and neutron-diffraction.
  • Nuclear Magnetic Resonance (NMR)
  • Very expensive and time consuming

6
Methods for Protein folding
  • Homology modeling
  • When one can find a known structure protein
    with good sequence similarity (over 30) to the
    protein we wish to fold.
  • Protein threading
  • Less conclusive similarity
  • Ab-initio
  • No homology is available

7
Protein threading
  • Form a database of known folds
  • Given a sequence, find most likely structure from
    database
  • Thread sequence through structure

8
If we have an efficient threading algorithm
  • Form a database of known folds
  • Given a sequence, thread this new sequence
    through all the models in the library
  • See which one does best

9
Protein threading
  • Profile method
  • Core threading method
  • -branch and bound
  • Dynamic Programming
  • -divide and conquer

10
Core threading
11
  • Protein threading problem definition
  •          Input   Given protein sequence A
                Core structural model M
                Score functions g1, g2.            
    Output  A threading T.             In short 
    Align A to model T.
  •         Given             A Protein sequence
    of length n a1, a2, a3, , an             M
    m core segments C1, C2, C3, , Cm            
    c1, c2, c3, , cm length of core segments
                l1, l2, l3, , lm-1 loop regions
    connecting core segments             l1max,
    l2max, l3max, ,  lm-1max maximum lengths of
    loop regions             l1min, l2min, l3min,
    ,  lm-1min minimum lengths of loop regions
                Properties of each amino acid
                F, g1, g2 score functions to
    evaluate threading
  •     

12
Score function
13
Output  T  t1, t2, t3, , tm  start locations
for core segments
where, g1 and g2  are based on the given model
M.  g1 shows how each segment corresponds to core
segment i in the model, and g2 deals with the
interactions between segments.  So to solve the
threading problem, we have to decide on  t1, t2,
t3, , tm, so that the overall score is maximum. 
Thus the threading problem, or alignment problem,
is converted to an optimization problem.  
14
Threading constraints
15
(No Transcript)
16
Branch and Bound
  • Set of all possible threadings defined by initial
    position bounds
  • Divide possible threadings into smaller sets, and
    compute new position bounds for each set
  • Compute a quick score lower bound for each set of
    threadings
  • Keep re-dividing the set with smallest lower
    bound, until set size if 1.

17
Branch and Bound
18
Branch and Bound
  • Given a set of threadings defined by position
    bounds, one possible score lower bound is

19
Branch and Bound-Issues
  • Constructing score function
  • Calculating lower bound
  • Choosing split segment
  • Choosing split point

20
Dynamic Programming
21
Dynamic Programming
  • Detect local region of high similarity among the
    target and the template sequence.
  • Local alignment
  • Exploit sequence as well as structural signals

22
Dynamic Programming
  • Any pair of locally aligned segments divides the
    unmatched region of both protein into two parts.
  • They can be processed independently with the same
    approach. Divide-and-conquer.
  • After dividing, the changed structural features
    of the template are recorded

23
Dynamic Programming
  • The algorithm proceeds recursively, until in the
    local alignment step no more significant similar
    segment pairs are found. E.g. only ONE core
    structure.

24
Dynamic Programming
25
Dynamic Programming
  • We can give more than one candidate while doing
    local alignment.
  • This produces a tree.
  • At the end, we assemble the respective threading
    alignments and compute their scores

26
PROSPECT
  • Use Divide and Conquer
  • References
  • Xu, Y.D., D. Xu, and E.C. Uberbacher, "An
    Efficient Computational Method for Globally
    Optimal Threading", Journal of Computational
    Biology, 5 (3), 597-614, 1998.
  • Xu, Ying and D. Xu, "Protein Threading using
    PROSPECT design and evaluation", Protein
    Structure, Function, Genetics, Vol 40, pp 343 -
    354, 2000.

27
PROSPECT
  • Energy function
  • In the early edition, all ?s are set to be 1

28
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com