CSE182-L6 - PowerPoint PPT Presentation

About This Presentation
Title:

CSE182-L6

Description:

CSE182-L6 Protein structure basics Protein sequencing – PowerPoint PPT presentation

Number of Views:109
Avg rating:3.0/5.0
Slides: 31
Provided by: Vine63
Learn more at: https://cseweb.ucsd.edu
Category:
Tags: cse182 | helix

less

Transcript and Presenter's Notes

Title: CSE182-L6


1
CSE182-L6
  • Protein structure basics
  • Protein sequencing

2
Announcements
  • Midterm 1 Nov 1, in class.
  • Assignment 2 Online, due October 20.

3
Distinguishing between families
4
Distinguishing between families
Assignment 2
5
Profiles
  • Start with an alignment of strings of length m,
    over an alphabet A,
  • Build an A X m matrix F(fki)
  • Each entry fki represents the frequency of symbol
    k in position i

0.71
0.14
0.28
0.14
6
Scoring Profiles
Scoring Matrix
i
k
fki
s
7
Psi-BLAST idea
  • Multiple alignments are important for capturing
    remote homology.
  • Profile based scores are a natural way to handle
    this.
  • Q What if the query is a single sequence.
  • A Iterate
  • Find homologs using Blast on query
  • Discard very similar homologs
  • Align, make a profile, search with profile.

8
Psi-BLAST speed
  • Two time consuming steps.
  • Multiple alignment of homologs
  • Searching with Profiles.
  • Does the keyword search idea work?

9
Protein Domains
  • An important realization (in the last decade) is
    that proteins have a modular architecture of
    domains/folds.
  • Example The zinc finger domain is a DNA-binding
    domain.
  • What is a domain?
  • Part of a sequence that can fold independently,
    and is present in other sequences as well

10
Domain review
  • What is a domain?
  • How are domains expressed
  • Motifs (Regular expression others)
  • Multiple alignments
  • Profiles
  • Profile HMMs

11
Domain databases
Can you speed up HMM search?
12
A structural view of proteins
13
CS view of a protein
  • gtspP00974BPT1_BOVIN Pancreatic trypsin
    inhibitor precursor (Basic protease inhibitor)
    (BPI) (BPTI) (Aprotinin) - Bos taurus (Bovine).
  • MKMSRLCLSVALLVLLGTLAASTPGCDTSNQAKAQRPDFCLEPPYTGPCK
    ARIIRYFYNAKAGLCQTFVYGGCRAKRNNFKSAEDCMRTCGGAIGPWENL

14
Protein structure basics
15
Side chains determine amino-acid type
  • The residues may have different properties.
  • Aspartic acid (D), and Glutamic Acid (E) are
    acidic residues

16
Bond angles form structural constraints
17
Various constraints determine 3d structure
  • Constraints
  • Structural constraints due to physiochemical
    properties
  • Constraints due to bond angles
  • H-bond formation
  • Surprisingly, a few conformations are seen over
    and over again.

18
Alpha-helix
  • 3.6 residues per turn
  • H-bonds between 1st and 4th residue stabilize the
    structure.
  • First discovered by Linus Pauling

19
Beta-sheet
  • Each strand by itself has 2 residues per turn,
    and is not stable.
  • Adjacent strands hydrogen-bond to form stable
    beta-sheets, parallel or anti-parallel.
  • Beta sheets have long range interactions that
    stabilize the structure, while alpha-helices have
    local interactions.

20
Domains
  • The basic structures (helix, strand, loop)
    combine to form complex 3D structures.
  • Certain combinations are popular. Many sequences,
    but only a few folds

21
3D structure
  • Predicting tertiary structure is an important
    problem in Bioinformatics.
  • Premise Clues to structure can be found in the
    sequence.
  • While de novo tertiary structure prediction is
    hard, there are many intermediate, and tractable
    goals.
  • The PDB database is a compendium of structures

PDB
22
Searching structure databases
  • Threading, and other 3d Alignments can be used to
    align structures.
  • Database filtering is possible through geometric
    hashing.

23
Trivia Quiz
  • What research won the Nobel prize in Chemistry in
    2004?
  • In 2002?

24
How are Proteins Sequenced? Mass Spec 101
25
Nobel Citation 2002
26
Nobel Citation, 2002
27
Mass Spectrometry
28
Sample Preparation
29
Single Stage MS
Mass Spectrometry
LC-MS 1 MS spectrum / second
30
Tandem MS
Secondary Fragmentation
Ionized parent peptide
Write a Comment
User Comments (0)
About PowerShow.com