Title: 1-month Practical Course
11-month Practical Course Genome AnalysisProtein
Structure-Function Relationships Centre for
Integrative Bioinformatics VU (IBIVU) Vrije
Universiteit Amsterdam The Netherlands www.ibivu.
cs.vu.nl heringa_at_cs.vu.nl
2Protein function
Genome/DNA Transcriptome/mRNA Proteome Metabolo
me Physiome
Transcription factors
Ribosomal proteins Chaperonins
Enzymes
3Protein function
Not all proteins are enzymes ?-crystallin eye
lens protein needs to stay stable and
transparent for a lifetime (very little turnover
in the eye lens)
4Protein function groups
- Catalysis (enzymes)
- Binding transport (active/passive)
- Protein-DNA/RNA binding (e.g. histones,
transcription factors) - Protein-protein interactions (e.g.
antibody-lysozyme) - Protein-fatty acid binding (e.g. apolipoproteins)
- Protein small molecules (drug interaction,
structure decoding) - Structural component (e.g. ?-crystallin)
- Regulation
- Transcription regulation
- Signalling
- Immune system
- Motor proteins (actin/myosin)
5What can happen to protein function through
evolution
- Proteins can have multiple functions (and
sometimes many -- Ig). - Enzyme function is defined by specificity and
activity - Through evolution
- Function and specificity can stay the same
- Function stays same but specificity changes
- Change to some similar function (e.g. somewhere
else in metabolic system) - Change to completely new function
6How to arrive at a given function
- Divergent evolution homologous proteins
proteins have same structure and same-ish
function - Convergent evolution analogous proteins
different structure but same function - Question can homologous proteins change
structure (and function)?
7Protein function evolution
Chymotrypsin
Active site (combination of ancestral active site
residues)
Putative ancestral barrel structure
Modern 2-barrel structure
Activity 1000-10,000 times enhanced
8How to evolve
- Important distinction
- Orthologues homologous proteins in different
species (all deriving from same ancestor) - Paralogues homologous proteins in same species
(internal gene duplication) - In practice to recognise orthology,
bi-directional best hit is used in conjunction
with database search program (this is called an
operational definition)
9How to evolve
- By addition of domains (at either end of protein
sequence or at loop sites see next slides) - Often through gene duplication followed by
divergence - Multi-domain proteins are a result of gene
fusion (multiple genes ending up in a single
ORF). - Repetitions of the same domain in a single
protein occur frequently (gene duplication
followed by gene fusion)
10Protein structure evolution
- Insertion/deletion of secondary structural
elements can easily be done at loop sites
These sites are normally at the surface of a
protein
11Example -- Flavodoxin fold
5(??) fold
12Flavodoxin family - TOPS diagrams (Flores et
al., 1994)
These are four variations of the same basic
topology (bottom) Do you see what is inserted as
compared to the basic topology?
2
3
4
A TOPS diagram is a schematic representation of a
protein fold
alpha-helix
1
2
3
4
5
beta-strand
5
1
13Protein structure evolution
Insertion/deletion of structural domains can
easily be done at loop sites
N C
14The basic functional unit of a protein is the
domain A domain is a
- Compact, semi-independent unit (Richardson,
1981). - Stable unit of a protein structure that can fold
autonomously (Wetlaufer, 1973). - Recurring functional and evolutionary module
(Bork, 1992). - Nature is a tinkerer and not an inventor
(Jacob, 1977).
15Delineating domains is essential for
- Obtaining high resolution structures (x-ray, NMR)
- Sequence analysis
- Multiple sequence alignment methods
- Prediction algorithms (SS, Class,
secondary/tertiary structure) - Fold recognition and threading
- Elucidating the evolution, structure and function
of a protein family (e.g. Rosetta Stone method
next lecture) - Structural/functional genomics
- Cross genome comparative analysis
16Structural domain organisation can be nasty
Pyruvate kinase Phosphotransferase
b barrel regulatory domain a/b barrel catalytic
substrate binding domain a/b nucleotide binding
domain
1 continuous 2 discontinuous domains
17Complex protein functions are a result of
multiple domains
- An example is the so-called swivelling domain in
pyruvate phosphate dikinase (Herzberg et al.,
1996), which brings an intermediate enzymatic
product over about 45 Å from the active site of
one domain to that of another. - This enhances the enzymatic activity delivery of
intermediate product not by a diffusion process
but by active transport
18- The DEATH Domain
- Present in a variety of Eukaryotic proteins
involved with cell death. - Six helices enclose a tightly packed hydrophobic
core. - Some DEATH domains form homotypic and
heterotypic dimers.
http//www.mshri.on.ca/pawson
19(No Transcript)
20Globin fold ? protein myoglobin PDB 1MBN
21 ? sandwich ? protein immunoglobulin PDB 7FAB
22TIM barrel ? / ? protein Triose phosphate
IsoMerase PDB 1TIM
23A fold in ? ? protein ribonuclease A PDB 7RSA
The red balls represent waters that are bound
to the protein based on polar contacts
24(No Transcript)
25 434 Cro protein complex (phage) PDB 3CRO
26Zinc finger DNA recognition (Drosophila) PDB
2DRP
..YRCKVCSRVY THISNFCRHY VTSH...
27Zinc-finger DNA binding protein family
Characteristics of the family
Function
The DNA-binding motif is found as part of
transcription regulatory proteins.
Structure
One of the most abundant DNA-binding motifs.
Proteins may contain more than one finger in a
single chain. For example Transcription Factor
TF3A was the first zinc-finger protein discovered
to contain 9 C2H2 zinc-finger motifs (tandem
repeats). Each motif consists of 2 antiparallel
beta-strands followed by by an alpha-helix. A
single zinc ion is tetrahedrally coordinated by
conserved histidine and cysteine residues,
stabilising the motif.
28 Zinc-finger DNA binding protein family
Characteristics of the family
Binding
Fingers bind to 3 base-pair subsites and specific
contacts are mediated by amino acids in positions
-1, 2, 3 and 6 relative to the start of the
alpha-helix. Contacts mainly involve one strand
of the DNA. Where proteins contain multiple
fingers, each finger binds to adjacent subsites
within a larger DNA recognition site thus
allowing a relatively simple motif to
specifically bind to a wide range of DNA
sequences. This means that the number and the
type of zinc fingers dictates the specificity of
binding to DNA
29Leucine zipper (yeast) PDB 1YSA
..RA RKLQRMKQLE DKVEE LLSKN YHLENEVARL...