Title: Evolution of bacterial regulatory systems
1Evolution of bacterial regulatory systems
- Mikhail Gelfand
- Research and Training Center Bioinformatics
- Institute for Information Transmission Problems
- Moscow, Russia
ASM, Philadelphia, 18.IV.2009
2Catalog of events
- Expansion and contraction of regulons
- New regulators (where from?)
- Duplications of regulators with or without
regulated loci - Loss of regulators with or without regulated loci
- Re-assortment of regulators and structural genes
- especially in complex systems
- Horizontal transfer
3Trehalose/maltose catabolism in
alpha-proteobacteria
Duplicated LacI-family regulators
lineage-specific post-duplication loss
4The binding motifs are very similar (the blue
branch is somewhat different to avoid
cross-recognition?)
5Utilization of an unknown galactoside in
gamma-proteobacteria
Yersinia and Klebsiella two regulons, GalR and
Laci-X
Erwinia one regulon, GalR
Loss of regulator and merger of regulons It
seems that laci-X was present in the common
ancestor (Klebsiella is an outgroup)
6Utilization of maltose/maltodextrin in Firmicutes
Displacement invasion of a regulator from a
different subfamily (horizontal transfer from a
related species?) blue sites
7Orthologous TFs with completely different
regulons (alpha-proteobaceria and Xanthomonadales)
8Cryptic sites and loss of regulators
Loss of RbsR in Y. pestis (ABC-transporter also
is lost)
RbsR binding site
Start codon of rbsD
9Regulon expansion, or how FruR has become CRA
- CRA (a.k.a. FruR) in Escherichia coli
- global regulator
- well-studied in experiment (many regulated genes
known) - Going back in time looking for candidate
CRA/FruR sites upstream of (orthologs of) genes
known to be regulated in E.coli
10Common ancestor of gamma-proteobacteria
Mannose
Glucose
ptsHI-crr
manXYZ
edd
epd
eda
adhE
aceEF
icdA
ppsA
pykF
mtlD
mtlA
Mannitol
pckA
gpmA
pgk
gapA
fbp
pfkA
aceA
tpiA
fruK
fruBA
Fructose
aceB
Gamma-proteobacteria
11Common ancestor of the Enterobacteriales
Mannose
Glucose
ptsHI-crr
manXYZ
edd
epd
eda
adhE
aceEF
icdA
ppsA
pykF
mtlD
mtlA
Mannitol
pckA
gpmA
pgk
gapA
fbp
pfkA
aceA
tpiA
fruK
fruBA
Fructose
aceB
Gamma-proteobacteria Enterobacteriales
12Common ancestor of Escherichia and Salmonella
Mannose
Glucose
ptsHI-crr
manXYZ
edd
epd
eda
adhE
aceEF
icdA
ppsA
pykF
mtlD
mtlA
Mannitol
pckA
gpmA
pgk
gapA
fbp
pfkA
aceA
tpiA
fruK
fruBA
Fructose
aceB
Gamma-proteobacteria Enterobacteriales E. coli
and Salmonella spp.
13Regulation of amino acid biosynthesis in the
Firmicutes
- Interplay between regulatory RNA elements and
transcription factors - Expansion of T-box systems (normally RNA
structures regulating aminoacyl-tRNA-synthetases)
14Why T-boxes?
- May be easily identified
- In most cases functional specificity may be
reliably predicted by the analysis of the
specifier codons (anti-anti-codons) - Sufficiently long to retain phylogenetic signal
- gt T-boxes are a good model of regulatory
evolution
15Partial alignment of predicted T-boxes
TGG T-box
Aminoacyl-tRNA synthetases
Amino acid biosynthetic genes
Amino acid transporters
16 continued (in the 5 direction)
anti-anti (specifier) codon
Aminoacyl-tRNA synthetases
Amino acid biosynthetic genes
Amino acid transporters
17805 T-boxes in 96 bacteria
- Firmicutes
- aa-tRNA synthetases
- enzymes
- transporters
- all amino acids excluding glutamate
- Actinobacteria (regulation of translation
predicted) - branched chain (ileS)
- aromatic (Atopobium minutum)
- Delta-proteobacteria
- branched chain (leu enzymes)
- Thermus/Deinococcus group (aa-tRNA synthases)
- branched chain (ileS, valS)
- glycine
- Chloroflexi, Dictyoglomi
- aromatic (trp enzymes)
- branched chain (ileS)
- threonine
18Recent duplications and bursts ARG-T-box in
Clostridium difficile
19 caused by loss of transcription factor AhrC
20Duplications and changes in specificity
ASN/ASP/HIS T-boxes
21Blow-up 1
22Blow-up 2. Prediction
- Regulators lost in lineages with expanded
HIS-T-box regulon??
23 and validation
- conserved motifs upstream of HIS biosynthesis
genes - candidate transcription factor yerC co-localized
with the his genes - present only in genomes with the motifs upstream
of the his genes - genomes with neither YerC motif nor HIS-T-boxes
attenuators
Bacillales (his operon)
Clostridiales Thermoanaerobacteriales Halanaerobia
les Bacillales
24The evolutionary history of the his genes
regulation in the Firmicutes
25More duplications THR-T-box in C. difficile and
B. cereus
26T-boxes Summary / History
27Life without Fur
28Regulation of iron homeostasis (the Escherichia
coli paradigm)
- Iron
- essential cofactor (limiting in many
environments) - dangerous at large concentrations
- FUR (responds to iron)
- synthesis of siderophores
- transport (siderophores, heme, Fe2, Fe3)
- storage
- iron-dependent enzymes
- synthesis of heme
- synthesis of Fe-S clusters
- Similar in Bacillus subtilis
29Regulation of iron homeostasis in a-proteobacteria
- Experimental studies
- FUR/MUR Bradyrhizobium, Rhizobium and
Sinorhizobium - RirA (Rrf2 family) Rhizobium and Sinorhizobium
- Irr (FUR family) Bradyrhizobium, Rhizobium and
Brucella
30Distribution of transcription factors in genomes
Search for candidate motifs and binding sites
using standard comparative genomic techniques
31FUR/MUR branch of the FUR family
32FUR and MUR boxes
Erythrobacter litoralis
Caulobacter crescentus
Novosphingobium aromaticivorans
Zymomonas mobilis
Sphinopyxis alaskensis
Oceanicaulis alexandrii
Rhodospirillum rubrum
Gluconobacter oxydans
Magnetospirillum magneticum
Parvularcula bermudensis -
Identified Mur-binding sites
Bacillus subtilis
Sequence logos for the known Fur-binding sites
in Escherichia coli and Bacillus subtilis
Mur
a
of - proteobacteria -
Escherichia coli
33Irr branch of the FUR family
34Irr boxes
- Rhizobiaceae plus
- Bradyrhizobiaceae
- Rhodobacteriaceae
- Rhodospirillales
35RirA/NsrR family (Rhizobiales)
36IscR family
37Regulation of genes in functional subsystems
Rhizobiales
Bradyrhizobiaceae
Rhodobacteriales
The Zoo (likely ancestral state)
38Reconstruction of history
Frequent co-regulation with Irr
Strict division of function with Irr
Appearance of theiron-Rhodo motif
39All logos and Some Very Tempting Hypotheses
2
- Cross-recognition of FUR and IscR motifs in the
ancestor. - When FUR had become MUR, and IscR had been lost
in Rhizobiales, emerging RirA (from the Rrf2
family, with a rather different general
consensus) took over their sites. - Iron-Rhodo boxes are recognized by IscR directly
testable
1
3
40Summary and open problems
- Regulatory systems are very flexible
- easily lost
- easily expanded (in particular, by duplication)
- may change specificity
- rapid turnover of regulatory sites
- With more stories like these, we can start
thinking about a general theory - catalog of elementary events how frequent?
- mechanisms (duplication, birth e.g. from enzymes,
horizontal transfer) - conserved (regulon cores) and non-conserved
(marginal regulon members) genes in relation to
metabolic and functional subsystems/roles - (TF family-specific) protein-DNA recognition code
- distribution of TF families in genomes
distribution of regulon sizes etc.
41People
- Andrei A. Mironov software, algorithms
- Alexandra Rakhmaninova SDP, protein-DNA
correlations - Olga Kalinina (on loan to EMBL) SDP
- Yuri Korostelev protein-DNA correlations
- Olga Laikova LacI
- Dmitry Ravcheev CRA/FruR
- Dmitry Rodionov (on loan to Burnham Institute)
iron etc. - Alexei Vitreschak T-boxes and riboswitches
- Andy Jonson (U. of East Anglia) experimental
validation (iron) - Leonid Mirny (MIT) protein-DNA, SDP
- Andrei Osterman (Burnham Institute)
experimental validation
- Howard Hughes Medical Institute
- Russian Foundation of Basic Research
- Russian Academy of Sciences, program Molecular
and Cellular Biology
42(No Transcript)