Download presentation
Presentation is loading. Please wait.
Published byGodwin Berry Modified over 9 years ago
1
Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems Moscow, Russia ISBRA, Ft. Lauderdale, 14.IV.2009
2
Plan Co-evolution of transcription factors and their binding motifs Evolution of regulatory systems and regulons
3
Regulators and their motifs Cases of motif conservation at surprisingly large distances Subtle changes at close evolutionary distances Correlation between contacting nucleotides and amino acid residues
4
NrdR (regulator of ribonucleotide reducases and some other replication-related genes): conservation at large distances
5
DNA motifs and protein-DNA interactions CRP PurR IHFTrpR Entropy at aligned sites and the number of contacts (heavy atoms in a base pair at a distance <cutoff from a protein atom)
6
The CRP/FNR family of regulators
7
Correlation between contacting nucleotides and amino acid residues CooA in Desulfovibrio spp. CRP in Gamma-proteobacteria HcpR in Desulfovibrio spp. FNR in Gamma-proteobacteria DD COOA ALTTEQLSLHMGATRQTVSTLLNNLVR DV COOA ELTMEQLAGLVGTTRQTASTLLNDMIR EC CRP KITRQEIGQIVGCSRETVGRILKMLED YP CRP KXTRQEIGQIVGCSRETVGRILKMLED VC CRP KITRQEIGQIVGCSRETVGRILKMLEE DD HCPR DVSKSLLAGVLGTARETLSRALAKLVE DV HCPR DVTKGLLAGLLGTARETLSRCLSRMVE EC FNR TMTRGDIGNYLGLTVETISRLLGRFQK YP FNR TMTRGDIGNYLGLTVETISRLLGRFQK VC FNR TMTRGDIGNYLGLTVETISRLLGRFQK TGTCGGCnnGCCGACA TTGTgAnnnnnnTcACAA TTGTGAnnnnnnTCACAA TTGATnnnnATCAA Contacting residues: REnnnR TG: 1 st arginine GA: glutamate and 2 nd arginine
8
The correlation holds for other factors in the family
9
The LacI family: subtle changes in motifs at close distances G A n CGCG GnGn GCGC
10
The LacI family: systematic analysis 1369 DNA-binding domains in 200 orthologous rows =35%, =71 а.о. 4484 binding sites, L=20н., =45% Calculate mutual information between columns of TF and site alignments Set threshold on mutual information of correlated pairs
11
Definitions Sites LAFDHDQILQMAQERLQGKVRYQP-IGFELLPEKFSLRQLQRMYETVLGRS---LDKRNF LAFDHNQILDYGYQRLRNKLEYSP-IAFEVLPELFTLNDLFQLYTTVLGED--FADYSNF tTAaTGgCTTTAtGcCACTAT LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHNQILDYGYQRLRNKLEYSP-IAFEVLPELFTLNDLFQLYTTVLGED--FADYSNF LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN—-FSDYSNF LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN--FSDYSNF TTAaaGTAAtAaTTACCATAA AaAtTGTCTTTAtGcCACTAT TTATGGTAAATTcTACCATAA TTATgGTCAgTTTcACcAaAA tTAaTGgCTTTAtGcCACTAT TTaGTCgAAATAaccaACtAA TTATCGTCAtCtcGACGACAA TttAGGTAAgTTATACTTTTA Z-score Mutual information Protein alignment
12
Correlated pairs
13
Higher order correlations -ATIKDVAKRANVSTTTV- AATTGTGAGCGCTCACT SLSQ TL TQ
14
Not a phylogenetic trace
15
NrtR (regulator of NAD metabolism)
16
Comparison with the recently solved structure: correlated positions indeed bind the DNA (more exactly, form a hydrophobic cluster)
17
NiaR: changed dimer structure? The GalR family and C- proteins of RM- systems: direct and inverted repeats BirA: changed spacing
18
Catalog of events Expansion and contraction of regulons New regulators (where from?) Duplications of regulators with or without regulated loci Loss of regulators with or without regulated loci Re-assortment of regulators and structural genes … especially in complex systems Horizontal transfer
19
Trehalose/maltose catabolism in alpha-proteobacteria Duplicated LacI-family regulators: lineage- specific post-duplication loss
20
The binding motifs are very similar (the blue branch is somewhat different: to avoid cross-recognition?)
21
Utilization of an unknown galactoside in gamma-proteobacteria Loss of regulator and merger of regulons: It seems that laci-X was present in the common ancestor (Klebsiella is an outgroup) Yersinia and Klebsiella: two regulons, GalR and Laci-X Erwinia: one regulon, GalR
22
Utilization of maltose/maltodextrin in Firmicutes Displacement: invasion of a regulator from a different subfamily (horizontal transfer from a related species?) – blue sites
23
Orthologous TFs with completely different regulons (alpha-proteobaceria and Xanthomonadales)
24
Catabolism of gluconate in proteobacteria
25
Extreme variability of the regulation of “marginal” regulon members γ Pseudomonas spp. β
26
Cryptic sites and loss of regulators Loss of RbsR in Y. pestis (ABC-transporter also is lost) Start codon of rbsD RbsR binding site
27
Regulon expansion, or how FruR has become CRA CRA (a.k.a. FruR) in Escherichia coli: –global regulator –well-studied in experiment (many regulated genes known) Going back in time: looking for candidate CRA/FruR sites upstream of (orthologs of) genes known to be regulated in E.coli
28
Common ancestor of gamma-proteobacteria icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria
29
Common ancestor of the Enterobacteriales icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria Enterobacteriales
30
Common ancestor of Escherichia and Salmonella icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria Enterobacteriales E. coli and Salmonella spp.
31
Regulation of amino acid biosynthesis in the Firmicutes Interplay between regulatory RNA elements and transcription factors Expansion of T-box systems (normally – RNA structures regulating aminoacyl-tRNA-synthetases)
32
Aminoacyl- tRNA synthetases Amino acid biosynthetic genes Amino acid transporters TGG: T-box Partial alignment of predicted T-boxes
33
Aminoacyl- tRNA synthetases Amino acid biosynthetic genes Amino acid transporters … continued (in the 5’ direction) anti-anti (specifier) codon
34
Why T-boxes? May be easily identified In most cases functional specificity may be reliably predicted by the analysis of the specifier codons (anti-anti-codons) Sufficiently long to retain phylogenetic signal => T-boxes are a good model of regulatory evolution
35
805 T-boxes in 96 bacteria Firmicutes –aa-tRNA synthetases –enzymes –transporters –all amino acids excluding glutamate Actinobacteria (regulation of translation – predicted) –branched chain (ileS) –aromatic (Atopobium minutum) Delta-proteobacteria –branched chain (leu – enzymes) Thermus/Deinococcus group (aa-tRNA synthases) –branched chain (ileS, valS) –glycine Chloroflexi, Dictyoglomi –aromatic (trp – enzymes) –branched chain (ileS) –threonine
36
Recent duplications and bursts: ARG-T-box in Clostridium difficile
37
… caused by loss of transcription factor AhrC
38
Duplications and changes in specificity: ASN/ASP/HIS T-boxes
39
Blow-up 1
40
Blow-up 2. Prediction Regulators lost in lineages with expanded HIS-T-box regulon??
41
… and validation conserved motifs upstream of HIS biosynthesis genes candidate transcription factor yerC co-localized with the his genes present only in genomes with the motifs upstream of the his genes genomes with neither YerC motif nor HIS-T-boxes: attenuators Bacillales (his operon) Clostridiales Thermoanaerobacteriales Halanaerobiales Bacillales
42
The evolutionary history of the his genes regulation in the Firmicutes
43
More duplications: THR-T-box in C. difficile and B. cereus
44
T-boxes: Summary / History
45
Life without Fur
46
Regulation of iron homeostasis (the Escherichia coli paradigm) Iron: essential cofactor (limiting in many environments) dangerous at large concentrations FUR (responds to iron): synthesis of siderophores transport (siderophores, heme, Fe 2+, Fe 3+ ) storage iron-dependent enzymes synthesis of heme synthesis of Fe-S clusters Similar in Bacillus subtilis
47
Regulation of iron homeostasis in α-proteobacteria Experimental studies: FUR/MUR: Bradyrhizobium, Rhizobium and Sinorhizobium RirA (Rrf2 family): Rhizobium and Sinorhizobium Irr (FUR family): Bradyrhizobium, Rhizobium and Brucella RirA Irr FeSheme RirA degraded Fur Fe Fur Iron uptak e systems Sideroph ore uptake Fe / Fe uptake Transcription factors 2+3+ Iron storage ferritins FeS synthesis Heme synthesis Iron-requiring enzymes [iron cofactor] IscR Irr [- Fe] [+Fe] [- Fe] [+Fe] [ Fe]- FeS FeS status of cell
48
Distribution of transcription factors in genomes Search for candidate motifs and binding sites using standard comparative genomic techniques
49
FUR/MUR branch of the FUR family Fur in - and - proteobacteria Fur in - proteobacteria Fur in Firmicutes in proteobacteria Fur MBNC03003593 RB2654 19538 AGR C 620 RL mur Nwi 0013 RPA0450 BJ fur ROS217 18337 Jann 1799 SPO2477 STM1w01000993 MED193 22541 OB2597 02997 SKA53 03101 Rsph03000505 ISM 15430 GOX0771 ZM01411 Saro02001148 Sala 1452 ELI1325 OA2633 10204 PB2503 04877 CC0057 Rrub02001143 Amb1009 Amb4460 SM mur MBNC03003179 BQ fur2 BMEI0375 Mesorhizobium sp. BNC1(I) Sinorhizobium meliloti Bartonella quintana Rhodopseudomonas palustris Bradyrhizobium japonicum Caulobacter crescentus Zmomonas mobilisy Rhodobacter sphaeroides Silicibacter sp. TM1040 Silicibacter pomeroyi Agrobacterium tumefaciens Rhizobium leguminosarum Brucella melitensis Mesorhizobium sp. BNC1(II) Rhodobacterales bacterium HTCC2654 Nitrobacter winogradskyi Nham 0990 Nitrobacter hamburgensis X14 Jannaschia sp. CC51 Roseovarius sp.217 Roseobacter sp. MED193 Oceanicola batsensis HTCC2597 Loktanella vestfoldensis SKA53 Roseovarius nubinhibens ISM Gluconobacter oxydans Erythrobacter litoralis Novosphingobium aromaticivorans Sphinopyxis alaskensis RB2256 Oceanicaulis alexandrii HTCC2633 Rhodospirillum rubrum Parvularcula bermudensisHTCC2503 Magnetospirillum magneticum (I) EE36 12413 Sulfitobacter sp. EE-36 ECOLI PSEAE NEIMA HELPY BACSU Helicobacter pylori : sp|O25671 Bacillus subtilis: P54574sp| Neisseria meningitidis : sp|P0A0S7 Pseudomonas aeruginosa : sp|Q03456 Escherichia coli : P0A9A9sp| Mur Fur Magnetospirillum magneticum (II) RHE_CH00378 Rhizobiumetli PU1002 04436 Pelagibacter ubique HTCC1002 Irr in proteobacteria proteobacteria Regulator of manganese uptake genes (sit, mntH) Regulator of iron uptake and metabolism genes
50
of - proteobacteria - Mur Caulobacter crescentus Zymomonas mobilis Gluconobacter oxydans Erythrobacter litoralis Novosphingobium aromaticivorans Rhodospirillum rubrum Magnetospirillum magneticum Escherichia coli Sphinopyxis alaskensis Parvularcula bermudensis - Oceanicaulis alexandrii Bacillus subtilis Sequence logos for the known Fur-binding sites in Escherichia coli and Bacillus subtilis Identified Mur-binding sites FUR and MUR boxes
51
Fur in - and - proteobacteria Fur in - proteobacteria Fur in Firmicutes Irr in proteo- bacteria: regulator of iron homeostasis proteobacteria Fur ECOLI PSEAE NEIMA HELPY BACSU Helicobacter pylori: sp|O25671 Bacillus subtilis: P54574sp| Neisseria meningitidis : sp|P0A0S7 Pseudomonas aeruginosa : sp|Q03456 Escherichia coli : P0A9A9sp| Mur / Fur I rr- AGR C 249 SM irr RL irr1 RL irr2 MLr5570 MBNC03003186 BQ fur1 BMEI1955 BMEI1563 BJ blr1216 RB2654 182 SKA53 01126 ROS217 15500 ISM 00785 OB2597 14726 Jann 1652 Rsph03001693 EE36 03493 STM1w01001534 MED193 17849 SPOA0445 RC irr RPA2339 RPA0424* BJ irr* Nwi 0035* Nham 1013* Nitrobacter hamburgensisX14 Nitrobacter winogradskyi Bradyrhizobium japonicum (I) Agrobacterium tumefaciens Rhizobium leguminosarum (I) Mesorhizobium sp. BNC1 Sinorhizobium meliloti Mesorhizobiumloti Bartonella quintana Brucella melitensis (I) Bradyrhizobium japonicum (II) Rhodobacter sphaeroides Rhodobactercapsulatus Silicibacter pomeroyi Silicibacter sp. TM1040 Roseobacter sp. MED193 Sulfitobacter sp. EE-36 Jannaschia sp. CC51 Oceanicola batsensis HTCC2597 Roseovarius nubinhibens ISM Roseovariussp.217 Loktanella vestfoldensis SKA53 Rhodobacterales bacterium HTCC2654 Rhizobiumetli RHE CH00106 Rhizobium leguminosarum (II) Brucella melitensis (II) Rhodopseudomonas palustris (II) Rhodopseudomonas palustris (I) PU1002 04361 Pelagibacter ubique HTCC1002 Irr branch of the FUR family
52
Irr boxes Rhizobiaceae plus Bradyrhizobiaceae Rhodobacteriaceae Rhodospirillales
53
RirA/NsrR family (Rhizobiales)
54
IscR family
55
Regulation of genes in functional subsystems Rhizobiales Bradyrhizobiaceae Rhodobacteriales The Zoo (likely ancestral state)
56
Reconstruction of history Appearance of the iron-Rhodo motif Frequent co-regulation with Irr Strict division of function with Irr
57
All logos and Some Very Tempting Hypotheses: 1.Cross-recognition of FUR and IscR motifs in the ancestor. 2.When FUR had become MUR, and IscR had been lost in Rhizobiales, emerging RirA (from the Rrf2 family, with a rather different general consensus) took over their sites. 3.Iron-Rhodo boxes are recognized by IscR: directly testable 1 2 3
58
Summary and open problems Regulatory systems are very flexible –easily lost –easily expanded (in particular, by duplication) –may change specificity –rapid turnover of regulatory sites With more stories like these, we can start thinking about a general theory –catalog of elementary events; how frequent? –mechanisms (duplication, birth e.g. from enzymes, horizontal transfer) –conserved (regulon cores) and non-conserved (marginal regulon members) genes in relation to metabolic and functional subsystems/roles –(TF family-specific) protein-DNA recognition code –distribution of TF families in genomes; distribution of regulon sizes; etc.
59
People Andrei A. Mironov – software, algorithms Alexandra Rakhmaninova – SDP, protein-DNA correlations Olga Kalinina (on loan to EMBL) – SDP Yuri Korostelev – protein-DNA correlations Olga Laikova – LacI Dmitry Ravcheev– CRA/FruR Dmitry Rodionov (on loan to Burnham Institute) – iron etc. Alexei Vitreschak – T-boxes and riboswitches Andy Jonson (U. of East Anglia) – experimental validation (iron) Leonid Mirny (MIT) – protein-DNA, SDP Andrei Osterman (Burnham Institute) – experimental validation Howard Hughes Medical Institute Russian Foundation of Basic Research Russian Academy of Sciences, program “Molecular and Cellular Biology”
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.