Presentation is loading. Please wait.

Presentation is loading. Please wait.

Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems.

Similar presentations


Presentation on theme: "Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems."— Presentation transcript:

1 Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems Moscow, Russia ISBRA, Ft. Lauderdale, 14.IV.2009

2 Plan Co-evolution of transcription factors and their binding motifs Evolution of regulatory systems and regulons

3 Regulators and their motifs Cases of motif conservation at surprisingly large distances Subtle changes at close evolutionary distances Correlation between contacting nucleotides and amino acid residues

4 NrdR (regulator of ribonucleotide reducases and some other replication-related genes): conservation at large distances

5 DNA motifs and protein-DNA interactions CRP PurR IHFTrpR Entropy at aligned sites and the number of contacts (heavy atoms in a base pair at a distance <cutoff from a protein atom)

6 The CRP/FNR family of regulators

7 Correlation between contacting nucleotides and amino acid residues CooA in Desulfovibrio spp. CRP in Gamma-proteobacteria HcpR in Desulfovibrio spp. FNR in Gamma-proteobacteria DD COOA ALTTEQLSLHMGATRQTVSTLLNNLVR DV COOA ELTMEQLAGLVGTTRQTASTLLNDMIR EC CRP KITRQEIGQIVGCSRETVGRILKMLED YP CRP KXTRQEIGQIVGCSRETVGRILKMLED VC CRP KITRQEIGQIVGCSRETVGRILKMLEE DD HCPR DVSKSLLAGVLGTARETLSRALAKLVE DV HCPR DVTKGLLAGLLGTARETLSRCLSRMVE EC FNR TMTRGDIGNYLGLTVETISRLLGRFQK YP FNR TMTRGDIGNYLGLTVETISRLLGRFQK VC FNR TMTRGDIGNYLGLTVETISRLLGRFQK TGTCGGCnnGCCGACA TTGTgAnnnnnnTcACAA TTGTGAnnnnnnTCACAA TTGATnnnnATCAA Contacting residues: REnnnR TG: 1 st arginine GA: glutamate and 2 nd arginine

8 The correlation holds for other factors in the family

9 The LacI family: subtle changes in motifs at close distances G A n CGCG GnGn GCGC

10 The LacI family: systematic analysis 1369 DNA-binding domains in 200 orthologous rows =35%, =71 а.о. 4484 binding sites, L=20н., =45% Calculate mutual information between columns of TF and site alignments Set threshold on mutual information of correlated pairs

11 Definitions Sites LAFDHDQILQMAQERLQGKVRYQP-IGFELLPEKFSLRQLQRMYETVLGRS---LDKRNF LAFDHNQILDYGYQRLRNKLEYSP-IAFEVLPELFTLNDLFQLYTTVLGED--FADYSNF tTAaTGgCTTTAtGcCACTAT LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHNQILDYGYQRLRNKLEYSP-IAFEVLPELFTLNDLFQLYTTVLGED--FADYSNF LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN—-FSDYSNF LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN--FSDYSNF TTAaaGTAAtAaTTACCATAA AaAtTGTCTTTAtGcCACTAT TTATGGTAAATTcTACCATAA TTATgGTCAgTTTcACcAaAA tTAaTGgCTTTAtGcCACTAT TTaGTCgAAATAaccaACtAA TTATCGTCAtCtcGACGACAA TttAGGTAAgTTATACTTTTA Z-score Mutual information Protein alignment

12 Correlated pairs

13 Higher order correlations -ATIKDVAKRANVSTTTV- AATTGTGAGCGCTCACT SLSQ TL TQ

14 Not a phylogenetic trace

15 NrtR (regulator of NAD metabolism)

16 Comparison with the recently solved structure: correlated positions indeed bind the DNA (more exactly, form a hydrophobic cluster)

17 NiaR: changed dimer structure? The GalR family and C- proteins of RM- systems: direct and inverted repeats BirA: changed spacing

18 Catalog of events Expansion and contraction of regulons New regulators (where from?) Duplications of regulators with or without regulated loci Loss of regulators with or without regulated loci Re-assortment of regulators and structural genes … especially in complex systems Horizontal transfer

19 Trehalose/maltose catabolism in alpha-proteobacteria Duplicated LacI-family regulators: lineage- specific post-duplication loss

20 The binding motifs are very similar (the blue branch is somewhat different: to avoid cross-recognition?)

21 Utilization of an unknown galactoside in gamma-proteobacteria Loss of regulator and merger of regulons: It seems that laci-X was present in the common ancestor (Klebsiella is an outgroup) Yersinia and Klebsiella: two regulons, GalR and Laci-X Erwinia: one regulon, GalR

22 Utilization of maltose/maltodextrin in Firmicutes Displacement: invasion of a regulator from a different subfamily (horizontal transfer from a related species?) – blue sites

23 Orthologous TFs with completely different regulons (alpha-proteobaceria and Xanthomonadales)

24 Catabolism of gluconate in proteobacteria

25 Extreme variability of the regulation of “marginal” regulon members γ Pseudomonas spp. β

26 Cryptic sites and loss of regulators Loss of RbsR in Y. pestis (ABC-transporter also is lost) Start codon of rbsD RbsR binding site

27 Regulon expansion, or how FruR has become CRA CRA (a.k.a. FruR) in Escherichia coli: –global regulator –well-studied in experiment (many regulated genes known) Going back in time: looking for candidate CRA/FruR sites upstream of (orthologs of) genes known to be regulated in E.coli

28 Common ancestor of gamma-proteobacteria icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria

29 Common ancestor of the Enterobacteriales icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria Enterobacteriales

30 Common ancestor of Escherichia and Salmonella icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria Enterobacteriales E. coli and Salmonella spp.

31 Regulation of amino acid biosynthesis in the Firmicutes Interplay between regulatory RNA elements and transcription factors Expansion of T-box systems (normally – RNA structures regulating aminoacyl-tRNA-synthetases)

32 Aminoacyl- tRNA synthetases Amino acid biosynthetic genes Amino acid transporters TGG: T-box Partial alignment of predicted T-boxes

33 Aminoacyl- tRNA synthetases Amino acid biosynthetic genes Amino acid transporters … continued (in the 5’ direction) anti-anti (specifier) codon

34 Why T-boxes? May be easily identified In most cases functional specificity may be reliably predicted by the analysis of the specifier codons (anti-anti-codons) Sufficiently long to retain phylogenetic signal => T-boxes are a good model of regulatory evolution

35 805 T-boxes in 96 bacteria Firmicutes –aa-tRNA synthetases –enzymes –transporters –all amino acids excluding glutamate Actinobacteria (regulation of translation – predicted) –branched chain (ileS) –aromatic (Atopobium minutum) Delta-proteobacteria –branched chain (leu – enzymes) Thermus/Deinococcus group (aa-tRNA synthases) –branched chain (ileS, valS) –glycine Chloroflexi, Dictyoglomi –aromatic (trp – enzymes) –branched chain (ileS) –threonine

36 Recent duplications and bursts: ARG-T-box in Clostridium difficile

37 … caused by loss of transcription factor AhrC

38 Duplications and changes in specificity: ASN/ASP/HIS T-boxes

39 Blow-up 1

40 Blow-up 2. Prediction Regulators lost in lineages with expanded HIS-T-box regulon??

41 … and validation conserved motifs upstream of HIS biosynthesis genes candidate transcription factor yerC co-localized with the his genes present only in genomes with the motifs upstream of the his genes genomes with neither YerC motif nor HIS-T-boxes: attenuators Bacillales (his operon) Clostridiales Thermoanaerobacteriales Halanaerobiales Bacillales

42 The evolutionary history of the his genes regulation in the Firmicutes

43 More duplications: THR-T-box in C. difficile and B. cereus

44 T-boxes: Summary / History

45 Life without Fur

46 Regulation of iron homeostasis (the Escherichia coli paradigm) Iron: essential cofactor (limiting in many environments) dangerous at large concentrations FUR (responds to iron): synthesis of siderophores transport (siderophores, heme, Fe 2+, Fe 3+ ) storage iron-dependent enzymes synthesis of heme synthesis of Fe-S clusters Similar in Bacillus subtilis

47 Regulation of iron homeostasis in α-proteobacteria Experimental studies: FUR/MUR: Bradyrhizobium, Rhizobium and Sinorhizobium RirA (Rrf2 family): Rhizobium and Sinorhizobium Irr (FUR family): Bradyrhizobium, Rhizobium and Brucella RirA Irr FeSheme RirA degraded Fur Fe Fur Iron uptak e systems Sideroph ore uptake Fe / Fe uptake Transcription factors 2+3+ Iron storage ferritins FeS synthesis Heme synthesis Iron-requiring enzymes [iron cofactor] IscR Irr [- Fe] [+Fe] [- Fe] [+Fe] [ Fe]- FeS FeS status of cell

48 Distribution of transcription factors in genomes Search for candidate motifs and binding sites using standard comparative genomic techniques

49 FUR/MUR branch of the FUR family Fur in  - and  - proteobacteria Fur in  - proteobacteria Fur in Firmicutes in  proteobacteria Fur MBNC03003593 RB2654 19538 AGR C 620 RL mur Nwi 0013 RPA0450 BJ fur ROS217 18337 Jann 1799 SPO2477 STM1w01000993 MED193 22541 OB2597 02997 SKA53 03101 Rsph03000505 ISM 15430 GOX0771 ZM01411 Saro02001148 Sala 1452 ELI1325 OA2633 10204 PB2503 04877 CC0057 Rrub02001143 Amb1009 Amb4460 SM mur MBNC03003179 BQ fur2 BMEI0375 Mesorhizobium sp. BNC1(I) Sinorhizobium meliloti Bartonella quintana Rhodopseudomonas palustris Bradyrhizobium japonicum Caulobacter crescentus Zmomonas mobilisy Rhodobacter sphaeroides Silicibacter sp. TM1040 Silicibacter pomeroyi Agrobacterium tumefaciens Rhizobium leguminosarum Brucella melitensis Mesorhizobium sp. BNC1(II) Rhodobacterales bacterium HTCC2654 Nitrobacter winogradskyi Nham 0990 Nitrobacter hamburgensis X14 Jannaschia sp. CC51 Roseovarius sp.217 Roseobacter sp. MED193 Oceanicola batsensis HTCC2597 Loktanella vestfoldensis SKA53 Roseovarius nubinhibens ISM Gluconobacter oxydans Erythrobacter litoralis Novosphingobium aromaticivorans Sphinopyxis alaskensis RB2256 Oceanicaulis alexandrii HTCC2633 Rhodospirillum rubrum Parvularcula bermudensisHTCC2503 Magnetospirillum magneticum (I) EE36 12413 Sulfitobacter sp. EE-36 ECOLI PSEAE NEIMA HELPY BACSU Helicobacter pylori : sp|O25671 Bacillus subtilis: P54574sp| Neisseria meningitidis : sp|P0A0S7 Pseudomonas aeruginosa : sp|Q03456 Escherichia coli : P0A9A9sp| Mur Fur  Magnetospirillum magneticum (II) RHE_CH00378 Rhizobiumetli  PU1002 04436 Pelagibacter ubique HTCC1002  Irr in  proteobacteria  proteobacteria Regulator of manganese uptake genes (sit, mntH) Regulator of iron uptake and metabolism genes

50 of - proteobacteria -  Mur Caulobacter crescentus Zymomonas mobilis Gluconobacter oxydans Erythrobacter litoralis Novosphingobium aromaticivorans Rhodospirillum rubrum Magnetospirillum magneticum Escherichia coli Sphinopyxis alaskensis Parvularcula bermudensis - Oceanicaulis alexandrii Bacillus subtilis Sequence logos for the known Fur-binding sites in Escherichia coli and Bacillus subtilis Identified Mur-binding sites FUR and MUR boxes

51 Fur in  - and  - proteobacteria Fur in  - proteobacteria Fur in Firmicutes Irr in  proteo- bacteria: regulator of iron homeostasis  proteobacteria Fur ECOLI PSEAE NEIMA HELPY BACSU Helicobacter pylori: sp|O25671 Bacillus subtilis: P54574sp| Neisseria meningitidis : sp|P0A0S7 Pseudomonas aeruginosa : sp|Q03456 Escherichia coli : P0A9A9sp| Mur / Fur  I rr- AGR C 249 SM irr RL irr1 RL irr2 MLr5570 MBNC03003186 BQ fur1 BMEI1955 BMEI1563 BJ blr1216 RB2654 182 SKA53 01126 ROS217 15500 ISM 00785 OB2597 14726 Jann 1652 Rsph03001693 EE36 03493 STM1w01001534 MED193 17849 SPOA0445 RC irr RPA2339 RPA0424* BJ irr* Nwi 0035* Nham 1013* Nitrobacter hamburgensisX14 Nitrobacter winogradskyi Bradyrhizobium japonicum (I) Agrobacterium tumefaciens Rhizobium leguminosarum (I) Mesorhizobium sp. BNC1 Sinorhizobium meliloti Mesorhizobiumloti Bartonella quintana Brucella melitensis (I) Bradyrhizobium japonicum (II) Rhodobacter sphaeroides Rhodobactercapsulatus Silicibacter pomeroyi Silicibacter sp. TM1040 Roseobacter sp. MED193 Sulfitobacter sp. EE-36 Jannaschia sp. CC51 Oceanicola batsensis HTCC2597 Roseovarius nubinhibens ISM Roseovariussp.217 Loktanella vestfoldensis SKA53 Rhodobacterales bacterium HTCC2654  Rhizobiumetli RHE CH00106 Rhizobium leguminosarum (II) Brucella melitensis (II) Rhodopseudomonas palustris (II) Rhodopseudomonas palustris (I) PU1002 04361 Pelagibacter ubique HTCC1002 Irr branch of the FUR family

52 Irr boxes Rhizobiaceae plus Bradyrhizobiaceae Rhodobacteriaceae Rhodospirillales

53 RirA/NsrR family (Rhizobiales)

54 IscR family

55 Regulation of genes in functional subsystems Rhizobiales Bradyrhizobiaceae Rhodobacteriales The Zoo (likely ancestral state)

56 Reconstruction of history Appearance of the iron-Rhodo motif Frequent co-regulation with Irr Strict division of function with Irr

57 All logos and Some Very Tempting Hypotheses: 1.Cross-recognition of FUR and IscR motifs in the ancestor. 2.When FUR had become MUR, and IscR had been lost in Rhizobiales, emerging RirA (from the Rrf2 family, with a rather different general consensus) took over their sites. 3.Iron-Rhodo boxes are recognized by IscR: directly testable 1 2 3

58 Summary and open problems Regulatory systems are very flexible –easily lost –easily expanded (in particular, by duplication) –may change specificity –rapid turnover of regulatory sites With more stories like these, we can start thinking about a general theory –catalog of elementary events; how frequent? –mechanisms (duplication, birth e.g. from enzymes, horizontal transfer) –conserved (regulon cores) and non-conserved (marginal regulon members) genes in relation to metabolic and functional subsystems/roles –(TF family-specific) protein-DNA recognition code –distribution of TF families in genomes; distribution of regulon sizes; etc.

59 People Andrei A. Mironov – software, algorithms Alexandra Rakhmaninova – SDP, protein-DNA correlations Olga Kalinina (on loan to EMBL) – SDP Yuri Korostelev – protein-DNA correlations Olga Laikova – LacI Dmitry Ravcheev– CRA/FruR Dmitry Rodionov (on loan to Burnham Institute) – iron etc. Alexei Vitreschak – T-boxes and riboswitches Andy Jonson (U. of East Anglia) – experimental validation (iron) Leonid Mirny (MIT) – protein-DNA, SDP Andrei Osterman (Burnham Institute) – experimental validation Howard Hughes Medical Institute Russian Foundation of Basic Research Russian Academy of Sciences, program “Molecular and Cellular Biology”

60


Download ppt "Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems."

Similar presentations


Ads by Google