Comparative genomics and metabolic reconstruction of bacterial pathogens Mikhail Gelfand Institute for Information Transmission Problems, RAS GPBM-2004.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Control of Expression In Bacteria –Part 1
Biology Ch. 12 Review.
Chapter 18 Regulation of Gene Expression in Prokaryotes
Transcriptional regulation of the fad regulon genes of Escherichia coli by ArcA Chao WANG Sept. 13, 2006.
Regulation and Control of Metabolism in Bacteria
Medical Genetics & Genomics
Warm up Mon 11/3/14 Adv Bio 1. What does the phrase “gene regulation” mean? 2. If the lac operon cannot bind to the repressor.. What would be the outcome?
Gene Expression AP Biology.
From computer to the wet lab and back Comparative genomics in the discovery of a family of bacterial transporters with a new mode of action Mikhail Gelfand.
Сравнительная геномика – окончание М.Гельфанд «Сравнительная геномика и системная биология» БиБи, 4 курс + ШБ, 2 год + ПФУ весна 2013.
Gene regulation. Gene expression models  Prokaryotes and Eukaryotes employ common and different methods of gene regulation  Prokaryotic models 1. Trp.
AP Biology Chapter 13: Gene Regulation
Gene Expression Viruses Biotechnology
Chapter 18 Regulation of Gene Expression.
CHAPTER 8 Metabolic Respiration Overview of Regulation Most genes encode proteins, and most proteins are enzymes. The expression of such a gene can be.
Negative regulatory proteins bind to operator sequences in the DNA and prevent or weaken RNA polymerase binding.
Control of Gene Expression
OPERONS: BACTERIAL GENE CONTROL. OPERONS Bacterial cells A group of genes that work together Illustrate how genes expression (“on”) and repression (“off”)
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Section 8.6: Gene Expression and Regulation
The construction of cells DNA or RNA Protein Carbohydrates Lipid etc.
The construction of cells DNA or RNA Protein Carbohydrates Lipid etc. 04.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Express yourself That darn ribosome Mighty Mighty Proteins Mutants RNA to the Rescue
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Differential Expression of Genes  Prokaryotes and eukaryotes precisely regulate gene expression in response to environmental conditions  In multicellular.
12.4 Gene Regulation and Mutation
© 2012 Pearson Education, Inc. Lecture by Edward J. Zalisko PowerPoint Lectures for Campbell Biology: Concepts & Connections, Seventh Edition Reece, Taylor,
Regulatory factors 1) Gene copy number 2) Transcriptional control 2-1) Promoters 2-2) Terminators, attenuators and anti-terminators 2-3) Induction and.
Draw 8 boxes on your paper
Prokaryotic and Eukaryotic Gene Regulation.  Regulatory gene  Transcriptional control  Posttranscriptional  Translational control  Posttranslational.
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
CONTROL MECHANISMS 5.5. Controlling Transcription and Translation of Genes  Housekeeping Genes: needed at all times: needed for life functions vital.
Comparative genomics: functional characterization of new genes and regulatory interactions using computer analysis Mikhail Gelfand Institute for Information.
Evolution of bacterial regulatory systems Mikhail Gelfand Institute for Information Transmission Problems, RAS BGRS-2004, Novosibirsk.
Comparative genomics and metabolic reconstruction of bacterial genomes Mikhail S. Gelfand Meeting of HHMI International Research Scholars Tallinn, 2004.
Ch 16: Molecular Basis of Inheritance Ch 17: From Gene to Protein Ch 18: Control of Gene Expression Ch 18: Gene Mutation.
Bacterial Gene Expression and Regulation
Sequencing the World of Possibilities for Energy & Environment Annotation: function prediction and metabolic reconstruction Thanos Lykidis Genome Biology.
1 Gene regulation in Prokaryotes Bacteria were models for working out the basic mechanisms, but eukaryotes are different. Some genes are constitutive,
Chapter 16 – Control of Gene Expression in Prokaryotes
Gene Expression. Cell Differentiation Cell types are different because genes are expressed differently in them. Causes:  Changes in chromatin structure.
Exploiting Gene Clusters to Curate Annotations October, 2003 Ross Overbeek, Fellowship for Interpretation of Genomes (FIG)
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Chapter 8 Lecture Outline
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Chapter 11 Molecular Mechanisms of Gene regulation Jones and Bartlett Publishers © 2005.
Controlling Gene Expression. Control Mechanisms Determine when to make more proteins and when to stop making more Cell has mechanisms to control transcription.
Regulation of Gene Expression in Bacteria and Their Viruses
GO-Slim term Cluster frequency cytoplasm 1944 out of 2727 genes, 71.3% 70 out of 97 genes, 72.2% out of 72 genes, 86.1% out.
SDPpred: a method for identification of amino acid residues that determine differences in functional specificity of homologous proteins and application.
Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems.
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
1 Computational functional genomics Lital Haham Sivan Pearl.
I. Introduction Tetrahydrobiopterin (BH4) is a cofactor used in various processes. It has been extensively studied in mammalian systems were BH4 has a.
Ribonucleotide reductases (RNRs) catalyse the reduction of ribonucleotides to their corresponding 2`-deoxyribonucleotides and therefore play an essential.
José A. Cardé Serrano, PhD Universidad Adventista de las Antillas Biol 223 Genética Agosto 2010.
Protein. Protein and Roles 1: biological process unknown 1.1 Structural categories 1.2 organism categories 1.3 cellular component o unlocalized.
Methionine and cysteine are the two sulfur-containing amino acids. In addition to its general function as a component of proteins, methionine is specifically.
Control of Gene Expression in Prokaryotes
Characterization of Transition Metal-Sensing Riboswitches
CHAPTER 8 MICROBIAL GENETICS: BIO 244 MICROBIOLOGY
GENE EXPRESSION AND REGULATION
Institute for Information Transmission Problems
The Mimivirus Giant double stranded DNA virus Discovered in amoebas
Ch 18: Regulation of Gene Expression
Gene Expression AP Biology.
Chapter 18 Bacterial Regulation of Gene Expression
Presentation transcript:

Comparative genomics and metabolic reconstruction of bacterial pathogens Mikhail Gelfand Institute for Information Transmission Problems, RAS GPBM-2004

Metabolic reconstruction Identification of missing genes in complete genomes Search for candidates –Analysis of individual genes to assign general function: homology functional patterns structural features –Comparative genomics to predict specificity: analysis of regulation positional clustering gene fusions phylogenetic patterns

Enzymes Identification of a gap in a pathway (universal, taxon-specific, or in individual genomes) Search for candidates assigned to the pathway by co-localization and co-regulation (in many genomes) Prediction of general biochemical function from (distant) similarty and functional patterns Tentative filling of the gap Verification by analysis of phylogenetic patterns: –Absence in genomes without this pathway –Complementary distribution with known enzymes for the same function

Transporters Identification of candidates assigned to the pathway by co-localization and co-regulation (in many genomes) Prediction of general function by analysis of transmembrane segments and similarty Prediction of specificity by analysis of phylogenetic patterns: –End product if present in genomes lacking this pathway (substituting the biosynthetic pathway for an essential compound) –Input metabolite if absent in genomes without the pathway (catabolic, also precursors in biosynthetic pathways) –Entry point in the middle if substituting an upper or side part of the pathway in some genomes

Missing link in fatty acid biosynthesis in Streptococci acp P fab D accA accDaccB accC fabHfab F fab G fabZ fabI Gene fabI of Enoyl-ACP reductase (EC ) is missing in the genome 12B, and a number of Streptococci fabI (Enoyl-ACP reductase, EC ) target of triclosan. Enzymatic activity, but no gene in Streptococci

Identification of a candidate by positional clustering Genome X TR? fabIhyp ? hyp TR? FRNS Genome Y Clostridium acetobutylicum TR? Streptococcus pyogenes ? hyp TR? ? fabH acpP ? fabGfabFaccAaccDaccC accB fabZ fabD fabGfabF accBfabD accAaccDaccCfabZ fabGfabFaccAaccDaccC accB fabZ fabDfabH acpP fabH acpP fabH acpP fabGfabFaccAaccDaccC accB fabZ fabD

Binding sites of FabR (“Tr?”, HTH) HTH fabK fabH acpP fabGfabFaccAaccDaccC accB fabZ fabD Fad ( ) 1234

Metabolic reconstruction of the thiamin biosynthesis (new genes/functions shown in red) thiN (confirmed) (Gram-positive bacteria) (Gram-negative bacteria) Transport of HMP Transport of HET Purine pathway

Carbohydrate metabolism in Streptococcus and Lactococcus spp. Only biochemical data, genes unknown Experimentally verified genes Biochemical data and genomic predictions Only genomic predictions

An uncharacterized locus in invasive species S. pneumoniae S. pyogenes S. equi S. agalactiae S. suis

Structure of the genome loci S. pyogenes, S. agalactiae S. equi S. pneumoniae TIGR4 S. suis S. pneumoniae R6

Gene functions 3-(4-deoxy-beta-D-gluc-4-enuronosyl)-N- acetyl-D-glucosamine PTS transporter hydrolase isomerase oxidoreductase dehydrogenase kinase aldolase pyruvate + D-glyceraldehyde 3-phosphate hyaluronidase (hyaluronate lyase) RegR

Candidate regulatory signal

Structure of the genome loci - 2 S. pyogenes, S. agalactiae S. equi S. pneumoniae TIGR4 S. suis S. pneumoniae R6

Possible function Pathway exists in invasive species Sometimes co-localized with hyaluronidase Always co-regulated with hyaluronidase Thus: Utilization of hyaluronate May be involved in pathogenesis

Comparative genomics of zinc regulons Two major roles of zinc in bacteria: Structural role in DNA polymerases, primases, ribosomal proteins, etc. Catalytic role in metal proteases and other enzymes

Genomes and regulators nZUR FUR family ??? AdcR ? MarR family pZUR FUR family

Regulators and signals nZUR-  nZUR-  AdcRpZUR TTAACYRGTTAA GATATGTTATAACATATC GAAATGTTATANTATAACATTTC GTAATGTAATAACATTAC TAAATCGTAATNATTACGATTTA

Transporters Orthologs of the AdcABC and YciC transport systems Paralogs of the components of the AdcABC and YciC transport systems Candidate transporters with previously unknown specificity

zinT: regulation zinT is isolated fusion: adcA-zinT E. coli, S. typhi, K. pneumoniaeGamma-proteobacteria Alpha-proteobacteria B. subtilis, S. aureus S. pneumoniae, S. mutans, S. pyogenes, L. lactis, E. faecalis Bacillus group Streptococcus group zinT is regulated by zinc repressors (nZUR- , nZUR- , pZUR) adcA-zinT is regulated by zinc repressors (pZUR, AdcR) (ex. L.l.) A. tumefaciens, R. sphaeroides

ZinT: protein sequence analysis E. coli, S. typhi, K. pneumoniae, A. tumefaciens, R. sphaeroides, B. subtilis L. lactis Y. pestis, V. cholerae, B. halodurans TMZnAdcA S. aureus, E. faecalis, S. pneumoniae, S. mutans, S. pyogenes ZinT

ZinT: summary zinT is sometimes fused to the gene of a zinc transporter component adcA zinT is expressed only in zinc-deplete conditions ZinT is attached to cell surface (has a TM- segment) ZinT has a zinc-binding domain ZinT: conclusions: ZinT is a new type of zinc-binding component of zinc ABC transporter

Zinc regulation of PHT (pneumococcal histidine triad) proteins of Streptococci S. pneumoniae S. equi S. agalactiae lmbphtDphtE phtBphtA lmbphtD S. pyogenes phtY lmbphtD zinc regulation shown in experiment

Structural features of PHP proteins PHT proteins contain multiple HxxHxH motifs PHT proteins of S. pneumoniae are paralogs (65-95% id) Sec-dependent hydrophobic leader sequences are present at the N-termini of PHT proteins Localization of PHT proteins from S. pneumoniae on bacterial cell surface has been confirmed by flow cytometry

PHH proteins: summary PHT proteins are induced in zinc-deplete conditions PHT proteins are localized at the cell surface PHT proteins have zinc-binding motifs A hypothesis: PHT proteins represent a new family of zinc transporters

… incorrect  Zinc-binding domains in zinc transporters: EEEHEEHDHGEHEHSH HSHEEHGHEEDDHDHSH EEHGHEEDDHHHHHDED DEHGEGHEEEHGHEH (histidine-aspartate- glutamate-rich) Histidine triads in streptococci: HGDHYHY 7 out of 21 HGDHYHF 2 out of 21 HGNHYHF 2 out of 21 HYDHYHN 2 out of 21 HMTHSHW 2 out of 21 (specific pattern of histidines and aromatic amino acids)

Analyis of PHP proteins (cont’d) The phtD gene forms a candidate operon with the lmb gene in all Streptococcus species –Lmb: an adhesin involved in laminin binding, adherence and internalization of streptococci into epithelial cells PhtY of S. pyogenes: –phtY regulated by AdcR –PhtY consists of 3 domains: PHTinternalinH-rich 4 HIS TRIADS LRR IR HDYNHNHTYEDEEGH AHEHRDKDDHDHEHED

PHH proteins: summary-2 PHT proteins are induced in zinc-deplete conditions PHT proteins are localized at the cell surface PHT proteins have structural zinc-binding motifs phtD forms a candidate operon with an adhesin gene PhtY contains an internalin domain responsible for the streptococcal invasion Hypothesis PHT proteins are adhesins involved in the attachment of streptococci to epithelium cells, leading to invasion

Zinc and (paralogs of) ribosomal proteins L36L33L31S14 E. coli, S.typhi ––– +– K. pneumoniae ––– – Y. pestis,V. cholerae –  –– +– B subtilis –– + –– + S. aureus –– – ––– + Listeria spp. –– –– + E. faecalis – –  – – –– + – S. pne., S. mutans –– – ––– S. pyo., L. lactis –– – ––– + nZUR pZUR AdcR

Zn-ribbon motif (Makarova-Ponomarev-Koonin, 2001) L36L33L31S14 E. coli, S.typhi (–)–(–) +– K. pneumoniae (–)–(–) –– Y. pestis,V. cholerae (–)  –(–) +– B subtilis (–)(–) + –(–) + S. aureus (–)(–) – ––(–) + Listeria spp. (–)(–) ––(–) + E. faecalis (–) (–)  – – –(–) + – S. pne., S. mutans (–)(–) – ––(–) S. pyo., L. lactis (–)(–) – ––(–) + nZUR pZUR AdcR

Summary of observations: Makarova-Ponomarev-Koonin, 2001: –L36, L33, L31, S14 are the only ribosomal proteins duplicated in more than one species –L36, L33, L31, S14 are four out of seven ribosomal proteins that contain the zinc-ribbon motif (four cysteines) –Out of two (or more) copies of the L36, L33, L31, S14 proteins, one usually contains zinc-ribbon, while the other has eliminated it Among genes encoding paralogs of ribosomal proteins, there is (almost) always one gene regulated by a zinc repressor, and the corresponding protein never has a zinc ribbon motif

Bad scenario Zn-rich conditions Zn-deplete conditions: all Zn utilized by the ribosomes, no Zn for Zn-dependent enzymes

Regulatory mechanism ribosomes Zn-dependent enzymes R Sufficient Zn Zn starvation R repressor

Good scenario Zn-rich conditions Zn-deplete conditions: some ribosomes without Zn, some Zn left for the enzymes

Prediction … (Proc Natl Acad Sci U S A Aug 19;100(17): ) … and confirmation (Mol Microbiol Apr;52(1): )

Andrei A. Mironov Anna Gerasimova Olga Kalinina Alexei Kazakov (hyaluronate) Ekaterina Kotelnikova Galina Kovaleva Pavel Novichkov Olga Laikova (hyaluronate) Ekaterina Panina (zinc) (now at UCLA, USA) Elizabeth Permina Dmitry Ravcheev Alexandra B. Rakhmaninova Dmitry Rodionov (thiamin) Alexey Vitreschak (thiamin) (on leave at LORIA, France) Howard Hughes Medical Institute Ludwig Institute of Cancer Research Russian Fund of Basic Research Programs “Origin and Evolution of the Biosphere” and “Molecular and Cellular Biology”, Russian Academy of Sciences Andrei Osterman (Burnham Institute, San-Diego, USA) (fatty acids)