Testing Bacterial Proteins for Evidence of Horizontal Gene Transfer James Godde, John Iverson, Kabi Neupane, and Sara Penhale.

Slides:



Advertisements
Similar presentations
“Classical” view of bacteria genome Single chromosome May have plasmids and phage Simple gene structure Genes have recognisable phenotype Vibrio y Bacteriodes.
Advertisements

Bacterial Physiology A Proteomic Approach to Oral Diseases Oral Diseases Peter Zilm Microbiology Laboratory Dental School The University of Adelaide.
Lichens and Ascomycota broadly Alternative markers to COI ITS.
Phylogeny Systematics Cladistics
Pathogenomics: Focusing studies of bacterial pathogenicity through evolutionary analysis of genomes.
Sequence Similarity Searching Class 4 March 2010.
The Statistical Significance of Max-gap Clusters Rose Hoberman David Sankoff Dannie Durand.
7 The Genetics of Bacteria and Their Viruses. 2 3 Plasmids Many DNA sequences in bacteria are mobile and can be transferred between individuals and among.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Bas E. Dutilh Phylogenomics Using complete genomes to determine the phylogeny of species.
Exploring the Biology of Disulfide-Rich Hyperthermophiles through Protein Phylogenetic Profiles Navapoln Ramakul 1, Morgan Beeby 12, and Todd O. Yeates.
AN INTRODUCTION TO TAXONOMY: THE BACTERIA
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Genetic transfer and recombination
Assessment of sequence alignment Lecture Introduction The Dot plot Matrix visualisation matching tool: – Basics of Dot plot – Examples of Dot plot.
T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo.
Recombination:. Different recombinases have different topological mechanisms: Xer recombinase on psi. Unique product Uses topological filter to only perform.
Identify gene markers for different taxonomic groups in Archaea and Bacteria Genomes Dongying Wu 1,2, Jonathan A. Eisen 1,2 1. DOE Joint Genome Institute,
“CRISPR genome editing” Precise gene regulation/modification using the simple CRISPR/Cas9 system Thank you. Nucleeases will be used as a new tool that.
The case:  2005: No production Continued sampling  2006: Detected plants expressing transgene - demonstrated pollen transfer and seed dispersal (Reichman.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
CRISPR-associated Proteins
Sequence-based Similarity Module (BLAST & CDD only ) & Horizontal Gene Transfer Module (Ortholog Neighborhood & GC content only)
Introduction to Phylogenetics
Construction of Substitution Matrices
Calculating branch lengths from distances. ABC A B C----- a b c.
Subsystem: Succinate dehydrogenase The super-macromolecular respiratory complex II (succinate:quinone oxidoreductase) couples the oxidation of succinate.
1 Bacterial Genomes Remember no nucleus!! Bacterial chromosome - Large ds circular DNA molecule = haploid - E. coli has about 4,300 genes (~4.2 Mb) 100x.
Big Picture Of ≈1.7 million species classified so far, roughly 6000 are microbes True number of microbes is obviously larger than 6000 “Imagine if our.
Mysterious Sequence Repeats in Phage Genomes AKHIL GARG BNFO 301 APRIL 30, 2015.
Consensus SDR Sequence in E. coli near Important Genes Nav Saini BNFO 301 4/29/15.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Significance Tests for Max-Gap Gene Clusters Rose Hoberman joint work with Dannie Durand and David Sankoff.
Nothing in (computational) biology makes sense except in the light of evolution after Theodosius Dobzhansky (1970) Comparative genomics, genome context.
ORF Calling. Why? Need to know protein sequence Protein sequence is usually what does the work Functional studies Crystallography Proteomics Similarity.
A B C D E F G H I J K FigS1. Supplemental Figure S1. Evolutionary relationships of Arabidopsis and tomato Aux/IAA proteins. The evolutionary history was.
Lecture-7 Genome editing CRISPR
Construction of Substitution matrices
Chapter 25: Phylogeny and Systematics. “Taxonomy is the division of organisms into categories based on… similarities and differences.” p. 495, Campbell.
How many genes are there?
Ribonucleotide reductases (RNRs) catalyse the reduction of ribonucleotides to their corresponding 2`-deoxyribonucleotides and therefore play an essential.
Section 26.5: Horizontal Gene Transfer By Monica Macaro.
1 Repeats!. 2 Introduction  A repeat family is a collection of repeats which appear multiple times in a genome.  Our objective is to identify all families.
A Robust and Accurate Binning Algorithm for Metagenomic Sequences with Arbitrary Species Abundance Ratio Zainab Haydari Dr. Zelikovsky Summer 2011.
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
Chapter 1 Evolution, Biology, & Science. Studying Biology Biology – QUESTION
Gaurav Arora 1, Vinayak Mathur 2, and Anne Rosenwald 2 1 Gallaudet University, Department of Science, Technology, and Mathematics, Washington, DC
Phylogenetic genome analysis, phylogenomics
bacteria and eukaryotes
Cyanobacteria’s Repeat Sequences… where did they come from?
CRISPR + CAS = Defensive or Immune System
Evolution of eukaryote genomes
Convergent Evolution: Gene Sharing by Eukaryotic Plant Pathogens
Strategies for annotation of a genome
Varodom Charoensawan, Derek Wilson, Sarah A. Teichmann 
CRISPRs and Tandem Repeats
16.1 – Genetic Variation in Bacteria
Chapter 26.5: Horizontal Gene Transfer
The CRISPR/Cas9 system.1 Clustered regularly interspaced palindromic repeats (CRISPR) refers to sequences in the bacterial genome. The CRISPR/Cas9 system.1.
Figure 1 Adaptive immune system of bacteria and archaea
The CRISPR/Cas9 system.1 Clustered regularly interspaced palindromic repeats (CRISPR) refers to sequences in the bacterial genome. The CRISPR/Cas9 system.1.
BSC1010: Intro to Biology I K. Maltz Chapter 21.
Extra chromosomal Agents Transposable elements
Evolution of Genomes Chapter 21.
KEY CONCEPT The current tree of life has three domains.
KEY CONCEPT The current tree of life has three domains.
Sequence Analysis Alan Christoffels
Presentation transcript:

Testing Bacterial Proteins for Evidence of Horizontal Gene Transfer James Godde, John Iverson, Kabi Neupane, and Sara Penhale

Repetitive DNA Found in abundance in Eukaryotes Only 1% of the human genome encodes protein, while more than half of the genome consists of repetitive DNA. Relatively rare in Prokaryotes Nearly 89% of the E. coli genome encodes protein, while less than 1% consists of repetitive DNA

Different Classes of Repetitive DNA

CRISPRs What is a CRISPR? –Clustered Regularly Interspersed Short Palindromic Repeats –Class of repeats found exclusively in prokaryotes How widespread are they? Frequency of Occurrence Unknown What is their function? Function Unknown How did they get there in the first place? Mode of Transmission Unknown

What are Cas genes? In addition to the CRISPR sequences themselves, there are a number of genes usually found in close association with the regions of repetitive DNA These genes were termed Cas (CRISPR associated) genes There are 4 Cas genes which have been characterized to date. The function of each gene can be guessed at due to similarities they share with known genes: Cas 1 is homologous to a DNA repair gene Cas 2 is homologous to a transposase Cas 3 is homologous to a helicase Cas 4 is homologous to RecB exonuclease

Finding Cas genes Cas genes were found by using NCBI BLAST to search for homologs to previously characterized Cas genes (Jansen et al., 2002), as well as to any newly characterized ones In addition to homology with other genes, Cas genes had to be located near CRISPR sequences themselves

Cas 1Cas 2Cas 3 Cas 4

Formation of a total evidence tree Cas genes have been found in 115 different species of prokaryotes Analysis was limited to the 58 species for which sequence data were available for all four Cas genes Protein sequences for all Cas genes were concatenated and aligned using Clustal W Combined dataset was used to draw a neighbor-joining tree with MacVector

Classical rRNA-based Phylogeny Archaea Eukarya Bacteria Yang et al., 2005

Classical rRNA-based Phylogeny Archaea Bacteria Yang et al., 2005

Method:Neighbor Joining; Best Tree; tie breaking = Random Distance:Absolute (# differences) Gaps distributed proportionally Nanoarchaem Pyrococcus hor 1 Archaeoglobus 2 Methanobacterium Thermotoga Rubrobacter Clostridium ther Desulfobacterium 2 Thermoanaerobacter Fusobacterium Moorella 2 Porphyromonas Bacteroides Methanosarcina bar Methanosarcina acet Methanococcus Pyrococcus hor 2 Pyrococcus fur Chloroflexus Corynebacterium Chlorobium 2 Desulfovibrio desul Rhodospillium 1 Salmonella typhi CT18 Salmonella typhimurium E. coli K12 E. coli 0157 Geobacter sulf Photobacterium (mega) Sulfolobus tok Sulfolobus sol Archaeoglobus 1 Methanosarcina maz Leptospira (lai) Streptococcus pyo 1 Streptococcus aga 2603 Streptococcus aga NEM316 Streptococcus pyo 2 Streptococcus mut Moorella 1 Geobacter meta Methylococcus Magnetococcus Chlorobium 1 Desulfovibrio vul (mega) Shewanella (Sargasso Sea) Rhodospillium 2 Xanthomonas Chromobacterium Azotobacter Bacillus halo Desulfobacterium 1 Pyrobaculum aero Thermus HB8 (mega) Synechocystis (mega) Nostoc pun Nostoc Archaea

Method:Neighbor Joining; Best Tree; tie breaking = Random Distance:Absolute (# differences) Gaps distributed proportionally Nanoarchaem Pyrococcus hor 1 Archaeoglobus 2 Methanobacterium Thermotoga Rubrobacter Clostridium ther Desulfobacterium 2 Thermoanaerobacter Fusobacterium Moorella 2 Porphyromonas Bacteroides Methanosarcina bar Methanosarcina acet Methanococcus Pyrococcus hor 2 Pyrococcus fur Chloroflexus Corynebacterium Chlorobium 2 Desulfovibrio desul Rhodospillium 1 Salmonella typhi CT18 Salmonella typhimurium E. coli K12 E. coli 0157 Geobacter sulf Photobacterium (mega) Sulfolobus tok Sulfolobus sol Archaeoglobus 1 Methanosarcina maz Leptospira (lai) Streptococcus pyo 1 Streptococcus aga 2603 Streptococcus aga NEM316 Streptococcus pyo 2 Streptococcus mut Moorella 1 Geobacter meta Methylococcus Magnetococcus Chlorobium 1 Desulfovibrio vul (mega) Shewanella (Sargasso Sea) Rhodospillium 2 Xanthomonas Chromobacterium Azotobacter Bacillus halo Desulfobacterium 1 Pyrobaculum aero Thermus HB8 (mega) Synechocystis (mega) Nostoc pun Nostoc Proteobacteria

Conclusions The total evidence tree is a good representation of the individual Cas gene trees, and can be used to draw the same conclusions The trees support the hypothesis that Cas genes have been passed via horizontal gene transfer More work is required to eliminate the alternate hypothesis that the trees reflect convergent evolution in response to similar environments

Yang, S. Doolittle, R. F., and Bourne, P. E Phylogeny determined by protein domain content. PNAS 102: Jansen, R., van Embden, J. D., Gaastra, W., and Schouls, L. M Identification of genes that are associated with DNA repeats in prokaryotes. Mol. Microbiol. 43: References