Questions we can address with bioinformatic analysis and genome sequence comparison: 1.Why is a given pathogen more virulent? 2.What is the geographic.

Slides:



Advertisements
Similar presentations
Recombinant DNA Technology
Advertisements

DNA BLAST Lab.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
© Wiley Publishing All Rights Reserved. How Most People Use Bioinformatics.
Key Area : Genetic Control of Metabolism in Micro-organisms Unit 2: Metabolism and Survival.
          Sequence Analysis with Artemis and.
Protein Structure Database Introduction Database of Comparative Protein Structure Models ModBase 生資所 g 詹濠先.
Comparative genomics Joachim Bargsten February 2012.
Supported by the NSF Plant Genome Research and REU Programs *Supported by the NSF Plant Genome Research and REU Programs Tutorial of bioinformatics and.
Protein Functional Site Prediction The identification of protein regions responsible for stability and function is an especially important post-genomic.
Alignment of mRNAs to genomic DNA Sequence Martin Berglund Khanh Huy Bui Md. Asaduzzaman Jean-Luc Leblond.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
CSE 182: Biological Data Analysis Instructor: Vineet Bafna TA: Ryan Kelley
Evaluating alignments using motif detection Let’s evaluate alignments by searching for motifs If alignment X reveals more functional motifs than Y using.
1. How does conjugation work? Sex in Bacteria How do bacteria exchange DNA.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
A Comprehensive Workflow for Microbial Genome Sequencing From Swab to Publication Madison I. Dunitz 1, David A. Coil 1, Jenna M. Lang 1, Guillaume Jospin.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Comparative Genomics of Viruses: VirGen as a case study Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune Pune
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Sigma Factors & Transcriptional Regulation of P. syringae TTSS Alexander Wong.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis Victoria Hunt 1 *, R. Burke Squires 1, Jyothi Noronha 1,
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Pathway Assignments. The assignment – Annotating Pathways KEGG Pathway Database.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
BASys: A Web Server for Automated Bacterial Genome Annotation Gary Van Domselaar †, Paul Stothard, Savita Shrivastava, Joseph A. Cruz, AnChi Guo, Xiaoli.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
GeneWise and Artemis Exercises Spliced Alignment using GeneWise Click on the GeneWise hyperlink on the course links page,
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
1. How does conjugation work? Sex in Bacteria How do bacteria exchange DNA.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Genomics (BIO 426) James Madison University. Why are you here? Have you taught Genomics before? Plan to teach it soon? Might you teach it sometime? Just.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Recent advances in understanding gene –for – gene interactions.
P P.s. tabaci Null P.s. syringae Hrp - (TTSS) mutant HR P.s. syringae >50 pathovars based on host specificity Tobacco Bean Tomato P.s. pv. tabaci P HR.
Copyright OpenHelix. No use or reproduction without express written consent1.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Methods by which pathogens cause disease: Adhesion: bacteria must bind to the cell surfaces Colonization: bacteria produce proteins and colonize parts.
Genome sequencing and annotation Comprehensive identification of virulence gene candidates by various means Bioinformatic prioritization of virulence gene.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
What is sequencing? Video: WlxM (Illumina video) WlxM.
Winthrop June 28 – July 2, 2014 Terrell L. Hodge Western Michigan University
Metagenomic Species Diversity.
NGS Analysis Using Galaxy
How to use a bioinformatics website!
John Rathjen and group ANU
Greg Challis Department of Chemistry, University of Warwick, UK
There are four levels of structure in proteins
Comparison of HTGs involved in nutrition synthesis, CIP, bacterial cell wall synthesis, population regulation, and plant or fungal cell wall degradation.
INFORMATION FLOW AARTHI & NEHA.
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 21, Issue 3, Pages (October 2017)
Libo Shan, Ping He, Jen Sheen  Cell Host & Microbe 
Volume 21, Issue 3, Pages (October 2017)
GENOMICS Copyright © 2009 Pearson Education, Inc..
Part II SeqViewer AraCyc Help
Core genome phylogeny of V. anguillarum strains.
TF candidate selection pipeline.
Phylogenetic tree of 38 Pseudomonas type strains, based on a concatenated nine-gene MLST analysis. Phylogenetic tree of 38 Pseudomonas type strains, based.
*Supported by the NSF Plant Genome Research and REU Programs
Presentation transcript:

Questions we can address with bioinformatic analysis and genome sequence comparison: 1.Why is a given pathogen more virulent? 2.What is the geographic range of different pathogen strains and how are they changing with time? 3.Why does a given pathogen attack one host but not another? Closely related strains caused different symptoms on the same host Distantly related strains are pathogens of woody plants Distantly related strains are pathogens of the same herbaceous species

Pseudomonas syringae is a plant pathogenic bacterium divided into “pathovars” depending largely on the host plant from which they were isolated MLST analysis reveals a population structure composed of 4 major clades Host specificity is only partially related to phylogenetic relationship

Three strains representing three of the major clades sequenced to completion bean (Pph 1448A) bean (Psy B728a) tomato (Pto DC3000)

More recently draft genome sequences have become available for three P. syringae pathogens of woody plants kiwi (Pan M303091) olive (Psv NCPPB 3335) horse chestnut (Pae 2250) bean (Pph 1448A) bean (Psy B728a) tomato (Pto DC3000)

P. savastanoiolive P. aesculihorse chestnut P. actinidaekiwi What enables these strains to be pathogens of woody plants? Are the properties shared (or not) between different clades? Candidate determinants: 1. Type III effectors 2. iron acquisition capabilities 3. metabolism of compounds associated with woody tissue Pathogens of woody plants:

1. Type III effectors Translocated into the plant cell by the Type III secretion system Regulated by the hrpL alternative sigma factor Impact host range chiefly through suppression of plant defenses Tools for finding type III effector genes: 1.Look for genes named as Type III effectors by the automated annotation pipeline 2.Look for genes associated with predicted binding sites for HrpL 3.Look for regions showing BLAST similarity to known Type III effectors 4.Compare genomes and examine regions known to be enriched for Type III effectors in other strains

2. Siderophores: extracellular iron-chelating compounds used by microbes to scavenge iron from their environment Synthesized by non-ribosomal peptides synthases – enormous modular enzymes Tools for finding siderophores: 1.Look for genes named with the following keywords by the automated annotation pipeline non-ribosomal siderophore pyoverdine, achromobactin, yersiniabactin 2.Compare genomes (sometimes helps in identification of unannotated fragments 3.Look for REALLY big genes

3. Metabolism: Are these strains able to thrive in woody hosts because they can derive nutrition from wood while others can’t? Challenge: Metabolic modeling from sequence data and comparison of metabolic pathways is not easily automated.

Rodriguez-Palenzuela et al found that P. savastanoi encodes genes allowing degradation of aromatic compounds (assoc with woody plants) to readily metabolizable compounds Genes shaded gray are present in P. savastanoi but not the three pathovars with closed genome sequences (pathogens of herbaceous plants) Read more about aromatic metabolism here:

Metabolic questions 1.Are similar genes present in P. aesculi and P. actinidae? 2.Do the genes appear in a genomic island when compared to the related herbaceous pathogen P syringae phaseolicola 1448A?

Materials for genome analysis 1.Annotated pseudomolecule: contigs concatenated into a single string of nucleotides contigs within scaffolds delineated by 50 “Ns” Scaffolds delineated with TIGR linker NNNNNAATTAATTAATTNNNNN gene calls and functional assignments generated by RAST 2. HrpL binding sites predicted 3. Regions similar to effector genes IDed using BLAST Sequence/annotation visualization: Artemis Artemis Comparison Tool MAUVE (see Dave) RAST

Teaching materials Scroll to the bottom of the home page:

Handout with instructions for using Artemis and ACT Open files and save as text

Alternative to handout: Go to “View genomes” at PPI website

Sequences in the P. syringae Hop database can also be used for BLAST analysis of Genbank nr

Artemis output Overview window DNA view window Feature annotation window

Artemis Comparison Tool output (for three genomes) Note usefulness for visualizing variable regions

To work on: 1. Select one of the three “tree” pathogens 2. Make an inventory of the effector genes using the tools available Do the genes appear to be complete? Do they have hrp boxes? Are there hrp boxes with good scores (>15) near the starts of genes that do not appear to be effectors? 3. How many siderophores and non-ribosomal peptide synthases do you find? 4. Are genes linked to catechol/anthranilate metabolism and similar to those in P. savastanoi present? Are they in a conserved location in the three tree pathogens? Are they in a genomic island relative to sequenced herbaceous pathogens?