The Whole Genome Sequencing Revolution Martin Wiedmann Gellert Family Professor of Food Safety Department of Food Science Cornell University, Ithaca, NY.

Slides:

Advertisements

Similar presentations

Rebecca E. Colman 1, Robert J. Brinkerhoff 2, Adina Doyle 1, Chris Ray 3, Paul Keim 1, Sharon K. Collinge 3, and David M. Wagner 1 1 Northern Arizona University,

Advertisements

A framework for the future: Building molecular tools to understand the epidemiology of Clostridium difficile in Scotland.

Course on Introduction to microbial whole genome sequencing and analysis Mette Voldby Larsen DTU – Center for Biological Sequence Analysis (CBS) Henrik.

Food Safety National Center for Emerging and Zoonotic Infectious Diseases Division of Foodborne, Waterborne, and Environmental Diseases.

Next-generation sequencing

Source attribution in Campylobacter jejuni Daniel Wilson Nuffield Department of Clinical Medicine JR Microbiology Seminar 16 th.

Foodborne Outbreak Investigation, Hanoi, Vietnam 01 – 05 June 2009 Foodborne Diseases Integrating efforts from feed to food Dr Danilo Lo Fo Wong.

Investigating Foodborne Disease Outbreaks: The CDC Perspective Ian Williams, PhD, MS Chief, Outbreak Response and Prevention Branch Division of Foodborne,

Foodborne Disease Surveillance in the U.S.: FoodNet, PulseNet, and Outbreak Alert! Caroline Smith DeWaal Center for Science in the Public Interest (U.S.)

Molecular Epidemiology: Impact on Food Regulation and Future Needs

1 Don L. Zink, Ph.D. Center for Food Safety and Applied Nutrition U.S. Food & Drug Administration College Park, MD The Challenge of Emerging Foodborne.

DNA fingerprinting Every human carries a unique set of genes (except twins!) The order of the base pairs in the sequence of every human varies In a single.

Phylogeny - based on whole genome data

United States Department of Agriculture Food Safety Inspection Service SALMONELLA SUBTYPING RESULTS IN RAW PRODUCTS FSIS Notice /3/2010 Policy Development.

Molecular Surveillance of Foodborne Infections Peter Gerner-Smidt, MD, PhD Chief of PulseNet USA CDC

Lee H. Harrison, MD Associate Professor

FDA Tree Nut Risk Assessment and Human Salmonellosis

Beyond Phylogeny: Evolutionary analysis of a mosaic pathogen Dr Rosalind Harding Departments of Zoology and Statistics, Oxford University,UK.

Department of Food Science

Listeriosis in the United States Benjamin J. Silk, PhD, MPH Staff Epidemiologist Enteric Diseases Epidemiology Branch, CDC Public meeting on the Interagency.

United States Department of Agriculture Food Safety and Inspection Service FSIS Foodborne Illness Investigations: Current Thinking Scott A. Seys, MPH Chief,

United States Department of Agriculture Food Safety and Inspection Service NACMPI February 5-6, 2008 Attribution February 5, 2008 Curtis Travis, PhD Science.

Strategy for developing a molecular subtyping tool for a foodborne bacterial pathogen using a whole genome analysis approach: the case of Salmonella Enteritidis.

Listeriosis in the United States Frederick J Angulo, DVM, PhD Enteric Diseases Epidemiology Branch Division of Foodborne, Bacterial and Mycotic Diseases.

Whole Genome Sequencing aka “WGS” - utility in foodborne illness outbreak detection and investigations Dan Rice FDA ORA – Pacific Regional Lab Northwest.

E. coli O157:H7 -- Illness trends and recent data from outbreak investigations, United States Shiga Toxin –Producing E. coli Addressing the Challenges,

United States Department of Agriculture Food Safety and Inspection Service Directive Foodborne Illness Investigations District Office Correlation.

Investigation of the Hald model as a method to improve foodborne illness source attribution estimates Antonio Vieira, DVM, MPH, PhD Enteric Diseases Epidemiology.

United States Department of Agriculture Food Safety and Inspection Service Use of Subtyping Data by FSIS: A Public Health Based Approach to Salmonella.

100K Genome Project By: Amanda Crichton and Laura Henkel.

United States Department of Agriculture Food Safety and Inspection Service Stage 1: Epidemiology and Identify the Food.

Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources

Lessons Learned from Salmonella in Eggs Outbreaks Don L. Zink, Ph.D. Center for Food Safety and Applied Nutrition U.S. Food & Drug Administration 1.

Pathogen Reduction Dialogue Panel 2 HACCP Impacts on Contamination Levels in Meat and Poultry Products: FSIS Perspective Delila R. Parham, DVM Office of.

Genome-wide longitudinal analysis of emm1 invasive Group A Streptococcus isolated from Belgian patients during 1994 ˗ 2013 J. Coppens 1, B. B. Xavier,

Neanderthals Noonan, et al. Sequencing and Analysis of Neanderthal Genomic DNA Green, et al. Analysis of one million base pairs of Neanderthal DNA Kristine.

Data Needed to Measure HACCP Impacts on Public Health Jack Guzewich, R.S., M.P.H. Pathogen Reduction Dialogue Panel 2 May 6, 2002.

Genomes & The Tree of Life

Pathogenicity of Bacteria. Campylobacter spp. Salmonella spp. Escherichia coli 76 Million Cases of Food-borne illness every year in the USA 325,000 result.

U.S. Food and Drug Administration Notice: Archived Document The content in this document is provided on the FDA’s website for reference purposes only.

Lecture #2 Characteristics of Life Studying Life What Characteristics do all living things share?

Listeria monocytogenes Prevalence, Persistence, and Control Haley F. Oliver, Ph.D. Associate Professor Purdue University.

1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.

Jean B. Patel, PhD, D(ABMM) Division of Healthcare Quality Promotion National Center for Emerging and Zoonotic Infectious Disease Centers for Disease Control.

Presented by: Najmeh Parhizgari PhD student of medical virology at TUMS Insights to Genetic Characterization Tools for Epidemiological Tracking of Francisella.

Outbreak Investigation

2. Centers for Disease Control and Prevention (CDC), Atlanta, GA, USA

Whole Genome Sequencing for Epidemiologists – A Brief Introduction

Steffany Cavallo, MPH (Tennessee) Carlota Medus, PhD, MPH (Minnesota)

Nucleotide variation in the human genome

Martin Wiedmann Cornell University

Recalls & Tracebacks Carrie Rigdon, PhD, MPH

Tracking a hospital outbreak of KPC-producing ST11 Klebsiella pneumoniae with whole genome sequencing Y. Jiang, Z. Wei, Y. Wang, X. Hua, Y. Feng, Y.

Presenter- Janet Nale ISOLATION AND CHARACTERISATION OF TEMPERATE BACTRIOPHAGES OF THE HYPERVIRULENT Clostridium difficile 027 STRAIN.

Epidemiologist Supervisor Foodborne Diseases Unit

Drivers and Constraints – application of molecular typing in surveillance of foodborne diseases in EU/EEA Johanna Takkinen, on behalf of ECDC FWD team.

Whole genome sequencing: New methods for traceback investigations

Future Directions Unknowns:

The Use of Molecular Epidemiology and

Genomics of medical importance

Outbreak Investigation

Whole genome sequencing options for bacterial strain typing and epidemiologic analysis based on single nucleotide polymorphism versus gene-by-gene–based.

Plague: Out of the Foothills

Tracking a hospital outbreak of KPC-producing ST11 Klebsiella pneumoniae with whole genome sequencing Y. Jiang, Z. Wei, Y. Wang, X. Hua, Y. Feng, Y.

Whole genome sequencing as a tool to investigate a cluster of seven cases of listeriosis in Austria and Germany, 2011–2013 D. Schmid, F. Allerberger,

Complex Outbreak Response

Contact investigations for outbreaks of Mycobacterium tuberculosis: advances through whole genome sequencing T.M. Walker, P. Monk, E. Grace Smith, T.E.A.

CDC to Inspect All Major US Egg-Producing Facilities

Francois Balloux, Ola Brønstad Brynildsrud, Lucy van Dorp, Liam P

Presentation transcript:

The Whole Genome Sequencing Revolution Martin Wiedmann Gellert Family Professor of Food Safety Department of Food Science Cornell University, Ithaca, NY Phone:

Outline Subtyping for disease surveillance: from PFGE to WGS WGS challenges: when are two isolates the same or different? Can we find identical isolates in different locations? Looking in the future

PulseNet allows international outbreak detection and traceback – a hypothetical example Food isolate, deposited into PulseNet Human case

Whole Genome Sequencing It all started with the human genome project Sequencing of a bacterial genome is now feasible at costs of <$100/isolate Costs will continue to drop Commonly used platforms include Roche 454 Illumina HiSeq/MiSeq Applied Biosystems SOLiD Systems Life Technologies/Thermofisher Ion Torrent; PacBio RS Nanopore based systems (e.g., Oxford Nanopore MinION)

The genome sequence revolution

DNA sequencing- based subtyping Isolate 1AACATGCAGACTGACGATTCGACGTAGGCTAGACGTTGACTG Isolate 2AACATGCAGACTGACGATTCGTCGTAGGCTAGACGTTGACTG Isolate 3AACATGCAGACTGACGATTCGACGTAGGCTAGACGTTGACTG Isolate 4AACATGCATACTGACGATTCGTCGAAGGCTAGACGTTGACTG SNP: single nucleotide polymorphism

Challenges with use of PFGE as a subtyping method in outbreak investigations Two isolates may show the same PFGE type even though they are genetically distinct PFGE only interrogates small part of the genome Two isolates may show “slightly” (?? - the “3-band rule”) different PFGE patterns despite sharing a very recent common ancestor Could be due to lateral genes transfer, loss of plasmid, rearrangements, point mutations etc.

Xbal SpeI L Den Bakker et al AEM. Includes isolates form Salmonella outbreak linked to sausages (Rhode Island) and isolates from pistachios

Tip-dated maximum clade credibility tree based on SNP data for 47 Montevideo isolates

98 MLVA types Salmonella Enteritidis is most common cause of human salmonellosis – poorly resolved by current subtyping technologies. 52 PFGE types 163 combined MLVA-PFGE types

Full genome sequencing identified the following differences between these isolates: (i)28 single nucleotide polymorphisms (SNPs) and (ii)three indels, including a 33 kbp prophage that accounted for the observed difference in AscI PFGE patterns. Both isolates were found to harbor a 50 kbp putative mobile genomic island encoding translocation and efflux functions that has not been observed in other Listeria genomes. Gilmour et al. BMC Genomics 2010, 11:120

In addition, whole genome sequencing showed that 5 Listeria isolates collected in 2010 from the same facility were also closely related genetically to isolates from ill people.

Listeria Outbreaks and Incidence, Pre-PulseNet Early PulseNet Listeria Initiative No. outbreaks Incidence (per million pop) Era Outbreaks per year Median cases per outbreak WGS Data are preliminary and subject to change

March 2015: Listeriosis cases linked to Blue Bell ice cream

Outline Subtyping for disease surveillance: from PFGE to WGS WGS challenges: when are two isolates the same or different? Can we find identical isolates in different locations? Looking in the future

The challenge Identical bacteria (100% match over the whole genome) can be found in different places that can be potential sources of foodborne disease outbreaks

The theoretical background Bacteria divide asexually: Bacterial populations can be seen as large populations of “identical twins” Mutation rate during replication is low: extremes of the suggested mutation rates range from 2.25 × to 4.50 × per bp per generation – With a genome size of around 5 Million bp per bacterial genome (5 × 10 6 ) between approx. 450 and 9,000 generations are needed for a single SNP difference – Eyre et al. estimated evolutionary rate of 0.74 SNVs per successfully sequenced genome per year for C. difficile (N. Engl. J. Med. 2013) “Whole-genome sequencing … identified 13% of cases that were genetically related (≤2 SNVs) but without any evidence of plausible previous contact through a hospital, residential area, or family doctor.” – Unknown bacterial generation time in different environments complicates interpretation

2000 US outbreak - Environmental persistence of L. monocytogenes 1988: one human listeriosis case linked to hot dogs produced by plant X 2000: 29 human listeriosis cases linked to sliced turkey meats from plant X

Real world observations

In one case, isolates with < 3 SNP differences were found in retail delis in there different states

Conclusions Even with WGS, epidemiological data are still essential Number of SNP differences/allele differences that is meaningful differs by organism, strain, outbreak/cluster, and growth environment – Number of bacterial generations per calendar year can differ hugely (think dry environment versus active infection in an animal population) Best way to determine “meaningful” SNP differences is through combination of phylogenetic and epidemiological data

Looking in the future WGS will get cheaper and will be used more – STEC next, probably Salmonella Enteritidis after that – Detection of more clusters and outbreaks WGS database will grow rapidly with inclusion of environmental isolates – More outbreak will be linked to source by using WGS matches between food or environmental isolates and human isolates as stating point More broad application of WGS by private labs, maybe customers and consumers?

Conclusions WGS is a game changer and will significantly improve detection of outbreaks, adulteration, etc. – False alarms will occur though Pathogen detection in environments, by regulatory agencies, will lead to inclusion of WGS data in CDC/FDA/USDA databases (GenomeTrakr) – Environmental pathogen monitoring by industry will become even more important

30

Analysis of genome wide SNPs (wgSNPs) Identifies all high confidence SNPs over whole genome (approx. 3 to 5 million nucleotides)

Whole genome multilocus sequence typing (MLST) Allows for simpler analysis and clear naming of subtypes Performs comparison on a gene by gene level Isolate AIsolate BIsolate C Gene 1111 Gene Gene 3552 Etc. Gene 1, wgMLST typeAAB