New method developments in pollen DNA barcoding for forensic applications Karen Leanne Bell, Emory University Homeland Security Symposium Series, University.

Slides:



Advertisements
Similar presentations
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Advertisements

Metabarcoding 16S RNA targeted sequencing
Determination of host-associated bacterial communities In the rhizospheres of maize, acorn squash, and pinto beans.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Getting Biodiesel From Jathropa The jatropha seeds, commonly known as the physic nuts, contain up to 40% oil. The jatropha oil can be used directly as.
Genomic DNA purification
Kamila Balušíková.  DNA – sequence of genes, repetitive sequence of noncoding regions  RNA  Proteins gene expression.
Real-Time PCR (Quantitative PCR)
Bioinformatics/PCR Lab How does having a certain genetic marker affect chances of getting brain cancer?
DNA BARCODING CHILLIES BIO-NERDS : Say Wah Yugraj Singh Tanja Obradovic Jenny Pham Lovita Bharossa Buai Chuol Diana Corzo.
DNA basics DNA is a molecule located in the nucleus of a cell Every cell in an organism contains the same DNA Characteristics of DNA varies between individuals.
explain how crime scene evidence is
DNA Barcoding Amy Driskell Laboratories of Analytical Biology
Discovery of new biomarkers as indicators of watershed health and water quality Anamaria Crisan & Mike Peabody.
A Bayesian method for DNA barcoding Kasper Munch, Wouter Boomsma, Eske Willerslev, Rasmus Nielsen, University of Copenhagen.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Genomics – Next-Gen sequencing and Microarrays
Accurate estimation of microbial communities using 16S tags Julien Tremblay, PhD
DNA barcoding: bane or boon (or both) for taxonomy? Donal A. Hickey, Concordia University, Montreal. Collaborators: Mehrdad Hajibabaei and Gregory Singer.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
1. 2 VARIANTS OF PCR APPLICATIONS OF PCR MECHANICS OF PCR WHAT IS PCR? PRIMER DESIGN.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Accurate estimation of microbial communities using 16S tags
All rights Reserved Cengage/NGL/South-Western © 2016.
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
Chapter 10 Advanced Concepts in DNA © 2012 Cengage Learning. All Rights Reserved.
Canadian Bioinformatics Workshops
Valentin Vasselon 1, Agnès Bouchez 1, Isabelle Domaizon 1, Maria Kahlert 2, Frédéric Rimet 1 Towards standardization of DNA extraction for next- generation.
Library QA & QC Day 1, Video 3
CNR ITB, Bari Section - BioInformatics and Genomics MOLECULAR BIODIVERSITY Barcode: A new challenge for Bioinformatics Cecilia Saccone Meeting FIRB 2005.
De Novo Assembly of Mitochondrial Genomes from Low Coverage Whole-Genome Sequencing Reads Fahad Alqahtani and Ion Mandoiu University of Connecticut Computer.
PRELIMINARY CONCLUSIONS
Explain how crime scene evidence is
16S rRNA Experimental Design
Riann Egusquiza D145 Presentation January 12, 2017
Quantitative Phylogenetic Assessment of Microbial Communities in Diverse Environments Xinjun Zhang.
Explain how crime scene evidence is
Canadian Bioinformatics Workshops
Biotechnology.
Metagenomic Species Diversity.
Metagenomics: From Bench to Data Analysis 19-23rd September S rRNA-based surveys for Community Analysis: How Quantitative are they? Dr.
최지혜.
Micelle PCR reduces artifact formation in 16S microbiota profiling
All rights Reserved Cengage/NGL/South-Western © 2016.
Comparison of WGA methods for genotyping fetal nucleated red blood cells for the application of non-invasive prenatal diagnosis Zhouwei Huang­­1, Angela.
Research in Computational Molecular Biology , Vol (2008)
All rights Reserved Cengage/NGL/South-Western © 2016.
3.5 Genetic modification and biotechnology
Pipelines for Computational Analysis (Bioinformatics)
Lecture 7. Topics in RNA Bioinformatics (Single-Cell RNA Sequencing)
Workshop on the analysis of microbial sequence data using ARB
Human Cells Human genomics
3.5 – Genetic Modification & Biotechnology
Reverse Complement PCR: fast, low cost amplicon based NGS
Metabarcoding Presentation
Slot blot hybridization
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
Explain how crime scene evidence is
Explain how crime scene evidence is
Visualising and Exploring BS-Seq Data
Forensic identification of 12 mammals species based on size variation of mitochondrial cytochrome b gene using multiplex PCR assay  M. Črček, K. Drobnič 
Plant Genetics A living organism of the kind exemplified by trees, shrubs, herbs, grasses, ferns, and mosses, typically growing in a permanent site, absorbing.
Aim: How is genetics used in law enforcement?
DNA Fingerprinting and Forensic Analysis
1.1.3 MI.
Explain how crime scene evidence is
explain how crime scene evidence is
Explain how crime scene evidence is
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

New method developments in pollen DNA barcoding for forensic applications Karen Leanne Bell, Emory University Homeland Security Symposium Series, University of Texas El Paso, June

Overview Background and aims Testing the DNA metabarcoding method Quantification Whole genome shotgun approach Removal of non-pollen material Ongoing method development 2

Pollen species identification Applications: Forensics Allergen monitoring Pollination ecology Determining plant community composition Typical sample: Mixture of pollen species May be low quantity 3

Pollen DNA metabarcoding Advantages over microscopic examination: Greater taxonomic resolution 4 SPECIES FAMILY GENUS FAMILY GENUS Visual DNA SPECIES Visual DNA Image: C. Chu

Pollen DNA metabarcoding Advantages over microscopic examination: Some species lack features on pollen Few people trained in palynology Many more could be trained in pollen DNA metabarcoding 5

Pollen DNA metabarcoding Advantages over microscopic examination: Some species lack features on pollen Few people trained in palynology Faster with large numbers of samples 6

Pollen DNA metabarcoding Advantages over microscopic examination: Some species lack features on pollen Few people trained in palynology Faster with large numbers of samples Disadvantages: Not quantitative Species missing from databases No standard bioinformatics pipeline 7

Overview Background and aims Testing the DNA metabarcoding method Quantification Whole genome shotgun approach Removal of non-pollen material Ongoing method development 8

ITS rbcL matK trnL Pollen DNA metabarcoding Testing the method with known pollen mixtures 9

Pollen DNA metabarcoding: Concepts and definitions DNA barcoding: The identification of species based on DNA sequence of a standard gene region DNA metabarcoding: Simultaneous identification of all species in a mixture using high-throughput sequencing of a standard gene region 10

Mixed -species pollen samples from objects of interest Pollen exine lysis DNA isolation Amplification of barcoding marker Species 1 Species 2 Sample A Species 3 Species 4 Sample B Sample C … Apply unique index to each sample Index A Index B Index C Pool and sequence Sort and remove index sequences Sample A: -Species 1 -Species 2 Sample B: -Species 1 -Species 3 -Species 4 Sample C … -Species 1 -Species 2 -Species 3 DNA sequence data Sample A: -DNA barcode 1 -DNA barcode 2 Sample B: -DNA barcode 1 -DNA barcode 3 -DNA barcode 4 Sample C … -DNA barcode 1 -DNA barcode 2 -DNA barcode 3 Search DNA sequence database Bell, K.L., Burgess, K.S., Okamoto, K.C., Aranda, R., and Brosi, B.J. (2016). Forensic Science International: Genetics, 21:

Testing the DNA metabarcoding method Why test the method? Don’t know how accurate the method is Don’t know how many samples can be multiplexed Don’t know if it is quantitative 12

Testing the basic pollen DNA metabarcoding method How many species can be detected within a sample? To what extent is this affected by taxonomic relatedness? How rare can a species be in a sample before it becomes undetectable? 13

Testing the method with “knowns” Mixtures of varying complexity: Number of species Relatedness Proportion of rarest species 450bp ITS2 500bp rbcL Illumina MiSeq: Dual-index 96 samples/flow cell Species identification with customized bioinformatics pipeline Bell, K.L., Burgess, K.S., Loeffler, V.M., Read, T.D., and Brosi, B.J. (in prep.). Testing the success of DNA metabarcoding of pollen for species identification and quantification with artificial mixtures of increasing complexity 14

ITS2 results: Increasing number of species 2 species detected3 species identified to species, Zea mays undetected, 2 species identified to genus Broussonetia papyrifera correctly identified, Zea mays undetected, Artemisia tridentata identified to genus 4 species identified to species, Zea mays undetected, 2 species identified to genus 2 species identified to species, Zea mays undetected, Artemisia tridentata identified to genus 5 species identified to species, Zea mays undetected, 2 species identified to genus 3 species identified to species, Zea mays undetected, Artemisia tridentata identified to genus 6 species identified to species, Zea mays undetected, 2 species identified to genus 15

ITS2 results: Effect of taxonomic relatedness Same genus 2 species detected Different APGIII lineage 2 species detected Same family 2 species detected Zea mays undetected Same APGIII lineage 2 species detected Monocot vs dicot 2 species detected 16

ITS2 results: Increasing rarity Different familiesSame family 2 species detected 17

Testing the DNA metabarcoding method: conclusions Success varied across species Success dependent on rarity Not affected by: Species richness Taxonomic relatedness Not quantitative 18

DNA metabarcdoing: ongoing research Bioinformatics pipeline for mixed-amplicon DNA metabarcoding including rbcL Effect of sequencing depth Standard methodology 19

Overview Background and aims Testing the DNA metabarcoding method Quantification Whole genome shotgun approach Removal of non-pollen material Ongoing method development 20

Quantitative pollen DNA metabarcoding: goals 21

Quantitative pollen DNA metabarcoding: benefits Benefits for various applications Pollination biology: Proportion of time on a particular plant species impacts pollination effectiveness Forensics: Fine-scale geographic/temporal signature May not differ qualitatively 22

Quantification & biases Sources of bias: Pollen preservation bias DNA isolation bias Amplification bias Copy number bias Phylogenetic trends Correction 23

Quantification: testing for biases Pollen preservation bias: Quantify DNA obtained from stored specimens DNA isolation bias: Quantify DNA obtained from fresh specimens Amplification bias: Context dependent Testing on mixtures of known species composition Copy number bias: rbcL and ITS2 qPCR methods 24

Quantification: copy number bias qPCR: Follows PCR amplification in real time using fluorescent dye Quantifies number of copies of gene of interest Determine copy number of chloroplast, ribosome and single-copy gene Look for phylogenetic patterns in copy number 25

Copy number bias: progress Primers developed for chloroplast for all major plant lineages Primers developed for ribosomal DNA for all major plant lineages Primers developed for single-copy nuclear gene for: Pinus taeda – gymnosperm Magnolia grandiflora – basal angiosperm Hibiscus syriacus – eudicot 26

Copy number bias: qPCR analysis Step 1: For 3 species with single-copy nuclear DNA primers: Use qPCR to find copy number of chloroplast, ribosome, and single-copy nuclear gene Calculate copy number per pollen grain for chloroplast & ribosome using: qPCR copy number of 3 genes Ploidy Number of cells per pollen grain 27

Copy number bias: qPCR analysis Step 1: For 3 species with single-copy nuclear DNA primers: Use qPCR to find copy number of chloroplast, ribosome, and single-copy nuclear gene Step 2: For all other species: Measure DNA concentration Use qPCR to find copy number of chloroplast & ribosome Calculate copy number per pollen grain for chloroplast & ribosome using: qPCR copy number of 3 genes DNA concentration Genome size Number of cells per pollen grain 28

Copy number bias: qPCR analysis Step 1: For 3 species with single-copy nuclear DNA primers: Use qPCR to find copy number of chloroplast, ribosome, and single-copy nuclear gene Step 2: For all other species: Measure DNA concentration Use qPCR to find copy number of chloroplast & ribosome For 3 species with single-copy nuclear DNA primers compare copy number calculation from both methods for accuracy 29

Copy number bias: sampling & phylogenetic analysis 100 species for analysis: 25 orders, 3 species each in different families 5 orders, 2 species-pairs in same genus, one species from another family Test for phylogenetic patterns in copy number Make generalizations for lineage, order, or family 30

Quantification: DNA isolation bias 31

Quantification: other biases Pollen preservation bias: Not examined Could be examined using stored samples Amplification bias: Could be examined with results of known mixtures Results will only apply under same PCR conditions Varies with other species in mixture 32

Quantification: ongoing research Test recently designed qPCR primers Quantify copy number of 100 species of pollen Determine phylogenetic trends for all biases Apply correction factors to convert from proportion of DNA sequencing reads to proportion of pollen grains Include correction factors in bioinformatics pipelines 33

Overview Background and aims Testing the DNA metabarcoding method Quantification Whole genome shotgun approach Removal of non-pollen material Ongoing method development 34

Whole Genome Shotgun (WGS) pollen DNA metabarcoding Random DNA Random DNA Random DNA Random DNA 35

WGS pollen DNA metabarcoding Advantages over PCR-based approaches: No amplification bias No copy number bias Finds variation between species identical in “barcode” genes Current challenges: Genome size bias Few species with genome data for reference library 36

Testing the WGS method How many species can be detected within a sample? To what extent is this affected by taxonomic relatedness? How rare can a species be in a sample before it becomes undetectable? How does it compare to mixed-amplicon sequencing? 37

Testing the method with “knowns” Subset of known mixtures used for mixed-amplicon sequencing Simulated data for same mixtures Illumina MiSeq: 21 indexed samples/flow cell 24-30M paired reads of 150bp Species identification with customized bioinformatics pipeline Bell, K.L., Macpherson, M., Burgess, K.S., Read, T.D., and Brosi, B. (in prep.) Assessing the potential for whole genome shotgun methods in mixed-species pollen identification using artificial mixtures 38

WGS results: Simulated data Same mixtures as laboratory samples, with exception of those containing Carya illinoinensis Mixture complexities: Up to 8 species Pairs of varying relatedness Down to 10% of pollen grains, 1% of DNA content 39

WGS results: Known pollen mixtures Results better than simulated data Fewer false positives All species identified except Carya illinoinensis 40

WGS results: Overall success All species could be identified given presence in reference database Advantages over mixed-amplicon sequencing: Species level identifications of species with identical ITS2 sequences Artemisia tridentata Populus tremuloides Detection of species that amplified poorly with ITS2 Zea mays Disadvantages over mixed-amplicon sequencing: Species absent from database: Carya illinoinensis 41

Testing the WGS DNA metabarcoding method: conclusions Demonstrated proof of concept Success varied only with database presence Not affected by: Species richness Taxonomic relatedness Rarity Probably not quantitative 42

WGS: ongoing research Is it quantitative? Quantitative comparison to standard DNA metabarcoding Effect of sequencing depth Standard methodology 43

Overview Background and aims Testing the DNA metabarcoding method Quantification Whole genome shotgun approach Removal of non-pollen material Ongoing method development 44

Ongoing method development Improving bioinformatics pipeline for mixed- amplicon DNA metabarcoding: Including multiple gene regions Correcting for biases Improving methods to remove non-pollen plant material Developing standard methods for DNA metabarcoding and Whole Genome Shotgun methods 45

Possible future directions Developing a geographical database to assist with determining geographic origin of pollen samples Trialing standard methods on known mixtures that resemble forensic samples Developing standard methods for specific types of forensic and other analyses 46

Acknowledgemen ts 47 Funding and Institutional Support: US Army Research Office (grants W911NF & W911NF to Brosi, Read, & Burgess) Emory University Individuals: Berry Brosi (Emory) Kevin Burgess (Columbus State) Tim Read (Emory) Cindy Chu (Emory) Emily Dobbs (Emory) Julie Fowler (Emory) Virginia Loeffler (Emory) Donated plant material: Atlanta Botanical Garden Missouri Botanical Garden New York Botanical Garden Montgomery Botanical Center Royal Botanic Gardens Victoria

48 Karen Leanne Bell Emory University