Metagenomics. What is metagenomics? Term first used in 1998 by Jo Handelsman "the application of modern genomics techniques to the study of communities.

Slides:



Advertisements
Similar presentations
Cyber Metagenomics; Challenge to See The Unseen Majority in The Ocean
Advertisements

16S sequencing for microbiome studies Nicola Segata and Nick Loman
Clostridium difficile Colitis or Dysbiosis. Symbiostasis/Dysbiosis.
Tucson High School Biotechnology Course Spring 2010.
Use of the genomic data o Reconstruction of metabolic properties o Nature’s Microbiome o NGS in Population Genetics.
Metabarcoding 16S RNA targeted sequencing
Basic Microbiome Analysis with QIIME
Microbiome Analysis from sample to data MGL Users Group June 18, 2014.
Yaron Fireizen, Vinay Rao, Lacy Loos, Nathan Butler, Dr. Julie Anderson, Dr. Evan Weiher ▪ Biology Department ▪ University of Wisconsin-Eau Claire From.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey What is Metagenomics?  Traditional microbial genomics 
Practical Bioinformatics Community structure measures for meta-genomics István Albert Bioinformatics Consulting Center Penn State.
Greg Phillips Veterinary Microbiology
ACTIVITY 2: SIZE AND SCALE MATTER! Original drawings by John Tenniel.
Microbial Diversity.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Molecular Microbial Ecology Lecture 1 Professor Ralph Kirby Faculty of Life Sciences Extension 5511 Room B322.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
The Microbiome and Metagenomics
Zachary Bendiks. Jonathan Eisen  UC Davis Genome Center  Lab focus: “Our work focuses on genomic basis for the origin of novelty in microorganisms (how.
Introduction to metagenomics Agnieszka S. Juncker Center for Biological Sequence Analysis Technical University of Denmark.
Metagenomics Binning and Machine Learning
Metagenomic Analysis Using MEGAN4
DNA Fingerprinting of Bacterial Communities. Overview Targets gene for ribosomal RNA (16S rDNA) Make many DNA copies of the gene for the entire community.
Molecular Microbial Ecology
Discovery of new biomarkers as indicators of watershed health and water quality Anamaria Crisan & Mike Peabody.
From Metagenomic Sample to Useful Visual Anna Shcherbina 01/10/ Anna Shcherbina Bioinformatics Challenge Day 02/02/2013 From Metagenomic Sample to.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Systematics the study of the diversity of organisms and their evolutionary relationships Taxonomy – the science of naming, describing, and classifying.
Species  OTUs  OPUs  Species  OTUs  OPUs. Rosselló-Mora & Amann 2001, FEMS Rev. 25:39-67 Taxa circumscription depends on the observable characters.
Probes can be designed in an evolutionary hierarchy.
Diversity of uncultured candidate division SR1 in anaerobic habitats James P. Davis Microbial & Molecular Genetics Oklahoma State University.
Accurate estimation of microbial communities using 16S tags Julien Tremblay, PhD
Roadmap for Soil Community Metagenomics of DOE’s FACE & OTC Sites
Microbial diversity and virulence probing of five different body sites Anu Rebbapragada, Pub. Health Ontario Central Lab. Canada Wei-Jen Lin, Cal State.
Diversity and quantification of candidate division SR1 in various anaerobic environments James P. Davis and Mostafa Elshahed Microbiology and Molecular.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Microbial biomass and community composition of a tallgrass prairie soil subjected to simulated global warming and clipping A. Belay-Tedla, M. Elshahed,
Elucidating factors behind pair wise distances discrepancies between short and near full-length sequences. We hypothesized that since the 16S rRNA molecule.
The Microbiome and Metagenomics
Accurate estimation of microbial communities using 16S tags
Diversity of Soil Microbes. Approaches for Assessing Diversity Microbial community Organism isolation Culture Nucleic acid extraction Molecular characterization.
A Robust and Accurate Binning Algorithm for Metagenomic Sequences with Arbitrary Species Abundance Ratio Zainab Haydari Dr. Zelikovsky Summer 2011.
MEGAN analysis of metagenomic data Daniel H. Huson, Alexander F. Auch, Ji Qi, et al. Genome Res
Convenience Sample of 4 Adults and 6 Infants. Adults 4 visits over 2 weeks; infants 2 visits over 2 weeks Adult specimens: 1) plaque (by method, teeth,
Introducing DOTUR, a Computer Program for Defining Operational Taxonomic Units and Estimating Species Richness Patric D. Schloss and Jo Handelsman Department.
Date of download: 6/23/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A)
Use of Slow Release Nitrogen Fertilizer and its effect on soil quality. Soil bacterial population Hernandez, Jorge D., Garcia, Rosalia. and Lightfoot,
Metagenomics The study of metagenomes, genetic material recovered directly from environmental samples. Term: Coined in 1998 to refer to the idea that a.
General Microbiology (Micr300)
Computational Characterization of Short Environmental DNA Fragments Jens Stoye 1, Lutz Krause 1, Robert A. Edwards 2, Forest Rohwer 2, Naryttza N. Diaz.
Date of download: 7/7/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A) DNA.
Tools for microbial community analysis. What I am not going to talk  Culture dependent analysis  Isolate all possible colonies  Infer community  Test.
Soil Microbiome of Native and Invasive Marsh Grasses in Blackbird Creek, Delaware Lathadevi K.Chintapenta 1#, Gulnihal Ozbay 1#, Venu Kalavacharla 1* Figure.
Quantitative Phylogenetic Assessment of Microbial Communities in Diverse Environments Xinjun Zhang.
16S RNA sequencing analysis
Metagenomic Species Diversity.
Peter Sterk EBI Metagenomics Course 2014
Environmental Biochemistry University of Oldenburg Fremantle, 2013
PNAS 2012 Alpha diversity: how many species are in each sample?
Genomic Data Manipulation Thinking about data visually
Figure 1. The relationships of bacterial operational taxonomic unit richness (A) and phylogenetic diversity (B) with aridity index based on 97% sequence.
Research in Computational Molecular Biology , Vol (2008)
Denaturing Gradient Gel Electrophoresis
Genomic Data Manipulation
Microbiome: 16S rRNA Sequencing
H = -Σpi log2 pi.
Fractions of 16S rRNA genes from bacteria (top panel) and archaea (bottom panel) in public databases from primer-amplified metagenomes (with and without.
Microbiome studies for microbial disease pathogenesis research
Example usage of mockrobiota MC resource for marker gene and metagenome sequencing pipelines. Example usage of mockrobiota MC resource for marker gene.
Bacterial composition of olive fermentations is affected by microbial inoculation. Bacterial composition of olive fermentations is affected by microbial.
Presentation transcript:

Metagenomics

What is metagenomics? Term first used in 1998 by Jo Handelsman "the application of modern genomics techniques to the study of communities of microbial organisms directly in their natural environments, bypassing the need for isolation and lab cultivation of individual species” Chen, K.; Pachter, L. (2005) Comput. Biol.

Milestones in metagenomics 1985: Idea for sequencing the environment first proposed – Norman Pace and colleagues (ASM News) 1991: First published 16S rRNA“metagenomic” study (Schmidt et al., J. Bacteriol.) 2003: Sargasso sea project: > 2000 species, 148 novel bacteria (Venter et al., Science) 2004: Shotgun sequencing of seawater: > 5000 different viruses (Breibart et al., PNAS) 2004: Complete bacterial genomes assembled from environmental samples (Tyson et al., Nature) 2006: First published environmental sequences generated using NGS(454) technology (Poinar et al., Nature)

Metagenomics opens our eyes to the hidden world Viruses are the most abundant biological entities Culturable bacteria represent only about 1% of the total bacterial population Unculturable microorganisms form the vast majority of lifeforms on earth

From descriptive biology… Early studies focused simply on describing the microbial communities in different environments – How many species? In soil In sea water In hot springs In acid drainage In poop Only addressed alpha diversity – Done with Sanger sequencing

…To hypothesis testing NGS sequencing  higher throughput, lower cost Allow testing of how microbial communities differ: – On various substrates – Based on climate – After environmental peturbation – When in competition Comparing populations – Examination of Beta diversity

Metagenomic Strategies Total genomic DNA (RNA for some viruses) – Gene discovery Target genes – Antibiotic resistance – “Detoxification” genes – 16S rRNA

16S rRNA gene sequencing Highly conserved regions – Identical in all bacteria – Single PCR primer pair can amplify 16S rRNA genes from diverse bacteria Highly variable regions – Conserved within species – Divergent between species Image from Alimetrics.net

General 16S rRNA sequencing strategy Isolate “environmental” DNA Amplify 16S rRNA genes using PCR and primers recognizing conserved regions – Incorporate sequencing adaptors into primers – (add barcodes/tags to primers for multiplexing) Perform NGS – usually Roche 454 (longer read lengths) Use sequence data to identify types and abundances of bacterial “species” Measure community diversity (alpha) Compare diversity between communities, locations, treatments, etc. (beta)

Extracting “information” from sequences Types of bacteria Relative abundances of species identified Interspecific relationships Step 1:Pick operational taxonomic units (OTUs) Step 2:Take a single sequence from the cluster to represent the OTU Step 3:Compare each representative OTU sequence with a 16S rRNA gene database

OTU picking Form clusters consisting of reads with highly similar sequences – De novo picking Reads are clustered by comparing amongst themselves (slow) – Reference-based Reads are clustered by comparing with a reference dataset (fast but less informative)

De novo OTU picking

Closed reference-based picking Representative sequence

Open reference-based picking

Assign taxonomy information to OTUs Representative sequences are used to search database(s) of 16S rRNA genes from known bacterial species The OTU inherits the taxonomic descriptors of the top, legitimate match OTUs, count, taxonomy information and metadata are stored in a BIOM table

Example BIOM table (head) {"id": "None","format": "Biological Observation Matrix 1.0.0","format_url": "htt p://biom-format.org","type": "OTU table","generated_by": "QIIME 1.7.0","date": " T19:06: ","matrix_type": "sparse","matrix_element_type": "int","shape": [419, 9],"data": [[0,0,1],[1,1,1],[2,0,1],[3,2,1],[4,3,1],[5,0,1],[5,1,1],[6,4,1],[7,3,1],[8,0,1],[8,1,1],[8,2,1],[8,4,1],[9,5,1],[10,3,1],[11,1,1],[1 1,3,1],[12,6,1],[13,4,2],[13,6,1],[14,1,1],[14,2,1],[15,7,1],[16,1,1],[17,8,1],[ 18,8,1],[19,2,1],[20,8,1],[21,3,1],[21,4,1],[22,7,1],[23,7,1],[24,1,2],[25,3,1], [26,1,1],[27,7,2],[28,0,1],[29,8,1],[30,1,1],[31,8,2],[32,1,1],[32,3,1],[33,8,1],[34,1,1],[34,2,1],[35,1,1],[36,7,1],[37,3,3],[38,7,1],[39,7,2],[40,6,1],[41,3,1 ],[41,7,2],[42,3,1],[42,4,1],[43,7,1],[44,5,1],[45,4,16],[45,5,12],[46,0,6],[46, 1,2],[46,7,3],[46,8,5],[47,3,1],[48,7,1],[49,5,1],[50,4,1],[51,5,1],[52,3,1],[53,1,1],[53,3,2],[53,6,2],[54,0,37],[54,1,10],[54,3,1],[54,8,4],[55,5,1],[56,0,5], [56,1,4],[56,2,1],[56,3,2],[56,4,1],[56,5,1],[56,6,3],[56,7,9],[56,8,2],[57,5,1],[58,0,1],[59,0,1],[59,1,1],[59,2,10],[59,3,2],[59,4,2],[59,5,24],[59,6,1],[60,3,1],[61,2,1],[62,7,1],[63,2,1],[64,6,1],[65,7,1],[66,1,1],[67,3,1],[68,3,1],[69, 6,1],[70,7,1],[71,6,1],[72,0,2],[72,1,3],[72,8,2],[73,0,1],[73,8,2],[74,0,1],[74

BIOM table (tail), "c__Clostridia", "o__Clostridiales", "f__Lachnospiraceae", "g__", "s__"]}},{"id": "denovo406", "metadata": {" taxonomy": ["k__Bacteria", "p__Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachnospiraceae", "g__[Rum inococcus]", "s__gnavus"]}},{"id": "denovo407", "metadata": {"taxonomy": ["k__Bacteria", "p__Firmicutes", "c__C lostridia", "o__Clostridiales", "f__Lachnospiraceae", "g__", "s__"]}},{"id": "denovo408", "metadata": {"taxonom y": ["k__Bacteria", "p__Firmicutes", "c__Erysipelotrichi", "o__Erysipelotrichales", "f__Erysipelotrichaceae", " g__[Eubacterium]", "s__dolichum"]}},{"id": "denovo409", "metadata": {"taxonomy": ["k__Bacteria", "p__Firmicutes ", "c__Clostridia", "o__Clostridiales", "f__Lachnospiraceae", "g__", "s__"]}},{"id": "denovo410", "metadata": { "taxonomy": ["k__Bacteria", "p__Firmicutes", "c__Bacilli", "o__Turicibacterales", "f__Turicibacteraceae", "g__T uricibacter", "s__"]}},{"id": "denovo411", "metadata": {"taxonomy": ["k__Bacteria", "p__Firmicutes", "c__Clostr idia", "o__Clostridiales", "f__Lachnospiraceae", "g__", "s__"]}},{"id": "denovo412", "metadata": {"taxonomy": [ "k__Bacteria", "p__Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachnospiraceae"]}},{"id": "denovo413", "metadata": {"taxonomy": ["k__Bacteria", "p__Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachnospir aceae", "g__", "s__"]}},{"id": "denovo414", "metadata": {"taxonomy": ["k__Bacteria", "p__Bacteroidetes", "c__Ba cteroidia", "o__Bacteroidales", "f__", "g__", "s__"]}},{"id": "denovo415", "metadata": {"taxonomy": ["k__Bacter ia", "p__Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachnospiraceae", "g__", "s__"]}},{"id": "denovo 416", "metadata": {"taxonomy": ["k__Bacteria", "p__Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachno spiraceae", "g__[Ruminococcus]", "s__gnavus"]}},{"id": "denovo417", "metadata": {"taxonomy": ["k__Bacteria", "p __Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachnospiraceae", "g__", "s__"]}},{"id": "denovo418", " metadata": {"taxonomy": ["k__Bacteria", "p__Firmicutes", "c__Clostridia", "o__Clostridiales", "f__Lachnospirace ae"]}}],"columns": [{"id": "PC.636", "metadata": null},{"id": "PC.635", "metadata": null},{"id": "PC.356", "met adata": null},{"id": "PC.481", "metadata": null},{"id": "PC.354", "metadata": null},{"id": "PC.593", "metadata" : null},{"id": "PC.355", "metadata": null},{"id": "PC.607", "metadata": null},{"id": "PC.634", "metadata": null }]}

Alpha diversity Measure of species richness at local scale – Locations – Conditions – Samples Measured in different ways: – Classical diversity measures Shannon index, Simpson Index, etc. – Phylogenetic measures Phylogenetic distances over whole trees

Assigning sequences to OTUs  loss of information OTU1 OTU2 ?

Phylogenetic metrics OTU1 OTU2 Genetic distance Store in a distance matrix

Beta diversity Comparison of microbial communities based on their composition Metrics assess differences between these communities in: – Overall composition Comparison of distance matrices  principle coordinates analysis – Composition at varying taxonomic levels Species, genus, family, etc.  G-tests, ANOVA, etc. – Correlation studies  Pearson test