 16S rRNA gene marker  intra-gene variability  primer selection  size & information content Primer selection, information content, alignment and length.

Slides:



Advertisements
Similar presentations
16S sequencing for microbiome studies Nicola Segata and Nick Loman
Advertisements

Metabarcoding 16S RNA targeted sequencing
Next-generation sequencing
Cloning lab results Cloning the human genome Physical map of the chromosomes Genome sequencing Integrating physical and recombination maps Polymorphic.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
PCR - Polymerase Chain Reaction PCR is an in vitro technique for the amplification of a region of DNA which lies between two regions of known sequence.
The Microbiome and Metagenomics
Zachary Bendiks. Jonathan Eisen  UC Davis Genome Center  Lab focus: “Our work focuses on genomic basis for the origin of novelty in microorganisms (how.
International Livestock Research Institute, Nairobi, Kenya. Introduction to Bioinformatics: NOV David Lynn (M.Sc., Ph.D.) Trinity College Dublin.
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
Discussion on Metagenomic Data for ANGUS Course Adina Howe.
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
Molecular Microbial Ecology
Species  OTUs  OPUs  Species  OTUs  OPUs. Rosselló-Mora & Amann 2001, FEMS Rev. 25:39-67 Taxa circumscription depends on the observable characters.
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
Probes can be designed in an evolutionary hierarchy.
Accurate estimation of microbial communities using 16S tags Julien Tremblay, PhD
Work by Antonio Izzo Based on 36 soil cores from a total of 9 plots contained within a 2.5 hectare region.
Prokaryote Taxonomy & Diversity
3- NON-RIBOSOMAL GENE RECONSTRUCTION  Core / auxiliary / strain specific genes  Housekeeping genes and accordance with global reconstruction  MLSA 
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
3- RIBOSOMAL RNA GENE RECONSTRUCITON  Phenetics Vs. Cladistics  Homology/Homoplasy/Orthology/Paralogy  Evolution Vs. Phylogeny  The relevance of the.
Christian Rinke Microbial Genomics DOE, Joint Genome Institute Introduction to ARB (From A User's Perspective)
DNA barcoding: bane or boon (or both) for taxonomy? Donal A. Hickey, Concordia University, Montreal. Collaborators: Mehrdad Hajibabaei and Gregory Singer.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
SPECIES AT THE GENOMIC LEVEL. DDH has been the gold standard  the “sex” for higher eukaryotes Stackebrandt et al., 2002, Int J Syst Evol Microbiol. 52:
POLYMERASE CHAIN REACTION (PCR) Bridges Polymerase Chain Reaction  Simple reaction  Produces many copies of a specific fragment of DNA  Live.
Chapter 7 Analyzing DNA and gene structure, variation and expression 1.Sequencing and genotyping DNA Standard/manual DNA sequencing using dideoxynucleotide.
Elucidating factors behind pair wise distances discrepancies between short and near full-length sequences. We hypothesized that since the 16S rRNA molecule.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
The Microbiome and Metagenomics
Census of Marine Life (CoML) / Sloan Foundation
Accurate estimation of microbial communities using 16S tags
Construction of Substitution matrices
Finding genes in the genome
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Canadian Bioinformatics Workshops
MEGAN analysis of metagenomic data Daniel H. Huson, Alexander F. Auch, Ji Qi, et al. Genome Res
Date of download: 6/23/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A)
Discussion on Genomic/Metagenomic Data for ANGUS Course Adina Howe.
Date of download: 7/7/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A) DNA.
16S rRNA Experimental Design
Metagenomic Species Diversity.
Metagenomics: From Bench to Data Analysis 19-23rd September S rRNA-based surveys for Community Analysis: How Quantitative are they? Dr.
Profiling microbial communities with T-RFLP
Lecture 1 Introduction to recombinant DNA Technology
Research in Computational Molecular Biology , Vol (2008)
Workshop on the analysis of microbial sequence data using ARB
Relationship between Genotype and Phenotype
Lecture 4: Probe & primer design
Microbiome: 16S rRNA Sequencing
VISUALIZING COMPLEX BACTERIAL POPULATIONS IN ANIMAL MODELS
Daniel A. Peterson, Daniel N. Frank, Norman R. Pace, Jeffrey I. Gordon 
Bacterial genomics: The controlled chaos of shifty pathogens
Comparing read recruitment, de novo, and insertion tree strategies for phylogenetic diversity computation. Comparing read recruitment, de novo, and insertion.
Phylogenetic tree of 38 Pseudomonas type strains, based on the V3-V5 region sequence of the 16S rRNA gene (V3 primer, positions 442 to 492; and V5 primer,
Proportion of 16S rRNA gene sequences in each category of phylogenetic novelty relative to cultures for each environment, by amplicons, metagenomes (without.
Daniel A. Peterson, Daniel N. Frank, Norman R. Pace, Jeffrey I. Gordon 
Practical Contents DNA Extraction Gel Electrophoresis
Diagram of fusion primers and dual-indexing approach for library construction. Diagram of fusion primers and dual-indexing approach for library construction.
Relationship between Genotype and Phenotype
RealTime-PCR.
(A) The result of a qPCR assay comparing fluorescence with cycle number. (A) The result of a qPCR assay comparing fluorescence with cycle number. The results.
Sequence variation of 16S rRNA gene primer-binding sites.
ITS rRNA gene locus. ITS rRNA gene locus. Schematic of the eukaryotic ribosomal gene cluster. The SILVA database contains sequences of the 18S gene, while.
Recovery template. Recovery template. The recovery template (internal control) has the same sequence as the PCR product except the probe region has been.
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
Unrooted neighbor-joining tree of 16S rRNA gene sequences from low-G+C-content gram-positive bacteria, obtained from clone libraries. Unrooted neighbor-joining.
Presentation transcript:

 16S rRNA gene marker  intra-gene variability  primer selection  size & information content Primer selection, information content, alignment and length

16s rRNA gene marker Conserved 2º structure Natural gene amplificationGenealogy reconstruction Ludwig and Schleifer, 1994 FEMS Rev 15:

Intra-gene variability  secondary structure shows differences in the conservation of homologous sites  highly conserved zones give information on deep-genealogies (higher resolution for distantly related)  hypervariable zones give information on recent events (higher resolution for close relatives) Anderson et al., 2008 PLoS ONE, 3: e2836 Stahl and Amann, 1991 John Wiley and Sons

Primer selection  universality  Universal primers target highly conserved regions  Universality depends on the known dataset  Different phyla may have differences in the “universal” regions (e.g. EUB 338)  Primers used for rRNA cloning may give biased results  Metagenomics without amplification steps may reveal hidden diversity EUB338 I Most Bacteria GCTGCCTCCCGTAGGA GT EUB338 II Planctomycetales GCAGCCACCCGTAGGT GT EUB338 III Verrucomicrobiales GCTGCCACCCGTAGGT GT Daims et al System Appl Microbiol 22,

Primer selection  size of the amplicon GM Valt 8 GM5 GM5-clamp F R 518 GM R F 945 Bac1055F R 1529 S 1505  ideally the almost complete gene (~ 1520 nucleotides) should be sequenced  many amplifications skip sequencing the helix 50 (~ 1490 nucleotides)  many clone libraries are based on just partial amplicons (~ 900 nucleotides)  Pairs GM3 (8) – GM4 (1492) most widely used

16S rRNA sequencing has grown exponentially in parallel to the development of sequencing techniques Yarza et al., Nature Revs : Tamames & Rosselló-Móra 2012 TIM 20: rRNA cataloguing radioactive Sanger sequencing non- radioactive Sanger sequencing reverse transcription sequencing NSG The database is exponentially increasing 99% environmental sequences 1% cultured organisms 3.8 x10 6 sequences 700,000 / year (last three) Sources of sequences and quality  rRNA Cataloguing (up to late 80’s), bad quality  reverse transcription sequencing (up to late 90’s), bad quality  Sanger methods (radioactive, biotin-labelled, terminal-dye… still in use)  cloning DNA, good quality  direct amplification, good quality  DGGE/TGGE, short sequences, bad quality  NSG, short sequences  454 technology (now up to 800nuc, mean of 500nuc), moderate quality  illumina (now 2x 250nuc), too short

16S rRNA sequencing has grown exponentially in parallel to the development of sequencing techniques Quast et al., 2013, Nuc Acid Res. 41: D590-D596 SILVA release 119 (July 2014) rate of rejection of about 30% of the existing sequences short sequences are generally worse than long stretches

We divided the 16S rRNA gene into 6 regions of 250 nucleotides -Calculated taxa recovery in each stretch -Compare with that of the full sequence Regions V1 & V2 Regions V3 & V4 Regions V5 & V6 Categoryminimum Species98.7% Genus94.5% Family86.5% Order82.0% Class78.5% Phylum75.0% Yarza et al., Nature Revs :

-77% of the 16S rRNA gene sequences < 900pb -The 5‘region (V1-V2) overestimates species -The remaining regions tend to underestimate all taxa -Increases in length tend to mirror that of the full sequence Yarza et al., Nature Revs :

Size & information content  complete sequences give complete information  partial sequences lose phylogenetic signal  short sequences lose resolution 1500 nuc 900 nuc 300 nuc

Primer selection & size of amplicons  selection of primers is important for representative results  the length of the amplified/sequenced gene  adequate phylogenetic signal  short sequences may lose resolution