Alastair Grant Environmental Sciences, University of East Anglia

Slides:



Advertisements
Similar presentations
RNA-Seq as a Discovery Tool
Advertisements

Control of Expression In Bacteria –Part 1
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
The Microbiome and Metagenomics
Metagenomic Analysis Using MEGAN4
Probes can be designed in an evolutionary hierarchy.
Library screening Heterologous and homologous gene probes Differential screening Expression library screening.
Metatranscriptomics: Challenges and Progress Cindi Hoover DOE Joint Genome Institute May 17, 2012.
The iPlant Collaborative
Research Papers.. Biology 423L Research Paper: Genetics behind cloning of a human gene: Outline due Oct. 23.
RNA Sequencing I: De novo RNAseq
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
Metatranscriptomics: Challenges and Progress
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Metagenomic Analysis Using MEGAN4 Peter R. Hoyt Director, OSU Bioinformatics Graduate Certificate Program Matthew Vaughn iPlant, University of Texas Super.
 Read quality  Adaptor trimming  Read sequence collapse Preprocessing Genome mapping  Map read to the spruce genome (Pabies1.0- genome.fa) using Patman
Advancing Science with DNA Sequence Natalia Ivanova MGM Workshop September 29, 2011 Metagenome analysis: use case.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Methodology U937 Human Immune Cells Control (No treatment) (n=4) Estrogen (5 uM) (n=4) 4-nonylphenol (5 uM) (n=4) Cultured Cells, RNA Isolation, RT (to.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
Work Presentation Novel RNA genes in A. thaliana Gaurav Moghe Oct, 2008-Nov, 2008.
Canadian Bioinformatics Workshops
Shotgun sequencing reveals transkingdom alterations in immunodeficiency associated enteropathy Xiaoxi Dong (Oregon State University), Jialu Hu (Oregon.
Canadian Bioinformatics Workshops
MEGAN analysis of metagenomic data Daniel H. Huson, Alexander F. Auch, Ji Qi, et al. Genome Res
Date of download: 7/7/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A) DNA.
Risheng Chen et al BMC Genomics
Metagenomic Species Diversity.
Metagenomics: From Bench to Data Analysis 19-23rd September S rRNA-based surveys for Community Analysis: How Quantitative are they? Dr.
The Transcriptional Landscape of the Mammalian Genome
RNA-Seq for the Next Generation RNA-Seq Intro Slides
Metagenomics Rob Edwards.
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Gene expression from RNA-Seq
CHAPTER 20 PART 3: A LITTLE MORE ADVANCED BIOTECHNOLOGY TOOLS
Metagenomic assembly Cedric Notredame
Canadian Bioinformatics Workshops
The Basics of Molecular Biology
Beijing, China / August Searching for nitrifying bacteria from sewage sludge assisted by metagenomic and metatranscriptomic analyses for optimisation.
Workshop on the analysis of microbial sequence data using ARB
Analysis of Double-Stranded RNA from Microbial Communities Identifies Double- Stranded RNA Virus-like Elements  Carolyn J. Decker, Roy Parker  Cell Reports 
Metagenomics Image: Iverson et al. 2012, Science.
Metagenomics and metatranscriptomics: Windows on CF-associated viral and microbial communities  Yan Wei Lim, Robert Schmieder, Matthew Haynes, Dana Willner,
Gene-expression changes associated with stunting.
Metagenomics and metatranscriptomics: Windows on CF-associated viral and microbial communities  Yan Wei Lim, Robert Schmieder, Matthew Haynes, Dana Willner,
Add anti-foaming agent
Detection and identification of plasma bacterial and viral elements in HIV/AIDS patients in comparison to healthy adults  S.-K. Li, R. K-K. Leung, H.-X.
Comparative Genomics.
Volume 21, Issue 8, Pages (August 2014)
RNA sequencing (RNA-Seq) and its application in ovarian cancer
DNA and the Genome Key Area 3a Gene Expression RNA.
Fractions of 16S rRNA genes from bacteria (top panel) and archaea (bottom panel) in public databases from primer-amplified metagenomes (with and without.
Edwards Allen, Zhixin Xie, Adam M. Gustafson, James C. Carrington  Cell 
Predicted location and functional classification of differentially expressed transcripts. Predicted location and functional classification of differentially.
Volume 24, Issue 13, Pages (July 2014)
Gene Expression Analysis
Gautam Dey, Tobias Meyer  Cell Systems 
Jong-Eun Park, Hyerim Yi, Yoosik Kim, Hyeshik Chang, V. Narry Kim 
A Novel Family of Mammalian Taste Receptors
AZA treatment induces a distinct gene-expression pattern in stromal cells. AZA treatment induces a distinct gene-expression pattern in stromal cells. (A-C)
Basic Local Alignment Search Tool
Comparison of N. gonorrhoeae gene expression in infected men in vivo and isolates grown in vitro. Comparison of N. gonorrhoeae gene expression in infected.
Differential protein, mRNA, lncRNA and miRNA regulation by p53.
Sequence Analysis - RNA-Seq 1
CD4+CLA+CD103+ T cells from human blood and skin share a transcriptional profile. CD4+CLA+CD103+ T cells from human blood and skin share a transcriptional.
Genome-wide Functional Analysis Reveals Factors Needed at the Transition Steps of Induced Reprogramming  Chao-Shun Yang, Kung-Yen Chang, Tariq M. Rana 
Transcripts enriched and depleted in NB TICs compared with SKPs and other tumor tissues. Transcripts enriched and depleted in NB TICs compared with SKPs.
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

Alastair Grant Environmental Sciences, University of East Anglia Metatransciptomics Alastair Grant Environmental Sciences, University of East Anglia

Definition: Metatranscriptomics is the study of the function and activity of the complete set of transcripts (RNA-seq) from environmental samples. http://www.metagenomics.wiki/pdf/definition/metatranscriptomics

Hot topic WoK search (topic: metatranscriptom*) 15th September 2016 494 hits 22 of them “highly cited” Is it for you??????????? Stop and think – what would you hope to get out of a metatranscriptomic study?

Total RNA or mRNA? Environmental RNA is 90-98% rRNA rRNA gives unbiased measure of taxonomy C.f. Turner et al, 2013 But not much mRNA So not much functional information

Total RNA: Turner et al, 2013, ISME J, 7:2248 Taxonomic identification using SSU and LSU RNA Total RNA does have some substantial advantages

Allows assessment of relative abundances, without PCR bias Big difference in Eukaryote abundance. Fungi abundant in Pea. Nematodes common in Oat. Would have been difficult to see using PCR amplified SSU

Total RNA Turner et al, 2013, ISME J, 7:2248 Wheat has least effect. Oat and pea have bigger effects. Avenicin minus oat has large effect on Eukaryotes, but not on bacteria. But I should say, that we only wrote this paper because we were having such a struggle to deplete rRNA

But assuming you still want to look at function….

First isolate your mRNA But bacterial mRNA lacks poly-A tail Get rid of rRNA Hybridisation MicrobeExpress (Ambion) Sample specific 16S and 23S probes (Stewart et al, 2010) Ribo-Zero (Epicentre) Duplex specific nuclease (Yi et al, 2011) MEssageAmp (Ambion – adds a poly-A tail) There are papers that make it sound easy – Tom Turner tried most of these, and I know we weren’t the only ones to find it challenging

Best solution for us…… 4:1 Ribo-Zero bacteria: Ribo-Zero plant root/seed Usually ~10% reads were rRNA Intact RNA is important Working in rhizosphere –so plant roots were an issue. Others have come out in favour of Ribo-Zero

Using internal standard allows quantification Gifford et al, 2011 Moran et al, 2013 Prepare known RNA molecule Spike into sample before extraction at known concentration Compare # reads of spike with # reads of each mRNA Helpful in studies of environmental function (e.g. biogeochemistry) Estimates ~200 mRNA molecules/cell Half-lives of 1-8 minutes Something that is common practice in some areas of analytical chemistry, but is novel to many Useful in looking at functional activity in the environment. Less important for experimental comparisons.

Gifford et al., 2011 Genes for Ammonia processing abundant NirK only other abundant N processing gene (nitrite) S processing surprisingly high

Community-level gene expression profile based on GOS peptide database. Jorge Frias-Lopez et al. PNAS 2008 50% of transcripts were unique ?databases dominated by culturables? Community-level gene expression profile based on GOS peptide database. (A) GOS protein clusters with DNA or cDNA matches at bit scores ≥40 are shown in the Venn diagram. Numbers of reads assigned to GOS protein clusters, when >70, are plotted for both cDNA-unique protein clusters and DNA-unique protein clusters. GOS protein clusters shared by DNA and cDNA libraries (shaded in gray) were further illustrated in B. (B) GOS protein clusters shared by cDNA and DNA libraries were ranked by their cluster-based expression ratio (representation of each cluster in the cDNA library normalized by its representation in the DNA library). Furthermore, each protein cluster was categorized (and color-coded) according to its abundance in the DNA library. Representative protein clusters were highlighted from each category and discussed in the text. ©2008 by National Academy of Sciences

91% of gene families from mRNA were novel Only 29% of those from DNA were novel

Summary so far…. We get information on what is going on But lots of this activity is uncharacterised

Comparing metagenome; metatranscriptome and single cell genome sequences can be helpful….

Metagenome Metatranscriptome No metatranscriptome data for uncontaminated sample Metagenome Metatranscriptome Black = clean Blue = moderate Red = severe pollution

Oceanospirillales – 80-90% of individuals in plume But this is a pretty gross insult to the community, and the impacted community is very low diversity Oceanospirillales – 80-90% of individuals in plume Draft genome from single cell sequencing. All genes in metagenome. Most expressed in metatranscriptome

At the other end of the spectrum…

abstract “Transcripts down regulated following glyphosate treatment involved carbohydrate and amino acid metabolism, and upregulated transcripts involved protein metabolism and respiration”

But the reality is pretty subtle – a bit depressing to go to all of that effort and find such small changes. Go for gross differences! – without that, the core metabolism is inevitably fairly similar.

Making sense of the data An example pipeline – not necessarily the best

Martinez et al, 2016, Scientific Reports

After “flatulogenic diet” – functional categories

After “flatulogenic diet” – Orthologous IDs

Downregulated functions are coloured

But all this is based on assigning 1.85% of high quality reads

Bioinformatics Remove rRNA (e.g. SortMeRNA) Quality screening (e.g. Sickle) Blastx (Rapsearch2 or Diamond – not Blast) Taxonomic classification (e.g. MEGAN) Functional annotation (into Seed or KEGG categories) Clustering (e.g. CD-Hit) or assembling (e.g. Trinity) Mapping reads to clusters/contigs (e.g. RSEM) Differential expression (e.g. DESeq2)

Three ways of viewing the data Contigs or clusters A bit of each Function What does it do? Taxonomy What is it from?

Of high quality reads…. 11% of high quality reads have matches in NR 68.2% of 200+bp contigs had matches in NR 67.7% had matches in Uniprot 7% of remainder had Blastn hits in NT 35% of reads mapped to contigs But only 13% to annotated contigs Many highly abundant but unidentified contigs. Lots of short and/or rare transcripts (65%)

Conclusions Total RNA is a good place to start Unbiassed and comprehensive community composition mRNA tells us about functions More work than metagenomes Much more work than 16S amplicons Unidentifiable transcripts a bit of a mystery More common than in metagenomes Novel biochemical pathways? Or something we’ve not thought of? A challenge, but potentially a big reward