Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Performed by the Data Management and Resource Repository (DMRR) ERCC Data Analysis.

Slides:



Advertisements
Similar presentations
Modules 6 and 7: Genboree and Epigenome Comparison Aleksandar Milosavljevic Epigenomics Data Analysis and Coordination Center (EDACC) Presented at the.
Advertisements

Exploring the Human Transcriptome
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
01 The sRNA Workbench project date 20/07/2012 Matthew. B. Stocks.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
Peter Tsai Bioinformatics Institute, University of Auckland
Bi-correlation clustering algorithm for determining a set of co- regulated genes BIOINFORMATICS vol. 25 no Anindya Bhattacharya and Rajat K. De.
Use Case 3: Circulating miRNA Changes Associated with Alzheimer’s and Parkinson’s Diseases Thursday, 23 April, :30 pm Organized and Hosted by the.
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Small RNA-Seq Data Analysis Tools in the Genboree Workbench Sai Lakshmi Subramanian Lab of Prof. Aleksandar Milosavljevic Bioinformatics Research Laboratory.
Use Case 1: Exogenous exRNA in plasma of patients with Colorectal Cancer and Ulcerative Colitis Wednesday, Nov 5 th, :00 – 8:30 pm Organized and.
Use Case 2: HDL-bound small RNA in Lupus 1 Data and background slides kindly provided by Kasey Vickers, Vanderbilt University. Use Case 2: High-Density.
Next generation sequencing Why? What? How? Marcel Dinger Developmental Biology Divisional Seminar 7 October 2010.
mRNA-Seq: methods and applications
Supplementary Material Supplementary Tables Supplementary Table 1. Sequencing statistics for ChIP-seq samples. Supplementary Table 2. Pearson correlation.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
BACKGROUND Have a gene involved in neurological disease, its function unclear Knockout is lethal, so… Designed a conditional knockout (cKO) mouse where.
Designing CAPS markers using SGN CAPS Designer
Expression Analysis of RNA-seq Data
Bioinformatics and OMICs Group Meeting REFERENCE GUIDED RNA SEQUENCING.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Genomics – Next-Gen sequencing and Microarrays
RNAseq analyses -- methods
Lecture 11. Microarray and RNA-seq II
Regulation of Gene Expression: An Overview  Transcriptional  Tissue-specific transcription factors  Direct binding of hormones, growth factors, etc.
SSAHA, or Sequence Search and Alignment by Hashing Algorithm, is used mainly for fast sequence assembly, SNP detection, and the ordering and orientation.
Next Generation DNA Sequencing
Copyright OpenHelix. No use or reproduction without express written consent1.
Stefan Aigner Christian Carson Rusty Gage Gene Yeo Crick-Jacobs Center Salk Institute Analysis of Small RNAs in Stem Cell Differentiation.
NIH Extracellular RNA Communication Consortium 2 nd Investigators’ Meeting May 19 th, 2014 Sai Lakshmi Subramanian – (Primary
To access the wireless network: Please bookmark the following link, which will allow each of you to become set-up as a Rice visitor online:
The iPlant Collaborative
ASNR 2012 Methodology for Imaging Genomics of Gliomas
Genboree Discovery Process Integration Aleksandar Milosavljevic, PhD Baylor College of Medicine January 10 th, 2008; modified April 1 st 2008.
Presented by: Ashgan Fararooy Referenced Papers and Related Work on:
Proposed redefinition of “gene” requires it to have a biological role Gerstein MB, …, Snyder M Genome Res 17: example of complexities observed.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Analysis and comparison of very large metagenomes with fast clustering and functional annotation Weizhong Li, BMC Bioinformatics 2009 Present by Chuan-Yih.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Introduction to RNAseq
Sai Lakshmi Subramanian ERC Consortium Data Management & Resource Repository (DMRR) Baylor College of Medicine, Houston, TX 5 th NIH ERCC Investigators’
Contains details of your submission Manifest file FILE EXTENSION -.manifest.json FORMAT - JSON format REQUIRED - Genboree login name, group name, database.
No reference available
Use Case 3: Circulating miRNA Changes Associated With Alzheimer’s and Parkinson’s Diseases Wednesday, Nov 5 th, :00 – 8:30 pm Organized and Hosted.
ExRNA Data Analysis Tools in the Genboree Workbench Organized and Hosted by the Data Management and Resource Repository (DMRR) Sai Lakshmi Subramanian.
CyVerse-enabled NCBI Sequence Read Archive (SRA) Submission Pipeline
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Introductory RNA-seq Transcriptome Profiling of the hy5 mutation in Arabidopsis thaliana.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
William Thistlethwaite 6th NIH ERCC Investigators’ Meeting
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
exRNA Metadata Standards
MicroRNAs in spent blastocyst culture medium are derived from trophectoderm cells and can be explored for human embryo reproductive competence assessment 
Pathway Informatics December 5, 2018 Ansuman Chattopadhyay, PhD
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
miRNA expression patterns in stools from healthy subjects.
Small RNA Modifications: Integral to Function and Disease
Mapping of small RNAs to LINE‐1 sequence (related to Figs 3 and 4)‏
Analysis of deep‐sequencing data obtained from the libraries of MILI‐bound piRNAs and total small RNAs (related to Fig 4)‏ Analysis of deep‐sequencing.
The NCI Genomic Data Commons as an engine for precision medicine
Transcripts enriched and depleted in NB TICs compared with SKPs and other tumor tissues. Transcripts enriched and depleted in NB TICs compared with SKPs.
Genome Information Lab
Presentation transcript:

Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Performed by the Data Management and Resource Repository (DMRR) ERCC Data Analysis 1 Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255–4260.

Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255– The goal of this use case is to show that the Genboree Workbench and the exceRpt small RNA-seq analysis pipeline can replicate the results of Williams et al. We have developed these pipelines because as the Extracellular RNA Communication Consortium (ERCC) begins to generate more datasets, it is vital that they be analyzed in a reproducible and comparable way. The dataset from Williams et al is freely available in the Short Read Archive (SRA), so it makes a good practice dataset for becoming familiar with the Workbench and exceRpt.

Biological Question of Interest 3 This work from the Tuschl lab at Rockefeller University is mainly a technical study. They examined extracellular RNA from the bloodstream of mothers and fathers, their newborn babies, and the placenta. They were interested in quantitating how much exRNA could be found, and whether it would be possible to detect biomarker RNAs at that level. The placenta acted as a model of a tumor; their thinking was that if detecting exRNAs from placenta is feasible, the same should be true of tumors. Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255–4260.

Biological Samples to Be Analyzed Use Case 5: Biomarker Potential and Limitations of Circulating miRNA 4 The dataset analyzed in this use case is available from the Short Read Archive (SRA) under Bioproject PRJNA

To match the sample names from the article with SRA accessions from the exceRpt output, we visit the Bioproject page at the SRA: Use Case 5: Data Scrubbing Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255–

To match the sample names from the article with SRA accessions from the exceRpt output, we visit the Bioproject page at the SRA: Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255– Use Case 5: Data Scrubbing

Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255– Use Case 5: Data Scrubbing

After matching sample IDs between the two pipelines, we can import both tables into Excel and start generating comparison plots. Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255– Use Case 5: Data Comparison

Results from the Article’s analysis pipeline Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255–

We can compare the number of reads mapped to various categories of RNA in Table S1 from the article with the same data from the exceRpt read mapping summary. Categories in common are total input reads, microRNA, ribosomal RNA, and transfer RNA. Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255–

11 Use Case 5: exceRpt Pipeline Results - Number of Input Reads

12 Use Case 5: exceRpt Pipeline Results – MicroRNA

13 Use Case 5: exceRpt Pipeline Results - Ribosomal RNA

14 Use Case 5: exceRpt Pipeline Results - Transfer RNA

The exceRpt analysis pipeline recapitulates the analysis from Williams et al. except for significant differences in the number of reads mapped to transfer RNA. Use Case 5: Summary 15

We used exceRpt version 2.8.8; the latest version is At the Genboree Workbench there is a version history. Use Case 5: Summary 16 v2.2.8 We moved the alignment against endogenous repetitive elements (RE) to occur after the main smallRNA alignments performed by sRNABench. This is because we noticed that the RE library was able to ‘compete’ for reads that would be better annotated/interpreted as coming from tRNAs, piRNAs, or other transcripts. This competitive alignment did not ever affect miRNAs as these are always aligned to before other annotated RNAs, but we expect that this update will faithfully capture reads aligning to repetitive small-RNAs, especially tRNAs, piRNAs, and snoRNAs. exceRpt still aligns to REs as a final step before aligning to exogenous sequences as this is critical to remove highly repetitive endogenous sequences that might otherwise be confused as exogenous sequences.

Acknowledgements 17 Baylor College of Medicine, Houston, TX Aleksandar Milosavljevic Matthew Roth Genboree Team at Baylor Andrew Jackson Sai Lakshmi Subramanian William Thistlethwaite Sameer Paithankar Neethu Shah Aaron Baker Yale University, New Haven, CT Mark Gerstein Joel Rozowsky Rob Kitchen Fabio Navarro Gladstone Institute Alexander Pico Anders Riutta Pacific Northwest Diabetes Research Institute, Seattle, WA David Galas Roger Alexander Rockefeller University, New York, NY Tom Tuschl NIH John Satterlee Lillian Kuo Dena Procaccini

Use Case 5: References 18 1.Williams Z., et al. (2013) Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. PNAS 110: 4255– Farazi T.A., et al. (2012) Bioinformatic analysis of barcoded cDNA libraries for small RNA profiling by next-generation sequencing. Methods 58: ExRNA Portal Software Resources (

19 Useful Links exRNA Portal Software Resources – exRNA Atlas – Genboree Workbench – Data Coordination Center Wiki – exRNA Data Analysis Tools Wiki – Use Case Tutorials – exRNA Portal Data Resource