NIH Extracellular RNA Communication Consortium 2 nd Investigators’ Meeting May 19 th, 2014 Sai Lakshmi Subramanian – (Primary

Slides:



Advertisements
Similar presentations
Genboree Microbiome Workbench 16S Workshop Part I March 11 th, 2014 Julia Cope Emily Hollister Kevin Riehle.
Advertisements

RNA-Seq based discovery and reconstruction of unannotated transcripts
IMGS 2012 Bioinformatics Workshop: RNA Seq using Galaxy
Processing of miRNA samples and primary data analysis
ChIP-seq analysis Ecole de bioinformatique AVIESAN – Roscoff, Jan 2013.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Use Case 3: Circulating miRNA Changes Associated with Alzheimer’s and Parkinson’s Diseases Thursday, 23 April, :30 pm Organized and Hosted by the.
Copyright OpenHelix. No use or reproduction without express written consent1.
Lab 3.41 Demo: Exploiting the UCSC Genome Browser Stefanie Butland UBC Bioinformatics Centre
RNA-seq Analysis in Galaxy
Small RNA-Seq Data Analysis Tools in the Genboree Workbench Sai Lakshmi Subramanian Lab of Prof. Aleksandar Milosavljevic Bioinformatics Research Laboratory.
Use Case 1: Exogenous exRNA in plasma of patients with Colorectal Cancer and Ulcerative Colitis Wednesday, Nov 5 th, :00 – 8:30 pm Organized and.
Use Case 2: HDL-bound small RNA in Lupus 1 Data and background slides kindly provided by Kasey Vickers, Vanderbilt University. Use Case 2: High-Density.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
Data Formats & QC Analysis for NGS Rosana O. Babu 8/19/20151.
NGS Analysis Using Galaxy
Li and Dewey BMC Bioinformatics 2011, 12:323
Expression Analysis of RNA-seq Data
Update on HTProcess Apps Sciplant May 8, HTProcessPipeline Purpose- – Provide a more functional set of commonly needed applications for RNASeq and.
Bioinformatics and OMICs Group Meeting REFERENCE GUIDED RNA SEQUENCING.
Computer Lab (I) Introduction of galaxy and UCSC genome browser.
Introduction to RNA-Seq & Transcriptome Analysis
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
NGS data analysis CCM Seminar series Michael Liang:
Next Generation DNA Sequencing
RNA-Seq in Galaxy Igor Makunin QAAFI, Internal Workshop, April 17, 2015.
RNA-seq workshop ALIGNMENT
EDACC Primary Analysis Pipelines Cristian Coarfa Bioinformatics Research Laboratory Molecular and Human Genetics.
1 Removing Sequential Bottlenecks in Analysis of Next-Generation Sequencing Data Yi Wang, Gagan Agrawal, Gulcin Ozer and Kun Huang The Ohio State University.
Genboree Discovery Process Integration Aleksandar Milosavljevic, PhD Baylor College of Medicine January 10 th, 2008; modified April 1 st 2008.
BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto.
Cloud Implementation of GT-FAR (Genome and Transcriptome-Free Analysis of RNA-Seq) University of Southern California.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
Sai Lakshmi Subramanian ERC Consortium Data Management & Resource Repository (DMRR) Baylor College of Medicine, Houston, TX 5 th NIH ERCC Investigators’
Trinity College Dublin, The University of Dublin GE3M25: Data Analysis, Class 4 Karsten Hokamp, PhD Genetics TCD, 07/12/2015
Tutorial 6 High Throughput Sequencing. HTS tools and analysis Review of resequencing pipeline Visualization - IGV Analysis platform – Galaxy Tuning up.
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Performed by the Data Management and Resource Repository (DMRR) ERCC Data Analysis.
IGV tools. Pipeline Download genome from Ensembl bacteria database Export the mapping reads file (SAM) Map reads to genome by CLC Using the mapping.
Contains details of your submission Manifest file FILE EXTENSION -.manifest.json FORMAT - JSON format REQUIRED - Genboree login name, group name, database.
The iPlant Collaborative
No reference available
RNA-Seq in Galaxy Igor Makunin DI/TRI, March 9, 2015.
Use Case 3: Circulating miRNA Changes Associated With Alzheimer’s and Parkinson’s Diseases Wednesday, Nov 5 th, :00 – 8:30 pm Organized and Hosted.
Manuel Holtgrewe Algorithmic Bioinformatics, Department of Mathematics and Computer Science PMSB Project: RNA-Seq Read Simulation.
ExRNA Data Analysis Tools in the Genboree Workbench Organized and Hosted by the Data Management and Resource Repository (DMRR) Sai Lakshmi Subramanian.
Moderní metody analýzy genomu - analýza Mgr. Nikola Tom Brno,
Objectives Genome-wide investigation – to estimate alternate Poly-Adenylation (APA) usage on 3’UTR – to identify polymorphism of Downstream Sequence Elements.
User-friendly Galaxy interface and analysis workflows for deep sequencing data Oskari Timonen and Petri Pölönen.
Canadian Bioinformatics Workshops
Using Galaxy to build and run data processing pipelines Jelle Scholtalbers / Charles Girardot GBCS Genome Biology Computational Support.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Practice:submit the ChIP_Streamline.pbs 1.Replace with your 2.Make sure the.fastq files are in your GMS6014 directory.
From Reads to Results Exome-seq analysis at CCBR
William Thistlethwaite 6th NIH ERCC Investigators’ Meeting
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Using command line tools to process sequencing data
Cancer Genomics Core Lab
University of Chicago and ANL
NGS Analysis Using Galaxy
High-Throughput Analysis of Genomic Data [S7] ENRIQUE BLANCO
Yonglan Zheng Galaxy Hands-on Demo Step-by-step Yonglan Zheng
Yating Liu July 2018 G-OnRamp workshop
BF528 - Sequence Analysis Fundamentals
Introduction to RNA-Seq & Transcriptome Analysis
RNA-Seq Data Analysis UND Genomics Core.
Presentation transcript:

NIH Extracellular RNA Communication Consortium 2 nd Investigators’ Meeting May 19 th, 2014 Sai Lakshmi Subramanian – (Primary Kevin Riehle – Bioinformatics Research Laboratory, Baylor College of Medicine, Houston, TX Data Coordination Component (DCC) of exRNA Communication Consortium Robert Kitchen - Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT Data Integration and Analysis Component (DCC) of exRNA Communication Consortium

Outline Introduction to Genboree Genboree Workbench Basics Data analysis tools RNA-Seq data – RSEQtools Setting up long RNA-Seq data analysis in the Genboree Workbench Small RNA-Seq data – smallRNA-seq Pipeline Setting up small RNA-Seq data analysis in the Genboree Workbench 2

Genboree Workbench Basics  Genboree URL -  Watch Video Tutorial – Introduction to Genboree Workbench /edit  How do I create a new account in Genboree? commons?faq_id=493 commons?faq_id=493  How to find Help within the Genboree Workbench? commons?faq_id=497 commons?faq_id=497  Help >> Getting Started  Help >> Tool Map  Help >> Tool Help Resources 3

Toolset Menus Drag & drop INPUT ENTITIES to Input Data Panel Drag & drop OUTPUT DESTINATIONS to Output Targets Panel Genome-Based Tables  Genomic annotations  High-density score data Generic Entity Tables  Loosely typed metadata  AVP based  Authorization & access  User data repository management Genboree Core Services Genboree User Data Services

Genboree Workbench Basics  Create Group commons?faq_id=489 commons?faq_id=489  Create Database commons?faq_id=491 commons?faq_id=491  Create Project commons?faq_id=492 commons?faq_id=492  Upload Data Files commons?faq_id=495 commons?faq_id=495 5

exRNA Data analysis tools RSEQtools A modular and flexible framework to perform common RNA-Seq analysis tasks Current implemented pipeline Generates genome-wide signal maps (hg18/19) as well as gene level expression quantifications (RPKM) for UCSC known genes. 6 Developed by Gerstein Lab – DIAC, exRNA consortium

RNA-Seq Analysis using RSEQtools INPUT: Takes 1 or 2 FASTQ sequences Gene Annotations: UCSC KnownGene for hg18, hg19 OUTPUT: – Alignments in SAM and BAM formats - compressed in AnalysisName_alignments.tar.gzSAM and BAM – All result files compressed in AnalysisName_results.tar.gz – Key result files for quick access (also found in AnalysisName_results.tar.gz): File with gene expression values Mapping bias file Annotation Coverage File – Signal Tracks of mapped reads (for visualization in genome browsers) - Uploaded as signal track in user db (if specified in the UI) – FastQC Output Plots – Custom Bowtie2 index, if generated 7

RNA-Seq Analysis pipeline in the Genboree Workbench 8

Example Dataset for RNA-Seq analysis using RSEQtools 9

exRNA Data analysis tools smallRNA-seq Pipeline pre-processing: remove adapter & primer sequences contaminants: no poly-A purification leaves rRNAs, etc mapping: small reads tend to multi-map to large genomes diversity: many species of small RNA (not just miRNAs!) quantification: different normalization 10 Developed by Gerstein Lab – DIAC, exRNA consortium

smallRNA-seq Pipeline INPUT: Takes a single-end FASTQ sequence file Supported Genome build: hg19 OUTPUT: – All result files compressed in AnalysisName_results.tar.gz Read length distribution Endogenous Mapping results – rRNA, miRNA, tRNA, snoRNA, piRNA, Rfam Exogenous Mapping results – Plant and virus miRNAs in miRBase 4 files containing sequence reads following – Adapter removal – rRNA removal – Endogenous mapping – Exogenous mappins 11

smallRNA- seq pipeline in the Genboree Workbench 12

Example Dataset for smallRNA-seq Pipeline 13

Live Demo of long and small RNA-seq pipelines in the Genboree Workbench Tutorial Screencasts to setup your analysis in the Genboree Workbench  Long RNA-Seq data analysis  small RNA-Seq data analysis -