Use Case 1: Exogenous exRNA in plasma of patients with Colorectal Cancer and Ulcerative Colitis Wednesday, Nov 5 th, 2014 6:00 – 8:30 pm Organized and.

Slides:



Advertisements
Similar presentations
RightNow 8 -- Adding a new report: New > Report: ORAnalytics > Reports > New Report
Advertisements

Genboree Microbiome Workbench 16S Workshop Part I March 11 th, 2014 Julia Cope Emily Hollister Kevin Riehle.
Modules 6 and 7: Genboree and Epigenome Comparison Aleksandar Milosavljevic Epigenomics Data Analysis and Coordination Center (EDACC) Presented at the.
Exploring the Human Transcriptome
Peter Tsai Bioinformatics Institute, University of Auckland
Use Case 3: Circulating miRNA Changes Associated with Alzheimer’s and Parkinson’s Diseases Thursday, 23 April, :30 pm Organized and Hosted by the.
MCB Lecture #21 Nov 20/14 Prokaryote RNAseq.
Introduction to Bioinformatics - Tutorial no. 9 RNA Secondary Structure Prediction.
Accelerating biology with bioinformatics: collaboration with lab scientists Lewitter, RECOMB BE, July 2011.
RNA-seq Analysis in Galaxy
Small RNA-Seq Data Analysis Tools in the Genboree Workbench Sai Lakshmi Subramanian Lab of Prof. Aleksandar Milosavljevic Bioinformatics Research Laboratory.
NCBI resources III: GEO and expression data analysis Yanbin Yin Fall
Use Case 2: HDL-bound small RNA in Lupus 1 Data and background slides kindly provided by Kasey Vickers, Vanderbilt University. Use Case 2: High-Density.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
® IBM Software Group © 2006 IBM Corporation The Eclipse Data Perspective and Database Explorer This section describes how to use the Eclipse Data Perspective,
Supplementary Material Supplementary Tables Supplementary Table 1. Sequencing statistics for ChIP-seq samples. Supplementary Table 2. Pearson correlation.
NGS Analysis Using Galaxy
Introduction to RNA-Seq and Transcriptome Analysis
Bioinformatics and OMICs Group Meeting REFERENCE GUIDED RNA SEQUENCING.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Eran Yanowski, Eran Hornstein’s: Monitor drug impact on the transcriptome of mouse beta cells (primary and cell-line) using Transeq/RNA-Seq Report.
Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research,
Stefan Aigner Christian Carson Rusty Gage Gene Yeo Crick-Jacobs Center Salk Institute Analysis of Small RNAs in Stem Cell Differentiation.
NIH Extracellular RNA Communication Consortium 2 nd Investigators’ Meeting May 19 th, 2014 Sai Lakshmi Subramanian – (Primary
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
EDACC Quality Characterization for Various Epigenetic Assays
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Fig S1. miR-10b overexpression in exosomes derived from MDA-MB-231 cells stably expressing miR-10b as compared to vector control exosomes. Supplementary.
Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.
Input data for analysis Users that have expression values (dataset 1_ chicken affy_foldchane.txt. can upload that file as shown in slide 30.
Sai Lakshmi Subramanian ERC Consortium Data Management & Resource Repository (DMRR) Baylor College of Medicine, Houston, TX 5 th NIH ERCC Investigators’
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Performed by the Data Management and Resource Repository (DMRR) ERCC Data Analysis.
IGV tools. Pipeline Download genome from Ensembl bacteria database Export the mapping reads file (SAM) Map reads to genome by CLC Using the mapping.
Investigate Variation of Chromatin Interactions in Human Tissues Hiren Karathia, PhD., Sridhar Hannenhalli, PhD., Michelle Girvan, PhD.
Contains details of your submission Manifest file FILE EXTENSION -.manifest.json FORMAT - JSON format REQUIRED - Genboree login name, group name, database.
No reference available
Use Case 3: Circulating miRNA Changes Associated With Alzheimer’s and Parkinson’s Diseases Wednesday, Nov 5 th, :00 – 8:30 pm Organized and Hosted.
ExRNA Data Analysis Tools in the Genboree Workbench Organized and Hosted by the Data Management and Resource Repository (DMRR) Sai Lakshmi Subramanian.
First of all: “Darnit Jim, I’m a doctor not a bioinformatician!”
Stitching the Tutorials Together Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.
Expression profiling & functional genomics Exercises.
MiRNAs and siRNAs 5 th March 2013 Saeideh Jafarinejad 4/22/2013 Rhumatology Research Center Lab(RRC lab)
Canadian Bioinformatics Workshops
William Thistlethwaite 6th NIH ERCC Investigators’ Meeting
Integrative Genomics Viewer (IGV)
Regulatory Genomics Lab
Stubbs Lab Bioinformatics – 4 Alignment Summary Report & Count files with htseq-count Nov 29, 2016 Joe Troy.
How to store and visualize RNA-seq data
Alteryx User Group August 2016.
Circulating MicroRNAs as Biomarkers of Colorectal Cancer: Results From a Genome- Wide Profiling and Validation Study  María Dolores Giráldez, Juan José.
Pathway Informatics December 5, 2018 Ansuman Chattopadhyay, PhD
Expression profiling of snoRNAs in normal hematopoiesis and AML
miRNA expression patterns in stools from healthy subjects.
AGEseq: Analysis of Genome Editing by Sequencing
Mapping of small RNAs to LINE‐1 sequence (related to Figs 3 and 4)‏
Figure 2. ROC curves for different group comparisons
Regulatory Genomics Lab
Transcriptomics Data Visualization Using Partek Flow Software
Negative and Rational Indices all slides © Christine Crisp
Adding &Subtracting.
Regulatory Genomics Lab
© The Author(s) Published by Science and Education Publishing.
Expression profiles of 87 miRNAs expressed in SC
Annual percent change (APC) in age-specific colorectal cancer (CRC), colon cancer and rectal cancer incidence rates in Europe, 1990–2016. *Indicates that.
Global analysis of the chemical–genetic interaction map.
A, Changes in BRAFV600E cfDNA allele fraction from baseline after one dose of treatment for 12 patients with serial samples available, classified according.
Presentation transcript:

Use Case 1: Exogenous exRNA in plasma of patients with Colorectal Cancer and Ulcerative Colitis Wednesday, Nov 5 th, :00 – 8:30 pm Organized and Hosted by the Data Management and Resource Repository (DMRR) Data kindly provided by David Galas, Pacific Northwest Diabetes Research Institute (PNDRI) ERCC Data Analysis Workshop

Background: Comparison of human plasma small RNA profiles of patients with colorectal cancer to those with ulcerative colitis, indicated that a large fraction of reads were not mapping to the human genome. This raised the question as to what was the origin of those small RNAs? Results: Mapping suggested that a significant fraction of small RNA reads were derived from bacterial, fungal, and plant sources. Use Case 1: Exogenous exRNA Wang K., Hong L., Yuan Y., Etheridge A., Zhou Y., Huang D., Wilmes P., & Galas D. (2012) The Complex Exogenous RNA Spectra in Human Plasma: An Interface with Human Gut Biota? PLoS ONE 7: e

We will use the Genboree Workbench to check what fraction of reads do not map to the human genome. We will also use the output of the small RNA-seq Pipeline to answer the following questions: Do all plasma small RNAs map to the human genome (slide 17)? Which miRNAs are normally present in human plasma (slide 18)? What are the sources of small RNAs found in human plasma that do not map to the human genome (exercise)? Use Case 1: Exogenous exRNA Wang K., Hong L., Yuan Y., Etheridge A., Zhou Y., Huang D., Wilmes P., & Galas D. (2012) The Complex Exogenous RNA Spectra in Human Plasma: An Interface with Human Gut Biota? PLoS ONE 7: e

Biological Samples to Be Analyzed Patient NumberSampleInput File NameBiosample Metadata # in KB #1Plasma (Colorectal) SM1_crc1_sequence.fastq.gz EXR PF-BS #2Plasma (Colorectal) SM2_crc2_sequence.fastq.gz EXR PF-BS #3Plasma (Colorectal) SM3_crc3_sequence.fastq.gz EXR PM-BS #4Plasma (Ulcerative) SM6_uc1_sequence.fastq.gz EXR-93163PMC-BS #5Plasma (Ulcerative) SM7_uc2_sequence.fastq.gz EXR-93164PMC-BS #6Plasma (Ulcerative) SM8_uc3_sequence.fastq.gz EXR-93166PFC-BS #7Plasma (Control) SM11_norm1_sequence.fastq.gz EXR-D3340PMN-BS #8Plasma (Control) SM12_norm2_sequence.fastq.gz EXR-D3176PFN-BS #9Plasma (Control) SM3_norm3 _sequence.fastq.gz EXR-D3142PFN-BS Input files are located in the Data Selector in the following Group  Database  Folder: Group: exRNA Metadata Standards Database: Use Case 1: Exogenous exRNA in Colorectal Cancer and Ulcerative Colitis Folder: 1. Inputs (FASTQ) 4 Use Case 1: Exogenous exRNA

Genboree Workbench – Getting Started Getting Started – lic-commons/wiki/Getting_startedhttp://genboree.org/theCommons/projects/pub lic-commons/wiki/Getting_started Genboree Workbench Icons Explanation – lic-commons/wiki/genboree_iconshttp://genboree.org/theCommons/projects/pub lic-commons/wiki/genboree_icons FAQs – public-commonshttp://genboree.org/theCommons/ezfaq/index/ public-commons 5

Genboree Workbench – Create Database Create a Genboree Workbench Database – public-commons?faq_id=491http://genboree.org/theCommons/ezfaq/show/ public-commons?faq_id=491 hg19 6 Note: - You will be using this newly created Genboree Workbench Database to hold the output of tool runs. This will be the database that we’re referring to when we say ‘your database’. Note: - You will be using this newly created Genboree Workbench Database to hold the output of tool runs. This will be the database that we’re referring to when we say ‘your database’.

Running the Pipeline: Select Input Files 7 Note: You will input (1) fastq file per tool run. So, for each fastq file you wish to analyze, you will need to repeat the process shown on the next 3 slides. Note: You will input (1) fastq file per tool run. So, for each fastq file you wish to analyze, you will need to repeat the process shown on the next 3 slides.

8 Running the Pipeline: Select Output Database Note: Drag Your newly created database to Output Targets. Note: Drag Your newly created database to Output Targets.

9 Running the Pipeline: Select Tool

10 Running the Pipeline: Submit Job

11 Post-processing: Select Input Files Note: These zip files will be in your database, in the folder that you named: Files/smallRNAseqPipeline/[your analysis name]/ Note: These zip files will be in your database, in the folder that you named: Files/smallRNAseqPipeline/[your analysis name]/

12 Post-processing: Select Output Database Note: Drag Your newly created database to Output Targets. Note: Drag Your newly created database to Output Targets.

13 Post-processing: Select Tool

14 Post-processing: Submit Job

15 Post-processing: Begin Analysis (Excel) Note: The processed files to the left will be in your database, but will be in the folder that you named: Files/processPipelineRuns/ [your analysis name]/ Note: The processed files to the left will be in your database, but will be in the folder that you named: Files/processPipelineRuns/ [your analysis name]/

Use Case 1: Pipeline Results - Number of Input Reads 16 Case 3) Sample ID inputclipped calibrato rrRNAnot_rRNAgenome miRNA sense miRNA antisensetRNA sense tRNA antisense piRNA sense piRNA antisense snoRNA sense snoRNA antisense Rfam sense Rfam antisense miRNA plantVirus sense norm127,002,90110,349,566NA3,483,7066,865,8603,154,174156, , ,323 norm227,957,1859,872,947NA3,253,5516,619,3962,969,73072, , ,776 norm328,214,2619,316,527NA2,929,0746,387,4532,901,24791, , ,048 crc221,132,6744,455,562NA1,508,6052,946,9571,307,65747,20485, ,266 crc323,547,3685,737,688NA1,901,3563,836,3321,721,24855,287107, ,076 crc122,729,8582,431,702NA704,8871,726,815767,52313,76841, ,817 uc120,626,9935,265,060NA1,714,1803,550,8801,553,15821,834611, ,915 uc218,186,2595,742,022NA1,937,7193,804,3031,642,32921,30747, ,510 uc328,426,8197,095,086NA2,447,3024,647,7842,099,770155,573810, ,218 Summary Table from small RNAseq Pipeline Wang et al (2012)

17 Sample not_rRNAgenome Mapped Fraction Unmapped Fraction norm %54% norm %55% norm %55% crc %56% crc %55% crc %56% uc %56% uc %57% uc %55% Fraction mapping to the human genome = genome / not_rRNA Summary Table from small RNA-Seq Pipeline Use Case 1: Pipeline Results - Do all plasma small RNAs map to the human genome? Wang et al (2012)

18 Fraction of reads mapping to miRNA = miRNA_sense / not_rRNA Summary Table from Pipeline Use Case 1: Pipeline Results – Reads Mapping to miRNA sampleinputclippedrRNA not rRNAgenome miRNA sense miRNA antisensetRNA sense tRNA antisense piRNA sense piRNA antisense snoRNA sense snoRNA antisense miRNA plantVirus sense miRNA sense average norm1393%151%51%100%46%2.28%0.0002%0.2148%0.0006%0.0106%0.0024%0.0016%0.0003%0.1795%1.60% norm2422%149%49%100%45%1.10%0.0002%0.1776%0.0008%0.0092%0.0040%0.0018%0.0027%0.1477% norm3442%146%46%100%45%1.44%0.0002%0.1956%0.0006%0.0115%0.0031%0.0020%0.0004%0.1260% crc2717%151%51%100%44%1.60%0.0003%0.1864%0.0006%0.0171%0.0067%0.0029%0.0057%0.1108%1.28% crc3614%150%50%100%45%1.44%0.0003%0.2072%0.0005%0.0174%0.0045%0.0020%0.0074%0.1062% crc11316%141%41%100%44%0.80%0.0002%0.1030%0.0003%0.0141%0.0036%0.0014%0.0002%0.1052% uc1581%148%48%100%44%0.61%0.0002%0.3210%0.0007%0.0186%0.0064%0.0052%0.0051%0.0821%1.51% uc2478%151%51%100%43%0.56%0.0001%0.1857%0.0004%0.0175%0.0039%0.0015%0.0044%0.1185% uc3612%153%53%100%45%3.35%0.0002%0.2215%0.0007%0.0155%0.0051%0.0029%0.0037%0.1123% Wang et al (2012)

19 We can look for the answer to this question in the processed pipeline output file DG_miRNA_Quantifications_RPM.txt. Use Case 1: Which miRNAs are normally present in human plasma? miRNAnorm1norm2norm3crc1crc2crc3uc1uc2uc3normcrcuccrc/normuc/norm hsa-let-7b-5p hsa-miR-451a hsa-let-7a-5p hsa-miR-378a-3p hsa-miR-143-3p hsa-let-7f-5p hsa-miR-486-5p hsa-miR hsa-miR hsa-miR hsa-miR-423-5p hsa-miR-24-3p hsa-miR hsa-miR-146a-5p hsa-miR-21-5p hsa-miR-140-3p hsa-miR-122-5p hsa-miR-148a-3p hsa-let-7g-5p We added columns for averages and differential expression, and then sorted by the average expression level in normal plasma. Average s Diff. Expr.

 55-60% of miRNA reads do not map to the human genome.  ~1.5% of reads map to human miRNA.  A fraction of a percent of reads map to plant or viral miRNA. Use Case 1: Summary 20