The PATRIC RNASeq Service

Slides:



Advertisements
Similar presentations
IMGS 2012 Bioinformatics Workshop: RNA Seq using Galaxy
Advertisements

Peter Tsai Bioinformatics Institute, University of Auckland
RNA-seq Analysis in Galaxy
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
Next generation sequencing Xusheng Wang 4/29/2010.
RNA-Seq Visualization
Introduction to RNA-Seq and Transcriptome Analysis
Li and Dewey BMC Bioinformatics 2011, 12:323
The Ensembl Gene set The “Genebuild” 21 April 2008.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Introduction to RNA-Seq & Transcriptome Analysis
BASys: A Web Server for Automated Bacterial Genome Annotation Gary Van Domselaar †, Paul Stothard, Savita Shrivastava, Joseph A. Cruz, AnChi Guo, Xiaoli.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
TopHat Mi-kyoung Seo. Today’s paper..TopHat Cole Trapnell at the University of Washington's Department of Genome Sciences Steven Salzberg Center.
Transcriptome Analysis
RNA-seq workshop ALIGNMENT
An Introduction to RNA-Seq Transcriptome Profiling with iPlant.
BIF Group Project Group (A)rabidopsis: David Nieuwenhuijse Matthew Price Qianqian Zhang Thijs Slijkhuis.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Introduction to RNAseq
Gene expression (signal intensity) Control Osmotic Salt Drought Root Control Gene expression (signal intensity) Treatment.
The iPlant Collaborative
No reference available
Work Presentation Novel RNA genes in A. thaliana Gaurav Moghe Oct, 2008-Nov, 2008.
Canadian Bioinformatics Workshops
RNA Seq Analysis Aaron Odell June 17 th Mapping Strategy A few questions you’ll want to ask about your data… - What organism is the data from? -
Canadian Bioinformatics Workshops
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on
Mahmuda Akter, Paige Fairrow-Davis, and Rebecca Seipelt-Thiemann
Introduction to Genes and Genomes with Ensembl
GCC Workshop 9 RNA-Seq with Galaxy
MCB 7200: Molecular Biology
Cancer Genomics Core Lab
WS9: RNA-Seq Analysis with Galaxy (non-model organism )
RNA Sequencing Day 7 Wooohoooo!
Regulatory Genomics Lab
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Analysis of genomes and transcriptomes using ChIP-seq and RNA-seq
Gene expression from RNA-Seq
RNA-Seq analysis in R (Bioconductor)
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Optimizing Biological Data Integration
exRNA Metadata Standards
Genome Sequence Annotation Server
PBIO 4500/5500: Biotechnology and Genetic Engineering
How to store and visualize RNA-seq data
High-Throughput Analysis of Genomic Data [S7] ENRIQUE BLANCO
Canadian Bioinformatics Workshops
Introductory RNA-Seq Transcriptome Profiling
Kallisto: near-optimal RNA seq quantification tool
Genome Sequence Annotation Server
From: Computational analysis of bacterial RNA-Seq data
TSS Annotation Workflow
ID Mapping tools: Converting Accessions between Databases
Transcriptome analysis
Mutation Altering the miR-184 Seed Region Causes Familial Keratoconus with Cataract  Anne E. Hughes, Declan T. Bradley, Malcolm Campbell, Judith Lechner,
Working with RNA-Seq Data
Regulatory Genomics Lab
Transcriptomics Data Visualization Using Partek Flow Software
BIOBASE Training TRANSFAC® ExPlain™
Computational Pipeline Strategies
Introduction to RNA-Seq & Transcriptome Analysis
Regulatory Genomics Lab
Common Errors in Student Annotation Submissions contributions from Paul Lee, David Xiong, Thomas Quisenberry Annotating multiple genes at the same locus.
RNA-Seq Data Analysis UND Genomics Core.
Presentation transcript:

The PATRIC RNASeq Service

Executing the Service Reads are aligned to a reference genome Counts are normalized across experiments Transcripts are assembled and transcript boundaries are identified Transcript abundance is quantified Tests for differential gene expression are performed Operon structures are predicted Results are formatted for visualization in a genome browser

1 2 3 4 5 Mero 25 vs MHB Mero 75 vs MHB Mero 25 vs MHB-NaCl Mero 75 vs MHB-NaCL MHB vs MHB-NaCl

Mapping RNA Reads to a Reference

RNA-seq strategies Tuxedo Pipeline [1] C. Trapnell, A. Roberts, L. Goff, G. Pertea, D. Kim, D. R. Kelley, H. Pimentel, S. L. Salzberg, J. L. Rinn, and L. Pachter, “Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks,” Nature Protocols, vol. 7, no. 3, pp. 562–578, Mar. 2012. [2] R. McClure, D. Balasubramanian, Y. Sun, M. Bobrovskyy, P. Sumby, C. A. Genco, C. K. Vanderpool, and B. Tjaden, “Computational analysis of bacterial RNA-Seq data.” [Online]. Available: http://nar.oxfordjournals.org. [Accessed: 15-Apr-2016]. RNA-seq strategies Tuxedo Pipeline

§ Caenorhabditis elegans § Danio rerio § Drosophila melanogaster § Gallus gallus § Homo sapiens § Macaca mulatta § Mus musculus § Mustela putorius furo § Rattus norvegicus § Sus scrofa

Setting up a Submission to the PATRIC RNA Seq Service Upload data into a private workspace Select a strategy from two current strategies Rockhopper (developed for bacteria) Tuxedo (developed for eukaryotes but can be used on bacteria) Select a reference genome for mapping reads Can be a public PATRIC genome Can be one of your private genomes in your workspace Create or select an output folder in your workspace Create conditions and associate them to sets of reads

RNA-seq in response to antibiotic stress Bioproject: PRJNA234525 Identification of transcriptional pathways associated with antibiotic stress in Acinetobacter baumannii Once they reached an OD600 of 0.4, the cultures to receive antibiotic treatment were amended with 25% or 75% of the approximate MIC value of one antibiotic (meropenem, ciprofloxacin, amikacin sulfate, or polymyxin-B) David Rasko Institute for Genome Sciences/Microbiology and Immunology http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM1432601 http://www.ncbi.nlm.nih.gov/bioproject/234524

RNA-seq in response to antibiotic stress The conditions are: MHB = Meuller Hinton Broth and represents “normal” growth conditions MHB-NaCl = Meuller Hinton Broth and 200mM NaCl. This should invoke an osmotic stress response COL25 = 25% MIC of polymyxin-b COL75 = 75% MIC of polymyxin-b The reference genome is Acinetobacter baumannii 34654, a public genome in PATRIC In this execution of the service, I’m using data generated at UMD on Acinetobacter baumannii and downloaded from the SRA While biological replicates exist, I’m simplifying things for this example The conditions are: MHB = Meuller Hinton Broth and represents “normal” growth conditions MHB- NaCl = Meuller Hinton Broth and 200mM NaCl. This should invoke an osmotic stress response COL25 = 25% MIC of polymyxin-b (a natural bacteriocide found in topical creams such as Neosporin) COL75 = 75% MIC of polymyxin-b The reference genome is Acinetobacter baumannii 34654, a public genome in PATRIC

Selecting parameters

Choosing the target genome

Assigning output folder and name

Uploading reads

Assigning condition names

Loading reads for analysis

Submitting the job

Results Different strategies produce different results Both strategies produces this information Transcripts Operons Alignments Differential expression matrix

Transcripts The output contains structural information Transcription start and stop Translation start and stop The output contains functional information Gene product, gene synonyms The output contains cross references to the PATRIC databases Contig identifiers and gene identifiers The output contains relative abundance measures of the transcripts The output contains q-values for the differentially expressed transcripts

Visualizing Transcript Data in the Genome Browser Using the genome finder, locate the reference genome Filter the reference genome using the Contig ID Using the genome browser, zoom in on the area of interest Note that I can confirm that the second transcript starting at 215347 is predicted in multiple experiments.

Work with the Results Off-line Data is easily downloaded Easy to import into MS-Excel

Integrate the Results with the Public Data in PATRIC The Differential Expression Import Service transforms and integrates differential expression data for viewing in PATRIC The data can be generated by the PATRIC RNA-seq service using any of the strategies The data can be generated using other chip based technologies using external software Differential expression objects are created in your workspace Differential expression objects contain expression data from different conditions