Bioinformatics Core Director

Slides:



Advertisements
Similar presentations
RNA-seq library prep introduction
Advertisements

Functional Genomics with Next-Generation Sequencing
The Past, Present, and Future of DNA Sequencing
RNA-Seq based discovery and reconstruction of unannotated transcripts
RNAseq.
Jeff Dangl, UNC Chapel Hill Phil Hugenholtz, Susannah Tringe, JGI Ruth Ley, Cornell Rhizosphere Grand Challenge Pilot Project Scott Clingenpeel Project.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
Peter Tsai Bioinformatics Institute, University of Auckland
Greg Phillips Veterinary Microbiology
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Transcriptomics Jim Noonan GENE 760.
Ribosomal Profiling Data Handling and Analysis
Real Time PCR = Quantitative PCR.
DNA Analysis Facility User Educational Series December 11, 2009.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Roadmap for Soil Community Metagenomics of DOE’s FACE & OTC Sites
The iPlant Collaborative
RNA surveillance and degradation: the Yin Yang of RNA RNA Pol II AAAAAAAAAAA AAA production destruction RNA Ribosome.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
No reference available
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
Motif Search and RNA Structure Prediction Lesson 9.
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
Canadian Bioinformatics Workshops
 Facilities Open House Functional Genomics Facility Molishree Joshi, Ph.D. 6/1/2015 Contact Information:
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: miRNA biogenesis & discovery Simon Moxon
Next generation sequencing
The Transcriptional Landscape of the Mammalian Genome
RNA-Seq for the Next Generation RNA-Seq Intro Slides
Ø Novel approaches for linkage mapping in dairy cattle
Moderní metody analýzy genomu
Figure 1. The overall workflow of RNA-seq QC
Nina Tanaskovic – SEQAPP – Crete 2016.
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Gene expression from RNA-Seq
Short Read Sequencing Analysis Workshop
The RNA-Seq Bid Idea: Statistical Design and Analysis for RNA Sequencing Data The RNA-Seq Big Idea Team: Yaqing Zhao1,2, Erika Cule1†, Andrew Gehman1,
Lab meeting
Lecture 7. Topics in RNA Bioinformatics (Single-Cell RNA Sequencing)
Day 4 Session 22: Questions and follow-up…. James C. Fleet, PhD
Many Sample Size and Power Calculators Exist On-Line
Measuring transcriptomes with RNA-Seq
Sequence comparison: Significance of similarity scores
Step 1: amplification and cloning procedures
Budding yeast has a small genome of approximately 6000 genes.
Comparative Analysis of Single-Cell RNA Sequencing Methods
Nienke van der Stoep, James R Gorman, Frederick W Alt  Immunity 
Study overview and analysis workflow.
RNA sequencing (RNA-Seq) and its application in ovarian cancer
Mapping of small RNAs to LINE‐1 sequence (related to Figs 3 and 4)‏
Sequence comparison: Significance of similarity scores
Correlation of mouse gene expression with bacterial gene expression.
Novel p53 target genes identified by RNA-Seq, pSILAC and ChIP-Seq.
A Presentation by Regina Strelecki
Sequence Analysis - RNA-Seq 2
Introduction to RNA-Seq & Transcriptome Analysis
Relationship between Genotype and Phenotype
Schematic representation of a transcriptomic evaluation approach.
RT-PCR analysis of GFP splice variants in prp18a-1 mutants.
Volume 14, Issue 6, Pages (June 2004)
iPSC Lines for Purchase
Toward Accurate and Quantitative Comparative Metagenomics

Transcriptomic Analysis of GmSIN1 OE-1 Transgenic Soybean.
RNA-Seq Data Analysis UND Genomics Core.
Lack of pilE expression in NZCM238 is associated with lower TNF-α secretion by 16HBE cells during coculture. Lack of pilE expression in NZCM238 is associated.
Presentation transcript:

Bioinformatics Core Director Usability of Marginal Data Jyothi Thimmapuram Bioinformatics Core Director jyothit@purdue.edu bioinformatics@purdue.edu www.bioinformatics.purdue.edu ISMB 2018 June 7, 2018

Marginal Data Marginal: close to the lower limit of qualification, acceptability, or function; barely exceeding the minimum requirements; almost insufficient Not ideal or optimal to address the hypothesis for which the data are generated

Experiment Failures Experimental Design – Insufficient replicates Wrong type of reads Insufficient number of reads Contamination – During sample collection In the sequencing facility Wrong sample IDs Data collection – Mistakes in lab protocol Sequencing machine failures

Lab Protocol Arenz et. al., 2015. J Microbiol Methods. 117:1-3 • Plant DNA can confound molecular studies of bacterial endophytes. • Blocking primers that greatly increased efficiency of Illumina-based bacterial amplification.

Contamination - Mislabeling WT Mutant

Rescuing a failed experiment Using partial data eg., using the replicates with enough depth and high quality when many replicates are available. Re-purpose the data Maybe you cannot answer the question you started with the data you have, but can answer some other question eg., RNA-Seq data for transcriptome assembly & characterization

Using partial data Samples %Contamination Season1_P_Leaf1 82.25 9.98 Season1_P_Leaf2 82.36 Season2_P_Leaf2 10.01 Season1_P_Root1 88.47 Season2_P_Root1 7.64 Season1_P_Root2 88.52 Season2_P_Root2 7.66 Season1_S_Leaf1 91.73 Season2_S_Leaf1 49.01 Season1_S_Leaf2 91.63 Season2_S_Leaf2 49.35 Season1_S_Root1 77.07 Season2_S_Root1 12.22 Season1_S_Root2 77.17 Season2_S_Root2 12.25

Other uses of marginal data Guide future experiments for data collection For eg., mitochondrial/chloroplast blocking primers for plant microbiome 16S amplification k Bacteria;p Proteobacteria

Data Analysis Failures Reference genome – Genome or transcriptome Different version / strains of the reference Appropriate methods – Assembler/ Aligner Counting Statistical methods Missing data

Price A, Gibas C (2017) The quantitative impact of read mapping to non-native reference genomes in comparative RNA-Seq studies. PLOS ONE 12(7): e0180904. https://doi.org/10.1371/journal.pone.0180904

Data Interpretation Failures FDR 0.05 – 5% probability that we are rejecting the hypothesis even when it is true. If FDR for ‘YFG” is > 0.05?

Interpretation of Results Victoria et. al. 2015. Aging Cell. 14:1055 - 1066 Length distribution and annotation of small RNAs circulating in mice serum. Two major small RNA peaks were detected in the serum from the studied mice: at 20–24 nt, consistent with the size of miRNAs, and at 30–33 nt consisted of reads mapping to tRNA genes (a). A total of 76% and 24% of the total reads mapped to the mouse small noncoding RNAs were derived from tRNAs and miRNAs, respectively (b).

Guidance for Future Studies FAILURE MODE EXPERIMENT ANALYSIS Use Partial Data Re-purpose Data Guidance for Future Studies Fixable

Thank you!