Presenting Results Laura Biggins v1.0 1.

Slides:



Advertisements
Similar presentations
C HAPTER 1.1 Analyzing Categorical Data. I NDIVIDUALS AND V ARIABLES Individuals are the objects described by a set of data. Individuals may be people,animals,
Advertisements

Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Visualising and Exploring BS-Seq Data
Unit 9.2 / Lesson5 / presentation5a Setting up the spreadsheet.
Analysis of Variance The contents in this chapter are from Chapter 15 and Chapter 16 of the textbook. One-Way Analysis of Variance Multiple Comparisons.
8.1 Types of Data Displays Remember to Silence Your Cell Phone and Put It In Your Bag!
Peter Tsai Bioinformatics Institute, University of Auckland
Lecture 23: Tues., Dec. 2 Today: Thursday:
Projects Key dates lists of suggested projects published * *You are highly encouraged to choose a project yourself or find a relevant project.
Predicting protein functions from redundancies in large-scale protein interaction networks Speaker: Chun-hui CAI
Introduction to research Analyzing and interpreting research data
Experimental Statistics I.  We use data to answer research questions  What evidence does data provide?  How do I make sense of these numbers without.
Sometimes, Tables can be confusing
High Throughput Sequencing
Pathway analysis Daniel Hurley Pathway analysis: summary A popular buzzword… but what does it mean? A popular buzzword… but what does it mean? How do.
Test Taking Tips How to help yourself with multiple choice and short answer questions for reading selections A. Caldwell.
NGS Analysis Using Galaxy
1Module 2: Analyzing Gene Lists Canadian Bioinformatics Workshops
Gene Set Enrichment Analysis (GSEA)
Sample size vs. Error A tutorial By Bill Thomas, Colby-Sawyer College.
Science Fair I have my data….now how do I analyze it?
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
Networks and Interactions Boo Virk v1.0.
Graphical summaries of data
Statistical Reasoning for everyday life
Lecture 1.2 Field work (lab work). Analysis of data.
EXPLORING DEAD GENES Adrienne Manuel I400. What are they? Dead Genes are also called Pseudogenes Pseudogenes are non functioning copies of genes in DNA.
Plotting Data Exploring Computer Science Lesson 5-7.
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
Tutorial session 2 Network annotation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
An Internet of Things: People, Processes, and Products in the Spotfire Cloud Library Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.
Copyright OpenHelix. No use or reproduction without express written consent1.
Sort the graphs. Match the type of graph to it’s name.
Beyond Slope and Points David Harris, Escondido USD/K12 Alliance Susan Gomez Zwiep, CSU Long Beach/K12 Alliance CMC Palm Springs Oct, 2013 Lesson Available:
Recording Data. Record Data Record Data in a Table or Chart. Make sure to have as much information as possible Record Everything that you do.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Writing. Academic Writing Allow about 20 minutes In TASK 1 candidates are presented with a graph, table,chart or diagram and are asked to describe, summarise.
Class 11, October 8, 2015 Lessons 2.4 & 2.5.  By the end of this lesson, you should understand: ◦ The scale on graphs can change the perception of the.
Bioinformatics support at School of Biological Sciences
Investigate Variation of Chromatin Interactions in Human Tissues Hiren Karathia, PhD., Sridhar Hannenhalli, PhD., Michelle Girvan, PhD.
Interpreting Data from Tables and Graphs. What are Tables and Graphs Tables and graphs are visual representations. They are used to organize information.
Copyright OpenHelix. No use or reproduction without express written consent1.
QUALITY MANAGEMENT TOOLS:. Bar Charts are a efficient way to compare classes or groups of data. A class or group can have a single category of data or.
No reference available
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Bud length (mm)bud width (mm) Let’s pretend this is my data. I’ve entered it in Excel. Note that I have labeled my columns appropriately.
Chapter 2:.  Come up to board and write the number of different types of social media YOU have used TODAY; write anywhere; no need to organize in any.
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Unit 2: Exploring Data with Graphs and Numerical Summaries Lesson 2-2a – Graphs for Categorical Data Probability & Stats Essential Question: How do we.
Chapter 2:.  Come up to board and write the number of different types of social media YOU have used TODAY; write anywhere; no need to organize in any.
RNA Quantitation from RNAseq Data
a Cytoscape plugin to assess enrichment of
EGASP 2005 Evaluation Protocol
Exploring and Presenting Results
Biases and their Effect on Biological Interpretation
Gene expression from RNA-Seq
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
How to make great graphs that make sense!
Artefacts and Biases in Gene Set Analysis
Using Excel to Make a Graph
Exploring and Understanding ChIP-Seq data
Visualising and Exploring BS-Seq Data
Volume 9, Issue 3, Pages (September 2017)
Artefacts and Biases in Gene Set Analysis
Volume 149, Issue 7, Pages (June 2012)
Exploring and Presenting Results
Categorical Gene Set Practical Results
Cancer Cell Line Encyclopedia
Global analysis of the chemical–genetic interaction map.
Presentation transcript:

Presenting Results Laura Biggins v1.0 1

2 I have my results in a table… what next? Plot everything?

Artefacts 3 Artefacts in the data can be caused by a whole myriad of reasons during any stage from library preparation to the final step of the analysis where the gene lists are produced. RNA-seq – transcript length, expression level Ribosomal, cytoskeleton, extracellular, secreted multi-mapping reads – multi vs single ribosomal, translation Bisulphite – CpG density GC content – low and high GC fragments are underrepresented in libraries Location, average copy number Starting population of cells – remember to include background Completely random genes….

Differential power RNA-seq – transcript length, expression level Bisulphite – CpG density Non-random distribution – CpG density 4

Mapping – multi-mapping – genome Splice variants – Analysis at transcript vs gene level 5

Copy number variation 6

Categories to be wary of ribosomal cytoskeleton extracellular secreted translation glycoprotein 7

Beware… 8 GC < 0.35

9 GC > 0.6

10 All genes on chr 2, 8, 13

11 No of transcripts > 4 Random sets of 1000 genes put through DAVID

Artefacts – checking your gene list 12 Make sure background is appropriate Be suspicious of some ontology categories – Ribosomal, cytoskeleton, extracellular, secreted, translation gene_screen – Shiny app to check for obvious differences in target genes compared to background population

What next? 13

Figure examples 14

Figure examples 15

GO graph 16 Genes are often annotated with many functions

Displaying Results Interpreting and exploring results How can the results be displayed so that I can interpret and explore them most easily? – Understanding the functional terms (incl GO hierarchy) – Finding relevant information amongst the masses ( GOslim, redundant terms, clustering) Presenting results How should I present my results? What information should I include? 17

Interpreting and Exploring Results How can the results be displayed so that I can interpret them most easily? Understanding the functional categories – GOrilla – hierarchical map – Panther - interactive pie charts Reducing redundancy – DAVID – clusters of similar functions – REVIGO - semantic similarity – GOslims 18

GOrilla 19 cbl-gorilla.cs.technion.ac.il/

Panther 20

GOrilla 21 cbl-gorilla.cs.technion.ac.il/

Exploring Results How can the results be displayed so that I can interpret them most easily? Understanding the functional categories – Gorilla – hierarchical map – Panther - interactive pie charts Reducing redundancy – DAVID – clusters of similar functions – REVIGO - semantic similarity – GOslims 22

GOrilla 23 cbl-gorilla.cs.technion.ac.il/

Exploring results 24

Reducing redundancy 25

Reducing redundancy 26

Reducing redundancy 27

Reducing redundancy Use a clustering tool Use a GOslim – various versions available, may lose the interesting detail Select non-redundant terms yourself – be consistent – P-value filter, top x number of categories, largest categories, most enriched 28

What information should be included? 29

Figure examples 30

Figure examples 31

Figure examples 32

Summary Beware of artefacts – if something looks too good to be true it probably is…. Remember your background population Do not try and plot absolutely everything Choose a method to deal with redundant terms Think about what you’re plotting and whether it makes sense Do not be afraid of including tables 33

Exercise 2 34

35

36

Panther plots 37