Exploring and Presenting Results

Slides:



Advertisements
Similar presentations
Spreadsheets in Finance and Forecasting Introduction to Charts, Summary Values and Pivot Tables.
Advertisements

Using visualization and network analysis to assist function analysis of microarray data Hepatitis C Virus (HCV) Micorarray Data Function Analysis Current.
Scaffold Download free viewer:
Automatic methods for functional annotation of sequences Petri Törönen.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Scientific Figure Design v2.0 Simon Andrews, Anne Segonds-Pichon, Boo Virk
Networks and Interactions Boo Virk v1.0.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Presenting Results Laura Biggins v1.0 1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Top X interactions of PIN Network A interactions Coverage of Network A Figure S1 - Network A interactions are distributed evenly across the top 60,000.
PLACING AND LINKING GRAPHICS
CPSC 203 Introduction to Computers T43, T46 & T68 By Jie (Jeff) Gao.
CuffDiff ran successfully. Output files include gene_exp.diff What are the next steps? Use Navigation bar to find files; they may be under DNA Subway if.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Network construction and exploration using CORNET and Cytoscape - Excercises SPICY WORKSHOP Wageningen, March 8 th 2012 Stefanie De Bodt.
© 2012 The McGraw-Hill Companies, Inc. All rights reserved. word 2010 Chapter 4 Working with Tables and Graphics.
Plot type specific considerations
NVivo Software – A Qualitative Research
Plotting in Excel Ken youssefi Engineering 10.
Descriptive Statistics: Tabular and Graphical Methods
Scientific Figure Design
Multimedia Design.
SeqMonk tools for methylation analysis
SeqMonk tools for methylation analysis
Revision of Modes of Representation or Cognitive Levels
Networks and Interactions
QLIK Overview & Desk Aid
OVERVIEW Objectives Follow a design document to prepare images for inclusion in a Web page Run a batch process to prepare multiple images in one step Use.
Exploring and Presenting Results
Microsoft Office Illustrated
A C++ generic model for the GLAST Geometric Description
Pathway Visualization
Helping Children Learn
Lesson 10 – Thinking about Structure
Lesson 10 – Thinking about Structure
METHOD TEST PREP EDUCATIONAL SERIES
Using Excel to Graph Data
Enhancing a Document Part 1
Context Is Everything Meaningful Alternative Text
Analysing ChIP-Seq Data
Enhancing a Document Part 1
Overview Expression data basics Introduction Biological network data
Artefacts and Biases in Gene Set Analysis
Global comparison of MinSpan pathways with databases of human‐defined pathways The pairwise connection specificity index (CSI) was calculated for all pathway.
Tutorial 3 – Querying a Database
Overview Gene Ontology Introduction Biological network data
Navya Thum February 06, 2013 Day 5: MICROSOFT EXCEL Navya Thum February 06, 2013.
Building Worksheet Charts
Exploring and Understanding ChIP-Seq data
Visualising and Exploring BS-Seq Data
Volume 9, Issue 3, Pages (September 2017)
Graphs with SPSS.
CHAPTER 1 Exploring Data
Directions: On each slide you will find fish of different colors, sizes, and numbers. You will also find bubbles for grouping the fish. On some slides.
Volume 23, Issue 4, Pages (April 2018)
MAPPFinder and You: An Introductory Presentation
Volume 149, Issue 7, Pages (June 2012)
Categorical Gene Set Practical Results
ChEA, ChIP Enrichment Analysis.
Volume 37, Issue 6, Pages (December 2012)
Pathway Visualization
Functional networks of proteins associated with SL3a, SL3abc, SL123, and D3 transcripts. Functional networks of proteins associated with SL3a, SL3abc,
Exploring Microsoft® Office 2016 Series Editor Mary Anne Poatsy
David Quigley  Journal of Investigative Dermatology 
Word Masterclass By Dan Carline.
Recursive Relationship
Global analysis of the chemical–genetic interaction map.
Presentation transcript:

Exploring and Presenting Results Simon Andrews, Laura Biggins simon.andrews@babraham.ac.uk laura.biggins@babraham.ac.uk v2019-02

Functional enrichment results Gene set information Gene set name Gene set source Gene set description Statistical information Raw p-value Corrected p-value Enrichment value Count information Hit genes in category Hit genes outside category Background genes in category Background genes outside category

Functional enrichment results Gene set information Gene set name Gene set source Gene set description Statistical information Raw p-value Corrected p-value Enrichment value Count information Hit genes in category Hit genes outside category Background genes in category Background genes outside category

Presenting results Tables are a natural way to present the key data .

Graphical Representations Need to add something over a table Relationships between multiple result values Representation of redundancy between categories Relationship to original data Context of surrounding pathway

Plotting relationships between values P-value (y) Enrichment (x) Count (radius) Redundancy (colour) http://wencke.github.io/

Redundancy in gene lists Gene ontology is hierarchical - a gene is placed in the most specific category and will also appear in all the parent categories Root ontology terms 1 2 3 general specific Parent Child Parent child redundancy – a gene is placed in the most specific category and will also appear in all the parent categories.

Redundancy in gene lists Most genes are associated with multiple GO terms There are many annotation sources, not just gene ontologies Ontology structure is more complex than just parent-child relationships

Redundancy: g:profiler filtering

Redundancy: DAVID clustering david clusters by overlapping genes

Redundancy: Gorilla GO images Good thing about GOrilla – shows the hierarchy of the GO terms Also a way of presenting results cbl-gorilla.cs.technion.ac.il/

Redundancy: Gorilla GO images But – if lots of info, difficult to interpret. Can see that we don’t just have a neat parent-child relationships cbl-gorilla.cs.technion.ac.il/

Redundancy: Enrichr overlap plots The heatmap - (clustergram as they call it) is quite a nice way of exploring the categories. It has some options to play around with the display The network may not be available for all gene-set libraries. Each node represents a term and a link between two nodes means that the two terms have some gene content similarity. Initially, the network is force directed, but if you drag the node to a fixed position, it will stay there. Like the other SVG figures, the network can be exported in three image formats. The grid may not be available for all gene-set libraries. Each grid square represents a term and is arranged based on its gene-set similarity with other terms. It shows only the top 10 terms sorted by combined score. The brighter the square, the more significant that term is. Clicking on the grid allows you to toggle to an alternate view that colors the grid based on its correlation score with neighbors with white dots representing the significant terms. The z-score and p-value is a measure of how clustered the top 10 terms are on the grid.

Redundancy: REVIGO (from Gorilla) http://revigo.irb.hr/

Redudancy: Giraph (via David) Giraph plot Proximity of circles is related to the proportion of overlapping genes between categories. Size of circle represents the number of genes. https://github.com/laurabiggins/Giraph

Relationship to original data Quantitative values for genes in category Direction and magnitude of change Look at genes in category which aren’t hits Relative numbers Supportive changes?

Relationship to original data

Pathways: Reactome

Pathways: ClueGo App within Cytoscape ClueGO integrates GO terms as well as pathways Creates a functionally organized GO/pathway term network

Summary Tables are often sufficient Figures can add extra information Must include name, enrichment, corrected p-value Other values are useful, but don’t put everything Figures can add extra information Plotting multiple metrics Illustrating redundancy Relating to original data Mapping to pathways