Canadian Bioinformatics Workshops www.bioinformatics.ca.

Slides:



Advertisements
Similar presentations
Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
Advertisements

Pathways analysis Iowa State Workshop 11 June 2009.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
BioPathways SIG, July Networks in Biology Molecular interaction and similarity networks are vital for understanding gene function.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Signaling pathways –SPIKE Network visualization.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Exploring PPI networks using Cytoscape
Ch10. Intermolecular Interactions and Biological Pathways
Cytoscape A powerful bioinformatic tool Mathieu Michaud
1Module #: Title of Module. Network visualization and analysis with Cytoscape Gary Bader July 15, 2013 – Network Analysis, UCLA.
Tutorial session 1 Network generation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Hyun Seok Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute, Yonsei University College of Medicine Lecture 13. Network Analysis MES
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Copyright OpenHelix. No use or reproduction without express written consent1.
Networks and Interactions Boo Virk v1.0.
Creating Metabolic Network Models using Text Mining and Expert Knowledge J.A. Dickerson, D. Berleant, Z. Cox, W. Qi, and E. Wurtele Iowa State University.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Tutorial session 2 Network annotation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Visualization and analysis of microarray and gene ontology data with treemaps Eric H Baehrecke, Niem Dang, Ketan Babaria and Ben Shneiderman Presenter:
CellFateScout step- by-step tutorial for a case study Version 0.94.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
BIological NetwOrk Manager Cytoscape plugin Andrei Zinovyev Institut Curie/INSERM/Ecole de Mines, UMR 900 “Computational Systems Biology of Cancer”
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
Tutorial session 1 Network generation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
W e present BiNoM [1,2], a Cytoscape plugin, developed to facilitate the manipulation of biological networks represented in standard systems biology formats.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Copyright OpenHelix. No use or reproduction without express written consent1.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
Canadian Bioinformatics Workshops
Human Genetics Integrative Bioinformatics using Cytoscape (and R2)
Goal: Help users learn to use Cytoscape to answer biologically relevant questions by providing information, tutorials and linking new.
July 18, Cytoscape User Tutorial John “Scooter” Morris, Ph.D. Resource for Biocomputing, Visualization, and Informatics, UCSF.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
a Cytoscape plugin to assess enrichment of
CSCI2950-C Lecture 12 Networks
Canadian Bioinformatics Workshops
Pathway Visualization
System Structures Identification
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Network biology An introduction to STRING and Cytoscape
Pathway Visualization
Presentation transcript:

Canadian Bioinformatics Workshops

2Module #: Title of Module

Module 4 Network Visualization and Analysis

Module 4: Network Visualization and Analysis bioinformatics.ca Network Analysis Workflow Load Networks e.g. PPI data – Import network data into Cytoscape Load Attributes e.g. gene expression data – Get data about networks into Cytoscape Analyze and Visualize Networks Prepare for Publication A specific example of this workflow: Cline, et al. “Integration of biological networks and gene expression data using Cytoscape”, Nature Protocols, 2, (2007).

Module 4: Network Visualization and Analysis bioinformatics.ca Network Visualization and Analysis Outline Network introduction Network visualization Cytoscape software tool for network visualization and analysis Network analysis

Module 4: Network Visualization and Analysis bioinformatics.ca Networks Represent relationships – Physical, regulatory, genetic, functional interactions Useful for discovering relationships in large data sets – Better than tables in Excel Visualize multiple data types together – See interesting patterns

Module 4: Network Visualization and Analysis bioinformatics.ca Network Representations

Biological Pathways/Networks?

Module 4: Network Visualization and Analysis bioinformatics.ca Mapping Biology to a Network A simple mapping – one compound/node, one interaction/edge A more realistic mapping – Cell localization, cell cycle, cell type, taxonomy – Only represent physiologically relevant interaction networks Edges can represent other relationships Critical: understand what nodes and edges mean

Module 4: Network Visualization and Analysis bioinformatics.ca Protein Sequence Similarity Network

Module 4: Network Visualization and Analysis bioinformatics.ca Six Degrees of Separation Everyone in the world is connected by at most six links Which path should we take? Shortest path by breadth first search – If two nodes are connected, will find the shortest path between them Are two proteins connected? If so, how? Biologically relevant?

humangenetics-amc.nl Gene Function Prediction – shows connections to sets of genes/proteins involved in same biological process Detection of protein complexes/other modular structures – discover modularity & higher order organization (motifs, feedback loops) Network evolution – biological process(es) conservation across species Prediction of new interactions and functional associations – Statistically significant domain- domain correlations in protein interaction network to predict protein-protein or genetic interaction jActiveModules, UCSD PathBlast, UCSD MCODE, University of Toronto DomainGraph, Max Planck Institute Applications of Network Biology

humangenetics-amc.nl Identification of disease subnetworks – identification of disease network subnetworks that are transcriptionally active in disease. Subnetwork-based diagnosis – source of biomarkers for disease classification, identify interconnected genes whose aggregate expression levels are predictive of disease state Subnetwork-based gene association – map common pathway mechanisms affected by collection of genotypes June 2009 Agilent Literature Search Mondrian, MSKCC PinnacleZ, UCSD Applications of Network Informatics in Disease

Module 4: Network Visualization and Analysis bioinformatics.ca What’s Missing? Dynamics –Pathways/networks represented as static processes Difficult to represent a calcium wave or a feedback loop –More detailed mathematical representations exist that handle these e.g. Stoichiometric modeling, Kinetic modeling (VirtualCell, E-cell, …) Need to accumulate or estimate comprehensive kinetic information Detail – atomic structures Context – cell type, developmental stage

Module 4: Network Visualization and Analysis bioinformatics.ca What Have We Learned? Networks are useful for seeing relationships in large data sets Important to understand what the nodes and edges mean Important to define the biological question - know what you want to do with your gene list or network Many methods available for gene list and network analysis – Good to determine your question and search for a solution – Or get to know many methods and see how they can be applied to your data

Module 4: Network Visualization and Analysis bioinformatics.ca Network Visualization Outline Automatic network layout Visual features Visually interpreting a network

Module 4: Network Visualization and Analysis bioinformatics.ca Automatic network layout

Module 4: Network Visualization and Analysis bioinformatics.ca Automatic network layout Force-directed: nodes repel and edges pull Good for up to 500 nodes – Bigger networks give hairballs - Reduce number of edges Advice: try force directed first, or hierarchical for tree-like networks Tips for better looking networks – Manually adjust layout – Load network into a drawing program (e.g. Illustrator) and adjust labels

Focus Overview Zoom PKC Cell Wall Integrity

Module 4: Network Visualization and Analysis bioinformatics.ca Visual Features Node and edge attributes – String, integer, float, Boolean, list – E.g. represent gene, interaction attributes Visual attributes – Node, edge visual properties – Colour, shape, size, borders, opacity...

Module 4: Network Visualization and Analysis bioinformatics.ca Visually Interpreting a Network Guilt-by-association Dense clusters Global relationships

Module 4: Network Visualization and Analysis bioinformatics.ca What Have We Learned? Automatic layout is required to visualize networks Networks help you visualize interesting relationships in your data Avoid hairballs by focusing analysis Visual attributes enable multiple types of data to be shown at once – useful to see their relationships

Module 4: Network Visualization and Analysis bioinformatics.ca Network Visualization and Analysis using Cytoscape Network visualization and analysis using Cytoscape software Cytoscape basics Cytoscape network analysis examples

Network visualization and analysis UCSD, ISB, Agilent, MSKCC, Pasteur, UCSF, Unilever, UToronto, U Texas Pathway comparison Literature mining Gene Ontology analysis Active modules Complex detection Network motif search

Network Analysis using Cytoscape Databases Literature Expert knowledge Experimental Data Find biological processes underlying a phenotype Network Information Network Analysis

Manipulate NetworksFilter/Query Automatic Layout Interaction Database Search

Module 4: Network Visualization and Analysis bioinformatics.ca Active Community Help – Tutorials, case studies – Mailing lists for discussion – Documentation, data sets Annual Conference: San Diego, May 18-21, ,000s users, 2500 downloads/month >100 Plugins Extend Functionality – Build your own, requires programming Cline MS et al. Integration of biological networks and gene expression data using Cytoscape Nat Protoc. 2007;2(10):

Module 4: Network Visualization and Analysis bioinformatics.ca What Have We Learned? Cytoscape is a useful, free software tool for network visualization and analysis Provides basic network manipulation features Plugins are available to extend the functionality

Cytoscape Demo Version

Desktop Canvas Network overview Network manager Attribute browser CytoPanels FYI

yFiles Organic FYI

yFiles Circular FYI

Module 4: Network Visualization and Analysis bioinformatics.ca Network Layout 15 algorithms available through plugins Demo: Move, zoom/pan, rotate, scale, align FYI

Create Subnetwork FYI

Create Subnetwork FYI

Module 4: Network Visualization and Analysis bioinformatics.ca Visual Style Customized views of experimental data in a network context Network has node and edge attributes E.g. expression data, GO function, interaction type Mapped to visual attributes E.g. node/edge size, shape, colour… E.g. Visualize gene expression data as node colour gradient on the network FYI

Visual Style Load “Your Favorite Network” FYI

Visual Style Load “Your Favorite Expression” Dataset FYI

Visual Style Map expression values to node colours using a continuous mapper FYI

Visual Style Expression data mapped to node colours FYI

Module 4: Network Visualization and Analysis bioinformatics.ca Network Filtering FYI

Interaction Database Search FYI

Module 4: Network Visualization and Analysis bioinformatics.ca FYI

Module 4: Network Visualization and Analysis bioinformatics.ca FYI

Module 4: Network Visualization and Analysis bioinformatics.ca We are on a Coffee Break & Networking Session

Module 4: Network Visualization and Analysis bioinformatics.ca VistaClara Visualization for gene expression data Heat maps, sorting, animation

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Lab Cytoscape – expression data visualization – Load the sample network file: galFiltered.sif – Lay it out – try different layouts – Load expression data - galExpData.pvals Use File->Import->Attribute from Table – Examine node attributes – Visualize gene expression data using the Visual Mapper – Install the VistaClara plugin from the plugin manager – Play the expression data as a movie

Module 4: Network Visualization and Analysis bioinformatics.ca BiNGO plugin Calculates over-representation of a subset of genes with respect to a background set in a specific GO category Input: subnetwork, or list – Background set by user Output: tree with nodes color reflecting overrepresentation; also as lists Caveats: Gene identifiers must match; low GO term coverage, GO bias, Background determining

Module 4: Network Visualization and Analysis bioinformatics.ca BiNGO Maere, S., Heymans, K. and Kuiper, M Bioinformatics 21, , 2005 Hypergeometric p-value Multiple testing correction (Benjamini-Hochberg FDR)

Module 4: Network Visualization and Analysis bioinformatics.ca Cerebral /

Module 4: Network Visualization and Analysis bioinformatics.ca Find Active Subnetworks Active modules –Input: network + p-values for gene expression values e.g. from GCRMA –Output: significantly differentially expressed subgraphs Method –Calculate z-score/node, Z A score/subgraph, correct vs. random expression data sampling –Score over multiple experimental conditions –Simulated annealing used to find high scoring networks Ideker T, Ozier O, Schwikowski B, Siegel AF Bioinformatics. 2002;18 Suppl 1:S233-40

Module 4: Network Visualization and Analysis bioinformatics.ca Active Module Results Network: yeast protein-protein and protein-DNA network Expression data: 3 gene knock out conditions (enzyme, TF activator, TF repressor) Note: non-deterministic, multiple runs required for confidence of result robustness Ideker T et al. Science May 4;292(5518):

Module 4: Network Visualization and Analysis bioinformatics.ca Network Clustering Clusters in a protein-protein interaction network have been shown to represent protein complexes and parts of pathways Clusters in a protein similarity network represent protein families Network clustering is available through the ClusterMaker Cytoscape plugin Bader & Hogue, BMC Bioinformatics (1):2

Proteasome 26S Proteasome 20S Ribosome RNA Pol core RNA Splicing

Module 4: Network Visualization and Analysis bioinformatics.ca Text Mining Computationally extract gene relationships from text, usually PubMed abstracts Useful if network is not in a database –Literature search tool BUT not perfect –Problems recognizing gene names –Natural language processing is difficult Agilent Literature Search Cytoscape plugin iHOP (

Agilent Literature Search

Cytoscape Network produced by Literature Search. Abstract from the scientific literature Sentences for an edge

Module 4: Network Visualization and Analysis bioinformatics.ca Analysis Lab Find Network Motifs - Netmatch plugin Network motif is a sub-network that occurs significantly more often than by chance alone Input: query and target networks, optional node/edge labels Output: topological query matches as subgraphs of target network Supports: subgraph matching, node/edge labels, label wildcards, approximate paths

Finding specific biological relevant TF-PPI sub-networks QueryResults Ferro et al. Bioinformatics 2007

Find Signaling Pathways Potential signaling pathways from plasma membrane to nucleus via cytoplasm Raf-1 Mek MAPK TFs Nucleus - Growth Control Mitogenesis MAP Kinase Cascade Ras NetMatch query Shortest path between subgraph matches Signaling pathway example NetMatch Results

Find Expressed Motifs ProteinDifferential Expression Significance YLR075W1.7255E-4 YGR085C2.639E-4 YPR102C3.7183E-4 NetMatch query NetMatch Results Find specific subgraphs where certain nodes are significantly differentially expressed

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Tips & Tricks “Root graph” – “There is one graph to rule them all….” – The networks in Cytoscape are all “views” on a single graph. – Changing the attribute for a node in one network will also change that attribute for a node with the same ID in all other loaded networks – There is no way to “copy” a node and keep the same ID – Make a copy of the session

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Tips & Tricks Network views – When you open a large network, you will not get a view by default – To improve interactive performance, Cytoscape has the concept of “Levels of Detail” Some visual attributes will only be apparent when you zoom in The level of detail for various attributes can be changed in the preferences To see what things will look like at full detail: – View  Show Graphics Details

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Tips & Tricks Sessions – Sessions save pretty much everything: Networks Properties Visual styles Screen sizes – Saving a session on a large screen may require some resizing when opened on your laptop

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Tips & Tricks Logging – By default, Cytoscape writes it’s logs to the Error Dialog: Help  Error Dialog – Can change a preference to write it to the console Edit  Preferences  Properties… Set logger.console to true Don’t forget to save your preferences Restart Cytoscape – (can also turn on debugging: cytoscape.debug, but I don’t recommend it)

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Tips & Tricks Memory – Cytoscape uses lots of it – Doesn’t like to let go of it – An occasional restart when working with large networks is a good thing – Destroy views when you don’t need them – Java doesn’t give us a good way to get the memory right at start time Since version 2.7, Cytoscape does a much better job at “guessing” good default memory sizes than previous versions

Module 4: Network Visualization and Analysis bioinformatics.ca Cytoscape Tips & Tricks.cytoscape directory – Your defaults and any plugins downloaded from the plugin manager will go here – Sometimes, if things get really messed up, deleting (or renaming) this directory can give you a “clean slate” Plugin manager – “Outdated” doesn’t necessarily mean “won’t work” – Plugin authors don’t always update their plugins immediately after new releases