Identifying functional subnetworks in large-scale datasets Benno Schwikowski Institut Pasteur – Systems Biology Group

Slides:



Advertisements
Similar presentations
BiNoM, a Cytoscape plugin for accessing and analyzing pathways using standard systems biology formats Eric Bonnet Computational Systems Biology of Cancer.
Advertisements

Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Pathways analysis Iowa State Workshop 11 June 2009.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
BioPathways SIG, July Networks in Biology Molecular interaction and similarity networks are vital for understanding gene function.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
ISMB, June From protein sequences…to protein networks Database DNA and protein sequences Query Sequence GACTGCATTAC Family of homologous genes.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Genetics: From Genes to Genomes
ISMB, June From protein sequences…to protein networks Database DNA and protein sequences Query Sequence GACTGCATTAC Family of homologous genes.
Discussion summary Cytoscape introduction Thomas Skøt Jensen Center for Biological Sequence Analysis The Technical University of Denmark.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Epistasis Analysis Using Microarrays Chris Workman.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
Exploring PPI networks using Cytoscape
Ch10. Intermolecular Interactions and Biological Pathways
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
1Module #: Title of Module. Network visualization and analysis with Cytoscape Gary Bader July 15, 2013 – Network Analysis, UCLA.
EnrichNet: network-based gene set enrichment analysis Presenter: Lu Liu.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Hyun Seok Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute, Yonsei University College of Medicine Lecture 13. Network Analysis MES
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Networks and Interactions Boo Virk v1.0.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
EB3233 Bioinformatics Introduction to Bioinformatics.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
A collaborative tool for sequence annotation. Contact:
Introduction to biological molecular networks
Exploring and Exploiting the Biological Maze Zoé Lacroix Arizona State University.
GO based data analysis Iowa State Workshop 11 June 2009.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Human Genetics Integrative Bioinformatics using Cytoscape (and R2)
Goal: Help users learn to use Cytoscape to answer biologically relevant questions by providing information, tutorials and linking new.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
a Cytoscape plugin to assess enrichment of
Networks and Interactions
Canadian Bioinformatics Workshops
Pathway Visualization
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Nicole Anguiano and Anu Varshneya
Schedule for the Afternoon
Volume 58, Issue 4, Pages (May 2015)
Pathway Visualization
Presentation transcript:

Identifying functional subnetworks in large-scale datasets Benno Schwikowski Institut Pasteur – Systems Biology Group

Benno Schwikowski The three levels of this talk 1.Discovery of pathways active in HepC infection 2.Cytoscape plug-ins 3.Cytoscape platform

Benno Schwikowski Hepatitis C infection One person out of 30 is infected No vaccine exists In 20% of chronic infections, liver fibrosis and cirrhosis Frequently requires liver transplants

Benno Schwikowski Studying HepC infection mRNA changes 50% of transplant livers become re-infected with Hepatitis C Study expression of 7000 genes in re-infected livers after transplantation –1-24 month post-transplant –Samples in 3-6 month intervals 28 biopsies from 11 patients –Mixture of hepatocytes, hepatic stellate cell, Kupffer cells, various types of blood cells Compare against pre-transplant reference pool

Benno Schwikowski Result of mRNA expression analysis Most genes (5968 of 7000) were significantly under- or overexpressed in one or more experiments High patient-to-patient variation

Benno Schwikowski Our approach 1.Construct seed network among known molecular players 2.Expand seed network to include differentially expressed genes 3.Identify putative pathways by the Active Modules approach

Seed network Protein-protein Protein-DNA Phosphorylation Activation Repression Covalent bond Methylation Types of interactions

Benno Schwikowski InteractionFetcher plug-in Purpose Dynamically retrieves remote information for selected nodes –From SQL database –Requests data via XML-RPC protocol Currently implemented types Protein/gene synonyms Orthologs Sequences (DNA, protein, DNA upstream) –Gene, protein, Interactions/associations Options Cross-species queries Ortholog information from Homologene Inferred interactions (interologs) Interactive links to Source Web pages 100% open-source (client and server)

Benno Schwikowski 2. Expand seed network Purpose Bring significantly up-/downregulated genes “into the picture” Approach Add interactions with differentially expressed genes (“in silico pull-down”) –Use BIND, HPRD databases –Only human-curated interactions

Network after InteractionFetcher expansion

Benno Schwikowski Identifying putative pathways Why clustering can be problematic Many clustering methods are not model-based  significance of clusters is unclear Any given cluster may not be supported by all experiments – noise problem Clusters tend to contain unrelated genes with vaguely similar profiles

Benno Schwikowski The three levels of this talk 1.Discovery of pathways active in HepC infection 2.Cytoscape plug-ins 3.Cytoscape platform

Benno Schwikowski How can the clustering issues be addressed? The ActiveModules Plug-in Define “up-/downregulated” on the basis of a well-defined statistical model Also derive clusters from some of the input experiments Use additional evidence to focus on “plausible” clusters  protein interactions

Benno Schwikowski Interaction networks Schwikowski, Uetz, Fields Nature Biotechnology (2000)

Benno Schwikowski Modular organization of interaction networks

Benno Schwikowski A lot of interaction data is becoming available Databases on... Protein-protein interactions Protein-DNA interactions Genetic interactions Metabolic pathways Cell signaling pathways, similarity relationships, literature-based relationships

Benno Schwikowski Multi-criteria detection of modules Experiments Genes  2. Differential Gene/Protein Abundances/Activities 1. Interaction network between genes/proteins

Perturbations /conditions Rank adjustment: Binomial summation P z = 1-  (z A(j) ) r A(j) =  -1 (1-p A(j) ) m = total number of conditions j = size of subset of conditions Final Score Ideker, Ozier, Schwikowski, Siegel (2002): Bioinformatics 18. S Scoring a module candidate

Benno Schwikowski Pathways in Rosetta’s compendium (300 conditions)

Benno Schwikowski The three levels of this talk 1.Discovery of pathways active in HepC infection 2.Cytoscape plug-ins 3.Cytoscape platform

Benno Schwikowski Active Modules plug-in applied to HCV re-infection data Iterative application results in four significant highly overlapping subnetworks Repeat analysis only retaining “late-active” re-infection experiments –Eliminates pathways activated by transplant operation –Cutoff: 8 months

Which observations can we make locally? Network after InteractionFetcher expansion Bold: Differentially regulated subnetwork Red/Green: Late- active subnetwork

Benno Schwikowski Cytotalk plug-in Overrepresentation analysis using Cytotalk plug-in, R, of overrepresentation of genes in Gene Ontology classes Cytotalk enables interactive communication with –C/C++ programs –Java processes –Python –UNIX shell scripts –R, R scripts Can be run on same machine or any other Internet- connected machine Can function as Cytoscape plug-in 100% open-source

Benno Schwikowski The three levels of this talk 1.Discovery of pathways active in HepC infection 2.Cytoscape plug-ins 3.Cytoscape platform

Benno Schwikowski Some Network Visualization Tools Pajek - Slovenia Osprey - SLRI, Toronto VisANT - BU Biolayout - EBI GraphViz PowerPoint Others Cytoscape (only open-source biology)

Cytoscape

Benno Schwikowski Cytoscape Basic Concepts Objects visualized as nodes Relationships visualized as edges Attributes (name, sequence, source,...) Mapping attributes  drawing customizable through visual mapper

Cytoscape file formats YDR216W pd YIL056W YDR216W pd YKR042W YDR216W pd YGL096W YDR216W pd YDR077W [...] GENEDESCexp0.sigexp1.sigexp0.sigexp1.sig GENE0G GENE1G GENE2G GENE3G [...] Sample interaction file

Benno Schwikowski Display gene & protein expression protein interactions (physical and non-physical) protein classifications Analysis plug-in modules Java: platform independent + web- start 100% open-source Cytoscape

Visual Styles Display gene expression as clear text

Visual Styles Map expression values to node colors using a continuous mapper

Visual Styles Expression data mapped to node colors

Multidimensional attributes Cytoscape, pre-release plug-in Data from Ideker et al., Science (2001)

Layout 16 algorithms available through plug-ins Zooming, hide/show, alignment

yFiles Circular

Benno Schwikowski

Cytoscape Core – Differences to most other approaches Emphasis on data analysis & integration No built-in semantics (added by plug-ins) Very simple concepts Human-readable input formats Extensibility

Benno Schwikowski Cytoscape extensibility Core: 100% open source Java –Plug-in API –Plug-ins are independently licensed “Just need to do the biology” Template code samples Plug-in

Biomodules plug-in Prinz S, Avila-Campillo I, Aldridge C, Srinivasan A, Dimitrov K, Siegel AF, and Galitski T Genome Res :

Benno Schwikowski Cytoscape Plugins Modules in Complex Networks Iliana Avila-Campillo, Tim Galitski Discovering Regulatory and Signaling Circuits in Molecular Interaction Networks Trey Ideker, Owen Ozier, Benno Schwikowski, Andrew Siegel Data Integration in Juvenile Diabetes Research Marta Janer, Paul Shannon A network motif sampler David Reiss, Benno Schwikowski

Benno Schwikowski Cytoscape Core Features Visualize and lay out networks Display network data using visual styles Easily organize multiple networks Bird’s eye view navigation of large networks Supports SIF and GML, molecular profiling formats, node/edge attributes Functional annotation from GO + KEGG Metanode support (hierarchical groupings) Extensible through plugins (20 developed)

Benno Schwikowski Baliga et al. Genome Research June 2004

Benno Schwikowski Collaborators: HCV Institute for Systems Biology, Seattle, WA David Reiss Iliana Avila-Campillo Vesteinn Thorsson Tim Galitski

Benno Schwikowski

Collaborators: Cytoscape ISB Leroy Hood Rowan Christmas Agilent Technologies Unilever PLC Long-term funding from NIH and participating institutions UCSD Trey Ideker Chris Workman Memorial-Sloan Kettering Cancer Center Chris Sander Gary Bader Ethan Cerami Pasteur Melissa Cline Andrea Splendiani Tero Aittokallio

Shannon, P., et al. (2003). Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res 13,

Benno Schwikowski Collaborators: Active Networks Trey Ideker Owen Ozier Andrew Siegel Richard Karp

Benno Schwikowski Levels of Biological Information DNA mRNA Protein Pathways Networks Cells Tissues Organs Individuals Populations Ecologies