Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.

Slides:



Advertisements
Similar presentations
Network analysis Sushmita Roy BMI/CS 576
Advertisements

Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles Authors: Chia-Hao Chin 1,4,
University at BuffaloThe State University of New York Young-Rae Cho Department of Computer Science and Engineering State University of New York at Buffalo.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Biological Networks Feng Luo.
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
Network Statistics Gesine Reinert. Yeast protein interactions.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Gene and Protein Networks II Monday, April CSCI 4830: Algorithms for Molecular Biology Debra Goldberg.
Global topological properties of biological networks.
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Biological networks: Types and origin
Joint analysis of genetic and physical interactions in S. cerevisiae Igor Ulitsky Ron Shamir lab School of Computer Science Tel Aviv University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Exploring PPI networks using Cytoscape
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Biological Pathways & Networks
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Network Biology Presentation by: Ansuman sahoo 10th semester
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Bioinformatics Lab. Centrality and Graph Mining. Bioinformatics Lab. Introduction Many real world systems can be described as networks.  Social relationships:
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
CSCE555 Bioinformatics Lecture 18 Network Biology Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Network resilience.
Introduction to complex networks Part I: Structure
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
Informatics tools in network science
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Biological Network Analysis
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Cohesive Subgraph Computation over Large Graphs
CSCI2950-C Lecture 12 Networks
Biological networks CS 5263 Bioinformatics.
Dynamics and context-specificity in biological networks
Biological Networks Analysis Degree Distribution and Network Motifs
Department of Computer Science University of York
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
SEG5010 Presentation Zhou Lanjun.
Presentation transcript:

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological Sequence Analysis Technical University of Denmark

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Network Example - The Internet

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Co-authorship at Max Planck

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Network Measures Degree k i Degree distribution P(k) Mean path length Network Diameter Clustering Coefficient

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Paths: metabolic, signaling pathways Cliques: protein complexes Hubs: regulatory modules Subgraphs: maximally weighted Network Analysis

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Graphs Graph G=(V,E) is a set of vertices V and edges E A subgraph G’ of G is induced by some V’  V and E’  E Graph properties: –Connectivity (node degree, paths) –Cyclic vs. acyclic –Directed vs. undirected

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Sparse vs Dense G(V, E) where |V|=n, |E|=m the number of vertices and edges Graph is sparse if m~n Graph is dense if m~n 2 Complete graph when m=n 2

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Connected Components G(V,E) |V| = 69 |E| = 71

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Connected Components G(V,E) |V| = 69 |E| = 71 6 connected components

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Paths A path is a sequence {x 1, x 2,…, x n } such that (x 1,x 2 ), (x 2,x 3 ), …, (x n-1,x n ) are edges of the graph. A closed path x n =x 1 on a graph is called a graph cycle or circuit.

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Shortest-Path Between Nodes

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Shortest-Path Between Nodes

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Longest Shortest-Path

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Small-world Network Every node can be reached from every other by a small number of hops or steps High clustering coefficient and low mean-shortest path length –Random graphs don’t necessarily have high clustering coefficients Social networks, the Internet, and biological networks all exhibit small-world network characteristics

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark gene A gene B regulates gene A gene B binds gene A gene B reaction product is a substrate for regulatory interactions (protein-DNA) functional complex B is a substrate of A (protein-protein) metabolic pathways Network Representation

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Representation of Metabolic Reactions

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Network Measures: Degree

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark P(k) is probability of each degree k, i.e fraction of nodes having that degree. For random networks, P(k) is normally distributed. For real networks the distribution is often a power- law: P(k) ~ k  Such networks are said to be scale-free Degree Distribution

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Interconnected Regions: Modules

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark k: neighbors of I n I : edges between node I’s neighbors The density of the network surrounding node I, characterized as the number of triangles through I. Related to network modularity The center node has 8 (grey) neighbors There are 4 edges between the neighbors C = 2*4 /(8*(8-1)) = 8/56 = 1/7 Clustering Coefficient

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Hierarchical Networks

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Detecting Hierarchical Organization

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Knock-out Lethality and Connectivity

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Target the hubs to have an efficient safe sex education campaign Lewin Bo, et al., Sex i Sverige; Om sexuallivet i Sverige 1996, Folkhälsoinstitutet, 1998

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Scale-Free Networks are Robust Complex systems (cell, internet, social networks), are resilient to component failure Network topology plays an important role in this robustness –Even if ~80% of nodes fail, the remaining ~20% still maintain network connectivity Attack vulnerability if hubs are selectively targeted In yeast, only ~20% of proteins are lethal when deleted, and are 5 times more likely to have degree k>15 than k<5.

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Other Interesting Features Cellular networks are assortative, hubs tend not to interact directly with other hubs. Hubs tend to be “older” proteins (so far claimed for protein- protein interaction networks only) Hubs also seem to have more evolutionary pressure—their protein sequences are more conserved than average between species (shown in yeast vs. worm) Experimentally determined protein complexes tend to contain solely essential or non-essential proteins—further evidence for modularity.

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Summary: Network Measures Degree k i The number of edges involving node i Degree distribution P(k) The probability (frequency) of nodes of degree k Mean path length The avg. shortest path between all node pairs Network Diameter –i.e. the longest shortest path Clustering Coefficient –A high CC is found for modules

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Finding Overrepresented Motifs

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Metabolic and Transcription Factor Networks

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Overrepresented Motifs

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Identifying protein complexes in protein- protein interaction networks

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Identifying protein complexes from PPI data Barabasi & Oltvai, Nature Reviews, 2004 Identifying protein complexes from protein-protein interaction data require computational tools.

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark The MCODE algorithm The three steps of MCODE 1.Vertex weighting 2.Complex prediction 3.Post-processing Molecular Complex Detection MCODE

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Vertex (nodes) weighting Vertex weighting 1.Find neighbors

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Vertex (nodes) weighting Vertex weighting 1.Find neighbors 2.Get highest k-core graph K-core graph: A graph of minimal degree k, i.e. All nodes must have at least k connections

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Vertex (nodes) weighting Vertex weighting 1.Find neighbors 2.Get highest k-core graph 3.Calculate density of k- core graph Density: Number of observed edges, E, divided by the total number of possible edges, E max E max = V (V-1)/2 (networks without loops)

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Vertex (nodes) weighting Vertex weighting 1.Find neighbors 2.Get highest k-core graph 3.Calculate density of k- core graph 4.Calculate vertex (node) weight: Density * k max Density: Number of observed edges, E, divided by the total number of possible edges, E max E max = V (V-1)/2 (networks without loops)

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Molecular complex prediction Complex prediction 1.Seed complex by nodes with highest weight 2.Include neighbors if the vertex weight is above threshold (VWP) 3.Repeat step 2 until no more nodes can be included

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Post-processing Complex post-processing 1.Complexes must contain at least a 2- core graph 2.Include neighbors if the vertex weight is above the fluff parameter (optional) 3.Haircut: Remove nodes with a degree less than two (optional)

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Identifying active subgraphs

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Ideker T, Ozier O, Schwikowski B, Siegel AF. Discovering regulatory and signaling circuits in molecular interaction networks. Bioinformatics. 2002;18 Suppl 1:S Active Subgraphs Find high scoring subnetwork based on data integration

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Scoring a Sub-graph Ideker T, Ozier O, Schwikowski B, Siegel AF. Discovering regulatory and signaling circuits in molecular interaction networks. Bioinformatics. 2002;18 Suppl 1:S

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Significance Assessment of Active Module Ideker T, Ozier O, Schwikowski B, Siegel AF. Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics. 2002;18 Suppl 1:S Score distributions for the 1st - 5th best scoring modules before (blue) and after (red) randomizing Z- scores (“states”). Randomization disrupts correlation between gene expression and network location.

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Finding “Active” Pathways in a Large Network is Hard Finding the highest scoring subnetwork is NP hard, so we use heuristic search algorithms to identify a collection of high-scoring subnetworks (local optima) Simulated annealing and/or greedy search starting from an initial subnetwork “seed” Considerations: Local topology, sub-network score significance (is score higher than would be expected at random?), multiple states (conditions)

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Summary Network measures –degree, network diameter, degree distributions, clustering coefficient Network modularity and robustness from hubs Analyzing networks –Finding motifs, identifying modules (complexes) Data integration –Finding active subnetworks

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark /3 1/ /21 3