Biological Networks & Network Evolution Eric Xing

Slides:



Advertisements
Similar presentations
Complex Networks Advanced Computer Networks: Part1.
Advertisements

Network analysis Sushmita Roy BMI/CS 576
Scale Free Networks.
Biological Networks Analysis Degree Distribution and Network Motifs
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
Analysis and Modeling of Social Networks Foudalis Ilias.
School of Information University of Michigan Network resilience Lecture 20.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Overview of part 1 Historical perspective
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Synopsis of “Emergence of Scaling in Random Networks”* *Albert-Laszlo Barabasi and Reka Albert, Science, Vol 286, 15 October 1999 Presentation for ENGS.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Biological Networks Feng Luo.
Network Statistics Gesine Reinert. Yeast protein interactions.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Global topological properties of biological networks.
Advanced Topics in Data Mining Special focus: Social Networks.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Error and Attack Tolerance of Complex Networks Albert, Jeong, Barabási (presented by Walfredo)
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Structure, evolution and dynamics of transcriptional regulatory networks M. Madan Babu, PhD National Institutes of Health.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
Bioinformatics Center Institute for Chemical Research Kyoto University
Network resilience.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Network (graph) Models
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Topics In Social Computing (67810)
Biological networks CS 5263 Bioinformatics.
Network biology : protein – protein interactions
Biological Networks Analysis Degree Distribution and Network Motifs
Peer-to-Peer and Social Networks Fall 2017
Modelling Structure and Function in Complex Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Advanced Algorithms and Models for Computational Biology -- a machine learning approach Biological Networks & Network Evolution Eric Xing Lecture 22, April 10, 2006 Reading:

Molecular Networks Nodes – molecules. Links – inteactions / relations. Regulatory networks Interaction networks Expression networks Metabolic networks Nodes – molecules. Links – inteactions / relations.

Other types of networks Disease Spread [Krebs] Electronic Circuit Food Web Internet [Burch & Cheswick] Social Network

Metabolic networks Nodes – metabolites (0.5K). Edges – directed biochemichal reactions (1K). Reflect the cell’s metabolic circuitry. KEGG database: http://www.genome.ad.jp/kegg/kegg2.html

Graph theoretic description of metabolic networks “Graph theoretic description for a simple pathway (catalyzed by Mg2+ -dependant enzymes) is illustrated (a). In the most abstract approach (b) all interacting metabolites are considered equally.” Barabasi & Oltvai. NRG. (2004) 5 101-113

Protein Interaction Networks Nodes – proteins (6K). Edges – interactions (15K). Reflect the cell’s machinery and signlaing pathways.

Experimental approaches Yeast Two-Hybrid Protein coIP

Graphs and Networks Graph: a pair of sets G={V,E} where V is a set of nodes, and E is a set of edges that connect 2 elements of V. Directed, undirected graphs Large, complex networks are ubiquitous in the world: Genetic networks Nervous system Social interactions World Wide Web

Global topological measures Indicate the gross topological structure of the network Degree Path length Clustering coefficient [Barabasi]

Connectivity Measures Node degree: the number of edges incident on the node (number of network neighbors.) Undetected networks Degree distribution P(k): probability that a node has degree k. Directed networks, i.e., transcription regulation networks (TRNs) i Degree of node i = 5 Incoming degree = 2.1 each gene is regulated by ~2 TFs Outgoing degree = 49.8 each TF targets ~50 genes

Characteristic path length Lij is the number of edges in the shortest path between vertices i and j The characteristic path length of a graph is the average of the Lij for every possible pair (i,j) Diameter: maximal distance in the network. Networks with small values of L are said to have the “small world property” In a TRN, Lij represents the number of intermediate TFs until final target i j Starting TF Final target Indicate how immediate a regulatory response is 1 intermediate TF Average path length = 4.7 Path length = 1

Clustering coefficient The clustering coefficient of node i is the ratio of the number Ei of edges that exist among its neighbors, over the number of edges that could exist: CI=2TI/nI(nI-1) The clustering coefficient for the entire network C is the average of all the Ci 4 neighbours 6 possible links 1 existing link Measure how inter-connected the network is Average coefficient = 0.11 Clustering coefficient = 1/6 = 0.17

A Comparison of Global Network Statistics (Barabasi & Oltvai, 2004) A. Random Networks [Erdos and Rényi (1959, 1960)] B. Scale Free [Price,1965 & Barabasi,1999] C.Hierarchial Mean path length ~ ln(k) Phase transition: Connected if: Preferential attachment. Add proportionally to connectedness Mean path length ~ lnln(k) Copy smaller graphs and let them keep their connections.

Local network motifs Regulatory modules within the network SIM MIM FBL FFL [Alon]

SIM = Single input motifs HCM1 First region shows feedback loop of SBF to itself and also Tos 4 to the original SBF and MBP factors ECM22 STB1 SPO1 YPR013C [Alon; Horak, Luscombe et al (2002), Genes & Dev, 16: 3017 ]

MIM = Multiple input motifs SBF MBF First region shows feedback loop of SBF to itself and also Tos 4 to the original SBF and MBP factors SPT21 HCM1 [Alon; Horak, Luscombe et al (2002), Genes & Dev, 16: 3017 ]

FFL = Feed-forward loops SBF Yox1 Pog1 First region shows feedback loop of SBF to itself and also Tos 4 to the original SBF and MBP factors Tos8 Plm2 [Alon; Horak, Luscombe et al (2002), Genes & Dev, 16: 3017 ]

FBL = Feed-back loops MBF SBF Tos4 First region shows feedback loop of SBF to itself and also Tos 4 to the original SBF and MBP factors SBF Tos4 [Alon; Horak, Luscombe et al (2002), Genes & Dev, 16: 3017 ]

What network structure should be used to model a biological network? Strogatz S.H., Nature (2001) 410 268 lattice random

Calculating the degree connectivity of a network 1 2 3 5 4 6 7 8 1 2 3 4 5 6 7 8 degree connectivity frequency Degree connectivity distributions:

Connectivity distributions for metabolic networks A. fulgidus (archaea) E. coli (bacterium) averaged over 43 organisms C. elegans (eukaryote) Jeong et al. Nature (2000) 407 651-654

Protein-protein interaction networks Jeong et al. Nature 411, 41 - 42 (2001) Wagner. RSL (2003) 270 457-466 (color of nodes is explained later)\

Random versus scaled exponential degree distribution Degree connectivity distributions differs between random and observed (metabolic and protein-protein interaction) networks. Strogatz S.H., Nature (2001) 410 268 log frequency log frequency log degree connectivity log degree connectivity

What is so “scale-free” about these networks? No matter which scale is chosen the same distribution of degrees is observed among nodes

Models for networks of complex topology Erdos-Renyi (1960) Watts-Strogatz (1998) Barabasi-Albert (1999)

Random Networks: The Erdős-Rényi [ER] model (1960): N nodes Every pair of nodes is connected with probability p. Mean degree: (N-1)p. Degree distribution is binomial, concentrated around the mean. Average distance (Np>1): log N Important result: many properties in these graphs appear quite suddenly, at a threshold value of PER(N) If PER~c/N with c<1, then almost all vertices belong to isolated trees Cycles of all orders appear at PER ~ 1/N

The Watts-Strogatz [WS] model (1998) Start with a regular network with N vertices Rewire each edge with probability p QUESTION: What happens for intermediate values of p? For p=0 (Regular Networks): high clustering coefficient high characteristic path length For p=1 (Random Networks): low clustering coefficient low characteristic path length

WS model, cont. There is a broad interval of p for which L is small but C remains large Small world networks are common : There is a quick drop of L due to the introduction of few “short cuts”

Scale-free networks: The Barabási-Albert [BA] model (1999) The distribution of degrees: In real network, the probability of finding a highly connected node decreases exponentially with k ER Model ER Model WS Model actors power grid www

BA model, cont. Two problems with the previous models: The BA model: N does not vary the probability that two vertices are connected is uniform The BA model: Evolution: networks expand continuously by the addition of new vertices, and Preferential-attachment (rich get richer): new vertices attach preferentially to sites that are already well connected.

Scale-free network model GROWTH: starting with a small number of vertices m0 at every timestep add a new vertex with m ≤ m0 PREFERENTIAL ATTACHMENT: the probability Π that a new vertex will be connected to vertex i depends on the connectivity of that vertex: Barabasi & Bonabeau Sci. Am. May 2003 60-69 Barabasi and Albert. Science (1999) 286 509-512

Scale Free Networks a) Connectivity distribution with N = m0+t=300000 and m0=m=1(circles), m0=m=3 (squares), and m0=m=5 (diamons) and m0=m=7 (triangles) b) P(k) for m0=m=5 and system size N=100000 (circles), N=150000 (squares) and N=200000 (diamonds) Barabasi and Albert. Science (1999) 286 509-512

Comparing Random Vs. Scale-free Networks Two networks both with 130 nodes and 215 links) The importance of the connected nodes in the scale-free network: 27% of the nodes are reached by the five most connected nodes, in the scale-free network more than 60% are reached. Five nodes with most links First neighbors of red nodes Modified from Albert et al. Science (2000) 406 378-382

Failure and Attack Failure: Removal of a random node. Albert et al. Science (2000) 406 378-382 Failure: Removal of a random node. Attack: The selection and removal of a few nodes that play a vital role in maintaining the network’s connectivity. a macroscopic snapshot of Internet connectivity by K. C. Claffy

Failure and Attack, cont. Random networks are homogeneous so there is no difference between failure and attack Fraction nodes removed from network Diameter of the network Modified from Albert et al. Science (2000) 406 378-382

Failure and Attack, cont. Scale-free networks are robust to failure but susceptible to attack Fraction nodes removed from network Diameter of the network Modified from Albert et al. Science (2000) 406 378-382

The phenotypic effect of removing the corresponding protein: Yeast protein-protein interaction networks Lethal Slow-growth Non-lethal Unknown Jeong et al. Nature 411, 41 - 42 (2001)

Lethality and connectivity are positively correlated Average and standard deviation for the various clusters. Pearson’s linear correlation coefficient = 0.75 Number of links % of essential proteins Jeong et al. Nature 411, 41 - 42 (2001)

Genetic foundation of network evolution Network expansion by gene duplication A gene duplicates Inherits it connections The connections can change Gene duplication slow ~10-9/year Connection evolution fast ~10-6/year Barabasi & Oltvai. NRG. (2004) 5 101-113

The transcriptional regulation network of Escherichia coli. Shai S. Shen-Orr, Ron Milo, Shmoolik Mangan & Uri Alon (2002) Nature Genetics 31 64 - 68

Motifs in the networks Deployed a motif detection algorithm on the transcriptional regulation network. Identified three recurring motifs (significant with respect to random graphs). Shai S. Shen-Orr, Ron Milo, Shmoolik Mangan & Uri Alon (2002) Nature Genetics 31 64 - 68

Convergent evolution of gene circuits Are the components of the feed-forward loop for example homologous? Circuit duplication is rare in the transcription network Conant and Wagner. Nature Genetics (2003) 34 264-266

Acknowledgements Itai Yanai and Doron Lancet Mark Gerstein Roded Sharan Jotun Hein Serafim Batzoglou for some of the slides modified from their lectures or tutorials

Reference Barabási and Albert. Emergence of scaling in random networks. Science 286, 509-512 (1999). Yook et al. Functional and topological characterization of protein interaction networks. Proteomics 4, 928-942 (2004). Jeong et al. The large-scale organization of metabolic networks. Nature 407, 651-654 (2000). Albert et al. Error and attack tolerance in complex networks. Nature 406 , 378 (2000). Barabási and Oltvai, Network Biology: Understanding the Cell's Functional Organization, Nature Reviews, vol 5, 2004