Gene and Protein Networks I Wednesday, April 11 2006 CSCI 4830: Algorithms for Molecular Biology Debra Goldberg.

Slides:



Advertisements
Similar presentations
Complex Networks: Complex Networks: Structures and Dynamics Changsong Zhou AGNLD, Institute für Physik Universität Potsdam.
Advertisements

Complex Networks Advanced Computer Networks: Part1.
Collective Dynamics of ‘Small World’ Networks C+ Elegans: Ilhan Savut, Spencer Telford, Melody Lim 29/10/13.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Advanced Topics in Data Mining Special focus: Social Networks.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
University at BuffaloThe State University of New York Young-Rae Cho Department of Computer Science and Engineering State University of New York at Buffalo.
Biological Networks Feng Luo.
Network Design IS250 Spring 2010 John Chuang. 2 Questions  What does the Internet look like? -Why do we care?  Are there any structural invariants?
Network Statistics Gesine Reinert. Yeast protein interactions.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Gene and Protein Networks Monday, April CSCI : Computational Genomics Debra Goldberg
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Gene and Protein Networks II Monday, April CSCI 4830: Algorithms for Molecular Biology Debra Goldberg.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Statistical Properties of Massive Graphs (Networks) Networks and Measurements.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Stefano Boccaletti Complex networks in science and society *Istituto Nazionale di Ottica Applicata - Largo E. Fermi, Florence, ITALY *CNR-Istituto.
V 1 – Introduction Mon, Oct 15, 2012 Bioinformatics 3 — Volkhard Helms.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Biological Networks & Network Evolution Eric Xing
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Complex Network Theory – An Introduction Niloy Ganguly.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Complex Network Theory – An Introduction Niloy Ganguly.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Class 19: Degree Correlations PartII Assortativity and hierarchy
Network resilience.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Lecture 23: Structure of Networks
Structures of Networks
Biological networks CS 5263 Bioinformatics.
Network biology : protein – protein interactions
Assessing Hierarchical Modularity in Protein Interaction Networks
Lecture 23: Structure of Networks
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Lecture 23: Structure of Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Gene and Protein Networks I Wednesday, April CSCI 4830: Algorithms for Molecular Biology Debra Goldberg

Outline 1. Introduction 2. Network models 3. Implications from topology 4. Confidence assessment, edge prediction

Outline 1. Introduction 2. Network models 3. Implications from topology 4. Confidence assessment, edge prediction

What is a network? A collection of objects (nodes, vertices) Binary relationships (edges) May be directed Also called a graph

Networks are everywhere

Social networks from Nodes: People Edges: Friendship

Sexual networks Nodes: People Edges: Romantic and sexual relations

Transportation networks Nodes: Locations Edges: Roads

Power grids Nodes: Power station Edges: High voltage transmission line

Airline routes Nodes: Airports Edges: Flights

Internet Nodes: MBone Routers Edges: Physical connection

World-Wide-Web Nodes: Web documents Edges: Hyperlinks

Gene and protein networks

Metabolic networks Nodes: Metabolites Edges: Biochemical reaction (enzyme) from web.indstate.edu

Protein interaction networks Gene function predicted from Nodes: Proteins Edges: Observed interaction

Gene regulatory networks Inferred from error-prone gene expression data from Wyrick et al Nodes: Genes or gene products Edges: Regulation of expression

Signaling networks Nodes: Molecules ( e.g., Proteins or Neurotransmitters) Edges: Activation or Deactivation from pharyngula.org

Signaling networks Nodes: Molecules (e.g., Proteins or Neurotransmitters) Edges: Activation or Deactivation from

Synthetic sick or lethal (SSL) Cells live (wild type) Cells live Cells die or grow slowly X Y X Y X Y X Y

SSL networks Gene function, drug targets predicted Nodes: Nonessential genes Edges: Genes co-lethal from Tong et al X Y

Other biological networks Coexpression –Nodes: genes –Edges: transcribed at same times, conditions Gene knockout / knockdown –Nodes: genes –Edges: similar phenotype (defects) when suppressed

What they really look like…

We need models!

Outline 1. Introduction 2. Network models 3. Implications from topology 4. Confidence assessment, edge prediction

Traditional graph modeling RandomRegular from GD2002

Introduce small-world networks

Small-world Networks Six degrees of separation 100 – 1000 friends each Six steps: But… We live in communities

Small-world measures Typical separation between two vertices –Measured by characteristic path length Cliquishness of a typical neighborhood –Measured by clustering coefficient v C v = 1.00 v C v = 0.33

Watts-Strogatz small-world model

Measures of the W-S model Path length drops faster than cliquishness Wide range of p has both small-world properties

Small-world measures of various graph types Cliquishness Characteristic Path Length Regular graph HighLong Random graph LowShort Small-world graph HighShort

Another network property: Degree distribution P (k) The degree (notation: k ) of a node is the number of its neighbors The degree distribution is a histogram showing the frequency of nodes having each degree

Degree distribution of E-R random networks Binomial degree distribution, well-approximated by a Poisson Degree = k P( k) Erdös-Rényi random graphs Network figures from Strogatz, Nature 2001

Degree distribution of many real-world networks Scale-free networks Degree distribution follows a power law P (k = x) =  x -  Degree = k P( k) log k log P( k)

Model for scale-free networks Growth and preferential attachment –New node has edge to existing node v with probability proportional to degree of v –Biologically plausible?

Another scale-free network model Duplication and divergence –New nodes are copies of existing nodes –Same neighbors, then some gain/loss Solé, Pastor-Satorras, et al. (2002)

Other degree distributions Amaral, Scala, et al., PNAS (2000)

Hierarchical Networks Ravasz, et al., Science 2002

3. Scaling clustering coefficient (DGM) 2. Clustering coefficient independent of N Properties of hierarchical networks 1. Scale-free

C of 43 metabolic networks Independent of N Ravasz, et al., Science 2002

Scaling of the clustering coefficient C(k) Metabolic networks Ravasz, et al., Science 2002

Summary of network models Random Poisson degree distribution Small world high CC, short pathlengths Scale-free power law degree distribution Hierarchical high CC, modular, power law degree distribution

Many real-world networks are small-world, scale-free World-wide-web Collaboration of film actors (Kevin Bacon) Mathematical collaborations (Erdös number) Power grid of US Syntactic networks of English Neural network of C. elegans Metabolic networks Protein-protein interaction networks

There is information in a gene’s position in the network We can use this to predict Relationships –Interactions –Regulatory relationships Protein function –Process –Complex / “molecular machine”

Outline 1. Introduction 2. Network models 3. Implications from topology 4. Confidence assessment, edge prediction

SSL “hubs” might be good cancer drug targets (Tong et al, Science, 2004) Normal cell Cancer cells w/ random mutations Alive Dead

Lethality Hubs are more likely to be essential Jeong, et al., Nature 2001

Degree anti-correlation Few edges directly between hubs Edges between hubs and low-degree genes are favored Maslov and Sneppen, Science 2002

Outline 1. Introduction 2. Network models 3. Implications from topology 4. Confidence assessment, edge prediction

Confidence assessment Traditionally, biological networks determined individually –High confidence –Slow New methods look at entire organism –Lower confidence (  50% false positives) Inferences made based on this data

Confidence assessment Can use topology to assess confidence if true edges and false edges have different network properties Assess how well each edge fits topology of true network Can also predict unknown relations Goldberg and Roth, PNAS 2003

Use clustering coefficient, a local property Number of triangles = | N(v)  N(w) | Normalization factor? N(x) = the neighborhood of node x y x v w v w...

Mutual clustering coefficient Jaccard Index:Meet / Min:Geometric: |N(v)  N(w)| |N(v)  N(w)| |N(v)  N(w)| |N(v)| · |N(w)| |N(v)  N(w)| min ( |N(v)|, |N(w)| ) Hypergeometric: a p-value

Mutual clustering coefficient Hypergeometric: P (intersection at least as large by chance) -log = neighbors of node v = neighbors of node w = nodes in graph

Prediction A v-w edge would have a high clustering coefficient v w

Interaction generality Confidence measure for edge based on topology around neighbors. Saito, Suzuki, and Hayashizaki 2002,2003

Confidence assessment Integrate experimental details with local topology –Degree –Clustering coefficient –Degree of neighbors –Etc. Used logistic regression Bader, et al., Nature Biotechnology 2003

The synthetic lethal network has many triangles Xiaofeng Xin, Boone Lab

2-hop predictors for SSL SSL – SSL (S-S) Homology – SSL (H-S) Co-expressed – SSL (X-S) Physical interaction – SSL (P-S) 2 physical interactions (P-P) v w S:Synthetic sickness or lethality (SSL) H:Sequence homology X:Correlated expression P:Stable physical interaction Wong, et al., PNAS 2004

Multi-color motifs S:Synthetic sickness or lethality H:Sequence homology X:Correlated expression P:Stable physical interaction R:Transcriptional regulation Zhang, et al., Journal of Biology 2005