V 1 – Introduction Mon, Oct 15, 2012 Bioinformatics 3 — Volkhard Helms.

Slides:



Advertisements
Similar presentations
Complex Networks Advanced Computer Networks: Part1.
Advertisements

22C:19 Discrete Math Graphs Fall 2010 Sukumar Ghosh.
22C:19 Discrete Math Graphs Fall 2014 Sukumar Ghosh.
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Synopsis of “Emergence of Scaling in Random Networks”* *Albert-Laszlo Barabasi and Reka Albert, Science, Vol 286, 15 October 1999 Presentation for ENGS.
Advanced Topics in Data Mining Special focus: Social Networks.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Mining and Searching Massive Graphs (Networks)
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
22C:19 Discrete Math Graphs Spring 2014 Sukumar Ghosh.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Mathematics of Networks (Cont)
Professor Yashar Ganjali Department of Computer Science University of Toronto
Microarrays.
L – Modelling and Simulating Social Systems with MATLAB Lesson 6 – Graphs (Networks) Anders Johansson and Wenjian Yu (with S. Lozano.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Bioinformatics 3 – WS 15/16V 1 – 1 V1 - Introduction A cell is a crowded environment => many different proteins, metabolites, compartments, … On a microscopic.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Models and Algorithms for Complex Networks Introduction and Background Lecture 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Class 2: Graph Theory IST402.
Bioinformatics 3 V 2 – Clusters, Dijkstra, and Graph Layout Mon, Oct 26, 2015.
Course Name: Comparative Genomics Conducted by- Shigehiko kanaya & Md. Altaf-Ul-Amin.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Cohesive Subgraph Computation over Large Graphs
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Topics In Social Computing (67810)
Biological networks CS 5263 Bioinformatics.
Network biology : protein – protein interactions
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Bioinformatics 3 — Volkhard Helms
How Do “Real” Networks Look?
Modelling Structure and Function in Complex Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

V 1 – Introduction Mon, Oct 15, 2012 Bioinformatics 3 — Volkhard Helms

Bioinformatics 3 – WS 12/13V 1 – 2 How Does a Cell Work? A cell is a crowded environment => many different proteins, metabolites, compartments, … On a microscopic level => direct two-body interactions At the macroscopic level => complex behavior Can we understand the behavior from the interactions? Medalia et al, Science 298 (2002) 1209 => Connectivity

Bioinformatics 3 – WS 12/13V 1 – 3 The view of traditional molecular biology Molecular Biology: "One protein — one function" mutation => phenotype Linear one-way dependencies: regulation at the DNA level, proteins follow DNA => RNA => protein => phenotype Structural Biology: "Protein structure determines its function" biochemical conditions => phenotype No feedback, just re-action: genetic information molecular structure biochemical function phenotype=>

Bioinformatics 3 – WS 12/13V 1 – 4 The Network View of Biology Molecular Systems Biology: "It's both + molecular interactions" genetic information molecular structure biochemical function phenotype=> molecular interactions => highly connected network of various interactions, dependencies => study networks

Bioinformatics 3 – WS 12/13V 1 – 5 Example: Proteins in the Cell Cycle From Lichtenberg et al, Science 307 (2005) 724: color coded assignment of proteins in time-dependent complexes during the cell cycle => protein complexes are transient => describe with a time dependent network

Bioinformatics 3 – WS 12/13V 1 – 6 Major Metabolic Pathways static connectivity dynamic response to external conditions different states during the cell cycle

Bioinformatics 3 – WS 12/13V 1 – 7 bahnhoefe/netzplaene/index.html

Bioinformatics 3 – WS 12/13V 1 – 8 Metabolism of E. coli from the KEGG Pathway database

Bioinformatics 3 – WS 12/13V 1 – 9 Königsberg (1736) Can one cross all seven bridges once in one continuous (closed) path??? Images: google maps, wikimedia

Bioinformatics 3 – WS 12/13V 1 – 10 The Königsberg Connections Turn problem into a graph: i) each neighborhood is a node ii) each bridge is a link iii) straighten the layout k = 3 k = 5 k = 3 Continuous path <=> ≤2 nodes with odd degree Closed continuous path <=> only nodes with even degree see also:

Bioinformatics 3 – WS 12/13V 1 – 11 Draw in one stroke:

Bioinformatics 3 – WS 12/13V 1 – 12 Quantify the "Hairy Monsters" Schwikowski, Uetz, Fields, Nature Biotech. 18, 1257 (2001) Network Measures: Degree distribution P(k) => structure of the network Cluster coefficient C(k) => local connectivity Average degree => density of connections Connected components => subgraphs No. of edges, nodes => size of the network

Bioinformatics 3 – WS 12/13V 1 – 13 Lecture – Overview Protein-Protein-Interaction Networks: pairwise connectivity => data from experiments, quality check PPI: static network structure => network measures, clusters, modules, … Gene regulation: cause and response => Boolean networks Metabolic networks: steady state of large networks => FBA, extreme pathways Metabolic networks / signaling networks: dynamics => ODEs, modules, stochastic effects Protein complexes: spatial structure => experiments, spatial fitting, docking Protein association: => interface properties, spatial simulations Systems Biology

Bioinformatics 3 – WS 12/13V 1 – 14 Literature Lecture slides — available before the lecture Textbooks Suggested reading => check our web page => check computer science library

Bioinformatics 3 – WS 12/13V 1 – 15 How to pass this course Schein = you need to pass 3 out of 4 short tests and you need to pass the final exam Short tests:4 tests of 45 min each planned are: first half of lectures V6, V12, V18, V24  average grade is computed from 3 best tests If you have passed 2 tests but failed 1-2 tests, you can select one failed test for an oral re-exam. Final exam: written test of 120 min about assignments requirements for participation: 50% of the points from the assignments one assignment task blackboard 2 short tests passed planned: Fri, Feb. 8, 2013???

Bioinformatics 3 – WS 12/13V 1 – 16 Assignments Tutors: Christian Spaniol, Ruslan Akulenko, Mohamed Hamed, Nadine Schaadt, 10 assignments with 100 points each => one solution for two students (or one) => content: data analysis + interpretation — think! => hand-written or one printable PDF/PS file per => attach the source code of the programs for checking (no suppl. data) => no 100% solutions required!!! Hand in at the following Fri electronically until 13:00 or printed at the start of the lecture. Assignments are part of the course material (not everything is covered in lecture) => present one task at the blackboard Tutorial: Wed, 12:00–14:00, E2 1, room 007

Bioinformatics 3 – WS 12/13V 1 – 17 Some Graph Basics Network Graph Formal definition: A graph G is an ordered pair (V, E) of a set V of vertices and a set E of edges. undirected graphdirected graph If E = V (2) => fully connected graph G = (V, E)

Bioinformatics 3 – WS 12/13V 1 – 18 Graph Basics II Subgraph: G' = (V', E') is a subset of G = (V, E) Weighted graph: Weights assigned to the edges Note: no weights for vertices => why??? Practical question: how to define useful subgraphs?

Bioinformatics 3 – WS 12/13V 1 – 19 Walk the Graph Path = sequence of connected vertices start vertex => internal vertices => end vertex Vertices u and v are connected, if there exists a path from u to v. otherwise: disconnected Two paths are independent (internally vertex-disjoint), if they have no internal vertices in common. How many paths connect the green to the red vertex? How long are the shortest paths? Find the four trails from the green to the red vertex. How many of them are independent? Length of a path = number of vertices || sum of the edge weights Trail = path, in which all edges are distinct

Bioinformatics 3 – WS 12/13V 1 – 20 Local Connectivity: Degree Degree k of a vertex = number of edges at this vertex Directed graph => discern k in and k out Degree distribution P(k) = fraction of nodes with k connections k01234 P(k)03/71/7 2/7 k0123 P(k in )1/75/701/7 P(k out )2/73/71/7

Bioinformatics 3 – WS 12/13V 1 – 21 Basic Types: Random Network Generally: N vertices connected by L edges More specific: distribute the edges randomly between the vertices Maximal number of links between N vertices: => propability p for an edge between two randomly selected nodes: => average degree λ => path lengths grow with log(N) => small world

Bioinformatics 3 – WS 12/13V 1 – 22 Random Network: P(k) Network with N vertices, L edges => Probability for a random link: Probability that random node has links to k other particular nodes: Probability that random node has links to any k other nodes: Limit of large graph: N → oo, p = / N

Bioinformatics 3 – WS 12/13V 1 – 23 Random Network: P(k) Many independently placed edges => Poisson statistics kP(k | λ = 2) e-05 => Small probability for k >> λ

Bioinformatics 3 – WS 12/13V 1 – 24 Basic Types: Scale-Free Growing network a la Barabasi and Albert (1999): start from a small "nucleus" add new node with n links connect new links to existing nodes with probability α k (preferential attachment; β(BA) = 1) => "the rich get richer" Properties: power-law degree distribution: self-similar structure with highly connected hubs (no intrinsic length scale) => path lengths grow with log(log(N)) => very small world with γ = 3 for the BA model

Bioinformatics 3 – WS 12/13V 1 – 25 The Power-Law Signature Power law (Potenzgesetz) Take log on both sides: Plot log(P) vs. log(k) => straight line Note: for fitting γ against experimental data it is often better to use the integrated P(k) => integral smoothes the data

Bioinformatics 3 – WS 12/13V 1 – 26 Scale-Free: Examples The World-Wide-Web: => growth via links to portal sites Flight connections between airports => large international hubs, small local airports Protein interaction networks => some central, ubiquitous proteins

Bioinformatics 3 – WS 12/13V 1 – 27 Saturation: Ageing + Costs Example: network of movie actors (with how many other actors did an actor appear in a joint movie?) Each actor makes new acquaintances for ~40 years before retirement => limits maximum number of links Example: building up a physical computer network It gets more and more expensive for a network hub to grow further => number of links saturates cost

Bioinformatics 3 – WS 12/13V 1 – 28 Hierarchical, Regular, Clustered… Tree-like network with similar degrees => like an organigram => hierarchic network All nodes have the same degree and the same local neighborhood => regular network Note: most real-world networks are somewhere in between the basic types P(k) for these example networks? (finite size!)

Bioinformatics 3 – WS 12/13V 1 – 29 Summary Further Reading: R. Albert and A–L Barabási, „Statistical mechanics of complex networks“ Rev. Mod. Phys. 74 (2002) What you learned today: => networks are everywhere => how to get the "Schein" for BI3 => basic network types and definitions: random, scale-free, degree distribution, Poisson distribution, ageing, … => algorithm on a graph: Dijkstra's shortest path algorithm => looking at graphs: graph layout Next lecture: => clusters, percolation