ANALYSIS OF GENETIC NETWORKS USING ATTRIBUTED GRAPH MATCHING.

Slides:



Advertisements
Similar presentations
Scale Free Networks.
Advertisements

Network biology Wang Jie Shanghai Institutes of Biological Sciences.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Networks FIAS Summer School 6th August 2008 Complex Networks 1.
COMPLEX NETWORK APPROACH TO PREDICTING MUTATIONS ON CARDIAC MYOSIN Del Jackson CS 790G Complex Networks
Structural Web Search Using a Graph-Based Discovery System Nitish Manocha, Diane J. Cook, and Lawrence B. Holder University of Texas at Arlington
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Computer Science 1 Web as a graph Anna Karpovsky.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Motif Discovery in Protein Sequences using Messy De Bruijn Graph Mehmet Dalkilic and Rupali Patwardhan.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Biological Pathways & Networks
Presentation by: Kyle Borge, David Byon, & Jim Hall
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
Toward Automatically Drawn Metabolic Pathway Atlas with Peripheral Node Abstraction Algorithm Myungha Jang, Arang Rhie, and Hyun-Seok Park * Bioinformatics.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Optimal Network Alignment with Graphlet Degree Vectors
Qiong Cheng, Robert Harrison, Alexander Zelikovsky Computer Science in Georgia State University Oct IEEE 7 th International Conference on BioInformatics.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Networks Igor Segota Statistical physics presentation.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
A connected simple graph is Eulerian iff every graph vertex has even degree. The numbers of Eulerian graphs with, 2,... nodes are 1, 1, 2, 3, 7, 16, 54,
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Social Networking: Large scale Networks
March 3, 2009 Network Analysis Valerie Cardenas Nicolson Assistant Adjunct Professor Department of Radiology and Biomedical Imaging.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Informatics tools in network science
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Abstract Networks. WWW (2000) Scientific Collaboration Girvan & Newman (2002)
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Lecture 23: Structure of Networks
Structures of Networks
CSCI2950-C Lecture 12 Networks
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Biological networks CS 5263 Bioinformatics.
Applications of graph theory in complex systems research
Building and Analyzing Genome-Wide Gene Disruption Networks
Department of Computer Science University of York
Lecture 23: Structure of Networks
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Presentation transcript:

ANALYSIS OF GENETIC NETWORKS USING ATTRIBUTED GRAPH MATCHING

BACKGROUND Completion of sequencing projects Need for functional discovery Emerging area of study: Large scale genomic analysis Similarity of living systems

GENETIC NETWORKS Modelling genetic networks Interaction of genes and proteins Relationship between topology and function

MOTIVATION Common biological processes Comparison of networks Discovering missing interactions Discovering missing genes

GRAPH MATCHING mpn132mpn124mpn141mpn145mpn134mpn133mge234mge235mge236mge312mge314mge310mge313mge336mge337 Search-based Algorithm Pruning Techniques G1 G2

ROADMAP Scale-Free Networks Modelling Genetic Networks Graph Matching Algorithm Results

SCALE-FREE NETWORKS

COMPLEX NETWORKS Small-world model –WWW –Human acquaintances network –Citation networks –Biological networks

SMALL-WORLD Features: –Characteristic path length –Clustering coefficient –Sparseness

SMALL-WORLD Somewhere in between regular & random graphs

SMALL-WORLD Highly clustered Short diameter

SCALE-FREE NETWORKS Complex networks: biological, social, www, power grid, citation etc. Power low connectivity: P(k) = k -  Hubs - authorities

SCALE-FREE NETWORKS Application for testing scale free behavior Yeast Helicobacter Pylori Mycoplasma Pnuemonia Mycoplasma Genitelium Linear log-log graph Slope = 

SCALE-FREE NETWORKS Slope is calculated by least mean square method

TOPOLOGY & FUNCTIONALITY Small diameter – ease of dissemination of information – ease of restoring after disturbance Cliquishness –Alternate paths are found Heterogeneity –Random removal does not effect the network –Hubs are vulnerable to attack

BIOLOGICAL ASPECTS Multifunctionality –Grouped into functional units Stability Reason: Most of the interactions are between hubs and authorities

MODELLING GENETIC NETWORKS

TYPES OF GENETIC NETWORKS Categorized by data sources –Metabolic pathways –Gene expression arrays –Protein interactions –Gene interactions

INTERACTION MAPS High level perspective –Nodes: Genes or proteins –Edges: Presence of an interaction Data sources –Two-hybrid analysis –Fusion analysis –Chromosomal proximity –Phylogenetic analysis

GRAPH MATCHING

PROBLEM DEFINITION Attributed Relational Graph (ARG) G = { V, E, X}. V = {v 1, v 2, …, v n } Nodes E = {e 1, e 2, …, e m } Edges X = {x 1, x 2,…,x n } Attributes

INEXACT SUBGRAPH MATCHING Allow for : Mismatching attribute values Missing nodes Missing links Also called error-correcting subgraph isomorphism NP-Complete

SEARCH TECHNIQUES Cost function Pruning (Structure Constraints) Backtracking

ATTRIBUTED GRAPH MATCHING TOOL

ATTRIBUTE MATCHING -Amino Acid Sequence Content Composition – array of 20, percentage of each aa –Amino acid grouped into classes: array of 6 –Amino acid triples grouped into classes: array of 216  MKVLNKNEL 6 x 6 x 6

ATTRIBUTE MATCHING Difference in amino acid composition values of gene pairs for M. Genitalium and M. Pneumoniae. Score observations

STRUCTURAL CONSTRAINTS Effect of scale-free behaviour –Connectivity information: Highly heterogeneous, thus start with most connected and work around it –Pruning strategy: comparibility is determined by power low

STRUCTURAL CONSTRAINTS Neigborhood connectivity –Choose the neighbor at the next stage Backtracking –Component by component –Go back to the neighbor with the most connectivity within the component

TEST CASE Mycoplasma Genitalium: –smallest genome (470 ORFs) Mycoplasma Pnuemoniae: –Very similar, superset (688 ORFs)

TEST CASE... Mycoplasma Genitalium: –232 nodes –211 links Mycoplasma Pnuemoniae: –267 nodes –257 links Inputs: MGE links MPN links MGE synonyms MPN synonyms MGE amino acid sequence MPN amino acid sequence

RESULTS MGEMPN

DISCOVERY OF MISSING DATA Missing link Link between in MPN632 and MPN637 is missing in our data but exists in literature

DISCOVERY OF MISSING DATA Missing node with known COG MPN MPN237---MPN238---MPN678 MG MG MG MG459 MG459 is ortholog of MPN678

DISCOVERY OF MISSING DATA Missing node without known ortholog

CONCLUSION Large-scale genomics Interaction data captures system structure and dynamics Graph matching exploits the scale-free characteristics Novel interactions and genes can be identified

ACKNOWLEDGEMENT YASEMİN TÜRKELİ