Dynamic Network Inference Most statistical work is done on gene regulatory networks, while inference of metabolic pathways and signaling networks are done.

Slides:



Advertisements
Similar presentations
Bayesian network for gene regulatory network construction
Advertisements

Analysis of High-Throughput Screening Data C371 Fall 2004.
DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Modelling and Identification of dynamical gene interactions Ronald Westra, Ralf Peeters Systems Theory Group Department of Mathematics Maastricht University.
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
Introduction of Probabilistic Reasoning and Bayesian Networks
Models and methods in systems biology Daniel Kluesing Algorithms in Biology Spring 2009.
Mutual Information Mathematical Biology Seminar
Dynamic Network Inference Most statistical work is done on gene regulatory networks, while inference of metabolic pathways and signaling networks are done.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Gene Regulatory Networks - the Boolean Approach Andrey Zhdanov Based on the papers by Tatsuya Akutsu et al and others.
Networks in Cellular Biology A. Metabolic Pathways Boehringer-Mannheim Enzyme catalyzed set of reactions controlling concentrations of metabolites B. Regulatory.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
6. Gene Regulatory Networks
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Super-Resolution of Remotely-Sensed Images Using a Learning-Based Approach Isabelle Bégin and Frank P. Ferrie Abstract Super-resolution addresses the problem.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
What Is a Gene Network?. Gene Regulatory Systems “Programs built into the DNA of every animal.” Eric H. Davidson.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
BsysE595 Lecture Basic modeling approaches for engineering systems – Summary and Review Shulin Chen January 10, 2013.
Modeling and identification of biological networks Esa Pitkänen Seminar on Computational Systems Biology Department of Computer Science University.
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Probabilistic Robotics Bayes Filter Implementations.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Reconstruction of Transcriptional Regulatory Networks
Stochastic modeling of molecular reaction networks Daniel Forger University of Michigan.
Combinatorial State Equations and Gene Regulation Jay Raol and Steven J. Cox Computational and Applied Mathematics Rice University.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
Lecture 2: Statistical learning primer for biologists
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
1 EE571 PART 3 Random Processes Huseyin Bilgekul Eeng571 Probability and astochastic Processes Department of Electrical and Electronic Engineering Eastern.
Tracking with dynamics
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION Kalman Filter with Process Noise Gauss- Markov.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Learning gene regulatory networks in Arabidopsis thaliana
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
Filtering and State Estimation: Basic Concepts
1 Department of Engineering, 2 Department of Mathematics,
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Regulation Analysis using Restricted Boltzmann Machines
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
SEG5010 Presentation Zhou Lanjun.
Presentation transcript:

Dynamic Network Inference Most statistical work is done on gene regulatory networks, while inference of metabolic pathways and signaling networks are done by other means. Like in phylogenetics, network inference has two components – graph structure (topology) and continuous aspects, such as parameters of the distributions relating neighboring nodes. Like genome annotation, networks are often hidden structures that influences something that can be observed. Inference of Boolean Networks ODEs with Noise Metabolic Pathways Dynamic Bayesian Networks Integrated Modelling of Networks: Metabolism and Genes Signal Transduction Pathways

Inferring Metabolic Pathways Arita (2000) Metabolic reconstruction using shortest path Simulation Practice and Theory Arita (2000) Graph Modelling of Metabolism JS Art Intel ?? Boyer (2003) Ab Initia reconstruction of metabolic pathways Bioinformatics Practice and Theory Partial Injections Given reaction Create atom to atom mapping between molecules Solvable by Subgraph Isomorphism Algorithms Partial Injections can be concatenated

Example:Tryptophan from 4-erythrose via Chorismate Automaton generating paths with at least 6 carbons transferred and maximum length 6. Bold paths corresponds to biological pathway.

Inferring Signalling Pathways Albert et al. (2007) A Novel Method for Signal Trnasduction Network Inference from Indirect Experimental Evidence JCompuBiol Li et l. (2006) Predicting Essential Components of Signal Transduction Networks: A Dynamic Model of Guard Cell Abscisic Acid Signaling. PLOS Biol Paths from i to j has parity weight of path modulo 2 Questions: there might be several paths from i to j. The effect of i on j, depends on the state of other nodes – ie cannot be viewed in isolation E critical - set of experimentally verified interactions Graph G=(E,V) with some some nodes real, some pseudo. Pseudo are non-observed, but simplifies explanation. Edges are labelled 0-excitation, 1 inhibition. B A C BTR – Binary Transitive Reduction: Find a subgraph E’ : E critical < E’ < E such that parity remains the same. Observations: Inhibition/Excitation relationships between all real pairs PVC – Pseudo Vertex Collapse. Procedure to remove pseudovertices without changing parity.

Abscisic Acid (ABA) Signaling and Simulations Manually Curated Inferred by Algorithm 54 vertices, 92 edges --- Identical strong connected component vertices, 84 edges Simulation: How large is proposed networks relative to theoretical lower bounds.

Shannon Entropies: Mutual Information: M(X,Y) = H(Y) – H(Y given X) = H(X)-H(X given Y) For j=1 to k Find k-sets with significant mutual information. Assign rule Y X H(X) =.97 H(Y) = 1.00 H(X,Y) = 1.85 D’haeseler et al.(2000) Genetic network Inference: from co-expression clustering to reverse engineering. Bioinformatics genes Random firing rules Thus network inference is easy. However, it is not Reverse Engeneering Algorithm-Reveal X Y Discrete known Generations No Noise

BOOL-1, BOOL2, QNET1 Akutsu et al. (2000) Inferring qualitative relations in genetic networks and metabolic pathways. Bioinformatics Bool-2 Activation v j  v i Inhibition v j --! v i Qualitative Network QNET X1X1 X2X2 Algorithm if (  X i * X j <0) delete “n 1 activates n 2 ” from E if (  X i * X j >0) delete “n 1 inhibits n 2 ” from E If O(2 2k [2k +  ]log(n)) INPUT patterns are given uniformly randomly, BOOL-1 correctly identifies the underlying network with probability 1-n , where  is any fixed real number > 1. Algorithm For each gene do (n) For each boolean rule (<= k inputs) not violated, keep it. Qnet Bool-1 p noise is the probability that experiment reports wrong boolean rule uniformly.

ODEs with Noise Z X Y Feed forward loop (FFL) Cao and Zhao (2008) “Estimating Dynamic Models for gene regulation networks” Bioinformatics This can be modeled by Where Objective is to estimate from noisy measurements of expression levels If noise is given a distribution the problem is well defined and statistical estimation can be done Data and estimation Goodness of Fit and Significance

Inference in the Presence of Knowledge Marton Munz Dynamic mass action systems on 10 components were sampled with a bias towards sparseness Kinetic parameters were sampled Dynamic trajectories were generated Normal noise was added Equation system minimizing SSE was chosen Adding deterministic knowledge was added

Inference in the Presence of Knowledge Marton Munz Dynamic mass action systems on 10 components were sampled with a bias towards sparseness Kinetic parameters were sampled Dynamic trajectories were generated Normal noise was added Equation system minimizing SSE was chosen Adding deterministic knowledge was added

Gaussian Processes Examples: Brownian Motion: All increments are N(,  t) distributed.  t is the time period for the increment. No equilibrium distribution. Ornstein-Uhlenbeck Process – diffusion process with centralizing linear drift. N(, ) as equilibrium distribution. One TF (transcription factor – black ball) (f(t)) whose concentration fluctuates over times influence k genes (x j ) (four in this illustration) through their TFBS (transcription factor binding site - blue). The strength of its influence is described through a gene specific sensitivity, S j. D j – decay of gene j, B j – production of gene j in absence of TF Definition: A Stochastic Process X(t) is a GP if all finite sets of time points, t 1,t 2,..,t k, defines stochastic variable that follows a multivariate Normal distribution, N(  ), where  is the k-dimensional mean and  is the k*k dimensional covariance matrix.

Gaussian Processes Gaussian Processes are characterized by their mean and variances thus calculating these for x j and f at pairs of time, t and t’, points is a key objective Rattray, Lawrence et al. Manchester time level Observable Hidden and Gaussian t’ t Correlation between two time points of f Correlation between two time points of same x’es Correlation between two time points of different x’es Correlation between two time points of x and f This defines a prior on the observables Then observeand a posterior distribution is defined

Graphical Models Correlation Graphs Conditional Independence Gaussian Causality Graphs Labeled Nodes: each associated a stochastic variable that can be observed or not. Edges/Hyperedges – directed or undirected – determines the combined distribution on all nodes

Dynamic Bayesian Networks Perrin et al. (2003) “Gene networks inference using dynamic Bayesian networks” Bioinformatics 19.suppl Take a graphical model i.Make a time series of of it ii.Model the observable as function of present network Example: DNA repair Inference about the level of hidden variables can be made

Network Integration A genome scale computational study of the interplay between transcriptional regulation and metabolism. A genome scale computational study of the interplay between transcriptional regulation and metabolism. (T. Shlomi, Y. Eisenberg, R. Sharan, E. Ruppin) Molecular Systems Biology (MSB), 3:101, doi: /msb , 2007 Chen-Hsiang Yeang and Martin Vingron, "A joint model of regulatory and metabolic networks" (2006). BMC Bioinformatics. 7, pp (Boolean vector) Regulatory state Metabolic state Modified from Ruppin Genome-scale integrated model for E. coli (Covert 2004) 1010 genes ( 104 TFs, 906 genes) 817 proteins 1083 reactions

Feasibility of Network Inference: Very Hard Data very noisy Number of network topologies very large Why it is hard: Other sources of knowledge – experiments Evolution What could help: A biological conclusion defines a large set of networks Why poor network inference might be acceptable: Declaring biology unknowable would be very radical Conceptual clarification of problem What statistics can do Optimal analysis of data Power studies (how much data do you need) Statistics can’t draw conclusion if the data is insufficient or too noisy (I hope not)

Summary Network Inference – topology and continuous parameters Interpretation: From Integrative Genomics to Systems Biology: Often the topology is assumed identical Inference of Boolean Networks ODEs with Noise Metabolic Pathways Dynamic Bayesian Networks Integrated Modeling of Networks: Metabolism and Genes Signal Transduction Pathways