6. Gene Regulatory Networks

Slides:



Advertisements
Similar presentations
Systems biology SAMSI Opening Workshop Algebraic Methods in Systems Biology and Statistics September 14, 2008 Reinhard Laubenbacher Virginia Bioinformatics.
Advertisements

Stochastic algebraic models SAMSI Transition Workshop June 18, 2009 Reinhard Laubenbacher Virginia Bioinformatics Institute and Mathematics Department.
CS188: Computational Models of Human Behavior
Bayesian network for gene regulatory network construction
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
A Tutorial on Learning with Bayesian Networks
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
Dynamic Bayesian Networks (DBNs)
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
An Introduction to Variational Methods for Graphical Models.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
Introduction of Probabilistic Reasoning and Bayesian Networks
Predicting Gene Expression using Logic Modeling and Optimization Abhimanyu Krishna New Challenges in the European Area: Young Scientist’s 1st International.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Lecture 5: Learning models using EM
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
Gene Regulatory Networks - the Boolean Approach Andrey Zhdanov Based on the papers by Tatsuya Akutsu et al and others.
Simulation and Application on learning gene causal relationships Xin Zhang.
. Approximate Inference Slides by Nir Friedman. When can we hope to approximate? Two situations: u Highly stochastic distributions “Far” evidence is discarded.
Artificial Intelligence Term Project #3 Kyu-Baek Hwang Biointelligence Lab School of Computer Science and Engineering Seoul National University
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl
Learning In Bayesian Networks. Learning Problem Set of random variables X = {W, X, Y, Z, …} Training set D = { x 1, x 2, …, x N }  Each observation specifies.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Bayes Net Perspectives on Causation and Causal Inference
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
What Is a Gene Network?. Gene Regulatory Systems “Programs built into the DNA of every animal.” Eric H. Davidson.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Using Bayesian Networks to Analyze Expression Data By Friedman Nir, Linial Michal, Nachman Iftach, Pe'er Dana (2000) Presented by Nikolaos Aravanis Lysimachos.
Bayesian Learning By Porchelvi Vijayakumar. Cognitive Science Current Problem: How do children learn and how do they get it right?
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Slides for “Data Mining” by I. H. Witten and E. Frank.
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
Learning In Bayesian Networks. General Learning Problem Set of random variables X = {X 1, X 2, X 3, X 4, …} Training set D = { X (1), X (2), …, X (N)
Introduction to biological molecular networks
Learning and Acting with Bayes Nets Chapter 20.. Page 2 === A Network and a Training Data.
1 Identifying Differentially Regulated Genes Nirmalya Bandyopadhyay, Manas Somaiya, Sanjay Ranka, and Tamer Kahveci Bioinformatics Lab., CISE Department,
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Review of statistical modeling and probability theory Alan Moses ML4bio.
CSC321: Introduction to Neural Networks and Machine Learning Lecture 17: Boltzmann Machines as Probabilistic Models Geoffrey Hinton.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
1 CISC 841 Bioinformatics (Fall 2008) Review Session.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Qian Liu CSE spring University of Pennsylvania
Learning gene regulatory networks in Arabidopsis thaliana
CSCI 5822 Probabilistic Models of Human and Machine Learning
Christopher A. Penfold Vicky Buchanan-Wollaston Katherine J. Denby And
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
Filtering and State Estimation: Basic Concepts
1 Department of Engineering, 2 Department of Mathematics,
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Presentation transcript:

6. Gene Regulatory Networks EECS 600: Systems Biology & Bioinformatics Instructor: Mehmet Koyuturk 1

Regulation of Gene Expression 6. Gene Regulatory Networks Regulation of Gene Expression Transcriptional Regulation of telomerase protein component gene hTERT 2 EECS 600: Systems Biology & Bioinformatics

Genetic Regulation & Cellular Signaling 6. Gene Regulatory Networks Genetic Regulation & Cellular Signaling 3 EECS 600: Systems Biology & Bioinformatics

Organization of Genetic Regulation 6. Gene Regulatory Networks Organization of Genetic Regulation Negative ligand-independent repression at chromatin level Up-regulation Gene Genetic network that controls flowering time in A. thaliana (Blazquez et al, EMBO Reports, 2001) Down-regulation 4 EECS 600: Systems Biology & Bioinformatics

Gene Regulatory Networks Transcriptional Regulatory Networks Nodes with outgoing edges are limited to transcription factors Can be reconstructed by identifying regulatory motifs (through clustering of gene expression & sequence analysis) and finding transcription factors that bind to the corresponding promoters (through structural/sequence analysis) 5 EECS 600: Systems Biology & Bioinformatics

Gene Regulatory Networks Gene expression networks General model of genetic regulation Identify the regulatory effects of genes on each other, independent of the underlying regulatory mechanism Can be inferred from correlations in gene expression data, time-series gene expression data, and/or gene knock-out experiments Observation Inference 6 EECS 600: Systems Biology & Bioinformatics

Boolean Network Model Binary model, a gene has only two states 6. Gene Regulatory Networks Boolean Network Model Binary model, a gene has only two states ON (1): The gene is expressed OFF (0): The gene is not expressed Each gene’s next state is determined by a boolean function of the current states of a subset of other genes A boolean network is specified by two sets Set of nodes (genes) State of a gene: Collection of boolean functions 7 EECS 600: Systems Biology & Bioinformatics

Logic Diagram Cell cycle regulation 6. Gene Regulatory Networks Logic Diagram Cell cycle regulation Retinoblastma (Rb) inhibits DNA synthesis Cyclin Dependent Kinase 2 (cdk2) & cyclin E inactivate Rb to release cell into S phase Up-regulated by CAK complex and down- regulated by p21/WAF1 p53 8 EECS 600: Systems Biology & Bioinformatics

Wiring Diagram 6. Gene Regulatory Networks 9 EECS 600: Systems Biology & Bioinformatics

Dynamics of Boolean Networks 6. Gene Regulatory Networks Dynamics of Boolean Networks Gene activity profile (GAP) Collection of the states of individual genes in the genome (network) The number of possible GAPs is 2n The system ultimately transitions into attractor states Steady state (point) attractors Dynamic attractors: state cycle Each transient state is associated with an attractor (basins of attraction) In practice, only a small number of GAPs correspond to attractors What is the biological meaning of an attractor? 10 EECS 600: Systems Biology & Bioinformatics

State Space of Boolean Networks 6. Gene Regulatory Networks State Space of Boolean Networks Equate cellular with attractors Attractor states are stable under small perturbations Most perturbations cause the network to flow back to the attractor Some genes are more important and changing their activation can cause the system to transition to a different attractor This slide is taken from the presentation by I. Shmulevich 11 EECS 600: Systems Biology & Bioinformatics

Identification of Boolean Networks 6. Gene Regulatory Networks Identification of Boolean Networks We have the “truth table” available Binarize time-series gene expression data REVEAL Use mutual information to derive logical rules that determine each variable If the mutual information between a set of variables and the target variable is equal to the entropy of that variable, then that set of variables completely determines the target variable For each variable, consider functions consisting of 1 variable, then 2, then 3, …, then i…, until one is found Once the minimum set of variables that determine a variable is found, we can infer the function from the truth table In general, the indegrees of genes in the network is small 12 EECS 600: Systems Biology & Bioinformatics

REVEAL 6. Gene Regulatory Networks 13 EECS 600: Systems Biology & Bioinformatics

Limitations of Boolean Networks 6. Gene Regulatory Networks Limitations of Boolean Networks The effect of intermediate gene expression levels is ignored It is assumed that the transitions between states are synchronous A model incorporates only a partial description of a physical system Noise Effects of other factors One may wish to model an open system A particular external condition may alter the parameters of the system Boolean networks are inherently deterministic 14 EECS 600: Systems Biology & Bioinformatics

Probabilistic Models Stochasticity can account for 6. Gene Regulatory Networks Probabilistic Models Stochasticity can account for Noise Variability in the biological system Aspects of the system that are not captured by the model Random variables include Observed attributes Expression level of a particular gene in a particular sample Hidden attributes The boolean function assigned to a gene? 15 EECS 600: Systems Biology & Bioinformatics

Probabilistic Boolean Networks 6. Gene Regulatory Networks Probabilistic Boolean Networks Each gene is associated with multiple boolean functions Each function is associated with a probility Can characterize the stochastic behavior of the system 16 EECS 600: Systems Biology & Bioinformatics

6. Gene Regulatory Networks Bayesian Networks A Bayesian network is a representation of a joint probability distribution A Bayesian network B=(G, ) is specified by two components A directed acyclic graph G, in which directed edges represent the conditional dependence between expression levels of genes (represented by nodes of the graph) A function  that specifies the conditional distribution of the expression level of each gene, given the expression levels of its parents Gene A is gene B’s parent if there is a directed edge from A to B P(B | Pa(B)) = (B, Pa(B)) 17 EECS 600: Systems Biology & Bioinformatics

Conditional Independence 6. Gene Regulatory Networks Conditional Independence In a Bayesian network, if no direct between two genes, then these genes are said to be conditionally independent The probability of observing a cellular state (configuration of expression levels) can be decomposed into product form 18 EECS 600: Systems Biology & Bioinformatics

Variables in Bayesian Network 6. Gene Regulatory Networks Variables in Bayesian Network Discrete variables Again, genes’ expression levels are modeled as ON and OFF (or more discrete levels) If a gene has k parents in the network, then the conditional distribution is characterized by rk parameters (r is the number of discrete levels) Continuous variables Real valued expression levels We have to specify multivariate continuous distribution functions Linear Gaussian distribution: Hybrid networks 19 EECS 600: Systems Biology & Bioinformatics

Equivalence Classes of Bayesian Nets 6. Gene Regulatory Networks Equivalence Classes of Bayesian Nets Observe that each network structure implies a set of independence assumptions Given its parents, each variable is independent of its non- descendants More than one graph can imply exactly the same set of independencies (e.g., X->Y and Y->X) Such graphs are said to be equivalent By looking at observations of a distribution, we cannot distinguish between equivalent graphs An equivalence class can be uniquely represented by a partially directed graph (some edges are undirected) 20 EECS 600: Systems Biology & Bioinformatics

Learning Bayesian Networks 6. Gene Regulatory Networks Learning Bayesian Networks Given a training set D = {x1, x2, …, xn} of m independent instances of the n random variables, find an equivalence class of networks B=(G, ) that best matches D x’s are the gene expression profiles Based on Bayes’ formula, the posterior probability of a network given the data can be evaluated as where C is a constant (independent of G) and is the marginal likelihood that averages the probability of data over all possible parameter assignments to G 21 EECS 600: Systems Biology & Bioinformatics

6. Gene Regulatory Networks Learning Algorithms The Bayes score S(G : D) depends on the particular choice of priors P(G) and P( | G) The priors can be chosen to be structure equivalent, so that equivalent networks will have the same score decomposable, so that the score can be represented as the superposition of contributions of each gene The problem becomes finding the optimal structure (G) We can estimate the gain associated with addition, removal, and reversal of an edge Then, we can use greedy-like heuristics (e.g., hill climbing) 22 EECS 600: Systems Biology & Bioinformatics

6. Gene Regulatory Networks Causal Patterns Bayesian networks model dependencies between multiple measurements How about the mechanism that generated these measurements? Causal network model: Flow of causality Model not only the distribution of observations, but also the effect of observations If gene X codes for a transcription factor of gene Y, manupilating X will affect Y, but not vice versa But in Bayesian networks, X->Y and Y->X are equivalent Intervention experiments (as compared to passive observation): Knock X out, then measure Y 23 EECS 600: Systems Biology & Bioinformatics

Dynamic Bayesian Networks 6. Gene Regulatory Networks Dynamic Bayesian Networks Dependencies do not uncover temporal relationships Gene expression varies over time Dynamic Bayesian Networks model the dependency between a gene’s expression level at time t and expression levels of parent genes at time t-1 24 EECS 600: Systems Biology & Bioinformatics

Linear Additive Regulation Model 6. Gene Regulatory Networks Linear Additive Regulation Model The expression level of a gene at a certain time point can be calculated by the weighted sum of the expression levels of all genes in the network at a previous time point ei : expression level of gene i wij : effect of gene j on gene i uk: kth external variable ik: effect of kth external variable on gene j i : gene-specific bias Can be fitted using linear regression 25 EECS 600: Systems Biology & Bioinformatics