Analysis of the yeast transcriptional regulatory network.

Slides:



Advertisements
Similar presentations
General Linear Model With correlated error terms  =  2 V ≠  2 I.
Advertisements

Dimension reduction (2) Projection pursuit ICA NCA Partial Least Squares Blais. “The role of the environment in synaptic plasticity…..” (1998) Liao et.
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
The multi-layered organization of information in living systems
Dimension reduction (1)
Promoter and Module Analysis Statistics for Systems Biology.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
From Sequence to Expression: A Probabilistic Framework Eran Segal (Stanford) Joint work with: Yoseph Barash (Hebrew U.) Itamar Simon (Whitehead Inst.)
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Radial Basis Functions
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
L16: Micro-array analysis Dimension reduction Unsupervised clustering.
Reconstructing Transcription Network in S.cerevisiae WANG Chao Oct. 4, 2004.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Functional annotation and network reconstruction through cross-platform integration of microarray data X. J. Zhou et al
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Indiana University Bloomington, IN Junguk Hur Computational Omics Lab School of Informatics Differential location analysis A novel approach to detecting.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Fuzzy K means.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
Bryan Heck Tong Ihn Lee et al Transcriptional Regulatory Networks in Saccharomyces cerevisiae.
Assigning Numbers to the Arrows Parameterizing a Gene Regulation Network by using Accurate Expression Kinetics.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Learning Regulatory Networks that Represent Regulator States and Roles Keith Noto and Mark Craven K. Noto and M. Craven, Learning Regulatory.
ResponseNet revealing signaling and regulatory networks linking genetic and transcriptomic screening data CSE Fall.
Solution Space? In most cases lack of constraints provide a space of solutions What can we do with this space? 1.Optimization methods (previous lesson)
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
Reconstruction of Transcriptional Regulatory Networks
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Abstract ODE System Model of GRNs Summary Evolving Small GRNs with a Top-Down Approach Javier Garcia-Bernardo* and Margaret J. Eppstein Department of Computer.
Steady-state Analysis of Gene Regulatory Networks via G-networks Intelligent Systems & Networks Group Dept. Electrical and Electronic Engineering Haseong.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
Chapter 28 Cononical Correction Regression Analysis used for Temperature Retrieval.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
. Finding Motifs in Promoter Regions Libi Hertzberg Or Zuk.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Transcription factor binding motifs (part II) 10/22/07.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Yeast Cell-Cycle Regulation Network inference Wang Lin.
Basis Expansions and Generalized Additive Models Basis expansion Piecewise polynomials Splines Generalized Additive Model MARS.
Biological Network Analysis
Dimension reduction (2) EDR space Sliced inverse regression Multi-dimensional LDA Partial Least Squares Network Component analysis.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Reverse-engineering transcription control networks timothy s
Biological networks CS 5263 Bioinformatics.
The minimum cost flow problem
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
1 Department of Engineering, 2 Department of Mathematics,
Linear Model Selection and regularization
Analyzing Time Series Gene Expression Data
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Label propagation algorithm
Presentation transcript:

Analysis of the yeast transcriptional regulatory network

Transcription Factor (TF) A TF is a protein that binds to DNA sequences and regulates the transcriptions of corresponding genes. Usually the binding site of a TF is one small segment of specific promoter sequence. The activity of a TF is regulated according to the cell’s need, largely through signal transduction. It may not be directly observed, but can be reflected by the genes it regulates.

Expression regulatory network Identifying the expression regulatory network is a crucial step towards understanding the cellular regulation system. Inferring network from microarray data alone Inferring network from microarray data and TF-TG (Target Gene) Information

Ihmels J, Friedlander G, Bergmann S, Sarig O, Ziv Y, Barkai N. Revealing modular organization in the yeast transcriptional network. Nat Genet Aug;31(4):370-7.

Segal E et al. Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet Jun;34(2):

TF Activity Use TF-TG relation benefit the regulatory network identification TF expression level is not a good measure of the TF activity. The activated protein level of a TF, rather than its expression level, is what controls gene expression. The activity of a transcription factor is regulated according to the cell’s need, largely through signal transduction. It may not be directly observed, but can be reflected by the genes it regulates.

Identify TF Activity by NCA Network Component Analysis Liao JC et al. Network component analysis: reconstruction of regulatory signals in biological systems. Proc Natl Acad Sci U S A Dec 23;100(26):

NCA compared with PCA, ICA

NCA Model Without further constraints, [E] cannot be uniquely decomposed to [A] and [P].

Criteria for Unique NCA [E] = [A][P] 1.The connectivity matrix [A] must have full-column rank. 2.When a node in the regulatory layer is removed along with all of the output nodes connected to it, the resulting network must be characterized by a connectivity matrix that still has full-column rank. This condition implies that each column of [A] must have at least L-1 zeros. 3.[P] must have full row rank. In other words, each regulatory signal cannot be expressed as a linear combination of the other regulatory signals.

Criteria 2

Estimation of [E]=[A][P] Iteratively estimate [A] and [P]: A 0  P 1  A 1  P 2 … until convergence Convergence criterion: decrease of least square error < cutoff

NCA, infer TF activity in Yeast [E] = [A] [P] How to define the restrictions to CS? i.e. which CS{i,j}=0?

Identify the TF-TG relation by ChIP-chip experiment

Yeast cell cycle regulation 441 genes vs. 33 transcription factors

Inference of regulatory network by Two-stage constrained factor analysis Yu T, Li KC. Inference of transcriptional regulatory network by two-stage constrained space factor analysis. Bioinformatics Nov 1;21(21):

Inference of regulatory network by Two- stage constrained factor analysis Shortcoming of Liao et. al.’s approach: E = AP Let C ij = I{E ij }, the constraint of where the loading matrix A can be non-zero C comes from very noisy source. Estimate C, A, P simultaneously.

Model setting Gene expression matrix Gene x Condition Regulation strength matrix (to be estimated) Gene x TF TF activity matrix (to be estimated) TF x Condition Error matrix Connection constraint matrix Gene x TF 1: connection; 0: no connection Constrained by: Up to here, it is the NCA model by Liao et al.

Model Fitting byand However, we do not assume full knowledge on C. We require C to be bounded Higher-confidence set, from biological evidence Lower-confidence set, from ChIP data

Model Fitting Difficulties: Simultaneous estimation of both the structure and coefficients amounts to finding optimum in a very complex function. The number of parameters to be estimated is overwhelming. Solution: Find a reasonable local optimum. Use the high-confidence set to find a starting point as close to the global optimum as possible. Implementation: Stepwise model fitting. Start with a network backbone with only the high-confidence set, and grow the network gradually, drawing new connections from the low- confidence set.

Set C=C MIN, estimate each activity profile t k by the consensus of the expression of the regulated genes. Is the reduction of total RSS in the last few steps too small? From (C MAX -C), find the TF-gene pair that best agree with current estimate of B and T NO Estimate B and T by alternating least squares, using ridge regression. YES Fix estimate of T, regress each gene expression profile on the activity profiles of TF ’ s that are associated with it in C MAX. Use BIC and p-value to select TF ’ s.

Result Data: Regular growth ChIP data; cell-cycle microarray data; 99 TFs enter our study. Start with 891 evidenced relationships and lower-confidence relationships. Final network has 3846 TF-gene connections.

TF ’ s that exhibit correlated expression and activity:

Time-shifting between a TF ’ s activity profile and its expression profile: (1) Fit the activity profile using cubic spline (2) interpolate the spline to get shifted profile (3) obtain correlation between the expression profile and shifted activity profile (4) maximize absolute correlation with regard to minute shift.

TF ’ s that have activity lagging behind expression:

SWI4 TF ’ s that have activity lagging behind expression:

Between-TF regulations: