Network inference from repeated observations of node sets Neil Clark, Avi Ma'ayan.

Slides:



Advertisements
Similar presentations
Bayesian network for gene regulatory network construction
Advertisements

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Methods to read out regulatory functions
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
The multi-layered organization of information in living systems
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
CAVEAT 1 MICROARRAY EXPERIMENTS ARE EXPENSIVE AND COMPLICATED. MICROARRAY EXPERIMENTS ARE THE STARTING POINT FOR RESEARCH. MICROARRAY EXPERIMENTS CANNOT.
Correlation Aware Feature Selection Annalisa Barla Cesare Furlanello Giuseppe Jurman Stefano Merler Silvano Paoli Berlin – 8/10/2005.
The STRING database Michael Kuhn EMBL Heidelberg.
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Network Motifs: simple Building Blocks of Complex Networks R. Milo et. al. Science 298, 824 (2002) Y. Lahini.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Protein Classification A comparison of function inference techniques.
Sequence comparison: Significance of similarity scores Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Gene Set Enrichment Analysis (GSEA)
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Conclusions As empirical data about sets of related entities accrues, there are more constraints on possible network realizations that can fit the data;
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Kristen Horstmann, Tessa Morris, and Lucia Ramirez Loyola Marymount University March 24, 2015 BIOL398-04: Biomathematical Modeling Lee, T. I., Rinaldi,
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
High-resolution computational models of genome binding events Yuan (Alan) Qi Joint work with Gifford and Young labs Dana-Farber Cancer Institute Jan 2007.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Affymetrix/BioCarta comparison & Java-based pathway analysis Michael Edmonson 2/26/2003.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
Understanding Network Concepts in Modules Dong J, Horvath S (2007) BMC Systems Biology 2007, 1:24.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Introduction to biological molecular networks
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
GMT files and Network are Related Content of GMT files can be converted to networks based on co-occurrence.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Title: Assign Pathways to Gene Set June 21, 2007 Guanming Wu.
1 Using Graph Theory to Analyze Gene Network Coherence José A. Lagares Jesús S. Aguilar Norberto Díaz-Díaz Francisco A. Gómez-Vela
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
David Amar, Tom Hait, and Ron Shamir
CSCI2950-C Lecture 12 Networks
Time-Course Network Enrichment
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Schedule for the Afternoon
Dynamic Causal Modelling for M/EEG
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Anastasia Baryshnikova  Cell Systems 
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
BRD4 expression and genomic distribution in B-CLL.
Presentation transcript:

Network inference from repeated observations of node sets Neil Clark, Avi Ma'ayan

Network Inference Protein-Protein interaction networkCell signaling network

Overview Network inference - the deduction of an underlying network of interactions from indirect data. 1.A general class of network inference problem 2.Network inference approach 3.Application: 1.inference of physical interactions: PPI 2.Inference of gene associations: Stem cell genes 3.inference of statistical interactions: Drug/side effect network

GMT files

The inference problem Input: a set of entities (genes or proteins or...) in the form of a GMT file - the results of experiments, or sampling more generally. Assumptions: 1An underlying network exists which relates the interactions between the entities in the GMT file 2Each line of the GMT file contains information on the connectivity of the underlying network The problem: Given a GMT file can we extract enough information to resolve the underlying network?

A synthetic example

Approach... Forget for the moment that we know the underlying network and pretend we only have the GMT file. Attempt to use the accumulation of our course data to infer the fine details of the underlying network. Consider the set of all networks that are consistent with our data - there are likely to be many. Use an algorithm to sample this ensemble of networks randomly. The mean adjacency matrix gives the probability of each link being present within the ensemble.

Inference live!

Information content

Analytic Approximation When applying this approach to real data typically there are large numbers of nodes Sample space of networks can be very large -> computationally demanding Write a simple analytical approximation which mimics the action of the algorithm.

Compare analytic approximation

Correction for sampling bias Destroy any information by a random permutation of the GMT file and compare the actual edge weight to the distribution of edge weights from the randomly permuted GMT files:

Application to Infer PPIs Malovannaya A et al. Analysis of the human endogenous coregulator complexome. Cell May 27;145(5):787-99

PPI network

Validataion Compare inferred PPI network to the following databases: – BioCarta – HPRD PPIInnateDB – IntAct – KEGG – MINT mammalia – MIPS – BioGrid

Comparison

Validation

Application to stem cells We used two types of high-throughput data from the ESCAPE database ( Chip X data: from Chip-Chip and Chip-seq experiments. – 203,190 protein DNA binding interactions in the proximity of coding regions from 48 ESC-relevant source proteins. Logof followed by microarray data: A manually compiled database of Protein-mRNA regulatory interactions deriving from loss-of-function gain-of- function followed by microarray profiling. – 154,170 interactions from 16 ESC-relevant regulatory proteins from loss-of-function studies, and 54 from gain- of-function studies.

Chip X network

Logof network

Combining networks Each data source gives a different perspective on the associations between the genes New insights may possibly be gained by combining the different perspectives. e.g. small but consistent associations across different perspectives will be revealed by the enhanced signal-to-noise ratio.

Combination of Chip X and Logof

An extension of the approach...

Application II: Inference of Network of statistical relationships in AERS database Adverse Event Reporting System (AERS) database contains records of.... AERS Record 1Drug 1, Drug 2,...Side-effect 1, Side-effect 2,... AERS Record 2Dug 3, Drug 4,...Side-effect 3, Side effect 4,...…

AERS sub network

AERS Large-scale Adjacency Matrix

And finally…

Summary We described a general class of problem in network inference. A network of physical interactions between proteins is inferred based on high-throughput IP/MS experiments The method has been applied to examine associations between stem-cell genes from multiple perspectives We have begun to apply the approach to the inference of statistical interactions between drugs and side- effects based on the AERS database More details can be found on the website 