Structure Learning for Inferring a Biological Pathway Charles Vaske Stuart Lab.

Slides:



Advertisements
Similar presentations
Experiment Design for Affymetrix Microarray.
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Modelling and Identification of dynamical gene interactions Ronald Westra, Ralf Peeters Systems Theory Group Department of Mathematics Maastricht University.
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
M. Kathleen Kerr “Design Considerations for Efficient and Effective Microarray Studies” Biometrics 59, ; December 2003 Biostatistics Article Oncology.
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
Florian Markowetz markowetzlab.org Joining the dots… Network analysis of gene perturbation data.
Statistical association of genotype and phenotype.
Separation of Scales. Interpretation of Networks Most publications do not consider this.
Using Gene Ontology Models and Tests Mark Reimers, NCI.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
1. Elements of the Genetic Algorithm  Genome: A finite dynamical system model as a set of d polynomials over  2 (finite field of 2 elements)  Fitness.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
Yongjin Park, Stanley Shackney, and Russell Schwartz Accepted Computational Biology and Bioinformatics.
Collaborative Ordinal Regression Shipeng Yu Joint work with Kai Yu, Volker Tresp and Hans-Peter Kriegel University of Munich, Germany Siemens Corporate.
6. Gene Regulatory Networks
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Whole Genome Expression Analysis
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Probabilistic Models that uncover the hidden Information Flow in Signalling Networks.
CSCE555 Bioinformatics Lecture 16 Identifying Differentially Expressed Genes from microarray data Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun.
Alexander van Oudenaarden Lab Final Presentation Mashaal Sohail
Identifying Candidate Pathways to Explain Phenotypes in Genome-Wide Mutant Screens Mark Craven Department of Biostatistics & Medical Informatics University.
Inferring strengths of protein-protein interactions from experimental data using linear programming Morihiro Hayashida, Nobuhisa Ueda, Tatsuya Akutsu Bioinformatics.
The Frequency Dependence of Osmo-Adaptation in Saccharomyces cerevisiae Jerome T. Mettetal, et al. Science 319, 482 (2008); William J. Gibson.
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Control of Gene Expression
Lecture Topic 5 Pre-processing AFFY data. Probe Level Analysis The Purpose –Calculate an expression value for each probe set (gene) from the PM.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
CHAPTER 5 SIGNAL SPACE ANALYSIS
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Modelling from Sequence to Gene Regulatory Network to Phenotype.
1 Identifying Differentially Regulated Genes Nirmalya Bandyopadhyay, Manas Somaiya, Sanjay Ranka, and Tamer Kahveci Bioinformatics Lab., CISE Department,
No reference available
CGH Data BIOS Chromosome Re-arrangements.
Convolutional Restricted Boltzmann Machines for Feature Learning Mohammad Norouzi Advisor: Dr. Greg Mori Simon Fraser University 27 Nov
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
Fan-out in Gene Regulatory Networks Kyung Hyuk Kim Senior Fellow Department of Bioengineering University of Washington, Seattle 2 nd International Workshop.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Dynamical Modeling in Biology: a semiotic perspective Junior Barrera BIOINFO-USP.
Performance of Digital Communications System
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Maximum Ingredient Level Optimization Workbook MIOW User Guide Rashed A. Alhotan, Graduate Student Dmitry V. Vedenov, Associate Professor Gene M. Pesti,
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Learning Deep Generative Models by Ruslan Salakhutdinov
Learning gene regulatory networks in Arabidopsis thaliana
Biological networks CS 5263 Bioinformatics.
Probabilistic Models for Linear Regression
Filtering and State Estimation: Basic Concepts
Confidence as Bayesian Probability: From Neural Origins to Behavior
Technical reproducibility and biological variability.
Phenotype gain and loss dynamics match protein family dynamics.
Data Analysis – Part1: The Initial Questions of the AFCS
Volume 45, Issue 5, Pages (March 2012)
Statistical analysis of the effect of Z-FA-FMK on the SMN protein expression in biological triplicates (sample 1, 2, and 3) of SMA fibroblast cells (GM03813).
Enrichment of miRNAs among DPAR genes and their relative expression under dietary restriction. Enrichment of miRNAs among DPAR genes and their relative.
Inferring Cellular Processes from Coexpressing Genes
Pre-processing AFFY data
Design Issues Lecture Topic 6.
Presentation transcript:

Structure Learning for Inferring a Biological Pathway Charles Vaske Stuart Lab

Biological Pathways Cell is a dynamical system Somewhat modularized (into pathways) Given pathway elements, how do they communicate? –Protein modification –Gene expression changes

Input Set of genes essential to phenotype RNAi perturbation - gene knockdown Expression measurement S Genes - essential to phenotype - each is individually perturbed E Genes - affected by S Genes - expression is measured

Input E Genes S genes & controls

Desired Output: Structure Probabilistic Model Binary Variable Domain Restricted factor form –Deterministic signalling –Shared measurement error rates Markowetz, et al. 2005

Model Averaging Maximum Likelihood estimate might not be interesting Gain a posterior on particular model features

First Attempt: All linear models Calculate likelihood of data under each model Find posterior of individual edges Tiered Sgenes (matches initial discovery method)

Link Robustness Noise level estimated from replicate spots Added noise to data, reran experiment 10 times

Link Significance

Experimentally verified

Scaling to More Sgenes All linear permutations is a hack MCMC structure sampling?