1 Harvard Medical SchoolMassachusetts Institute of Technology Identifying Differentially Expressed Genes in Time Series Microarrays Jonathan J. Smith 1.

Slides:



Advertisements
Similar presentations
Systems biology SAMSI Opening Workshop Algebraic Methods in Systems Biology and Statistics September 14, 2008 Reinhard Laubenbacher Virginia Bioinformatics.
Advertisements

Linear Models for Microarray Data
CPL The Convergence of Bioinformatics and Medical Informatics -- PL Chang, M.D.
1 Phenotype Prediction by Integrative Network Analysis of SNP and Gene Expression Microarrays Hsun-Hsien Chang 1, Michael McGeachie 1,2 1 Children’s Hospital.
1 Harvard Medical SchoolMassachusetts Institute of Technology Inferring Nonstationary Gene Networks from Temporal Gene Expression Data Hsun-Hsien Chang.
Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Microarrays Dr Peter Smooker,
DNA Microarray Bioinformatics - #27611 Program Normalization exercise (from last week) Dimension reduction theory (PCA/Clustering) Dimension reduction.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Part II: Discriminative Margin Clustering Joint work with: Rob Tibshirani, Dept of Statistics Patrick O. Brown, School of Medicine Stanford University.
1 Microarray Cancer Data Visualization Analysis in Relation to Pharmacogenomics By Ngozi Nwana.
Temporal Processes Eran Segal Weizmann Institute.
Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.
Goal: Reconstruct Cellular Networks Biocarta. Conditions Genes.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Computational Approaches in Epigenomics Guo-Cheng Yuan Department of Biostatistics and Computational Biology Dana-Farber Cancer Institute Harvard School.
1 April, 2005 Chapter C4.1 and C5.1 DNA Microarrays and Cancer.
ICA-based Clustering of Genes from Microarray Expression Data Su-In Lee 1, Serafim Batzoglou 2 1 Department.
Epistasis Analysis Using Microarrays Chris Workman.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Comprehensive Gene Expression Analysis of Prostate Cancer Reveals Distinct Transcriptional Programs Associated With Metastatic Disease Kevin Paiz-Ramirez.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
Identification of network motifs in lung disease Cecily Swinburne Mentor: Carol J. Bult Ph.D. Summer 2007.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
第十讲 概率图模型导论 Chapter 10 Introduction to Probabilistic Graphical Models
Applications of Biological Network Presented and Created By : Harshit Bhatt.
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
Lectures 2 – Oct 3, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
A quick introduction to Oncinfo Lab Dr. Habil Zare, PhD PI of Oncinfo Lab Department of Computer Science Texas State University 18 September 2015.
S. F. Molaeezadeh-31 may 2008Gene expression modeling through positive Boolean functions 1 Seminar Title: Gene expression modeling through positive Boolean.
Computational Diagnostics A new research group at the Max Planck Institute for molecular Genetics, Berlin.
A meta-analysis of differential coexpression across age Jesse Gillis.
Drug–Target Network Hussein Hijazi Fall 2012 Michigan State University 1.
Identification of Different Phenotypes of Breast Cancer Based on Two-Step Selective Clustering Analysis of Gene Expression Profiling of Several Signal.
The Broad Institute of MIT and Harvard Differential Analysis.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.
The National Cancer Research Network is part of the National Institute for Health Research CANCER GENETIC TRIALS Leicestershire, Northamptonshire and Rutland.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 3.
1 CISC 841 Bioinformatics (Fall 2008) Review Session.
基于 R/Bioconductor 进行生物芯片数据分析 曹宗富 博奥生物有限公司
Dan Frumkin*, Adam Wasserstrom*, Shai Kaplan, Uriel Feige & Ehud Shapiro * equal contributors Genomic variability within an organism exposes its cell lineage.
Canadian Bioinformatics Workshops
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Adding Dynamic Nodes to Reliability Graph with General Gates using Discrete-Time Method Lab Seminar Mar. 12th, 2007 Seung Ki, Shin.
Journal club Jun , Zhen.
Meredith L. Wilcox FIU, Department of Epidemiology/Biostatistics
Induced sputum proteome in healthy subjects and asthmatic patients
A Bayesian Approach to Learning Causal networks
Computational Diagnostics
A Short Tutorial on Causal Network Modeling and Discovery
Loyola Marymount University
Transcriptional Signature of Histone Deacetylases in Breast cancer
Somi Jacob and Christian Bach
Principles and Strategies for Developing Network Models in Cancer
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Presentation transcript:

1 Harvard Medical SchoolMassachusetts Institute of Technology Identifying Differentially Expressed Genes in Time Series Microarrays Jonathan J. Smith 1 Hsun-Hsien Chang 2 Marco F. Ramoni 2 1 Department of Mathematics, MIT 2 Division of Health Sciences and Technology, Harvard-MIT New England Statistics Symposium April 17, 2010

2 Harvard Medical SchoolMassachusetts Institute of Technology Background Microarray technologies enable profiling expression of thousands of genes in parallel on a single chip. Comparative analysis of gene expression across tissue states extracts signature genes for disease diagnosis. –Identify differentially expressed genes across tissue states, using t-statistics, fold-change, signal-to-noise ratio, principal component analysis, etc. Research trend: –Microarray technologies are cost down. –Collect times series gene expression microarrays to study biological functions.

3 Harvard Medical SchoolMassachusetts Institute of Technology Approach Challenge: –Existing methods (t-statistics, fold-change, SNR, PCA) cannot be extended to longitudinal expression analysis because temporal information is not well represented. Propose to use the framework of Bayesian networks to capture both the functional and temporal dependencies.

4 Harvard Medical SchoolMassachusetts Institute of Technology Bayesian Networks Bayesian networks are directed acyclic graphs where: –Node corresponds to random variables. –Directed arcs encode conditional probabilities of the target nodes on the source nodes.

5 Harvard Medical SchoolMassachusetts Institute of Technology Representation of Functional Dependence Case 1Case Tissue state 1 Case M Tissue state 2 Phenotypes are modeled by a binomial variable. G Pheno The gene is independent of the phenotypes. Gene G Expression of human subjects is modeled by a log-normal variable. Pheno GG The gene is dependent on the phenotypes.

6 Harvard Medical SchoolMassachusetts Institute of Technology Representation of Temporal Dependence Case 1Case Tissue state 1 Case M Tissue state 2 G(1) G(2) G(T) The time series expression of gene G is considered a 1 st order Markov chain. G(1)G(2)G(T)

7 Harvard Medical SchoolMassachusetts Institute of Technology Differentially Expressed Time Series G(1)G(2)G(T) Pheno The expression series is independent of the phenotypes. G(1)G(2)G(T) Pheno The expression series is dependent on the phenotypes. G(1)G(2)G(T) Pheno Phenotype variable modulates gene expression at every time point.

8 Harvard Medical SchoolMassachusetts Institute of Technology p( | Data ) Identify Function-Dependent Genes p( | Data ) Bayes Factor = p( Data | )

9 Harvard Medical SchoolMassachusetts Institute of Technology Clinical Study on Breast Cancer Breast cancer is the most prevalent cancer in women. Identification of genes inducing breast cancer will help drug development. We used breast cancer microarray data from Gene Expression Omnibus (accession number GSE11352). Our method identified 40 genes that may drive breast cancer development. Biologists confirmed that these genes are involved in cell death, developmental disorder, and endocrine system disorder (all are prerequisites of breast cancer).

10 Harvard Medical SchoolMassachusetts Institute of Technology Conclusion Develop a Bayesian network method for identification of genes in longitudinal expression microarray data. –Functional dependence: genes modulated by phenotypes. –Temporal dependence: gene expression time series modeled by 1 st order Markov chain. –Use Bayes factor to select differentially expressed genes.