Stanford University Boolean Analysis of Large Gene-expression Datasets Debashis Sahoo PhD Candidate, Electrical Engineering Joint work with David Dill,

Slides:



Advertisements
Similar presentations
Graphing Data on the Coordinate Plane
Advertisements

Mining Association Rules from Microarray Gene Expression Data.
DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Cells and Organs of the Immune System David Chaplin, MD, PhD
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Genetic algorithms applied to multi-class prediction for the analysis of gene expressions data C.H. Ooi & Patrick Tan Presentation by Tim Hamilton.
Part II: Discriminative Margin Clustering Joint work with: Rob Tibshirani, Dept of Statistics Patrick O. Brown, School of Medicine Stanford University.
University of CreteCS4831 The use of Minimum Spanning Trees in microarray expression data Gkirtzou Ekaterini.
1 Exploratory Tools for Follow-up Studies to Microarray Experiments Kaushik Sinha Ruoming Jin Gagan Agrawal Helen Piontkivska Ohio State and Kent State.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Simulation and Application on learning gene causal relationships Xin Zhang.
Feature Screening Concept: A greedy feature selection method. Rank features and discard those whose ranking criterions are below the threshold. Problem:
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
Feature Selection and Its Application in Genomic Data Analysis March 9, 2004 Lei Yu Arizona State University.
Artificial Intelligence Term Project #3 Kyu-Baek Hwang Biointelligence Lab School of Computer Science and Engineering Seoul National University
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Gene expression profiling identifies molecular subtypes of gliomas
ANALYSIS AND VISUALIZATION OF TIME-VARYING DATA USING ‘ACTIVITY MODELING’ By Salil R. Akerkar Advisor Dr Bernard P. Zeigler ACIMS LAB (University of Arizona)
Identification of network motifs in lung disease Cecily Swinburne Mentor: Carol J. Bult Ph.D. Summer 2007.
Bioinformatics for Stem Cell Lecture 1 Debashis Sahoo, PhD.
ICBP, Stanford University 1 Implication Networks from Large Gene-expression Datasets Debashis Sahoo PhD Candidate, Electrical Engineering, Stanford University.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.
Graph and Topological Structure Mining on Scientific Articles Fan Wang, Ruoming Jin, Gagan Agrawal and Helen Piontkivska The Ohio State University The.
Using Bayesian Networks to Analyze Expression Data By Friedman Nir, Linial Michal, Nachman Iftach, Pe'er Dana (2000) Presented by Nikolaos Aravanis Lysimachos.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Agenda Introduction to microarrays
Boundary Recognition in Sensor Networks by Topology Methods Yue Wang, Jie Gao Dept. of Computer Science Stony Brook University Stony Brook, NY Joseph S.B.
Gene expression analysis
S. F. Molaeezadeh-31 may 2008Gene expression modeling through positive Boolean functions 1 Seminar Title: Gene expression modeling through positive Boolean.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
Bioinformatics for Stem Cell Lecture 2 Debashis Sahoo, PhD.
Prostate Cancer CAD Michael Feldman, MD, PhD Assistant Professor Pathology University Pennsylvania.
Sylvia K. Plevritis, PhD Professor Department of Radiology and
Computing Co-Expression Relationships Wen-Dar Lin.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
Paper Review on Cross- species Microarray Comparison Hong Lu
Equivalent Opposite PTPRC low  CD19 low FAM60A low  NUAK1 high XIST high  RPS4Y1 low COL3A1 high  SPARC high Boolean analysis of large gene-expression.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Boolean Analysis of High-Throughput Biological Datasets
AN INTRODUCTION TO GENE EXPRESSION ANALYSIS BY MICROARRAY TECHNIQUE (PART II) DR. AYAT B. AL-GHAFARI MONDAY 10 TH OF MUHARAM 1436.
Twitter Community Discovery & Analysis Using Topologies Andrew McClain Karen Aguar.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
Kuby Immunology, 7e: Chapter 2
MAMMALIAN BLOOD SMEARS SLIDE M
DEPARTMENT OF COMPUTER SCIENCE
Impact of Formal Methods in Biology and Medicine Final Review
Impact of Formal Methods in Biology and Medicine
Impact of Formal Methods in Biology and Medicine
MiDReG: Mining Developmentally Regulated Genes
Department of Computer Science
Volume 41, Issue 1, Pages (July 2014)
Markov Random Fields Presented by: Vladan Radosavljevic.
Department of Computer Science
HSC DC - precursors DC Lymphoid precursor Lymphoid Myeloid precursor
Robust inference of biological Bayesian networks
Interpretation of Similar Gene Expression Reordering
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Lab-Specific Gene Expression Signatures in Pluripotent Stem Cells
Michael F. Gurish, PhD, Joshua A. Boyce, MD 
Planting trees in random graphs (and finding them back)
Department of Computer Science
HIF-1α is not required for the classic transcriptional response to hypoxia. HIF-1α is not required for the classic transcriptional response to hypoxia.
Erythrocyte What type of cell is this?.
Introduction to Biological Systems
Presentation transcript:

Stanford University Boolean Analysis of Large Gene-expression Datasets Debashis Sahoo PhD Candidate, Electrical Engineering Joint work with David Dill, Andrew Gentles, Rob Tibshirani, Sylvia Plevritis

Stanford University Outline Standard microarray work flow Data collection and preprocessing Boolean analysis Biological insights Conclusion and future work

Stanford University Microarray Work Flow mRNAHybridizationScanning Image processingNormalizationData analysis

Stanford University Data Collection There are thousands of microarray freely available GEO ArrayExpress SMD Celsius

Stanford University Preprocessing Get original RAW CEL files for one platform together. Typical number of CEL files : 2,000-11,000 Use RMA to normalize the CEL files Need a memory efficient algorithm Generates expression values for each probeset

Stanford University Existing Methods Correlation analysis Conditional probability Mutual information

Stanford University Boolean Analysis Get RAW DataNormalize Determine thresholds Discover Boolean relationshipsNew Biology

Stanford University Example

Stanford University Determine threshold Sort the gene expressions Use StepMiner to determine the threshold

Stanford University Determine threshold Its hard to determine a threshold for this gene. StepMiner usually puts a threshold in the middle for this case.

Stanford University Discover Boolean Relationships Analyze scatter plots between two genes. Divide the space into four different regions using the thresholds (quadrants). Determine sparse quadrants. Determine the Boolean relationships. WNT5A high PAX5 low

Stanford University Statistical Tests Compute the expected number of points under the independence model Compute maximum likelihood estimate of the error rate statistic = (expected – observed) expected √ a 00 (a 00 + a 01 ) a 00 (a 00 + a 10 ) + () 1 2 error rate = a 00 a 01 a 11 a 10

Stanford University Boolean Relationships Tightly co-regulated genes forms two sparse quadrants. There are six possible Boolean relationships Equivalent Opposite A lowB low A lowB high A highB low A high B high

Stanford University Boolean Relationships Equivalent Opposite PTPRC low CD19 low XIST high RPS4Y1 low COL3A1 high COL1A1 highFAM60A low NUAK1 high SymmetricAsymmetric

Stanford University Boolean Implication Network Directed graph Nodes: For each gene A A high A low Edges: A high to B low A high B low A high B low A low B high C high C low

Stanford University New Biology This slide is under construction!!

Stanford University Biological Insights Gender Organ Tissue DevelopmentDifferentiationCo-expression

Stanford University Example Application Immunology B Cell differentiation Goal: Discover genes that mark unique B Cell precursors

Stanford University Differentiation Tree Hematopoietic stem cell differentiation is a tree Root: HSC Leaf Lymphocytes B Cell, T Cell, NK cell, Dendritic cell Erythrocytes Granulocytes: Basophil, Neutrophil, Eosinophil Monocytes: Dendritic cell Thrombocytes

Stanford University KIT high A high B low B220 low CD19 low KIT A B B220 CD19 A high B low

Stanford University Conclusion Boolean analysis Directly visible on the scatter plot. Enables discovery of asymmetric relationship. Follow biology. Potential application to Immunology Future work Cancer progression New biology

Stanford University Acknowledgements The Felsher Lab:  Natalie Wu  Cathy Shachaf  Dean Felsher Funding: ICBP Program (NIH grant: 5U56CA )  Leonore A Herzenberg  James Brooks  Joe Lipsick  Gavin Sherlock  Howard Chang  Stuart Kim

Stanford University The END