Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.

Slides:



Advertisements
Similar presentations
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Advertisements

Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Gene regulation in cancer 11/14/07. Overview The hallmark of cancer is uncontrolled cell proliferation. Oncogenes code for proteins that help to regulate.
Timothy H. W. Chan, Calum MacAulay, Wan Lam, Stephen Lam, Kim Lonergan, Steven Jones, Marco Marra, Raymond T. Ng Department of Computer Science, University.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.
Predictive sub-typing of subjects Retrospective and prospective studies Exploration of clinico-genomic data Identify relevant gene expression patterns.
The Model To model the complex distribution of the data we used the Gaussian Mixture Model (GMM) with a countable infinite number of Gaussian components.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
DIMACS Workshop on Machine Learning Techniques in Bioinformatics 1 Cancer Classification with Data-dependent Kernels Anne Ya Zhang (with Xue-wen.
1 April, 2005 Chapter C4.1 and C5.1 DNA Microarrays and Cancer.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Comprehensive Gene Expression Analysis of Prostate Cancer Reveals Distinct Transcriptional Programs Associated With Metastatic Disease Kevin Paiz-Ramirez.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
Gene expression profiling identifies molecular subtypes of gliomas
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Classification of multiple cancer types by multicategory support vector machines using gene expression data.
Whole Genome Expression Analysis
From motif search to gene expression analysis
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Learning Regulatory Networks that Represent Regulator States and Roles Keith Noto and Mark Craven K. Noto and M. Craven, Learning Regulatory.
Finish up array applications Move on to proteomics Protein microarrays.
ChIP-on-Chip and Differential Location Analysis Junguk Hur School of Informatics October 4, 2005.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Reconstruction of Transcriptional Regulatory Networks
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Small RNAs and their regulatory roles. Presented by: Chirag Nepal.
Inferring transcriptional and microRNA-mediated regulatory programs in glioblastma Setty, M., et al.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
Analysis of the yeast transcriptional regulatory network.
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
Journal Club Meeting Sept 13, 2010 Tejaswini Narayanan.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) May 5, 2015
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Classification of tissues and samples 指導老師:藍清隆 演講者:張許恩、王人禾.
BIOBASE Training TRANSFAC ® Containing data on eukaryotic transcription factors, their experimentally-proven binding sites, and regulated genes ExPlain™
The regulation of Caspase 8 chIP-seq motifs mRNA expression DNA methylation.
Department of Mathematics, Northern New Mexico College1
Global Transcriptional Dysregulation in Breast Cancer
Gene Expression Classification
Microarray Technology and Applications
Albert Xue, Binbin Huang, Jianrong Wang
Loyola Marymount University
Volume 20, Issue 5, Pages (November 2014)
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Volume 20, Issue 5, Pages (November 2014)
Loyola Marymount University
By Wenfei Jin Presenter: Peter Kyesmu
BIOBASE Training TRANSFAC® ExPlain™
Volume 26, Issue 12, Pages e5 (March 2019)
Loyola Marymount University
Loyola Marymount University
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.

Background ● How do we classify processes/expression related to disease/phenotype (separating signal/data)? ● How do we use all of the data available to us – sequences, expression, regulation? ● Present case study of acute leukemia and breast cancer (normal vs. diseased cells).

Summary of Contributions ● Constructing sample-specific regulatory networks. ● Identify links between transcription factors and regulated genes that differentiate healthy states from diseased states. ● Generalize to simultaneous changes in functionality of multiple regulatory links, pointing to a regulatory gene / emanating from one TF.

Summary of Contributions ● Examine distances in transcriptional networks for subsets of genes that characterize diseased state. ● Observation that genes that optimally classify samples are concentrated in neighborhoods. ● Genes that are deregulated in diseased sttes exhibit high connectivity. ● TF-regulated gene links and centrality of genes can be used to characterize diseased cells.

Background ● Current work largely focuses on identification of individual differentially expressed genes, or co- regulated gene sets. ● There is significant work on module identification (graph models, SVD, connected components, etc.) ● There is work on expression patterns of genes that can classify tumor types. ● There is some work on transcription networks prior to this work as well [TRANSFAC/CREME]

Constructing Disease Cell Networks ● Intersect connectivity network representing TF binding to gene promoter regions, with co-expression networks representing TF target gene co-expression. ● Use TRANSFAC to relate known TF binding sites to promoter regions of genes and known TF-target gene interactions. ● For data derived from each microarray (Sample or patient), construct a co-expression network such that each TF-gene pair is assigned +1 or -1 based on up/down co-regulation.

Constructing Disease Cell Networks ● Intersection of connectivity and individual co- expression networks gives condition specific (CS) regulatory networks. ● CS networks derived from 6 gene expression studies using 3 types of datasets – normal cell lineages, tumor vs. normal tissues, and disease specific tumors associated with variable climical outcomes. ● 4821 genes and 196 Tfs on early Affy arrays and genes and 233 Tfs on newer arrays.

Constructing Disease Cell Networks

Classifying based on network features. ● Assume that each disease sample has a distinct regulatory network (pattern of activated links that gives rise to its expression profile). ● Examine how different aspects of network structure characterize different phenotypes.

Classifying based on network features. Link Based Approach ● Examine differences between patient samples by analyzing activity status of regulatory links ● Construct networks unique to patients ● Yields complete discriminatory networks.

Classifying based on network features. Degree Based Approach ● “Centrality” of individual genes in networks ● Degree – number of TFs activating or suppressing a particular gene (in degree), or number of genes regulated by a single TF (out degree). ● Use genome wide degree profile – identifying nodes with largest changes in centrality (rewiring) will assist is in detecting hotspots.

Classifying based on network features. Sample Classification ● Create regulatory networks for every sample and apply a classifier.  Rank features to identify set of TF-gene links  Use training sets to identify features and rank links, genes, and degree of nodes that undergo most substantial changes ● Acute lymphoblastic leukemia vs. acute mueloid leukemia ● Two different myeloid leukemia types ● Different matched cell types (renal-cell carcinoma vs. normal)

Classifying based on network features. Sample Classification ● Create regulatory networks for every sample and apply a classifier.  Rank features to identify set of TF-gene links  Use training sets to identify features and rank links, genes, and degree of nodes that undergo most substantial changes ● Acute lymphoblastic leukemia vs. acute mueloid leukemia ● Two different myeloid leukemia types ● Different matched cell types (renal-cell carcinoma vs. normal)

Classifying based on network features. Sample Classification ● Pass top links to train a basic classifier ● Cross validate.

Classifying based on network features.

Classification Techniques

Classification results