Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.

Slides:



Advertisements
Similar presentations
IDENTIFYING CAUSAL GENES AND DYSREGULATED PATHWAYS IN COMPLEX DISEASES Nov. 6 th, 2010 YOO-AH KIM NIH / NLM / NCBI.
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Kwee Yong, UCL Cancer Institute
RIP – T RANSCRIPT E XPRESSION L EVELS. O UTLINE RNA Immuno-Precipitation (RIP) NGS on RIP & its alternatives Alternate splicing Transcription as a graph.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
METHODS FOR HAPLOTYPE RECONSTRUCTION
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene- induced Signaling Anthony Gitter Cancer Bioinformatics.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Systems Biology Biological Sequence Analysis
Gene expression analysis summary Where are we now?
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Systems Biology Biological Sequence Analysis
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Cis-regulation Trans-regulation 5 Objective: pathway reconstruction.
Radiogenomics in glioblastoma multiforme
Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Networks and Interactions Boo Virk v1.0.
ResponseNet revealing signaling and regulatory networks linking genetic and transcriptomic screening data CSE Fall.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
COMPUTATIONAL ANALYSIS OF MULTILEVEL OMICS DATA FOR THE ELUCIDATION OF MOLECULAR MECHANISMS OF CANCER Presented by Azeez Ayomide Fatai Supervisor: Junaid.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Identifying Causal Genes and Dysregulated Pathways in Complex Diseases Discussion leader: Nafisah Islam Scribe: Matthew Computational Network Biology BMI.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Fast test for multiple locus mapping By Yi Wen Nisha Rajagopal.
A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches Cantone, I., Marucci, L., Iorio, F., Ricci, M., Belcastro,
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
CSE280Stefano/Hossein Project: Primer design for cancer genomics.
Stitching the Tutorials Together Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
Seojin Bang. The goal of this review paper is.. To address problems and computational solutions that arise in analysis of omics data. To highlight fundamental.
EQTLs.
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Reverse-engineering transcription control networks timothy s
Dept of Biomedical Informatics University of Pittsburgh
Ingenuity Knowledge Base
A bioinformatic analysis of microRNAs role in osteoarthritis
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Schedule for the Afternoon
Cyclin E1 Is Amplified and Overexpressed in Osteosarcoma
Volume 58, Issue 4, Pages (May 2015)
GWAS-eQTL signal colocalisation methods
Presentation transcript:

Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be presented at RECOMB 2010 in Lisbon, August 2010

We developed a novel multistep algorithm to identify causal genes and associated dysregulated pathways by integrating several levels of analysis and data, including gene expression, genomic alterations and molecular interactions. Specifically, we aim to identify pathways, starting from genes that are located in areas of genomic alterations in human gliomas to potential target genes by following molecular interactions such as protein-protein interactions, phosphorylation events and protein-transcription factor interactions. Target genes Disease cases Causal genes pathways Outline

‘Target genes’ are genes differentially expressed in disease samples. The set of target genes is chosen so that to minimize the total number of target genes, while ensuring that every disease case is associated with a chosen minimal number of DE genes. Selecting target genes

eQTL mapping Search for a cause of observed DE by associating target genes with alterations in genetic loci. Loci are locally clustered into sets of consecutive loci (tag loci) with correlated copy number values, to reduce computational costs and alleviate multiple hypothesis testing problem. Linear regression between expression value of a gene and copy number of tag locus. Tag loci with significant p-values are chosen for subsequent steps.

Identifying candidate causal genes For each target gene and associated tag locus find candidate causal genes in the genomic region of that tag locus. Gene is more likely to be causal if there exists a path in the underlying interaction network connecting it with corresponding target gene. Problem of finding a path through a network is modeled as current flow in electric circuit. Compute the current flow from the target gene to its potential causal genes as a function of gene expression correlation at edges and target gene. Current flow is obtained by solving a system of linear equations. Candidate causal genes within loci are selected based on empirical p-values.

Identifying final causal genes We want to explain all disease cases with a minimal number of causal genes. The set of causal genes is chosen so that to minimize their total number, while ensuring minimal number of targets covered by causal genes in every disease case.

Dysregulated pathways Using current flow solution, determine path with maximum current from causal gene to target gene.

Algorithm refinements 1.Allow the target genes to only interact with transcription factors 2.Account for directions of protein-DNA interactions and phosphorylation events. Heuristic solution: solve undirected version, remove edges used in opposite direction, solve the linear system again, etc., until only a small number of directed edges are used incorrrectly.

Application to glioma data Data: -mRNA expression data for 321 glioma patients and 32 controls -Copy number alteration data for the same patients and controls -interaction network – pooled data from databases of protein-protein interactions, phosphorylation (networKIN, phosphoELM) and protein-DNA interactions (TRED)

Application to glioma data

PTEN, a tumor supressor is a causal gene, subnetwork with paths to all its targets. Enriched in genes involved in cell cycle and ER overload response. CDK4, a target gene connected to its causal genes. Enriched in genes involved in cell cycle processes and positive regulation of cell proliferation.