Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene- induced Signaling Anthony Gitter Cancer Bioinformatics.

Slides:



Advertisements
Similar presentations
Control Case Common Always active
Advertisements

Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Gene regulatory network
Gene regulation in cancer 11/14/07. Overview The hallmark of cancer is uncontrolled cell proliferation. Oncogenes code for proteins that help to regulate.
Inferring causal genomic alterations in breast cancer using gene expression data (Linh M Tran et.al) By Linglin Huang.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
Simultaneous Identification of Multiple Driver Pathways in Cancer Mark D. M. Leiserson, et.al.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
6. Gene Regulatory Networks
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Case study reveals transcription factor (TF) modules, dynamic TF binding and an expanded role for cell cycle regulators Mapping the DNA Damage Response.
Epistasis Analysis Using Microarrays Chris Workman.
Case study reveals transcription factor (TF) modules, dynamic TF binding and an expanded role for cell cycle regulators Mapping the DNA Damage Response.
TP Tumor-Stroma Interactions Tumor and plasma proteomics insights into tumor stroma interactions 109 luminal human breast cancer specimens+plasma o high.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Network Analysis and Application Yao Fu
“An Extension of Weighted Gene Co-Expression Network Analysis to Include Signed Interactions” Michael Mason Department of Statistics, UCLA.
ResponseNet revealing signaling and regulatory networks linking genetic and transcriptomic screening data CSE Fall.
P. falciparum Life Cycle & Pathogenesis of Malaria Miller et al., Nature  Molecular and genetic.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Network visualization –Cytoscape Cool.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
Cancer- abnormal cell growth (cell growth not under "normal" control) Benign tumors are self-limiting- have contact inhibition Malignant tumors can metastasize.
Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Lecture 11. Topics in Omic Studies (Cancer Genomics, Transcriptomics and Epignomics) The Chinese University of Hong Kong CSCI5050 Bioinformatics and Computational.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) May 5, 2015
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Motif Search and RNA Structure Prediction Lesson 9.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Seojin Bang. The goal of this review paper is.. To address problems and computational solutions that arise in analysis of omics data. To highlight fundamental.
High-throughput genomic profiling of tumor-infiltrating leukocytes
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
GCN5 Regulates FGF Signaling and Activates Selective MYC Target Genes during Early Embryoid Body Differentiation  Li Wang, Evangelia Koutelou, Calley.
Identifying Signaling Pathways
Multidimensional Drug Profiling By Automated Microscopy
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Systematic Analysis Reveals that Cancer Mutations Converge on Deregulated Metabolism of Arachidonate and Xenobiotics  Francesco Gatto, Almut Schulze,
Schedule for the Afternoon
Widespread modulation of epigenetic landscape for differentiated organoids is driven by Hnf4g Widespread modulation of epigenetic landscape for differentiated.
Anastasia Baryshnikova  Cell Systems 
Volume 10, Issue 1, Pages (January 2018)
Hepatic Dysfunction Caused by Consumption of a High-Fat Diet
Relationship between Genotype and Phenotype
Marijn T.M. van Jaarsveld, Difan Deng, Erik A.C. Wiemer, Zhike Zi 
EZH2-driven lung cancer as a molecularly distinct entity.
No Driver behind the Wheel? Targeting Transcription in Cancer
Presentation transcript:

Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene- induced Signaling Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) April 16, 2015 All figures and quotes from Huang2013 unless noted otherwiseHuang2013

Prize-Collecting Steiner Tree (PCST) Pathways needed to determine how heterogeneous drivers lead to cancer-related phenotypes PCST focuses on learning pathway structure

Pathway structure important for therapeutics Vogelstein2013 Kinase inhibitors target specific pathway members

Glioblastoma model Phosphorylation changes (88 proteins) DNase hypersensitivity changes (~13000 regions) Gene expression changes (1623 genes) U87H – High EGFRvIIIU87DK – Dead Kinase

Pathways for integrating data Low correlation between phosphorylation and transcription Provide complementary information

Two stages: define prizes, solve PCST OmicsIntegrator

Stage 1: phosphorylation prizes Prize based on phosphorylation fold change Proteins with prizes called terminals p: protein prize ph : phosphorylation prize

Stage 1: TF prizes Find differentially expressed genes Find differential DNase peaks Create gene X TF score matrix DNA binding motif affinity for TF τ at score for peak i in condition c proximal to gene g Peaks mapped to gene g in U87DK cellsPeaks mapped to gene g in U87H cells

Stage 1: TF prizes continued Test each TF independently for association with differentially expressed gene g Are coefficients in linear regression significantly different from 0? Regression coefficient, use t-test statistic for prize Binding affinity for TF τ near gene gGene g log 2 expression fold change

Stage 2: identify subnetwork

Solving PCST Use off-the-shelf Steiner tree solver Solver creates an integer program Penalty for omitted prize nodes Cost of selected edgesPrize vs. edge cost tradeoff

EGRFvIII signaling pathway

Comparing with other network approaches Also compare to xenograft phosphorylation

Using pathway structure for validation Which proteins to test? What are appropriate negative controls?

Three tiers of proteins to test

Inhibitor viability screening U87H cells very sensitive to inhibiting high-ranked targets

Inhibitor viability screening 3 of 4 top targets more toxic in U87H 1 of 3 lower targets more toxic, requires high dose

ChIP-seq to explore EP300 role EP300 is a Steiner node, top-ranked TF Find its targets include epithelial-mesenchymal transformation markers

PCST versus other pathway approaches MEMo / Multi-Dendrix (mutual exclusivity) RPPA regression GSEA / PARADIGM (pathway enrichment / activity) HotNet / NBS (network diffusion) ActiveDriver (phosphorylation impact)

Multi-sample Prize-Collecting Steiner Forest (Multi-PCSF) PCST => PCSF: allow multiple disjoint trees PCSF => Multi-PCSF: jointly optimize pathways for many related samples (e.g. tumors) Approximate optimization with belief propagation instead of integer program vivi vjvj vlvl vkvk

PCST formulation Protein-protein interactions Alteration No data Penalty for omitted prize nodes Cost of selected edgesPrize vs. edge cost tradeoff

Multi-PCSF formulation Alteration Important in related tumors No data Artificial prizes to share information Original Steiner tree objective Explaining individuals’ data vs. similarity tradeoff

Mutation or proteomic change Important in related tumors Steiner node Breast cancer tumor TCGA-AN- A0AR

Unique pathways with common core Mutation or proteomic change Important in related tumors Steiner node

Appendix

Alternative pathway identification algorithms Steiner tree/forest (related to PCST) Prize-collecting Steiner forest (PCSF)PCSF Belief propagation approximation (msgsteiner)msgsteiner k-shortest paths Ruths2007 Shih2012 Integer programs Signaling-regulatory Pathway INferencE (SPINE)SPINE Chasman2014

Alternative pathway identification algorithms continued Path-based objectives Physical Network Models (PNM)PNM Maximum Edge Orientation (MEO)MEO Signaling and Dynamic Regulatory Events Miner (SDREM)SDREM Maximum flow ResponseNet Hybrid approaches PathLinker: random walk + shortest paths ANAT: shortest path + Steiner tree ANAT

Recent developments in pathway discovery Multi-task learning: jointly model several related biological conditions ResponseNet extension: SAMNetSAMNet Steiner forest extension: Multi-PCSFMulti-PCSF SDREM extension: MT-SDREMMT-SDREM Temporal data ResponseNet extension: TimeXNetTimeXNet Pathway synthesis