CELL INDEX DATABASE (CELLX): A WEB TOOL FOR CANCER PRECISION MEDICINE Pacific Symposium on Biocomputing (PSB) 2015 January.

Slides:



Advertisements
Similar presentations
Biomarker Analyses in CLEOPATRA: A Phase III, Placebo-Controlled Study of Pertuzumab in HER2- Positive, First-Line Metastatic Breast Cancer (MBC) Baselga.
Advertisements

Comparison of Mutations and Protein Expression in Potentially Actionable Targets in 5500 Triple Negative vs. non-Triple Negative Breast Cancers Joyce A.
Data Integration for Cancer Genomics. Personalized Medicine Tumor Board Question: given all we know about a patient, what is the “optimal” treatment?
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Introduction to Glioblastoma Chris Plaisier Introduction to Systems Biology Course Institute for Systems Biology.
1 Research Data Marts In Support Of Cancer Personalized Medicine Jack London, PhD and Devjani Chatterjee, PhD Jefferson Kimmel Cancer Center, Philadelphia.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
Supplementary Figure 1. Somatic mutation spectrum # Substitutions # Substitutions per Mb b c a Repeats Pseudogenes Whole genome Splice sites Non-coding.
Sage Bionetworks Mission Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by.
Enabling biomarker validation in breast cancer molecular subtypes: sensitivity and specificity of array-based subtype classification in 983 patients Balázs.
Vs. home.ccr.cancer.gov Personalized medicine-The goal.
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
Karl Clauser Proteomics and Biomarker Discovery Breast Cancer Proteomics and the use of TCGA Mutational Data - Broad Institute update/issues Karl Clauser.
Translational Genomics Research Institute | The Sarcoma Data Portal: Making High Content Sarcoma Datasets Available For All Users Jonathan.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
CANDID: A candidate gene identification tool Part 2 Janna Hutz March 26, 2007.
Supporting Scientific Collaboration Online SCOPE Workshop at San Diego Supercomputer Center March 19-22, 2008.
The Stanley Neuropathology Consortium Integrative Database: A novel web-based tool for exploring neuropathological traits, gene expression and associated.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Gene Expression Omnibus (GEO)
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Supplementary Figure S1 Inhibition of AKT and S6 phosphorylation. After 4 h treatment with 1 µM of GDC-0941, EVSA-T parental, resistant Clone1, Clone2,
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Date of download: 6/18/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Association of BRCA1 and BRCA2 Mutations With Survival,
An Overview of The Cancer Genome Atlas (TCGA)
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
 Cancer  Compound perturbations  Gene perturbations  Tumor development  Cancer metastasis  Cancer treatments Altered Caspase-8 Expression.
Pichai Raman on behalf of cBioPortal Team Wednesday, May 25, 16
Recurrent copy number alterations in prostate cancer: an in silico meta-analysis of publicly available genomic data  Julia L. Williams, Peter A. Greer,
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
Integrated genomic and proteomic analysis identifies PTEN loss and AKT/MTOR as drivers of resistance to MEK inhibitors in NSCLC cells Dianren Xia1, Lauren.
Iorio et al., 2016, Cell 166, 1-15 These oncogenic alterations were investigated as possible predictors of differential drug sensitivity across 1,001 cancer.
Web-based Tools for Integrative Analysis of Pancreatic Cancer Data
The PedcBioPortal & DiseaseXpress
Multiple Myeloma Research Foundation
Dept of Biomedical Informatics University of Pittsburgh
Impact of Formal Methods in Biology and Medicine
Volume 152, Issue 1, Pages e4 (January 2017)
Impact of Formal Methods in Biology and Medicine
Fig. 8. Recurrent copy number amplification of BRD4 gene was observed across common cancers. Recurrent copy number amplification of BRD4 gene was observed.
A Targeted High-Throughput Next-Generation Sequencing Panel for Clinical Screening of Mutations, Gene Amplifications, and Fusions in Solid Tumors  Rajyalakshmi.
Strategy Description Discovery Validation Application
The Genomics of Cancer and Molecular Testing:
Correlation Between Gene Expression and Prognostic Biomarkers in Small Cell Bladder Cancer (SCBC) Vadim S Koshkin1, Andrew Dhawan1, Ming Hu1, Jordan Reynolds1,
Comprehensive Characterization of Oncogenic Drivers in Asian Lung Adenocarcinoma  Shiyong Li, BS, Yoon-La Choi, MD, PhD, Zhuolin Gong, PhD, Xiao Liu, PhD,
Covering the Cover Gastroenterology
Volume 145, Issue 3, Pages (June 2017)
Volume 152, Issue 1, Pages e4 (January 2017)
Volume 31, Issue 2, Pages (February 2017)
Cyclin E1 Is Amplified and Overexpressed in Osteosarcoma
Personalized Medicine: Patient-Predictive Panel Power
Volume 24, Issue 8, Pages (August 2018)
Altered Caspase-8 Expression
LATS2-associated gene expression pattern is down-regulated specifically in lumB breast tumors. LATS2-associated gene expression pattern is down-regulated.
Knowledge-Guided Sample Clustering
Cancer Cell Line Encyclopedia
Figure 1. Identification of three tumour molecular subtypes in CIT and TCGA cohorts. We used CIT multi-omics data ( Figure 1. Identification of.
Stephen Bridgett, James Campbell, Christopher J. Lord, Colm J. Ryan 
PD-L1 expression correlates with T-cell markers and an IFN response signature in human melanomas. PD-L1 expression correlates with T-cell markers and an.
MYC expression is correlated with dasatinib sensitivity in cancer cell lines and in vivo. MYC expression is correlated with dasatinib sensitivity in cancer.
Global analysis of the chemical–genetic interaction map.
Asociación entre el ARNm de PD1 y la respuesta a tratamiento con anti-PD1 en monoterapia en múltiples tipos de cánceres Tomás Pascual*, Laia Paré*, Elia.
Presentation transcript:

CELL INDEX DATABASE (CELLX): A WEB TOOL FOR CANCER PRECISION MEDICINE Pacific Symposium on Biocomputing (PSB) 2015 January 4-8, 2015 The Big Island of Hawaii Keith Ching Senior Principal Scientist, Computational Biology Pfizer, Oncology Research Unit, San Diego, CA

What is CELLX? Web interface to a database of molecular profiling data Cell Lines ( CCLE, Broad, Sanger, GSK, Pfizer ) TCGA – The Cancer Genome Atlas Published studies ( GSE from NCBI GEO ) GTEx - Genotype-Tissue Expression project Custom data ( internal studies ) Datatypes Microarray expression RNA-Seq expression (RSEM) mutation (COSMIC, TCGA, CCLE) Copy Number Variation (CNV) Compound activity (limited) Protein array, RPPA (limited) Meta data, annotations. Pfizer Confidential │ 2

Architecture Demo: Open source YouTube tutorials : Pfizer Confidential │ 3 mysql Apache/ Tomcat Rserve Amazon Web Services minimum requirements: t2.micro vm, 1GB RAM, 1 CPU 150 GB disk space Perl Java

Why CELLX ? For each analysis, half the time is spent on data collection and formatting. –getting most recent dataset. –matching identifiers, merging datatypes Analyses developed to answer a specific question are abstracted and generalized. As new data is generated, the same analysis will be repeated over and over. Pfizer Confidential │ 4

Generalized query For target gene X: –what kinds of alterations mutation, fusion, amplification, deletion, over/under expression –where are alterations found cell lines, primary samples, PDX models –what gene alterations associate (or not) with gene X alterations KRAS mutation, ALK fusion, CCND1 amplification, PD1 expression –what sample characteristics associate with gene X alterations tissue type, subtype, compound sensitivity For target genes W, X, Y, Z –which tumor types have W and X alterations but not Y or Z. Pfizer Confidential │ 5

Precision Medicine Support pre-clinical and translational programs for late-stage targeted oncology agents. ( small molecules or antibodies ) –cell line or Patient Derived Xenograft (PDX) selection mutation status, CNV amp or del, high/low expression –cell line / PDX correlates with agent activity. tissue type, mutation, CNV, expression, meta data –understanding the size / frequency of potential responder indications presence / absence of biomarkers one or more constraints ( tissue type, subtype, subgroup, viral status) –hypothesis testing confirming literature reports, investigator results in public datasets. –easy data access, merging for custom analyses adding custom analyses as new queries Pfizer Confidential │ 6

Expression Pfizer Confidential │ 7

CNV Pfizer Confidential │ 8

Exp vs CNV Pfizer Confidential │ 9

Matrix Pfizer Confidential │ 10

Pfizer Confidential │ 11

Expression / mutation Pfizer Confidential │ 12

Breast Cell line panel screening – CDK4i IC50 values Palbociclib* Gene expression Sens vs. Resist CNV / mutations RB1 *Finn RS, et.al Breast Cancer Res. 2009;11(5):R77. doi: /bcr2419. CCNE1

Metadata test vs. expression of RB1

Meta association with EGFR expression Pfizer Confidential │ 15

RB1, CDKN2A, CCND1 in TCGA breast Pfizer Confidential │ 16

Cutoffs Pfizer Confidential │ 17

Across all TCGA Pfizer Confidential │ 18

Genes correlated with RB1 expression Pfizer Confidential │ 19 TCGA-BRCA-RSEM

Pfizer Confidential │ 20 GLI1

Pfizer Confidential │ 21 Data: George Kan Classes: Kai Wang ACRG HCC – ACVRL1 correlation

Multiple correlations across TCGA (PD-L1) runs correlation across 32 TCGA datasets summary table of number of times a gene appears zip file of each correlation table Pfizer Confidential │ 22 top 100 genes per dataset top 1000 genes per dataset

CD274 / JAK2 / PDCD1LG2 same locus 9p24 Pfizer Confidential │ 23 ACRG157T IGV

Survival Genomewide rank of gene expression and survival. Pfizer Confidential │ 24 TCGA-HNSC

Acknowledgments Paul Rejto : Exec Dir Precision Med CompBio Kai Wang – ACRG subclasses Zhengyan (George) Kan – ACRG data Julio Fernandez – CCLE data Wenyan Zhong – requirements, exp, cnv, mutation correlations Jarek Kostrowicki – R optimization Tao Xi – Tumor vs. normal plots Zhou Zhu – METABRIC data Pfizer Confidential │ 25 Requirements Oncology Business Unit Jean-François Martini : Sr. Dir Biomarker reports, venn, freq Maria Koehler : VP multiple datatype scatter plot Integrative Biology and Biochem Kim Arndt : VP IBB biomarker frequencies by subtype