The PedcBioPortal & DiseaseXpress

Slides:



Advertisements
Similar presentations
Data Visualization in Molecular Biology Alexander Lex July 29, 2013.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
TCGA(The cancer genome atlas) catalogue genetic mutations responsible for cancer, using genome sequencing and bioinformatics The TCGA is sequencing the.
Overview of Biomedical Informatics Rakesh Nagarajan.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Evaluating cell lines as tumor models by comparison of genomic profiles Domcke, S. et al. Nat. Commun 4:2126.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Call in: Participant Passcode: Centra: Meeting ID: ICR_meetinghttp://ncicb.centra.com April 1, 2009 caArray.
CceHUB A Knowledge Discovery Environment for Cancer Care Engineering Research Ann Christine Catlin HUBzero Workshop November 7, 2008.
Radiogenomics in glioblastoma multiforme
Sage Bionetworks Mission Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by.
SP Transcription Factor network of cell migration Summary Hypothesis: We hypothesize that, by generating a network model for transcription factor regulation.
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
1 Jack London, PhD Research Professor, Cancer Biology Thomas Jefferson University Informatics Shared Resource Director Sidney Kimmel Cancer Center at Jefferson.
TCGA The Cancer Genome Atlas Project January 24, 2008.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
COMPUTATIONAL ANALYSIS OF MULTILEVEL OMICS DATA FOR THE ELUCIDATION OF MOLECULAR MECHANISMS OF CANCER Presented by Azeez Ayomide Fatai Supervisor: Junaid.
Using Predictive Classifiers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
Using Frequent Pattern Mining to Find Co-mutated Genes in Breast Cancer Zachary Stanfield 4/7/2015.
Center for Causal Discovery (CCD) of Biomedical Knowledge from Big Data University of Pittsburgh Carnegie Mellon University Pittsburgh Supercomputing Center.
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Multi-scale network biology model & the model library 多尺度网络生物学模型 -- 兼论模型库的建立与应用 Jianghui Xiong 熊江辉
Pichai Raman on behalf of cBioPortal Team Wednesday, May 25, 16
Towards a unified MOD resource: An Overview
David Amar, Tom Hait, and Ron Shamir
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
Recurrent copy number alterations in prostate cancer: an in silico meta-analysis of publicly available genomic data  Julia L. Williams, Peter A. Greer,
014 - Evaluation of Immunosignature Profile in Medulloblastoma
Sungkyunkwan University, School of Medicine.
NCT: Gaining Medical Insights and Enhancing Care for Cancer Patients with SAP HANA® Organization National Center for Tumor Diseases (NCT) Heidelberg, part.
Solutions to Clinical Data Visualization and Analysis
An Artificial Intelligence Approach to Precision Oncology
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Web-based Tools for Integrative Analysis of Pancreatic Cancer Data
SI-II A SI-II. Expression analysis of bladder cancer functionally active genes and significantly mutated genes. Comprehensive transcriptome profiling of.
Data challenges in the pharmaceutical industry
Sarah Leary, MD MS CBTTC 5/25/2016
ATOM Accelerating Therapeutics for Opportunities in Medicine
Multiple Myeloma Research Foundation
NCI’s Genomics Data Commons (GDC) & NCI Cloud Pilots
Dept of Biomedical Informatics University of Pittsburgh
Knowledge l Action l Impact
The cBio Cancer Genomics Portal.
Strategy Description Discovery Validation Application
Using Spotfire for Proteomic Analysis
Preclinical Accelerators of Precision Medicine Todd R. Golub, M.D.
The Genetic Basis for Cancer Treatment Decisions
Recurrence-Associated Long Non-coding RNA Signature for Determining the Risk of Recurrence in Patients with Colon Cancer  Meng Zhou, Long Hu, Zicheng.
Florian T. Merkle, Kevin Eggan  Cell Stem Cell 
Volume 58, Issue 4, Pages (May 2015)
100,000 Genomes Project & mainstreaming genomic medicine
How Can Precision Medicine in Oncology Refine Best Treatment Approach?
A New Approach to Clinical Trials
Tracing the Innate Genetic Evolution and Spatial Heterogeneity in Treatment Naïve Lung Cancer Lesions Jihye Kim Translational Bioinformatics and Cancer.
Altered Caspase-8 Expression
Volume 26, Issue 12, Pages e5 (March 2019)
Landscape of genomic alterations in 444 tumors from 429 patients with mCRPC. Landscape of genomic alterations in 444 tumors from 429 patients with mCRPC.
Knowledge-Guided Sample Clustering
Figure 1. Identification of three tumour molecular subtypes in CIT and TCGA cohorts. We used CIT multi-omics data ( Figure 1. Identification of.
Stephen Bridgett, James Campbell, Christopher J. Lord, Colm J. Ryan 
Medical Informatics and Explainable AI
Targetable alterations and pathways in TNBCs after NAC
One potential implementation for the use of outlier kinase profiling and targeting for clinical management of pancreatic cancer in a precision medicine.
Molecular characterization of esophagogastric tumors.
Presentation transcript:

The PedcBioPortal & DiseaseXpress CBTTC Investigator Meeting Pichai Raman on behalf of D3B Monday, May 8, 17

Downstream Analytics and Visualization Focus on RNA-Seq with common standard of processing & normalization DiseaseXpress Raw Data SBG Cavatica PedcBioPortal Processed Data Takes all data types with a focus on integrative analysis & summary

Downstream Analytics and Visualization PedcBioPortal DiseaseXpress Original cBioPortal developed at MSKCC for TCGA data and other large-scale cancer profiling efforts Lowers barrier to access and visualize complex genomic data for research Hypothesis driven analysis cBioPortal Development now shared across 5 teams : DFCI, MSKCC, Princess Margaret, and CHOP, the Hyve PedcBioPortal : CHOP Implementation has a focus on Pediatric cancer data sets Data Mining D3B Powered application that has greater than 20K samples all processed the same way Leverages work performed by UCSD and the broad and builds upon it by providing web-api access and additional data sets Accessible by R CURL DEXTER Cohort analysis and summary Machine Learning Pan-cancer Analysis Sample level views What genes are most often mutated in my cohort? And what is the association of that mutation with survival? What are some potential CAR therapy targets for Group 3 Medulloblastomas? Are these targets expressed in adult tumors as well? What are the known actionable genetic lesions for this sample? Are there gene expression signatures that are predictive of survival or relapse in brain cancers? What are the hotspot mutations or domains most commonly mutated on this gene?

PedcBio - Key Functionality & Features PedcBioPortal+ has a number of visualizations based on one of three entry points Study View Display of frequent / recurrent mutations or lesions within a study When creating virtual co-horts of molecular subtypes will be able to quickly identify “potental” drivers Patient View Get an overview of all of a patients genetic lesions, connections to Path Reports, clinical trials, drugs, etc.. Has COSMIC data as well as internal statistics to aid in determining if a mutation is likely causal Gene View Look at gene data (mutation / expression etc..) across or within study Correlate genes to other genes within a study or compare to normal tissue expression Can be used to identify targets for immunotherapy

PedcBio - Visualizations & Analytics - Gene Determine mutational hotspots on a gene by looking at mutational landscape across the gene Compare copy number vs mRNA or Risk/Stage vs mRNA CNA vs mRNA Mutation Mapper Single Gene View Look at relationships between genes and identify other potentially interesting candidates by association analysis Determine if genes are mutually exclusive or recurrently associated. Find patients without canonical lesions. OncoPrints Network Multi-Gene Views Survival Analysis

PedcBio - Visualizations & Analytics - Patient Global view of tumor sample lesions & CNA PedcBio - Visualizations & Analytics - Patient Relevant mutations Copy Number Changes

PedcBio - Visualizations & Analytics – Study Summary Survival Analysis Relevant mutations Cohort Characteristics

PedcBio - Current Development & Connectivity Virtual Cohort Creation Biorepository Integration Synthetic or virtual cohorts can serve to bridge together data form different studies to focus on subsets of patients or samples. Examples Include Patients with rare cancers & diseases Patients belonging to specific population or demographic Cases originating from a particular locale Users can move seemlessly between visualization on the portal and samples in the biorepository applications Connecting to other applications on the cloud currently

Downstream Analytics and Visualization PedcBioPortal DiseaseXpress Original cBioPortal developed at MSKCC for TCGA data and other large-scale cancer profiling efforts Lowers barrier to access and visualize complex genomic data for research cBioPortal Development now shared across 5 teams : DFCI, MSKCC, Princess Margaret, and CHOP, the Hyve PedcBioPortal : CHOP Implementation has a focus on Pediatric cancer data sets D3B Powered application that has greater than 20K samples all processed the same way Leverages work performed by UCSD and the broad and builds upon it by providing web-api access and additional data sets Accessible by R CURL DEXTER

DiseaseXpress - Overview

DiseaseXpress – Boxplots by Study / Disease Collapse a study to represent one box Set the collapsed study or other disease as reference box

DiseaseXpress – Scatterplot of 2 Genes Disease specific correlation estimates And P-values

DiseaseXpress – Current Development & Connectivity Cancer Pathway Analysis (single sample & cohort) Addition of all available public RNA-Seq datasets (GEO, ArrayExpress) Increasing speed of access Enhancing annotation to include tags Tie to variant, fusion, and copy number warehouses

Acknowledgements Karthik Kalletla / Anthony Cros : Building DiseaseXpress datastore and engine Komal Rathi : Developing DEXTER Shiny Application DAMEU Bo Zhang Allison Heath Yuankun Zhu D3B Team Members

BACKUPS

Discovery Data Cavatica Mission Pipelines Hypotheses Visualizations To develop a sustainable application eco-system that supports many of the aspects associated with basic & translational research. To be the premier portal for pediatric disease research and serve as a central hub to promote collaborative research between investigators to support sharing and creation of … Pipelines Hypotheses Data Visualizations Algorithms

Services Applications Users Cavatica Eco-System Services Applications Users Bio Repository [ HARVEST, …] Physical Specimens Raw Data Storage & Processing [SBG, …] FASTQ / BAM Files Processed Data Informatics [GitHub LFS, …] Matrices, Data Frames Data Visualizations & Analytics [cBioPortal & DiseaseXpress] Graphs, tables, summaries Data Scientists Data Tracker Investigators / Post-Doc User Management System Data Engineers