Integrative Analysis of Pathology, Radiology and High Throughput Molecular Data Joel Saltz MD, PhD Director Center for Comprehensive Informatics.

Slides:



Advertisements
Similar presentations
CVRG Presenter Disclosure Information Joel Saltz MD, PhD Director Comprehensive Informatics Center Emory University Translational Research Informatics.
Advertisements

CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
CIP NCIA Rembrandt – Vasari Project research enabled by caBIG - NCIA Eliot Siegel.
In Silico Brain Tumor Research Center Emory University, Atlanta, GA Classification of Brain Tumor Regions S. Cholleti, *L. Cooper, J. Kong, C. Chisolm,
16 November 2004Biomedical Imaging BMEN Biomedical Imaging of the Future Alvin T. Yeh Department of Biomedical Engineering Texas A&M University.
Overview of Biomedical Informatics Rakesh Nagarajan.
National Cancer Institute U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health NCI Perspective on Informatics and Clinical Decision.
Glioblastoma Multiforme (GBM) – Subtype Analysis Lance Parsons.
RSNA 2010 Talking Points NCI had two booths: The caBIG exhibit which comprised of seven workstations showcasing the NBIA, AIM, Virtual PACS, AVT, XIP,
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Data Mining: A Closer Look
Image Query (IQ) Project Update Building queries one question mark at a time March, 2009.
Introduction to Biomedical Image Analysis BMI 705 Winter 2009 Kun Huang Department of Biomedical Informatics Ohio State University.
Imaging of bevacizumab treated brain: traditional and emerging concepts Asim K. Bag Joel K Cure Aparna Singhal David Wever Asim K. Bag Joel K Cure Aparna.
Clinical Trials, TCGA: Deep Integrative Research RT, Imaging, Pathology, “omics” Joel Saltz MD, PhD Director Center for Comprehensive Informatics.
CaGrid, Fog and Clouds Joel Saltz MD, PhD Director Center for Comprehensive Informatics.
Gene expression profiling identifies molecular subtypes of gliomas
Integrative Analysis of Pathology, Radiology and High Throughput Molecular Data Joel Saltz MD, PhD Director Center for Comprehensive Informatics.
Domain agnostic tools for multi- scale/integrative sensor data analysis Joel Saltz MD, PhD Stony Brook University.
Tony Pan, Ashish Sharma, Metin Gurcan Kun Huang, Gustavo Leone, Joel Saltz The Ohio State University Medical Center, Columbus OH gridIMAGE Microscopy:
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Imaging Genomics: Correlation of Invasive Genomic Composition and Patient Survival Using Qualitative and Quantitative MR Imaging Parameters RR Colen 1,
Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
Imaging Workspace An Overview and Roadmap Eliot L. Siegel, MD Imaging Workspace Lead SME January 23, 2008.
Radiogenomics in glioblastoma multiforme
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
1 Jack London, PhD Research Professor, Cancer Biology Thomas Jefferson University Informatics Shared Resource Director Sidney Kimmel Cancer Center at Jefferson.
Department of Radiology and Imaging Sciences Division of Neuroradiology University School of Medicine.
The National Biomedical Imaging Archive (NBIA) In Action: An Introduction for Users A Tool Demonstration from caBIG® Presented by: Eliot Siegel, MD Maryland.
Prediction of Glioblastoma Multiforme Patient Time to Recurrence Using MR Image Features and Gene Expression Nicolasjilwan, M.1·Clifford, R.2·Flanders,
Michael Birrer Ian McNeish New Developments in Biology and Targets of Epithelial Ovarian Cancer.
© NlH National Center for Image Guided Therapy, 2012 ASNR 2012 Imaging Genomic mapping of Edema/Cellular Invasion MRI-Phenotypes in Glioblastoma Multiforme.
May 2007 Mouse GBM: A Pilot Collection on NBIA Sunny Jansen Postdoctoral Fellow Terry Van Dyke Lab Mouse Cancer Genetics Program NCI Collaboration with.
ASNR 2012 Methodology for Imaging Genomics of Gliomas
1 Introduction and Welcome Joel Saltz MD, PhD Co-Director caGrid Knowledge Center caGrid Knowledge Center Workshop February 2011.
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
Prediction of Glioblastoma Multiforme Patient Survival Using MR Image Features and Gene Expression Control/Tracking Number: 11-O-1519-ASNR Nicolasjilwan,
Neuroimage Analysis Center An NCRR National Resource Center Time Series MRI Core Analysis, Modeling - toward Dynamic Surrogates.
1 Image-Based Biomedical Big Data Analytics Jens Rittscher Department of Engineering Science, Nuffield Department of Medicine, University of Oxford.
ICNCT-16, June 2014, Helsinki Glioma heterogeneity and the L-Amino acid transporter-1 (LAT1): A first step to stratified BPA-based BNCT? D. Ngoga 1 ; C.
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
The National Cancer Imaging Archive (NCIA) In Action: An Introduction for Users A Tool Demonstration from caBIG™ Carl Jaffe, MD NCI-Cancer Imaging Program.
Inflammatory Bowel Diseases November 19, 2007 NCDD Meeting Chair: Daniel K. Podolsky, MD Vice Chair: Eugene B. Chang, MD.
Imaging Workspace An Overview and Roadmap Eliot L. Siegel, MD Imaging Workspace Lead SME January 23, 2008.
1 LS DAM Overview August 7, 2012 Current Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Mervi Heiskanen, NCI-CBIIT, Joyce.
Tony Pan, Stephen Langella, Shannon Hastings, Scott Oster, Ashish Sharma, Metin Gurcan, Tahsin Kurc, Joel Saltz Department of Biomedical Informatics The.
The Contribution of caBIG and In Silico Resources to Glioblastoma Research Daniel J. Brat MD, PhD Department of Pathology and Laboratory Medicine Emory.
CTTI PROJECT Emory University, Quality Assurance and Review Center (QARC) and Washington University in St. Louis.
PD: Joel Saltz, MD, PhD PI: Daniel J. Brat MD, PhD Emory University School of Medicine In Silico Center for Brain Tumor Research.
IDENTIFYING CANCER SUBTYPES BASED ON SOMATIC MUTATION PROFILE BIOINFORMATICS SEMINAR 2016 SPRING YOUJIN SHIN.
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
Three-Dimension (3D) Whole-slide Histological Image Analytics
miRNA-targets cross-talks: key players in glioblastoma multiforme
Mouse GBM: A Pilot Collection on NBIA
W. Scott Campbell, MBA, PhD James R. Campbell, MD
Stony Brook University The Process for Joining TIES
Pathology Spatial Analysis February 2017
CLASSIFICATION OF TUMOR HISTOPATHOLOGY VIA SPARSE FEATURE LEARNING Nandita M. Nayak1, Hang Chang1, Alexander Borowsky2, Paul Spellman3 and Bahram Parvin1.
MR images analysis of glioma
Data challenges in the pharmaceutical industry
Joel Saltz MD, PhD Director Center for Comprehensive Informatics
The pathological diagnosis of diffuse gliomas: towards a smart synthesis of microscopic and molecular information in a multidisciplinary context  Pieter.
Focus on central nervous system neoplasia
Presenters: Joel Saltz, Biomedical Informatics, Stony Brook University
Volume 4, Issue 3, Pages (August 2013)
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Figure 1. Identification of three tumour molecular subtypes in CIT and TCGA cohorts. We used CIT multi-omics data ( Figure 1. Identification of.
Shengcong Chen, Changxing Ding, Minfeng Liu 2018
Presentation transcript:

Integrative Analysis of Pathology, Radiology and High Throughput Molecular Data Joel Saltz MD, PhD Director Center for Comprehensive Informatics

Objectives Reproducible anatomic/functional characterization at gross level (Radiology) and fine level (Pathology) Integration of anatomic/functional characterization with multiple types of “omic” information Create categories of jointly classified data to describe pathophysiology, predict prognosis, response to treatment Data modeling standards, semantics Data research issues

In Silico Program Objectives (from NCI) In silico is an expression used to mean "performed on computer or via computer simulation.“ (Wikipedia)computer computer simulation In silico science centers: support investigator-initiated, hypothesis- driven research in the etiology, treatment, and prevention of cancer using in silico methods –Generating and publishing novel cancer research findings leveraging caBIG tools and infrastructure –Identifying novel bioinformatics processes and tools to exploit existing data resources Encouraging the development of additional data resources and caBIG analytic services Assessing the capabilities of current caBIG tools Emory, Columbia, Georgetown, Fred Hutchinson Cancer, Translational Genomics Research Institute

In Silico Center for Brain Tumor Research Specific Aims: 1.Influence of necrosis/ hypoxia on gene expression and genetic classification. 2. Molecular correlates of high resolution nuclear morphometry. 3.Gene expression profiles that predict glioma progression. 4. Molecular correlates of MRI enhancement patterns.

Integrative Analysis: Tumor Microenvironment Structural and functional differentiation within tumor Molecular pathways are time and space dependent “Field effects” – gradient of genetic, epigenetic changes Radiology, microscopy, high throughput genetic, genomic, epigenetic studies, flow cytometry, microCT, nanotechologies … Create biomarkers to understand disease progression, response to treatment Tumors are organs consisting of many interdependent cell types From John E. Niederhuber, M.D. Director National Cancer Institute, NIH presented at Integrating and Leveraging the Physical Sciences to Open a New Frontier in Oncology, Feb 2008

Informatics Requirements Parallel initiatives Pathology, Radiology, “omics” Exploit synergies between all initiatives to improve ability to forecast survival & response. Radiology Imaging Patient Outco me Pathologic Features “Omic” Data

In Silico Center for Brain Tumor Research Key Data Sets REMBRANDT: Gene expression and genomics data set of all glioma subtypes The Cancer Genome Atlas (TCGA): Rich “omics” set of GBM, digitized Pathology and Radiology Vasari Feature Set: Standardized annotation of gliomas of all subtypes

TCGA Research Network Digital Pathology Neuroimaging

Distinguishing Among the Gliomas “There are also many cells which appear to be transitions between gigantic oligodendroglia and astrocytes. It is impossible to classify them as belonging in either group” Bailey P, Bucy PC. Oligodendrogliomas of the brain. J Pathol Bacteriol 1929: 32:735

OligodendrogliomaAstrocytoma Nuclear Qualities

Progression to GBM Anaplastic Astrocytoma (WHO grade III) Glioblastoma (WHO grade IV)

TCGA Neuropathology Attributes 120 TCGA specimens; 3 Reviewers Presence and Degree of: Microvascular hyperplasia  Complex/glomeruloid  Endothelial hyperplasia Necrosis  Pseudopalisading pattern  Zonal necrosis Inflammation  Macrophages/histiocytes  Lymphocytes  Neutrophils Differentiation:  Small cell component  Gemistocytes  Oligodendroglial  Multi-nucleated/giant cells  Epithelial metaplasia  Mesenchymal metaplasia Other Features  Perineuronal/perivascular satellitosis  Entrapped gray or white matter  Micro-mineralization

Feature Extraction TCGA Whole Slide Images Jun Kong

Astrocytoma vs Oligodendroglima Overlap in genetics, gene expression, histology Astrocytoma vs Oligodendroglima Assess nuclear size (area and perimeter), shape (eccentricity, circularity major axis, minor axis, Fourier shape descriptor and extent ratio), intensity (average, maximum, minimum, standard error) and texture (entropy, energy, skewness and kurtosis).

Nuclear Qualities Which features carry most prognostic significance? Which features correlate with genetic alterations?

Whole slide scans from 14 TCGA GBMS (69 slides) 7 purely astrocytic in morphology; 7 with 2+ oligo component 399,233 nuclei analyzed for astro/oligo features Cases were categorized based on ratio of oligo/astro cells Machine-based Classification of TCGA GBMs (J Kong) TCGA Gene Expression Query: c-Met overexpression

Nuclear Feature Analysis: TCGA Using the parallel computation infrastructure of Sun Grid Engine, we analyzed image tiles of 4096x4096 of 213 whole- slide TCGA images of permanent tissue sections. Approximately 90 million nuclei segmented. 79 patients:57 are diagnosed as GBM (‘oligo 0’) 17 are classified as GBM with ‘oligo 1’, 5 as GBM with ‘oligo 2+’. With each data file including all nuclear features from one patient, all nuclei were classified with color blue, green, and red representing nuclei scored as 1~3, 4~6, and 7~10, respectively.

The ratio of nuclei classified ≤ 5 to >5 was computed for each 79 patients. Ratios associated with ‘oligo 0’ and ‘oligo 2+’ patient populations were compared with two-sample t-test (p=0.0145) Nuclear Feature Analysis: TCGA

Discriminating Features (Grade 1 vs. Grade 7-10)

Determine the influence of tumor microenvironment on gene expression profiling and genetic classification using TCGA data Necrosis = Severe Hypoxia Microvascular Hyperplasia GBM

3 Gene Families are Altered in GBM: RTK, p53 and RB

GBM: necrosis, hypoxia, angiogenesis and gene expression Does the presence or degree of necrosis within digitized frozen section slides correlate with specific gene expression patterns or determine algorithm- based unsupervised clustering of GBMs gene expression categories? Does the presence or degree of necrosis influence the type of angiogenesis or pro-angiogenic gene expression patterns within human gliomas?

GBM: % Necrosis

TCGA: GBM Frozen Sections 179 cases were assessed for % necrosis on frozen section slides for TCGA quality assurance. Cox-based regression analysis for % necrosis vs. gene expression (795 probe sets; 647 distinct genes)

Network Analysis based on % Necrosis Carlos Moreno David Gutman

Aim 4. Identify correlates of MRI enhancement patterns in astrocytic neoplasms with underlying vascular changes and gene expression profiles. No enhancement Normal Vessels Stable lesion ? Rim-enhancement Vascular Changes Rapid progression

Angiogenesis Segmentation H&E Image Color Deconvolution Hematoxylin Image Eosin Image Eosin Image Spatial Norm. Density Image Density Calculation Boundary Smoothing Density Image Object ID Segmented Vessels Eosin intensity image Angiogenic Segmentation

States of Angiogenesis Endothelial Hypertrophy Complex Microvascular Hyperplasia Endothelial Hyperplasia Lee Cooper Sharath Cholleti

Vessel Characterization Bifurcation detection

Vasari Imaging Criteria (Adam Flanders, TJU; Dan Rubin, Stanford, Lori Dodd, NCI) Require standardized validated feature sets to describe de novo disease. Fundamental obstacle to new imaging criteria as treatment biomarkers is lack of standard terminology: –To define a comprehensive set of imaging features of cancer –For reporting imaging results –To provide a more quantitative, reproducible basis for assessing baseline disease and treatment response

Defining Rich Set of Qualitative and Quantitative Image Biomarkers Community-driven ontology development project; collaboration with ASNR Imaging features (5 categories) –Location of lesion –Morphology of lesion margin (definition, thickness, enhancement, diffusion) –Morphology of lesion substance (enhancement, PS characteristics, focality/multicentricity, necrosis, cysts, midline invasion, cortical involvement, T1/FLAIR ratio) –Alterations in vicinity of lesion (edema, edema crossing midline, hemorrhage, pial invasion, ependymal invasion, satellites, deep WM invasion, calvarial remodeling) –Resection features (extent of nCE tissue, CE tissue, resected components)

Results: Reader Agreement High inter-observer agreement among the three readers –(kappa = 0.68, p<0.001) Percentage agreement was also high for most features individually –22 of 30 features (73%) had agreement greater than 50% –Twelve features (40%) had >80% agreement –No feature had less than 20% agreement Feature agreement rose substantially when used with tolerance (+/- 1).

Preliminary Relationships of Features to Survival Cox proportional hazards models were fit to each of the thirty features related to overall survival. Features associated with lower survival included (p<.0001): –Proportion of enhancing tissue at baseline. –Thick or nodular enhancement characteristics. –Contralateral hemisphere invasion. Proportion of non contrast enhancing tumor (nCET) had positive correlation with survival. Tumor size at baseline had no relationship to survival.

Coupling silico methodology with a clinical trial: Will Treatment work and if not, why not? Example: Avastin and Glioblastoma as in RTOG-0825 (plus institutional accrual) Treatment: Radiation therapy and Avastin (anti angiogenesis) Predict and Explain: Genetic, gene expression, microRNA, Pathology, Imaging Imaging/RT reproducibility, Integration with EMR, PACS, RT systems

DATAINTEGRATIONDATAINTEGRATION DATAINTEGRATIONDATAINTEGRATION Information WarehouseUser AccessAcquisition Transfer CPOE OR system Patient Mgmt Dictated reports Pathology reports Daily ADT Lab Respiratory Blood Endoscopy Cardiology Siemens Img Real time Patient Billing Practice Plans Pt Satisfaction Weekly Monthly Cancer Genetics Wound Images Tissue Pulmonary Genomic Data Web Multi-Dimensional Analysis & Data Mining Ad-hoc Query Error Report Benchmarking De- Identification Honest Broker Wound Center Research Web Scorecards & Dashboards Image Analysis Text Mining, NLP Business Clinical ResearchExternal Meta Data Overview Crucial to Leverage Institutional Data Ohio State Information Warehouse Infrastructure

Annotations and Imaging Markup (AIM) Annotations and Imaging Markup Developer (AIM) provides a standard for medical image annotation and markup for images used in the research space, and in particular, the image based cancer clinical trial. It is notable that there is no existing standard for radiology annotation and markup. The caBIG ® program is working with almost every standards body such as DICOM to elicit consensus regarding use of AIM as the accepted standard for radiology annotation and markup, and is positioned to extend AIM to digital pathology. The pixel at the tip of the arrow [coordinates (x,y)] in this image [DICOM: ] represents the Ascending Thoracic Aorta [SNOMED:A ] Aim Data Service Emory (AIME) is a caGrid data service that manages AIM documents in XML databases. AIME supports query, enumerationQuery, queryByTransfer, submit and submitByTransfer methods

MicroAIM Provide a semantically enabled data model for pathology and microscopy image markups and observations Goal: interoperable data exchange and knowledge sharing Ongoing collaborative efforts to standardize via caBIG and Association for Pathology Informatics microAIM data service provides comprehensive query support, and can be efficiently implemented either through an XML-based or a relational based approach.

Pathology AIM; Semantic Annotation and Spatial Reasoning Observation on a single or multiple objects Support multiple objects and associate different observations to each object Calculation on a single or multiple objects Class of type to represent mask or field value Represent provenance information for computed markup and annotation Identify all instances of a set of cell types Complex spatial queries: Quantify areas of glomeruloid microvascular proliferation within X microns of pseudopalisading necrosis

Use of caBIG Tools in Emory in Silico Brain Tumor Research Center Osirix XIP/AVT Clear Canvas NBIA AIME TCGA Portal Modified WorkstationExtended AIME Service Enterprise Architecture 2.0 caB2B caIntegrator 2 CBIIT Research Team Carl Schaefer Jinghui Zhang TBPT ICR VCDE/ ARCH Science CIP: Research & Clinical Trials TCGA Portal In vivo Imaging

Data Science Research Challenges Driven by In Silico Discovery Research Data integration that targets multiple data sources with conflicting metadata and conflicting data Efficient methods for semantic query that targets questions involving complex multi-scale features associated with petascale and exascale ensembles of highly annotated images Computer assisted annotation and markup for very large datasets Systems to support combinations of structured and irregular accesses to exascale datasets

Data Science Research Challenges Structural and semantic metadata management: how to manage tradeoff between flexibility and curation Data and semantic modeling infrastructures and policies able to scale to handle distributed systems with an aggregate of 10*9 or more data models/concepts Three dimensional (time dependent) reconstruction, feature detection and annotation of 3-D microscopy imagery Workflow infrastructure for large scale data intensive computations

Final Data Science Challenge: Large Dataset Size –Basic small mouse is 10 cm 3 –1 μ resolution – very roughly bytes/mouse –Molecular data (spatial location) multiply by 10 2 –Vary genetic composition, environmental manipulation, systematic mechanisms for varying genetic expression; multiply by 10 3 Total: bytes per big science animal experiment

Thanks to: In silico center team: Dan Brat (Science PI), Tahsin Kurc, Ashish Sharma, Tony Pan, David Gutman, Jun Kong, Sharath Cholleti, Carlos Moreno, Chad Holder, Erwin Van Meir, Daniel Rubin, Tom Mikkelsen, Adam Flanders, Joel Saltz (Director) caGrid Knowledge Center: Joel Saltz, Mike Caliguiri, Steve Langella co-Directors; Tahsin Kurc, Himanshu Rathod Emory leads caBIG In vivo imaging team: Eliot Siegel, Paul Mulhern, Adam Flanders, David Channon, Daniel Rubin, Fred Prior, Larry Tarbox and many others In vivo imaging Emory team: Tony Pan, Ashish Sharma, Joel Saltz Emory ATC Supplement team: Tim Fox, Ashish Sharma, Tony Pan, Edi Schreibmann, Paul Pantalone Digital Pathology R01: Foran and Saltz; Jun Kong, Sharath Cholleti, Fusheng Wang, Tony Pan, Tahsin Kurc, Ashish Sharma, David Gutman (Emory), Wenjin Chen, Vicky Chu, Jun Hu, Lin Yang, David J. Foran (Rutgers)

Thanks!