Tumor Heterogeneity: From biological concepts to computational methods Bo Li, PhD Dana Farber Cancer Institute Harvard Statistics Department.

Slides:



Advertisements
Similar presentations
Making Sense of Novel Prognostics: NOTCH1, SF3B1 Jennifer R Brown, MD PhD Director, CLL Center Dana-Farber Cancer Institute October 24, 2014.
Advertisements

Acquisition of tumour multidrug resistance inevitable in most advanced solid tumours – Failing to cure the majority of advanced solid tumours – Declining.
Supervisor: VS 高志平 Reporter: R4 張妙而.  Mutations in nucleophosmin 1 ( NPM1 ) gene, one of the most common gene mutations (25%-30%) in AML  NPM1 mut co-occurs.
Data Integration for Cancer Genomics. Personalized Medicine Tumor Board Question: given all we know about a patient, what is the “optimal” treatment?
Bioinformatics lectures at Rice University Li Zhang Lecture 10: Networks and integrative genomic analysis-2 Genome instability and DNA copy number data.
Yanxin Shi 1, Fan Guo 1, Wei Wu 2, Eric P. Xing 1 GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH Data RECOMB 2007 Presentation.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
By: Katie Adolphsen, Robin Aldrich, Brandon Hu, Nate Havko.
Introduction Integrative Analysis of Genomic Variants in Carcinogenesis Syed Haider, Arek Kasprzyk, Pietro Lio Artificial Intelligence and Computational.
Glioblastoma Multiforme (GBM) – Subtype Analysis Lance Parsons.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Evaluating cell lines as tumor models by comparison of genomic profiles Domcke, S. et al. Nat. Commun 4:2126.
Master in Advanced Genetics
DNA Microarrays Examining Gene Expression. Prof. GrossBiology 4 DNA MicroArrays DNA MicroArrays use hybridization technology to examine gene expression.
Re-Examination of the Design of Early Clinical Trials for Molecularly Targeted Drugs Richard Simon, D.Sc. National Cancer Institute linus.nci.nih.gov/brb.
Introduction to Glioblastoma Chris Plaisier Introduction to Systems Biology Course Institute for Systems Biology.
Gene expression profiling identifies molecular subtypes of gliomas
Genetic Alterations of TP53 Gene in Brain Astrocytic Tumours Methodology Θ Eighty-three brain tumor biopsies were collected and used in this study. Thirty.
Molecular Biomarkers in Radiotherapy of Cervical Cancer A collaboration project between Department of Gynecologic Oncology and Department of Radiation.
Radiogenomics in glioblastoma multiforme
Chapter 7 Essential Concepts in Molecular Pathology Companion site for Molecular Pathology Author: William B. Coleman and Gregory J. Tsongalis.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Genetics-multistep tumorigenesis genomic integrity & cancer Sections from Weinberg’s ‘the biology of Cancer’ Cancer genetics and genomics Selected.
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Michael Birrer Ian McNeish New Developments in Biology and Targets of Epithelial Ovarian Cancer.
Inferring transcriptional and microRNA-mediated regulatory programs in glioblastma Setty, M., et al.
Irradiation of stem cell niches in the periventricular and sub granular zones in gbm : A Prospective study Akram K S, Monica I, Deepa J, Kesava R, Fayaz.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Ranjit Ganta, Raj Acharya, Shruthi Prabhakara Department of Computer Science and Engineering, Penn State University DATA WAREHOUSE FOR BIO-GEO HEALTH CARE.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
MCB 317 Genetics and Genomics Topic 11 Genomics. Readings Genomics: Hartwell Chapter 10 of full textbook; chapter 6 of the abbreviated textbook.
COMPUTATIONAL ANALYSIS OF MULTILEVEL OMICS DATA FOR THE ELUCIDATION OF MOLECULAR MECHANISMS OF CANCER Presented by Azeez Ayomide Fatai Supervisor: Junaid.
Computational Identification of Tumor heterogeneity
Maxwell Lee National Cancer Institute Center for Cancer Research High-dimension Data Analysis Group March 19, 2014 Integrated Studies Of Breast, Esophageal,
Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.
ICNCT-16, June 2014, Helsinki Glioma heterogeneity and the L-Amino acid transporter-1 (LAT1): A first step to stratified BPA-based BNCT? D. Ngoga 1 ; C.
Lecture 11. Topics in Omic Studies (Cancer Genomics, Transcriptomics and Epignomics) The Chinese University of Hong Kong CSCI5050 Bioinformatics and Computational.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
Jin MENG Shen FU (DPD 08) Biology 2 - Head/Neck and CNS Tumors
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin Hoadley, KA et al. Cell 158(4):
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Samuel Aparicio, B.M., B.Ch., Ph.D., and Carlos Caldas, M.D.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Advances and challenges in computational modeling and statistical learning of biological systems Qi Liu Department of Biomedical Informatics Vanderbilt.
Tumor Genome Sequencing Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST512.
Natural History, Response to Treatment
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
Cancer Genomics and Class Discovery
Transcriptional heterogeneity of breast cancer subtypes,
Sensitivity Analysis of the MGMT-STP27 Model and Impact of Genetic and Epigenetic Context to Predict the MGMT Methylation Status in Gliomas and Other.
Gene expression.
Microarray Technology and Applications
Sensitivity Analysis of the MGMT-STP27 Model and Impact of Genetic and Epigenetic Context to Predict the MGMT Methylation Status in Gliomas and Other.
Tao Wang Assistant Professor Quantitative Biomedical Research Center
Volume 17, Issue 1, Pages (January 2010)
Diagnostic approaches to measure the impact of cancer therapies on clonal evolution. Diagnostic approaches to measure the impact of cancer therapies on.
A, relationship between weighted mean chromosome copy number and weighted Genome Instability Index (wGII). A, relationship between weighted mean chromosome.
Volume 4, Issue 3, Pages (August 2013)
SNP Arrays in Heterogeneous Tissue: Highly Accurate Collection of Both Germline and Somatic Genetic Information from Unpaired Single Tumor Samples  Guillaume.
Utilizing NGS-Data to Evaluate Anti-PD-1 Treatment
Altered Caspase-8 Expression
Figure 1. Identification of three tumour molecular subtypes in CIT and TCGA cohorts. We used CIT multi-omics data ( Figure 1. Identification of.
High-risk neuroblastoma molecular subtypes classification and inference of master regulators. High-risk neuroblastoma molecular subtypes classification.
Highly metastatic PDAC cells have a unique gene signature, which is not preserved in metastases but predicts poor patient outcome. Highly metastatic PDAC.
Presentation transcript:

Tumor Heterogeneity: From biological concepts to computational methods Bo Li, PhD Dana Farber Cancer Institute Harvard Statistics Department

Background Tumor heterogeneity: difference between tumors What it affects: – Diagnosis – Prognosis – Selection of treatment – Drug resistance

Levels of Tumor Heterogeneity Attolini et al., 2010 Burell et al., 2013 tumor/normal mixing inter-tumor heterogeneity 3 intra-tumor heterogeneity tumor subclones

Tumor microenvironment Junttila et al., 2013, Nature

Tumor Evolution as a Darwinian Process Greaves and Maley, 2012, Nature Key facts: tumor cell population is heterogeneous tumor genome harbors somatic aberrations 5 Darwin’s notebook, 1837

Clonal expansion model 1976 Nowell, 1976, Science

Vogelstein model Fearon and Vogelstein, 1990

Key factors to study tumor heterogeneity Sampling procedures – Ideal but expensive: single cell profiling – Practical: multi-regional sampling or longitudinal sampling – Most commonly used: bulk tissue collected from end-stage tumor Data types – sequencing data on DNA or RNA – SNP array data – mRNA expression profiling – DNA methylation array, etc. Examples of large cancer studies: – The Cancer Genome Atlas (TCGA): ~10,000 samples collected from over 30 types of cancer, mostly in the US – International Cancer Genome Consortium (ICGC): ~ 11,000 samples from 50 types of cancer, worldwide 8

Sampling procedures Single cell sequencing Multi-region sampling Gerlinger et al., 2012Nawy et al., 2014

Computational inference of tumor purity Pathological inference is semi-quantitative, empirical and low throughput. Tumor purity inference – DNA copy number variation based – DNA methylation based – Gene expression based (ESTIMATE,Yoshihara et al., 2013)

DNA COPY NUMBER VARIATION BASED PURITY ESTIMATION

Two-way mixing hypothesis euploid cells AmpDel AGP ~ 7/9 9/9 6/9 7/9 7/9 3/9 CN-LOH aneuploid cells 12 AGP=Aneuploid Genome Proportion, surrogate for tumor purity

I II III IV n T =n A +n B Using allele-specific SNP array data to infer AGP Normal (AB) Amp (AAB) Deletion (A or B) Copy Neutral LOH (AA or BB) Homozygous deletion (0) Balanced amplification (AABB) High-fold amplification (AAAB) High-fold amplification (AAAAB) BAF-LRR plot log 2 (n T )-1 |0.5-n B /n T | 0% Euploid Mixing (AGP=1)40% Euploid Mixing (AGP=0.6) AB A or B AB 13 Illumina 550K data on tumor/normal pairs Intensity (logR) B Allele Frequency (BAF) n B /n T I II III IV

GBM Molecular Subtypes Glioblastoma Multiforme (GBM) is a malignant brain cancer, with median survival time ~18 months. GBM is heterogeneous among patients in histology, molecular signatures and clinical outcome. Two of the studies have attempted to classify GBM tumors into molecular subtypes. 14 Phillps et al, 2006Verhaak et al, 2010 ProneuralProliferativeMesenchymalProneuralNeuralClassicalMesenchymal Median Age of Onset Survival (month) Two classification schemes are not consistent: Proneural subtypes have different clinical and demographical features. Subtypes do not show significant survival difference in Verhaak et al..

AGP correlates with gene expression pattern Same samples analyzed by Verhaak et al, 2010, used 128 with both gene expression and SNP array data available. Li et al, 2012, Clin Can Res 15 Discovery of a new GBM subtype

Revised Classification for GBM For the remaining 108 ‘Typical’ GBMs, we performed a two- step consensus clustering, and identified three subtypes, with significant survival difference. P=

Integer rounding ASCAT ABOSLUTE Baysian inference HMM PICNIC oncoSNP MixHMM GPHMM PSCN pennCNV tumor LOH-based inference BACOM qpure Pattern recognition AttiyehGAP TAPS CHAT GenoCNA

Discussion Multiple subclones co- exist in the tumor cell population How to estimate tumor purity? What happens if a subclone has more events than the dominant clone?

DNA methylation based purity estimation Rationale: DNA methylation is dichotomized at each locus. Sites differentially methylated in tumor or normal cells are informative for purity estimation. Zheng et al., 2014, Genome Biology

Single cell analysis Low throughput: – FISH (spectrum karyotyping) – FACS (Cytof) High throughput (NGS): – DNA Genome evolution, subclonality etc – RNA Gene expression heterogeneity

DNA sequencing Navin et al., 2011, Nature

RNA sequencing Subtype clustering Patel et al., 2014, Science

Future directions Experimental design is critical. Longitudinal + multi- regional sampling Heterogeneity of the tumor microenvironment Immune infiltration Fibroblast Endothelial cells