Connecting Cancer Genomics to Cancer Biology using Proteomics

Slides:



Advertisements
Similar presentations
Big Data & the CPTAC Data Portal Nathan Edwards, Peter McGarvey Mauricio Oberti, Ratna Thangudu Shuang Cai, Karen Ketchum Georgetown University & ESAC.
Advertisements

Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Generalized Protein Parsimony and Spectral Counting for Functional Enrichment Analysis Nathan Edwards Department of Biochemistry and Molecular & Cellular.
Network-based stratification of tumor mutations Matan Hofree.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Evaluating cell lines as tumor models by comparison of genomic profiles Domcke, S. et al. Nat. Commun 4:2126.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Proteomics Informatics – Data Analysis and Visualization (Week 13)
Karl Clauser Proteomics and Biomarker Discovery Breast Cancer Proteomics and the use of TCGA Mutational Data - Broad Institute update/issues Karl Clauser.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Cancer Genome Consortium
Panel Discussion: Reference Databases Nathan Edwards Georgetown University Medical Center.
Quantification of Membrane and Membrane- Bound Proteins in Normal and Malignant Breast Cancer Cells Isolated from the Same Patient with Primary Breast.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Proteogenomic Novelty in 105 TCGA Breast Tumors
Clinical Proteomic Tumor Analysis Consortium: Ontology Considerations
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Virginia G. Piper Center for Personalized Diagnostics Chromosome 10 Joshua LaBaer, M.D., Ph.D. Konstantinos Petritis, Ph.D. Virginia G. Piper Center for.
NCRI Cancer Conference November 1, 2015.
Hereditary Cancer Predisposition: Updates in Genetic Testing
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
Genetics Journal Club Sumeet A. Khetarpal 10 December 2015.
Algorithms and Computation: Bottom-Up Data Analysis Workflows
EQTLs.
Press report 13/10/ publications selected.
Cancer Genomics and Class Discovery
Transcriptional heterogeneity of breast cancer subtypes,
Introduction to Bioinformatics February 13, 2017
Global Transcriptional Dysregulation in Breast Cancer
Dept of Biomedical Informatics University of Pittsburgh
Protein Inference by Generalized Protein Parsimony reduces False Positive Proteins in Bottom-Up Workflows Nathan J. Edwards, Department of Biochemistry.
The cBio Cancer Genomics Portal.
Fig. 8. Recurrent copy number amplification of BRD4 gene was observed across common cancers. Recurrent copy number amplification of BRD4 gene was observed.
Proteomic-based integrated subject-specific networks in cancer
Tao Wang Assistant Professor Quantitative Biomedical Research Center
Proteomics Informatics David Fenyő
Eukaryotic Genomes: The Organization and Control.
Genomic alterations in breast cancer cell line MDA-MB-231.
Genomic alterations in non–small cell lung cancer, breast cancer, and colorectal cancer. Genomic alterations in non–small cell lung cancer, breast cancer,
Relationship between Genotype and Phenotype
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Volume 5, Issue 4, Pages e5 (October 2017)
Anne L. Sapiro, Patricia Deng, Rui Zhang, Jin Billy Li  Cell Reports 
Cancer Evolution: No Room for Negative Selection
Volume 18, Issue 1, Pages (January 2017)
Schematic of cellular role categories of theoretical (open bars) and identified proteins on a 2-D electrophoresis gel, pH 4–7 (black bars), in L. casei.
Frequency distribution of the GRAVY of the theoretical proteins (open bars) and of 110 genes encoding proteins identified on a 2-D electrophoresis gel,
Schematic model of effector pathways that mediate tumor suppression by p53. Schematic model of effector pathways that mediate tumor suppression by p53.
High correlation of expression changes of NMD-regulated genes identified by both the pSILAC screen and previously reported global RNA screens after UPF1.
Testing the effectiveness of the three-step peptide fractionation method.A, μLC mass chromatograms of SCX fractions for an acidic FFE fraction. Testing.
Altered pathways in prostate cancer.
Relative quantification of cis and trans PSP gp10040–42/47–52 variants
Differential expression levels of archaella operon genes and respiratory chain genes in Δsaci_pp2a and Δsaci_ptp. Differential expression levels of archaella.
Volume 29, Issue 5, Pages (May 2016)
Proteomics Informatics David Fenyő
Breast Cancer Subtype Identification Using RNA-Seq Data
Figure 1. Identification of three tumour molecular subtypes in CIT and TCGA cohorts. We used CIT multi-omics data ( Figure 1. Identification of.
Volume 26, Issue 10, Pages e6 (March 2019)
Targetable alterations and pathways in TNBCs after NAC
Frequently mutated genes in colorectal cancer.
Concordance between the genomic landscape identified by whole-exome sequencing of plasma cfDNA and tumor; DNA and recurrence of KDR/VEGFR2 oncogenic mutations.
The ovarian cancer cell lines modestly recapitulate the spectrum of mutations found in primary ovarian tumors. The ovarian cancer cell lines modestly recapitulate.
Highly metastatic PDAC cells have a unique gene signature, which is not preserved in metastases but predicts poor patient outcome. Highly metastatic PDAC.
Molecular characterization of esophagogastric tumors.
Molecular Therapy - Nucleic Acids
Presentation transcript:

Connecting Cancer Genomics to Cancer Biology using Proteomics Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center

NCI: CPTAC Phase II Clinical Proteomic Tumor Analysis Consortium (CPTAC): 2011-2016 Comprehensive study of genomically characterized (TCGA) cancer biospecimens by bottom-up mass-spectrometry-based proteomics workflows Follows Clinical Proteomics Technology Assessment Consortium (CPTAC Phase I) CPTAC 3.0 just starting…

NCI: CPTAC Phase II

CPTAC-II Data Coordinating Center (DCC) Partnership between ESAC Inc. and Georgetown ICBI & UIS Subha Madhavan (ICBI), Peter McGarvey (ICBI), Nathan Edwards (BMCB) ESAC: Karen Ketchum (PM), Mauricio Oberti, Ratna Thangudu, Shuang Cai, many others… Recently awarded - CPTAC 3.0 DCC Simina Boca (ICBI), Shaojun Tang (ICBI)

CPTAC Data Portal All data is publicly released… …subject to responsible use guidelines Consortium has 15 months to publish first global analysis Data available in the meantime. http://grg.tn/cptac Edwards et al., J. Proteome Res., 2015, 14 (6).

CPTAC Data Portal

Three CPTAC-TCGA Studies: Big (Proteomics) Data Per study: ~ 100 samples, ~ 1 TB of data, ~ 10M-50M spectra Breast, Ovarian, Colorectal Cancer Tumor samples only! Global protein abundance, plus phospho- and glyco- sub-proteomes From TCGA: Exome, RNA-Seq, copy-number Same tumors! Connect genomics to proteomics!

Big Question for CPTAC-II Henry Rodrigez, Office of Cancer Clinical Proteomics Research, NCI (10/12/2016) Despite the investment in TCGA (34 cancer types, 12,000 individuals), “missing biology” Can additional (cancer) biology be elucidated from deep proteomic analysis? YES! New molecular cancer subtypes, new drug targets, new biological insight. CPTAC/TCGA Colorectal cancer study Zhang, et al. , Nature 513, 382 (2014).

Non-synonymous Germline & Somatic Variants 800 single amino-acid variants in 86 tumor samples. Mostly germline mutations, relatively few somatic mutations. Proteins with somatic variants are less abundant. Some variants are germline in some tumors, somatic in others. Sometimes, variant peptides are observed without accompanying RNA-Seq evidence – tumor heterogeneity or sampling depth for low abundance mRNA? Zhang, et al. , Nature 513, 382 (2014).

Protein abundance and mRNA expression are poorly correlated The correlation of a gene’s mRNA expression (RNA-Seq) and protein abundance across samples is not strong. Few genes have strong correlation. Unfortunate, as we have been using mRNA expression as a surrogate for protein abundance for years. mRNA expression is better thought of as the /rate/ of protein production. Zhang, et al. , Nature 513, 382 (2014).

…except for proteins in metabolic pathways? Metabolic pathways are enriched with genes that have strong mRNA-protein correlation, suggesting that the proteins are made on demand when the relevant metabolite is available, and the proteins discarded quickly once they are not needed. Housekeeping genes where a constant amount of protein is required only make proteins when needed to “top-up”, so highly uncorrelated. Zhang, et al. , Nature 513, 382 (2014).

Protein abundance is mostly unaffected by copy-number var. Copy number variation has a much stronger cis and trans effect on mRNA expression. Copy number variation effect on protein abundance is buffered by feedback mechanisms. Gross changes such as at 20q still produce an effect in both mRNA and proteins, especially a trans effect, but affect different genes. Zhang, et al. , Nature 513, 382 (2014).

New colorectal cancer subtypes Proteomic subtypes, defined using consensus clustering, from protein abundance data Note the relative lack of concordance with mRNA based sub-types. Some interesting structure to the proteomics subtypes – lack of TP53 and 18q loss genomic features in subtype B, etc. Notice too, that the “black” TCGA-subtype is isolated in subtypes B and C and partitioned by the proteins that distinguish them. Zhang, et al. , Nature 513, 382 (2014).

CPTAC 3.0 6 new cancer types, 200 tumors / study, matched normals. Proteogenomics is the next big thing: Cancer Moonshot (Biden) APOLLO Program (VA) 7 international collaborations with NCI Should be fun!