MRNASeq analysis using TCGA HNSC data Vinay Kartha Monti lab rotation project 11/25/2013.

Slides:



Advertisements
Similar presentations
Exercise 1: Importing Illumina data  Using the Import tool File / Import folder. Select the folder IlluminaTeratospermiaHuman6v1_BS1 In the Import files.
Advertisements

Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Bayesian Factor Regression Models in the “Large p, Small n” Paradigm Mike West, Duke University Presented by: John Paisley Duke University.
Carolina Breast Cancer Study: Breast cancer subtypes and race Robert Millikan University of North Carolina Chapel Hill, NC.
The 70-Gene Profile and Chemotherapy Benefit in 1,600 Breast Cancer Patients Bender RA et al. ASCO 2009; Abstract 512. (Oral Presentation)
Cancer Staging. What is cancer staging? Staging describes the severity of a person’s cancer based on the extent of the original (primary) tumor and whether.
Mutual Information Mathematical Biology Seminar
1 Test of significance for small samples Javier Cabrera.
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Postoperative Radiation for Oral Cavity Squamous Cell Carcinoma: The EP.
Staging and Grading of cancers By Haleigh Nelson.
Gene Set Enrichment Analysis Petri Törönen petri(DOT)toronen(AT)helsinki.fi.
Analysis of microarray data
Mauricio A. Moreno, M.D. Assistant Professor Department of Otolaryngology- Head and Neck Surgery University or Arkansas for Medical Sciences Mauricio A.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Gene expression profiling identifies molecular subtypes of gliomas
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
Preliminary results Zoho page ki/ /TAZ-YAP-and- DPAGT1-knockdown.html
Cancer Staging.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
Pharynx Sagittal view of the face and neck depicting the subdivisions of the pharynx as described in the text. Compton, C.C., Byrd, D.R., et al., Editors.
Head and Neck Cancer: microRNA analysis
Principles of Surgical Oncology Done by : 428 surgery team surgery team.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
A A R H U S U N I V E R S I T E T Faculty of Agricultural Sciences Introduction to analysis of microarray data David Edwards.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
The Broad Institute of MIT and Harvard Differential Analysis.
Variability & Statistical Analysis of Microarray Data GCAT – Georgetown July 2004 Jo Hardin Pomona College
Mouth Inspection of the MOUTH 1. Should use light source for inspection 2. Inspect lips, gums, buccal mucosa, teeth 3. Inspect tongue, posterior pharynx.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Table S1. CD44 expression and clinicopathologic characteristics Cases (n=54) CD44 protein expression P value* Negativ e WeakStrong (n=8)(n=23) Age
Indivumed # Tumor pairs (T) & normal control (N) #DifferentiationTumor Stage Lymph Node (LN) and vessels (LV) Metastases stageResectionGenderAge A3429T1/N1W1LN0/LV0R0M76.
HEAD/NECK CARCINOMA Johns Hopkins Hospital.
AN INTRODUCTION TO GENE EXPRESSION ANALYSIS BY MICROARRAY TECHNIQUE (PART II) DR. AYAT B. AL-GHAFARI MONDAY 10 TH OF MUHARAM 1436.
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
Bioinformatics for biologists (2) Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Cancer: Staging and Grading What is meant by the term “biopsy”? How do tumors behave differently from one another ? Examples of the stages of cancer and.
 Cancer  Compound perturbations  Gene perturbations  Tumor development  Cancer metastasis  Cancer treatments Altered Caspase-8 Expression.
K. Brennan, J.L. Koenig, A.J. Gentles, J.B. Sunwoo, O. Gevaert
ANOVA: Analysis of Variation
CellExpress Tutorial A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System :8080 NTU.
ANOVA: Analysis of Variation
Cancer Waiting Times, UK countries   England Wales Scotland
HANA Audit Update for SSG
Alvin Y. Liu, Martine P. Roudier, Lawrence D. True 
CellExpress Examples A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System :8080 NTU.
Functional Genomics Analysis Reveals a MYC Signature Associated with a Poor Clinical Prognosis in Liposarcomas  Dat Tran, Kundan Verma, Kristin Ward,
Elizabeth Garrett Giovanni Parmigiani
Cancer Staging.
MicroRNAs in spent blastocyst culture medium are derived from trophectoderm cells and can be explored for human embryo reproductive competence assessment 
TNM 8 UPDATE Head and Neck SSG March 2018
Christos Sotiriou, Chand Khanna, Amir A
Consensus of Melanoma Gene Expression Subtypes Converges on Biological Entities  Martin Lauss, Jeremie Nsengimana, Johan Staaf, Julia Newton-Bishop, Göran.
by Andrea J. O'Hara, Ling Wang, Bruce J. Dezube, William J
Molecular Subtypes of Non-muscle Invasive Bladder Cancer
Prognostic Gene Expression Signatures Can Be Measured in Tissues Collected in RNAlater Preservative  Dondapati Chowdary, Jessica Lathrop, Joanne Skelton,
Volume 4, Issue 3, Pages (August 2013)
Volume 127, Issue 2, Pages (August 2004)
Volume 3, Issue 1, Pages (July 2016)
Biopsy Types Fine Needle Aspiration Core Biopsy Surgical Biopsy
Taming Human Genetic Variability: Transcriptomic Meta-Analysis Guides the Experimental Design and Interpretation of iPSC-Based Disease Modeling  Pierre-Luc.
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Intratumoral Heterogeneity of MicroRNA Expression in Breast Cancer
Cancer 101: A Cancer Education and Training Program for [Target Population] Date Location Presented by: Presenter 1 Presenter 2 1.
Loyola Marymount University
ADAM8 is overexpressed in human breast cancer AADAM8 mRNA expression in samples from breast tumor and normal breast tissue was analyzed using the Oncomine.
ICOS+ and activated CD4+ T cells are dominant, tumour tissue-specific T cell populations in both mismatch repair-deficient and repair-proficient colorectal.
Volume 28, Issue 3, Pages e7 (July 2019)
Highly metastatic PDAC cells have a unique gene signature, which is not preserved in metastases but predicts poor patient outcome. Highly metastatic PDAC.
Presentation transcript:

mRNASeq analysis using TCGA HNSC data Vinay Kartha Monti lab rotation project 11/25/2013

Expression data  mRNASeqv2 (Illumina HiSeq 2000)  Samples with data available: 340  Each sample has 6 associated files:  junction_quantification.txt  rsem.genes.results  rsem.genes.normalized_results  rsem.isoforms.results  rsem.isoforms.normalized_results  bt.exon_quantification.txt  Dataset reduction:  Raw expression matrix: 20,531 genes  Non-zero expression matrix: 20,200 genes  Filtered expression matrix (CV >=1.25): 7,091 genes

QC Scatter plot of mean vs SD expression for non-zero expression data CV = std dev / mean = 1.25 CV-filtered data N = 340; n = 7091

QC Log-transformed* Asinh-transformed Box plot of CV - filtered expression data across all samples * Pseudocount of 0.01 added

QC

QC CV = std dev / mean = 1.25 x = y

QC Log-transformed* Box plot of MAD-filtered expression data across all samples * Pseudocount of 1 added

Clustered gene expression profile

Sample clustering based on grade/stage?  See if expression is associated with clinical/phenotypic variables of interest  Grade:  GX: Grade cannot be assessed (undetermined grade)  G1: Well differentiated (low grade)  G2: Moderately differentiated (intermediate grade)  G3: Poorly differentiated (high grade)  G4: Undifferentiated (high grade)  Stage:  SI,SII, and SIII: Higher numbers indicate more extensive disease: Larger tumor size and/or spread of the cancer beyond the organ in which it first developed to nearby lymph nodes and/or tissues or organs adjacent to the location of the primary tumor  SIV: Cancer has spread to distant organs and tissues  For more information, see:  

Sample clustering based on grade/stage?  Fisher’s exact test (k = 2)  Histological Grade  Pathological Stage ClusterG1G2G3G4GXNATotal Total ClusterS1S2S3S4AS4BNATotal Total p = 2.98e-04 (< 0.05) p = (< 0.05)

Differential Expression with respect to Grade/Stage  340 samples (Total)  TCGA sample vial codes:  Histological Grade distribution among samples:  Pathological Stage distribution among samples: 01A B 2 11A 37 G1 30 G2 203 G3 87 G4 6 GX 13 NA 1 G0 37 G1 25 G2 185 G3 77 G4 6 GX 9 NA 1 S0 37 SI 16 SII 47 SIII 41 SIVA 147 SIVB 6 NA 46 SI 18 SII 62 SIII 46 SIVA 162 SIVB 6 NA 46

Differential Expression with respect to Grade/Stage  Cannot adjust expression for certain factors (Race/Ethnicity) due to missing phenotypic information  Remove samples with missing information with respect to Grade/Stage and non-white patients G0 37 G1 25 G2 185 G3 77 G4 6 GX 9 NA 1 G0 32 G1 24 G2 156 G3 68 G4 6 Total = 286 S0 37 SI 16 SII 47 SIII 41 SIVA 147 SIVB 6 NA 46 S0 32 SI 14 SII 42 SIII 34 SIVA 130 SIVB 3 Total = 255

Adjust for gender? DE wrt Grade DE wrt Stage  Don’t want to adjust for gender when it is associated with very few genes

Percentile-based gene filtering prior to DE testing  Further reduce gene space prior to DE testing using 90 th percentiles to filter on  Roughly divide # genes in half by choosing threshold log2(90 th percentile) value 90 th percentile >= 10.5 n = 5046 Grade (N = 286) 90 th percentile >= 10.5 n = 5019 Stage (N = 255)

Differential Expression testing  Perform DE wrt Grade (N=286; n=5046) and Stage (N=255; n=5019)  Tumor vs Normal (G0 vs G1+ ; S0 vs S1+)  Within Grade/Stage comparison (G1 vs G2+ ; S2- vs S3+; excluding controls)  Permutation-based t-test with sliding ‘time-points’ and sample pooling  S3- vs S4A+ => (S1+S2+S3) vs (S4A + S4B)  ‘diffanal’ function from diffanal.R (CBM repository)  Number of permutations: 1000

DE testing by grade ComparisonNo. DE genes G G2+617 G3+943 G4456

DE testing by grade

DE testing by stage ComparisonNo. DE genes S1+327 S2+0 S3+0 S4A+0

DE testing by stage

DE genes: G0 vs G1+

DE genes: G1 vs G2+

DE genes: G2- vs G3+

DE genes: G3- vs G4

AhR targets

Variation of expression across grade  DPAGT1

Variation of expression across grade  TAZ

Variation of expression across grade  YAP1

Variation of expression across grade  PDGFRB

Sliding windows  Tool takes Time factors in the order in which they appear in the ‘Time’ column  Does not pull corresponding factors in order of Time point levels  For example:  Results in incorrect ordering of groups prior to sliding window DE testing Time G3 G2 … … Levels: G3 G2 …

Future work  Perform GSEA/hyper-enrichment and pathway analyses  Perform Oral cancer-specific analyses  Restrict anatomic sub-types to include only:  Alveolar Ridge  Base of tongue  Buccal Mucosa  Floor of mouth  Hard Palate  Hypopharynx  Larynx  Lip  Oral cavity  Oral tongue  Oropharynx  Tonsil