Head and Neck Cancer: microRNA analysis

Slides:



Advertisements
Similar presentations
Zeroing in on Non-Small Cell Lung Cancer: Integrating Targeted Therapies into Practice.
Advertisements

1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Glioblastoma Multiforme (GBM) – Subtype Analysis Lance Parsons.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
. Differentially Expressed Genes, Class Discovery & Classification.
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Analysis of microarray data
Radiogenomics in glioblastoma multiforme
Molecular classification of renal cell carcinoma subtypes using microRNA signatures Zaravinos A 1, Lambrou GI 2, Mourmouras N 3, Delakas D 3, Deltas C.
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
UBio Training Courses Micro-RNA web tools Gonzalo
SP Cancer Metastasis Summary Hypothesis: We hypothesize that miRNAs regulate breast cancer cell invasiveness and metastasis by synergistically targeting.
OncomiR and tumor-suppressor miR expression in triple-negative primary breast cancer. Jelena Radojicic 1, Apostolos Zaravinos 2, Thomas Vrekoussis 1, Maria.
Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
MRNA Expression Experiment Measurement Unit Array Probe Gene Sequence n n n Clinical Sample Anatomy Ontology n 1 Patient 1 n Disease n n ProjectPlatform.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
The Broad Institute of MIT and Harvard Differential Analysis.
Modeling the cell cycle regulation by the RB/E2F pathway Laurence Calzone Service de Bioinformatique U900 Inserm / Ecoles de Mines / Institut Curie Collaborative.
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
Stitching the Tutorials Together Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.
MRNASeq analysis using TCGA HNSC data Vinay Kartha Monti lab rotation project 11/25/2013.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Triple negative (TN) vs. positive (TP) breast cancer (BC) analysis with public genomic data yields AGR3 as a biomarker Anita Umesh, Jenny Park, James Shima,
 We investigated for biomarkers that distinguish metastatic or recurring disease with non-metastatic disease, with a particular focus on breast cancer.
Date of download: 6/3/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Novel Integrative Methods for Gene Discovery Associated.
Advances and challenges in computational modeling and statistical learning of biological systems Qi Liu Department of Biomedical Informatics Vanderbilt.
Cancer: Staging and Grading What is meant by the term “biopsy”? How do tumors behave differently from one another ? Examples of the stages of cancer and.
K. Brennan, J.L. Koenig, A.J. Gentles, J.B. Sunwoo, O. Gevaert
Cancer Genomics and Class Discovery
Fig. 1 Increased MIR155HG expression correlates with glioma grade and mesenchymal transition and confers a poor prognosis in GBM patients. (A) The level.
ANIMAL TARGET PREDICTION - TIPS
Gene expression.
Volume 15, Pages (February 2017)
Volume 69, Issue 6, Pages (June 2016)
Functional Genomics Analysis Reveals a MYC Signature Associated with a Poor Clinical Prognosis in Liposarcomas  Dat Tran, Kundan Verma, Kristin Ward,
M. Fu, G. Huang, Z. Zhang, J. Liu, Z. Zhang, Z. Huang, B. Yu, F. Meng 
Impact of Formal Methods in Biology and Medicine Final Review
Impact of Formal Methods in Biology and Medicine
Volume 69, Issue 6, Pages (June 2016)
Characterization of microRNA transcriptome in tumor, adjacent, and normal tissues of lung squamous cell carcinoma  Jun Wang, MD, PhD, Zhi Li, MD, PhD,
Consensus of Melanoma Gene Expression Subtypes Converges on Biological Entities  Martin Lauss, Jeremie Nsengimana, Johan Staaf, Julia Newton-Bishop, Göran.
Volume 5, Issue 4, Pages e5 (October 2017)
Tumor intrinsic subtype is reflected in cancer-adjacent tissue.
Volume 23, Issue 11, Pages (June 2018)
Revealing Global Regulatory Perturbations across Human Cancers
Volume 31, Issue 2, Pages (February 2017)
Volume 4, Issue 3, Pages (August 2013)
Volume 17, Issue 8, Pages (November 2016)
Volume 24, Issue 12, Pages e5 (September 2018)
Volume 24, Issue 8, Pages (August 2018)
Revealing Global Regulatory Perturbations across Human Cancers
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Volume 1, Issue 1, Pages (July 2015)
Volume 26, Issue 12, Pages e5 (March 2019)
Figure 1. Identification of three tumour molecular subtypes in CIT and TCGA cohorts. We used CIT multi-omics data ( Figure 1. Identification of.
Volume 2, Issue 3, Pages (March 2016)
Supplementary Figure S1
EN1 expression in breast cancer and clinical outcome.
One-way hierarchical cluster analysis of SAM-identified genes using the TMEV software to see the data substructure. One-way hierarchical cluster analysis.
Volume 28, Issue 3, Pages e7 (July 2019)
Subtype classification of breast functional screening results.
Volume 28, Issue 4, Pages e6 (July 2019)
Presentation transcript:

Head and Neck Cancer: microRNA analysis Amy Li Monti Lab Rotation Boston University 11/25/13

Dataset Head and Neck Cancer Dataset from The Cancer Genome Atlas (TCGA) Contains large and well-documented data in many cancer subtypes DNA-methylation, SNP Array, RNA-seq, miRNA-seq, low pass DNA-seq, Reverse Phase Protein Array Normalized miRNAseq data 463 samples 39 patients: tumor tissue and adjacent normal tissue 385 patients: tumor tissue only 1046 miRNA Clinical information 360 samples, 71 clinical attributes (ie. anatomic subdivision, gender, grade, race, stage)

Goals Identify miRNA markers of: Cancer status: Normal vs. tumor Cancer progression: Differentially expressed in each stage or grade Integrate miRNA expression with gene expression data Gene set enrichment in miRNA targets mRNA data (Vinay)

Tumor Progression Classification Tumor Grade: Assigned based on how abnormal the tumor cells looks under a microscope Ranges from G1 (well-differentiated) to G4 (undifferentiated) Well differentiated tumor cells from a lower grade resemble normal cells, tend to spread slowly, and is generally indicative of better prognosis Tumor Stage: Based on size or extent (reach) of the primary tumor http://www.cancer.gov/cancertopics/factsheet/detection/tumor-grade

Analysis Overview Exploratory data analysis Unsupervised clustering Mean vs. standard deviation Boxplots of miRNA expression Data filtering Clinical demographics Unsupervised clustering Heatmaps Fisher test for association between clusters and sample attribute assignment (tumor status, grade, stage, etc) Tests for confounders Association between grade and other attributes, ie: ethnicity, gender, smoking history, age, alcohol consumption Differential Analysis Look for differentially expressed genes with respect to grade or stage miRNA targets and Gene Set Enrichment Analysis Identify sets of miRNA targets and see whether such gene sets are enriched with respect to disease phenotype

Exploratory Data Analysis: Mean vs. standard deviation For each gene: plot mean vs standard deviation Linear relationship

Exploratory Data Analysis: Boxplot of Expression For each gene: made boxplot of expression across the samples Reordered based on median

Exploratory Data Analysis: Data filtering Sample filtering: Samples without clinical labels Gene filtering: Lowly expressed genes Row maximum Genes with constant expression Standard deviation Full Matrix 1046 × 463 Filtered Matrix 692 × 393

Exploratory Data Analysis: Clinical Demographics Grade: G1: well-differentiated G4: Undifferentiated

Exploratory Data Analysis: Clinical Demographics Oral: oral cavity, oral tongue,, bucal mucosa

Exploratory Data Analysis: Clinical Demographics

Unsupervised Clustering: Paired samples: Tumor vs. Adjacent Normal 38 1 Tumor 2 37 Fisher Test: Test for association between Cluster Assignment and Actual Class Label P-val ~ 0 Row: pearson, ward Col: pearson, ward

Unsupervised Clustering: Grades: G1, G2, G3, G4 Fisher Test: Tested for association between grades and cluster assignments for total number of clusters ranging from 2 to 5 P-vals not significant in all cases Row: pearson, ward Col: euclidean, ward

Tests for confounders Tested for association between grade and the putative confounding variable using Fisher test (discrete variables) or ANOVA (continuous variables) Ethnicity (p=0.57), race (p=0.84), gender (p=0.09), age (p=0.55), alcohol consumption (p=0.63) Correct gene expression for gender using a linear regression model prior to performing differential analysis

Data Processing for Differential Analysis Sample Filtering Removed samples without clinical labels Removed samples sequenced on IlluminaGA (kept IlluminaHiseq samples) Removed samples with minority races (kept “white”) Gene filtering Removed miRNAs with low expression (90% quantile < 100) Removed miRNAs with constant expression (sd < 0.1) Attribute Filtering Grade: Removed GX and “Not Available” Kept G1, G2, G3, G4 Stage: Removed “Not Available” Kept S1, S2, S3, S4A, S4B

Differential Analysis: Grade diffAnal.R Performs permutation tests to identify significant genes differentially regulated in one of two classes Normalized expression matrix corrected for gender Class label: grade attribute binarized to “low” vs. “high” Run diffAnal for each high vs. low cutoff: G0 (adjacent normal) vs. G1-G4 (tumor) G1 vs. G2-G4 G1-G2 vs. G3-G4 G1-G3 vs. G4

Differential Analysis: Grade : G0 vs. G1-G4

Differential Analysis: Grade : G1 vs. G2-G4

Differential Analysis: Grade : G1-G2 vs. G3-G4

Differential Analysis: Grade : G1-G3 vs. G4

Differential Analysis: Trends Found more significant markers for tumors vs. normal than for distinguishing between low and high grades Performed same analysis for stage, significant markers for stage are weaker than that of grade For both grade and stage, most significant markers found by diffAnal show upregulation in the later disease state.

Differential Analysis: Tumor Classification Marker 148 total genes (90% quantile > 100) used for diffAnal 65 significant genes upregulated in tumors 37 significant genes downregulated in tumors Cutoff: FDR < 0.01

Differential Analysis: Cancer Progression Marker for Grades Cancer progression marker will satisfy ALL of: Tumor classification marker Significant FDR in 2/3 runs of diffAnal Monotonous increase or decrease across grades 4 miRNA markers identified (all are upregulated with increasing grade) G1-_vs_G2+_fdr G2-_vs_G3+_fdr G3-_vs_G4+_fdr hsa-mir-106b 0.03 0.01 0.04 hsa-mir-15b 0.05 0.81 hsa-mir-582 hsa-mir-151 0.61 -0.25 hsa-mir-196b 0.18 hsa-mir-10a 0.32 -0.96 hsa-mir-374a -0.6 hsa-mir-128-2 0.26 hsa-mir-25 0.02 hsa-mir-128-1 0.47 0.17 hsa-mir-28 0.44

Differential Analysis: Cancer Progression Marker for Grades

Finding miRNA Targets miRWalk Targetscan mirBase

Finding miRNA Targets miRNA AhR Targets Targets (54) (162) miRWalk “Validated targets” module Targets for differentially expressed miRNAs: 162 targets Intersect targets found by miRWalk with AhR targets 6 matches: NQO1, NFE2, IL1B, TNF, TGFB1, MYC miRNA Targets (162) AhR Targets (54) miRNA markers (4) (6)

Work in Progress Gene set enrichment analysis: Consider targets of strong miRNA markers as a gene set Is there an enrichment of this defined gene set in certain disease phenotypes, ie. high grade? Pathway analysis: Which pathways are these miRNA markers involved in? Modeling tumor progression: Explore other definitions of tumor progression markers