Emergent Biology Through Integration and Mining Of Microarray Datasets Lance D. Miller GIS Microarray & Expression Genomics.

Slides:



Advertisements
Similar presentations
TOP2A IS AN INDEPENDENT PREDICTOR OF SURVIVAL IN UNSELECTED BREAST CANCER Amit Pancholi Molecular Profiling of Breast Cancer: Predictive Markers of Long.
Advertisements

Analysis of Microarray Genomic Data of Breast Cancer Patients Hui Liu, MS candidate Department of statistics Prof. Eric Suess, faculty mentor Department.
Most Random Gene Expression Signatures are Significantly Associated with Breast Cancer Outcome Venet, et al. PLoS Computational Biology, 2011 Molly Carroll.
Hepatocellular carcinoma (HCC) is the fifth most frequent cancer in the world and the third most common cause of cancer mortality.
Microarray analysis as a prognostic and predictive tool: are we ready? Enzo Medico Laboratory of Functional Oncogenomics Institute for Cancer Research.
MiRNA-drug resistance mechanisms Summary Hypothesis: The interplay between miRNAs, signaling pathways and epigenetic and genetic alterations are responsible.
Welcome to MATH:7450 (22M:305) Topological Data Analysis Office hours: MWF 15: :20 GMT (10: :20 CDT), M 2:00 - 3:00 am GMT (9pm - 10pm CDT)
Expression profiles for prognosis and prediction Laura J. Van ‘t Veer The Netherlands Cancer Institute, Amsterdam.
Microarrays Dr Peter Smooker,
Glioblastoma Multiforme (GBM) – Subtype Analysis Lance Parsons.
Gene expression patterns of breast cancer phenotype revealed by molecular profiling Gabriela Alexe, IBM Research DIMACS Workshop on Detecting and Processing.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Alternative Splicing As an introduction to microarrays.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
MammaPrint, the story of the 70-gene profile
Comprehensive Gene Expression Analysis of Prostate Cancer Reveals Distinct Transcriptional Programs Associated With Metastatic Disease Kevin Paiz-Ramirez.
Metastatic Breast Cancer: One Size Does Not Fit All Clifford Hudis, M.D. Chief, Breast Cancer Medicine Service MSKCC.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
2nd Quebec Conference on Therapeutic Resistance in Cancer Bienvenue !!!!!
Expression profiling of peripheral blood cells for early detection of breast cancer Introduction Early detection of breast cancer is a key to successful.
PrognoScan A new database for meta-analysis of the prognostic value of genes 1 Hideaki Mizuno, Kunio Kitada, Kenta Nakai, Akinori Sarai BMC Med Genomics.
A Bioinformatics Meta-analysis of Differentially Expressed Genes in Colorectal Cancer Simon Chan, Thursday Trainee Seminar – October 11.
Identification of network motifs in lung disease Cecily Swinburne Mentor: Carol J. Bult Ph.D. Summer 2007.
A 14-gene prognosis signature predicts metastasis risk in node-negative, estrogen receptor-positive, Tamoxifen-treated breast cancer in different ethnogeographic.
1. Abstract SAGE Serial analysis of gene expression (SAGE) is a method of large-scale gene expression analysis.that involves sequencing small segments.
Radiogenomics in glioblastoma multiforme
Chapter 7 Essential Concepts in Molecular Pathology Companion site for Molecular Pathology Author: William B. Coleman and Gregory J. Tsongalis.
Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.
The virochip (UCSF) is a spotted microarray. Hybridization of a clinical RNA (cDNA) sample can identify specific viral expression.
Enabling biomarker validation in breast cancer molecular subtypes: sensitivity and specificity of array-based subtype classification in 983 patients Balázs.
Exagen Diagnostics, Inc., all rights reserved Biomarker Discovery in Genomic Data with Partial Clinical Annotation Cole Harris, Noushin Ghaffari.
1.DATABASE construction  n=1,715  Median OS=40.0 months, age: 64+/-10 yrs  Histology (adeno/squamous/large): 50% / 45% / 5%  Stage 1/2/3/4: 63% / 27%
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
1 Decision tree based classifications of heterogeneous lung cancer data Student: Yi LI Supervisor: Associate Prof. Jiuyong Li Data: 15 th May 2009.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Identification of network motifs in lung disease Cecily Swinburne Mentor: Carol J. Bult Ph.D. Summer 2007.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
SP Cancer Metastasis Summary Hypothesis: We hypothesize that miRNAs regulate breast cancer cell invasiveness and metastasis by synergistically targeting.
Sage / DREAM Breast Cancer Prognosis Challenge The goal of the breast cancer prognosis challenge is to assess the accuracy of computational models designed.
Gene Expression Signatures for Prognosis in NSCLC, Coupled with Signatures of Oncogenic Pathway Deregulation, Provide a Novel Approach for Selection of.
PaLS: Pathways and Literature Strainer Filtering common literature, ontology terms and pathway information. Andrés Cañada Pallarés Instituto Nacional de.
Lecture 8. Functional Genomics: Gene Expression Profiling using DNA microarrays. Part II Clark EA, Golub TR, Lander ES, Hynes RO.(2000) Genomic analysis.
Maxwell Lee National Cancer Institute Center for Cancer Research High-dimension Data Analysis Group March 19, 2014 Integrated Studies Of Breast, Esophageal,
PREDICTING OUTCOME IN OSTEOSARCOMA USING A GENOME-WIDE APPROACH N Gokgoz,T Yan, M Ghert, S Eskandarian W He, R Parkes, SB Bull, RS Bell, IL Andrulis and.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
A comparative study of survival models for breast cancer prognostication based on microarray data: a single gene beat them all? B. Haibe-Kains, C. Desmedt,
The human protease CLIP-CHIP: Genomic analysis of all 715 human protease and inhibitor gene transcripts in human breast carcinoma Reinhild Kappelhoff ,Tom.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.
Research Aspects. Research Aspects. 1Microarrays cDNAgDNACpGDNA uRNA probesOligonucleotideantibody 2Bioinformatics Databasealgorithmsoftware Combined gene.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Use of gene expression to identify heterogeneity of metastatic behavior among high-grade pleomorphic soft tissue sarcomas Keith Skubitz 1, Princy Francis.
ArrayExpress Ugis Sarkans EMBL - EBI
 We investigated for biomarkers that distinguish metastatic or recurring disease with non-metastatic disease, with a particular focus on breast cancer.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 3.
Estrogen-Regulated Genes Predict Survival in Hormone Receptor–Positive Breast Cancers J Clin Oncol 24: Daniel S. Oh, Melissa A. Troester,
Yu-fang huang, yi-hui wu, cheng-yang chou*
Deep Learning Analysis of Gene Expression Data for Breast Cancer Classification AS Y.P. Manawadu.
Van’t Veer et al, Nature 415: (2002)
Volume 143, Issue 3, Pages e2 (September 2012)
Molecular prognostication of liver cancer: End of the beginning
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Altered Caspase-8 Expression
EN1 expression in breast cancer and clinical outcome.
Isolation of large soft agar clones from HBEC3p53,KRAS and HBEC3p53,KRAS,MYC identifies tumorigenic and nontumorigenic clones and genome-wide mRNA expression.
Presentation transcript:

Emergent Biology Through Integration and Mining Of Microarray Datasets Lance D. Miller GIS Microarray & Expression Genomics

Mining of expression data to understand the molecular composition of human cancers and to define components of the tumor molecular profile with mechanistic and clinical importance. FOCUS:

2001, PNAS

Molecular classes are predictive of outcome overall survival:relapse-free survival:

70-gene prognosis classifier for predicting risk of distant metastasis within 5 years Van’t veer, et. al.

Sotiriou, et. al.

Though each tumor is molecularly unique, there exist common transcriptional cassettes that underly biological and clinical properties of tumors that may be of diagnostic, prognostic and therapeutic significance.

GOAL: Mining of expression data to understand the molecular composition of human cancers and to define components of the tumor molecular profile with mechanistic and clinical importance.

The GIS Perpetual Array Platform

Integration of Independent Datasets Perou et. al., 1999Sorlie et. al., 2001West et. al., 2001

Meta-Analysis of Breast Cancer Datasets: dataset source sample size array format 1. Miller-Liu: unpublished 61 tumors: 39 ER+, 22 ER- 19K spotted oligo 2. Sotiriou-Liu: submitted: PNAS 99 tumors: 34 ER+, 65 ER- 7.6K spotted cDNA 3. Gruvberger-Meltzer: Cancer Research 47 tumors: 23 ER+, 24 ER- 6.7K spotted cDNA 4. Sorlie-Borrensen-Dale: PNAS 74 tumors: 56 ER+, 18 ER- 8.1K spotted cDNA 5. van’t Veer-Friend: Nature 98 tumors: 59 ER+, 39 ER- 25K spotted oligo 6. West-Nevins: PNAS 49 tumors: 25 ER+, 24 ER- 7.1K Affymetrix total: 428 tumors, ~73,500 probes (Adaikalavan Ramasamy et. al.)

META MADB: The Construct 1.Extract and Format the Data 2.Link sample/probe info via unique keys 3.Log Transform and Normalize 4.Filter Genes and Arrays 5.Apply Statistical Tests Building the Matrix Creating a Universe 1.Apply UniGene ID as Unifying Key 2.Remove Gene Redundancy 3.Extract p values, d values, z-scores 4.Set p value threshold 5.Merge Datasets

META MADB

d values (difference of average expression) T1T1 T2T2 T3T3 T4T4 T5T5 …Tn…Tn T1T1 T2T2 T3T3 T4T4 T5T5 …Tn…Tn gene 1 : e1e1 e2e2 e3e3 e4e4 e5e5 …en…en e1e1 e2e2 e3e3 e4e4 e5e5 …en…en d = average e [ER+]average e [ER-] / ER+ER-

Identifying Grade-Specific Genes in Hepatocellular Carcinoma Sample: 10 cases of each class Sample collection: HBV(+) Array: Human 19K Oligonucleotide array Analysis : 50 arrays OAHAAHG1G2G3 HCC Progression Pre-neoplastic lesions Adenomatous hyperplasia ordinaryatypical HCC Grade 1, 2, 3

Identifying Grade-Specific Genes in Hepatocellular Carcinoma

Identifying Grade-Specific Genes in Hepatocellular Carcinoma

Breast Cancer Grade-Associated Genes as Predictors of HCC Grade? HCC BC

Breast Cancer Grade-Associated Genes as Predictors of HCC Grade? HCC

Estrogen Responsive Genes in vitro (Chin-Yo Lin)

(p<0.001) Estrogen-Responsive in vitro and ER Status-Associated in vivo E2E2 + ICIE2 + CHX

Identifying Cancer-Linked Genes in Epithelial Adenocarcinomas Datasets: 3 gastric, 3 prostate, 2 liver, 1 lung

selection at p< Genes that Distinguish Tumor from Normal at p<0.001 in at least 3 of the 4 Tumor Types

database components: internal and external datasets derived from: - tumor studies (clinical samples) - in vitro, pathway studies (eg, timecourse) - SAGE data - mouse studies (in vitro/in vivo) An Integrated Database for Pan-Cancer Meta-Analysis of Gene Expression Data Summary

 Derive expression signatures for all major factors known or suspected to have prognostic value  Determine the reliability of expression signatures in outcome prediction  Expand integrated database for pan- cancer meta-analysis  Integrate expression profiling into clinical decision making Future Directions

Acknowledgements Catholic University of Korea Suk-Woo Nam Jung Yong Lee GIS Adai Ramasamy Liza Vergara Phil Long Chin-Yo Lin Benjamin Mow