Improving gene expression similarity measurement using pathway-based analytic dimension Changwon Keum BMDRC.

Slides:



Advertisements
Similar presentations
Integrating Cross-Platform Microarray Data by Second-order Analysis: Functional Annotation and Network Reconstruction Ming-Chih Kao, PhD University of.
Advertisements

Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
Working with gene lists: Finding data using GEO & BioMart June 5, 2014.
The STRING database Michael Kuhn EMBL Heidelberg.
Integrative data mining and visualization of genome-wide SNP profiles in childhood acute lymphoblastic leukaemia. Ahmad Aloqaily Faculty of IT University.
Clustered alignments of gene- expression time series data Adam A. Smith, Aaron Vollrath, Cristopher A. Bradfield and Mark Craven Department of Biosatatistics.
Genetic algorithms applied to multi-class prediction for the analysis of gene expressions data C.H. Ooi & Patrick Tan Presentation by Tim Hamilton.
Microarray GEO – Microarray sets database
Microarray Data Preprocessing and Clustering Analysis
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
CRM Chapter 9 Analytics. Analytics  Collection, extraction, modification, measurement, identification, and reporting of information designed to be useful.
Introduction to Bioinformatics - Tutorial no. 12
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
Midterm project Course: Statistics in Bioinformatics Date: 指導教授 : 陳光琦 學生 : 吳昱賢.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
A Multivariate Biomarker for Parkinson’s Disease M. Coakley, G. Crocetti, P. Dressner, W. Kellum, T. Lamin The Michael L. Gargano 12 th Annual Research.
Automatic methods for functional annotation of sequences Petri Törönen.
Manifestation of Novel Social Challenges of the European Union in the Teaching Material of Medical Biotechnology Master’s Programmes at the University.
BIOMARKER STUDIES IN CLINICAL TRIALS Vicki Seyfert-Margolis, PhD.
Analysis and Management of Microarray Data Dr G. P. S. Raghava.
Semantic Similarity over Gene Ontology for Multi-label Protein Subcellular Localization Shibiao WAN and Man-Wai MAK The Hong Kong Polytechnic University.
Gene Expression Omnibus (GEO)
Multiple Examples of tumor tissue (public data from Whitehead/MIT) SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25%
Ontology-based Annotation & Query of TMA data Nigam Shah Stanford Medical Informatics
Biomarker and Classifier Selection in Diverse Genetic Datasets J AMES L INDSAY 1 E D H EMPHILL 2 C HIH L EE 1 I ON M ANDOIU 1 C RAIG N ELSON 2 U NIVERSITY.
Enabling biomarker validation in breast cancer molecular subtypes: sensitivity and specificity of array-based subtype classification in 983 patients Balázs.
Exagen Diagnostics, Inc., all rights reserved Biomarker Discovery in Genomic Data with Partial Clinical Annotation Cole Harris, Noushin Ghaffari.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
ChipDB: An interactive database system for high- throughput expression analysis Peter Young, John Barnett, Bing Ren, Ezra Jennings and Richard Young Whitehead.
Supplemental figure 1: Correlation coefficients between signal intensities from biological replicates of wild.
Adding GO GO Workshop 3-6 August GOanna results and GOanna2ga 2. gene association files 3. getting GO for your dataset 4. adding more GO (introduction)
1 Critical Review of Published Microarray Studies for Cancer Outcome and Guidelines on Statistical Analysis and Reporting Authors: A. Dupuy and R.M. Simon.
Translational Genomics Research Institute | The Sarcoma Data Portal: Making High Content Sarcoma Datasets Available For All Users Jonathan.
Overview of Bioinformatics 1 Module Denis Manley..
The Stanley Neuropathology Consortium Integrative Database: A novel web-based tool for exploring neuropathological traits, gene expression and associated.
Gene Expression Omnibus (GEO)
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Chapter 3 Gene Alignments: Investigating Antibiotic Resistance.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Thanh Le, Katheleen J. Gardiner University of Colorado Denver
Tmm: Analysis of Multiple Microarray Data Sets Richard Moffitt Georgia Institute of Technology 29 June, 2006.
RELIABILITY BY DONNA MARGARET. WHAT IS RELIABILITY?  Does this test consistently measure what it’s supposed to measure?  The more similar the scores,
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
A Combinatorial Approach to the Analysis of Differential Gene Expression Data The Use of Graph Algorithms for Disease Prediction and Screening.
Biocomputational Languages December 1, 2011 Greg Antell & Khoa Nguyen.
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
ArrayExpress Ugis Sarkans EMBL - EBI
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.
David Amar, Tom Hait, and Ron Shamir
Logistic Regression: To classify gene pairs
PNAS 2012 Alpha diversity: how many species are in each sample?
Using ArrayExpress.
CIS Term Project Proposal November 1, 2002 Sharon Diskin
Gene Expression Omnibus (GEO)
دانشگاه شهیدرجایی تهران
تعهدات مشتری در کنوانسیون بیع بین المللی
Elsevier’s New Biology Solution
Correlations between APOBEC expression and immune cell markers across 22 cancer types. Correlations between APOBEC expression and immune cell markers across.
Presentation transcript:

Improving gene expression similarity measurement using pathway-based analytic dimension Changwon Keum BMDRC

Accumulated gene expression data in public repository GEO, NCBI

Search the database * Search by annotation * Search by contents

Dataset vs. individual sample( profile) Case Control Microarray database search Data set Individual profiles Data set level Similarity measure Profile level Similarity measure search

Raw gene expression profile based similarity search Used by Cellmontage –Spearman correlation coefficient Limitation –Cross-platform comparison –Cross-experiment comparison S1S2d G1514 G2321 G3132 G4242 G5451 S1S2 G1630 G21625 G32215 G42010 G5105

Pathway expression profile based similarity measure G1G2G3. Pathway1Pathway2Pathway3 Pathway1Pathway2Pathway3 Pathway1Pathway2Pathway3 Step1. Converting to pathway Expression profile Step2. Spearman Correlation Test

Cell type classification SampleExperim ent PlatformCell type Sam1Exp1Plat1Breast Sam2Exp1Plat1Breast Sam3Exp1Plat2Breast Sam4Exp2Plat3Breast Sam5Exp2Plat3Breast Samples with cell type –Annotated by Cellmontage group –For 42 cell type with multiple samples Query Cross-platform Cross-experiment

Classification accuracy

CGSEP vs. PEPC Thalamus (all) Liver(Cross-platform)

Similarity score for TP?

Details of cross-experiment classification

GEFERENCE Reference database of gene expression –Search similar gene expression profile –Meta analysis

Marker Validation Extract sample Patient GEFERENCE Gene expression profiling Search Matched reference individual with Clinical information

Acknowledgement Jung Hoon Woo Members at BMDRC KFDA for funding Thanks for your attention!!