Classification of microarray samples Tim Beißbarth Mini-Group Meeting 8.7.2002.

Slides:



Advertisements
Similar presentations
CSCE555 Bioinformatics Lecture 15 classification for microarray data Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Advertisements

My name is Dustin Boswell and I will be presenting: Ensemble Methods in Machine Learning by Thomas G. Dietterich Oregon State University, Corvallis, Oregon.
A gene expression analysis system for medical diagnosis D. Maroulis, D. Iakovidis, S. Karkanis, I. Flaounas D. Maroulis, D. Iakovidis, S. Karkanis, I.
Capturing Best Practice for Microarray Gene Expression Data Analysis Gregory Piatetsky-Shapiro Tom Khabaza Sridhar Ramaswamy Presented briefly by Joey.
Achim Tresch Computational Biology ‘Omics’ - Analysis of high dimensional Data.
Model Assessment, Selection and Averaging
Feature/Model Selection by Linear Programming SVM, Combined with State-of-Art Classifiers: What Can We Learn About the Data Erinija Pranckeviciene, Ray.
Wenting Zhou, Weichen Wu, Nathan Palmer, Emily Mower, Noah Daniels, Lenore Cowen, Anselm Blumer Tufts University Microarray Data.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
Classification: Support Vector Machine 10/10/07. What hyperplane (line) can separate the two classes of data?
Classification of Microarray Data. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
Principle of Locality for Statistical Shape Analysis Paul Yushkevich.
Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.
Classification 10/03/07.
Classification of Microarray Data. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
The Chicken Project Dimension Reduction-Based Penalized logistic Regression for cancer classification Using Microarray Data By L. Shen and E.C. Tan Name.
Feature Selection and Its Application in Genomic Data Analysis March 9, 2004 Lei Yu Arizona State University.
1 Masterseminar „A statistical framework for the diagnostic of meningioma cancer“ Chair for Bioinformatics, Saarland University Andreas Keller Supervised.
Evaluation of Results (classifiers, and beyond) Biplav Srivastava Sources: [Witten&Frank00] Witten, I.H. and Frank, E. Data Mining - Practical Machine.
Ordinal Decision Trees Qinghua Hu Harbin Institute of Technology
Guidelines on Statistical Analysis and Reporting of DNA Microarray Studies of Clinical Outcome Richard Simon, D.Sc. Chief, Biometric Research Branch National.
1 Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data Presented by: Tun-Hsiang Yang.
Clustering and Classification In Gene Expression Data Carlo Colantuoni Slide Acknowledgements: Elizabeth Garrett-Mayer, Rafael Irizarry,
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
An Evaluation of Gene Selection Methods for Multi-class Microarray Data Classification by Carlotta Domeniconi and Hong Chai.
Model Assessment and Selection Florian Markowetz & Rainer Spang Courses in Practical DNA Microarray Analysis.
Gene expression profiling identifies molecular subtypes of gliomas
A Multivariate Biomarker for Parkinson’s Disease M. Coakley, G. Crocetti, P. Dressner, W. Kellum, T. Lamin The Michael L. Gargano 12 th Annual Research.
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
Classification of multiple cancer types by multicategory support vector machines using gene expression data.
Whole Genome Expression Analysis
Gene Expression Profiling Illustrated Using BRB-ArrayTools.
Analysis and Management of Microarray Data Dr G. P. S. Raghava.
Evaluation of Supervised Learning Algorithms on Gene Expression Data CSCI 6505 – Machine Learning Adan Cosgaya Winter 2006 Dalhousie University.
Prediction model building and feature selection with SVM in breast cancer diagnosis Cheng-Lung Huang, Hung-Chang Liao, Mu- Chen Chen Expert Systems with.
Supervised Classification. Selection bias in gene extraction on the basis of microarray gene-expression data Ambroise and McLachlan Proceedings of the.
Diagnosis of multiple cancer types by shrunken centroids of gene expression Course: Topics in Bioinformatics Presenter: Ting Yang Teacher: Professor.
Diagnosis using computers. One disease Three therapies.
Sample classification using Microarray Data. AB We have two sample entities malignant vs. benign tumor patient responding to drug vs. patient resistant.
The Broad Institute of MIT and Harvard Classification / Prediction.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
1/15 Strengthening I-ReGEC classifier G. Attratto, D. Feminiano, and M.R. Guarracino High Performance Computing and Networking Institute Italian National.
Usman Roshan Machine Learning, CS 698
Microarray Workshop 1 Introduction to Classification Issues in Microarray Data Analysis Jane Fridlyand Jean Yee Hwa Yang University of California, San.
Gene Expression Profiling. Good Microarray Studies Have Clear Objectives Class Comparison (gene finding) –Find genes whose expression differs among predetermined.
Class Prediction and Discovery Using Gene Expression Data Donna K. Slonim, Pablo Tamayo, Jill P. Mesirov, Todd R. Golub, Eric S. Lander 발표자 : 이인희.
Classification Course web page: vision.cis.udel.edu/~cv May 12, 2003  Lecture 33.
Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks From Nature Medicine 7(6) 2001 By Javed.
Today Ensemble Methods. Recap of the course. Classifier Fusion
BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
Classification Derek Hoiem CS 598, Spring 2009 Jan 27, 2009.
A presentation on the topic For CIS 595 Bioinformatics course
Chapter 6 Classification and Prediction Dr. Bernard Chen Ph.D. University of Central Arkansas.
Feature (Gene) Selection MethodsSample Classification Methods Gene filtering: Variance (SD/Mean) Principal Component Analysis Regression using variable.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Molecular Classification of Cancer Class Discovery and Class Prediction by Gene Expression Monitoring.
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Classification of tissues and samples 指導老師:藍清隆 演講者:張許恩、王人禾.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks From Nature Medicine 7(6) 2001 By Javed.
Predictive Automatic Relevance Determination by Expectation Propagation Y. Qi T.P. Minka R.W. Picard Z. Ghahramani.
Classification with Gene Expression Data
Mammogram Analysis – Tumor classification
Gene Expression Classification
Claudio Lottaz and Rainer Spang
Claudio Lottaz and Rainer Spang
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
Presentation transcript:

Classification of microarray samples Tim Beißbarth Mini-Group Meeting

Papers in PNAS May 2002  Diagnosis of multiple cancer types by shrunken centroids of gene expression Robert Tibshirani,Trevor Hastie, Balasubramanian Narasimhan, and Gilbert Chu  Selection bias in gene extraction on the basis of microarray gene- expression data Christphe Ambroise, and Geoffrey J. McLachlan

DNA Microarray Hybridization

Tables of Expression Data Tissue information Gene 1 Expression levels Gene 2 Table of expression levels:

The Classification Problem Classification Methods: Support Vector Machines, Neural Networks, Fishers linear descriminant, etc.

Heat map of the chosen 43 genes.

Steps in classification  Feature selection  Training a classification rule Problem:  For microarray data there are many more features (genes) than there are training samples and conditions to be classified.  Therefore usually a set of features which discriminates the conditions perfectly can be found (overfitting)

Feature selection  Criterion is independent of the prediction rule (filter approach)  Criterion depends on the prediction rule (wrapper approach) Goal:  Feature set must not be to small, as this will produce a large bias towards the training set.  Feature set must not be to large, as this will include noise which does not have any discriminatory power.

Methods to evaluate classification  Split Training-Set vs. Test-Set: Disadvantage: Looses a lot of training data.  M-fold cross-validation: Divide in M subsets, Train on M-1 subsets, Test on 1 subset Do this M-times and calculate mean error Special case: m=n, leave-one out cross-validation  Bootstrap Important!!!  Feature selection needs to be part of the testing and may not be performed on the complete data set. Otherwise a selection bias is introduced.

Tibshirani et al, PNAS, 2002

Conclusions  One needs to be very carefull when interpreting test and cross-validation results.  The feature selection method needs to be included in the testing.  10-fold cross-validation or bootstrap with external feature selection.  Feature selection has more influence on the classification result than the classification method used.

The End