A presentation on the topic For CIS 595 Bioinformatics course

Slides:



Advertisements
Similar presentations
Support Vector Machine & Its Applications
Advertisements

Lecture 9 Support Vector Machines
ECG Signal processing (2)
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
CSCE555 Bioinformatics Lecture 15 classification for microarray data Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Support Vector Machine & Its Applications Mingyue Tan The University of British Columbia Nov 26, 2004 A portion (1/3) of the slides are taken from Prof.
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
Classification / Regression Support Vector Machines
SVM—Support Vector Machines
Support vector machine
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Classification and Decision Boundaries
Support Vector Classification (Linearly Separable Case, Primal) The hyperplanethat solves the minimization problem: realizes the maximal margin hyperplane.
Support Vector Machines Kernel Machines
Classification Problem 2-Category Linearly Separable Case A- A+ Malignant Benign.
Support Vector Machine (SVM) Classification
Sketched Derivation of error bound using VC-dimension (1) Bound our usual PAC expression by the probability that an algorithm has 0 error on the training.
Support Vector Machines
Ti MACHINE VISION SUPPORT VECTOR MACHINES Maxim Mikhnevich Pavel Stepanov Pankaj Sharma Ivan Ryzhov Sergey Vlasov
The Chicken Project Dimension Reduction-Based Penalized logistic Regression for cancer classification Using Microarray Data By L. Shen and E.C. Tan Name.
Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.
DIMACS Workshop on Machine Learning Techniques in Bioinformatics 1 Cancer Classification with Data-dependent Kernels Anne Ya Zhang (with Xue-wen.
What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.
Lecture 10: Support Vector Machines
Optimization Theory Primal Optimization Problem subject to: Primal Optimal Value:
Linear hyperplanes as classifiers Usman Roshan. Hyperplane separators.
Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Data mining for credit card fraud: A comparative study.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
Support vector machines for classification Radek Zíka
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
1/15 Strengthening I-ReGEC classifier G. Attratto, D. Feminiano, and M.R. Guarracino High Performance Computing and Networking Institute Italian National.
SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.
Support Vector Machine PNU Artificial Intelligence Lab. Kim, Minho.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
1 Chapter 6. Classification and Prediction Overview Classification algorithms and methods Decision tree induction Bayesian classification Lazy learning.
CISC667, F05, Lec22, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Support Vector Machines I.
Support Vector Machine Data Mining Olvi L. Mangasarian with Glenn M. Fung, Jude W. Shavlik & Collaborators at ExonHit – Paris Data Mining Institute University.
컴퓨터 과학부 김명재.  Introduction  Data Preprocessing  Model Selection  Experiments.
Linear hyperplanes as classifiers Usman Roshan. Hyperplane separators.
Support Vector Machines and Gene Function Prediction Brown et al PNAS. CS 466 Saurabh Sinha.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
1 Classification and Feature Selection Algorithms for Multi-class CGH data Jun Liu, Sanjay Ranka, Tamer Kahveci
Support Vector Machine Debapriyo Majumdar Data Mining – Fall 2014 Indian Statistical Institute Kolkata November 3, 2014.
CZ5225: Modeling and Simulation in Biology Lecture 7, Microarray Class Classification by Machine learning Methods Prof. Chen Yu Zong Tel:
Support Vector Machines. Notation Assume a binary classification problem. –Instances are represented by vector x   n. –Training examples: x = (x 1,
Text Classification using Support Vector Machine Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata.
Support Vector Machines (SVM): A Tool for Machine Learning Yixin Chen Ph.D Candidate, CSE 1/10/2002.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
SVMs in a Nutshell.
Lecture 14. Outline Support Vector Machine 1. Overview of SVM 2. Problem setting of linear separators 3. Soft Margin Method 4. Lagrange Multiplier Method.
Machine Learning and Data Mining: A Math Programming- Based Approach Glenn Fung CS412 April 10, 2003 Madison, Wisconsin.
Support Vector Machine (SVM) Presented by Robert Chen.
Introduction to Machine Learning Prof. Nir Ailon Lecture 5: Support Vector Machines (SVM)
Support Vector Machines Reading: Textbook, Chapter 5 Ben-Hur and Weston, A User’s Guide to Support Vector Machines (linked from class web page)
Support Vector Machine & Its Applications. Overview Intro. to Support Vector Machines (SVM) Properties of SVM Applications  Gene Expression Data Classification.
SUPPORT VECTOR MACHINES
Support Vector Machine
PREDICT 422: Practical Machine Learning
Mammogram Analysis – Tumor classification
An Introduction to Support Vector Machines
Pawan Lingras and Cory Butz
Support Vector Machines Introduction to Data Mining, 2nd Edition by
Support Vector Machines
COSC 4335: Other Classification Techniques
Recitation 6: Kernel SVM
Machine Learning Week 3.
Other Classification Models: Support Vector Machine (SVM)
COSC 4368 Machine Learning Organization
Presentation transcript:

Classification of microarray gene expression data using support vector machines (SVM) A presentation on the topic For CIS 595 Bioinformatics course by Despina Kontos Spring 2003 – Temple University

Overview… What are microarray gene expression data? What are Support Vectors Machines? How can we use them to utilize these gene expression data? CLASSIFICATION EXPERIMENTS !!!

Microarrays… What are they anyway?? Gene expression levels on tissue or cell for varying environment conditions

Microarrays… From a machine learning point of view… Genes Experiment g-1 g-2 …… g-n ex-1 ex-2 ……. ex-m Tissue classification Function classification

Support Vector Machines (SVM) Linear classifiers Attempt to avoid overfitting by finding the optimal hyperplane that separates the data HOW??? By maximizing the Margin.. Support Vectors Introduced by V.Vapnic and co-workers in 1995

Support Vector Machines (SVM) And what about datasets that are not linearly separable?? Map the data into higher dimensional space and make linear classification there (theorem!!)

Support Vector Machines (SVM) Some mathematical formulations… We need ONLY the support vectors for computations!! We can use KERNEL functions to avoid computations in higher dimensional space

Some experiments… M.P.S.Brown, W.N.Grundy, D.Lin, N.Cristianini, C.W.Sugnet, T.S.Furey, M.Ares Jr. and D.Haussler,“Knowledge-based analysis of microarray gene expression data by using support vector machines", Proc.Natl.Acad.Sci.USA,97, 1, pp.262-267, 2000. Classification of gene function from microarray data using SVM 2,476 genes 79 DNA hybridization experiments 6 gene function families Genes Experiment g-1 g-2 …… g-n ex-1 ex-2 ……. ex-m SVM provided optimal classification!!! F1 F2 F3 ... Function Classification

Gene expression data on tissue More experiments… T.furey, N.Cristianini, N. Duffy, D. Bednarski, M. Schummer and D Haussler, “Support Vector Machine Classification and Validation of Cancer Tissue Samples Using Microarray Expressioin Data”, Bioinformatics, 2000. Gene expression data on tissue 97,802 DNA clones 31 tissue samples Cancer ovarian Normal ovarian Normal non-ovarian Genes Experiment g-1 g-2 …… g-n ex-1 ex-2 ……. ex-m Cancer Not Cancer ... Tissue Classification

Conclusions Microarray gene expression data are a very useful format of biological information (..expensive to obtain!!) SVM new and very promising classification apprach A lot of research still to be done on Biological information processing using techniques developed in fields such as Machine Learning, Data Mining, etc..

Additional resources.. N. Cristianini. ICML'01 tutorial, 2001 Osuna, R. Freund, and F. Girosi. Support vector machines: Training and applications. In A.I. Memo. MIT A.I. Lab, 1996 N. Cristianini. ICML'01 tutorial, 2001 http://www.kernel-machines.org/ http://research.microsoft.com/users/jplatt/svm.html http://www.isis.ecs.soton.ac.uk/resources/svminfo/

THANK YOU!!!!!