Robust Optimization and Applications in Machine Learning.

Slides:



Advertisements
Similar presentations
(SubLoc) Support vector machine approach for protein subcelluar localization prediction (SubLoc) Kim Hye Jin Intelligent Multimedia Lab
Advertisements

Generative Models Thus far we have essentially considered techniques that perform classification indirectly by modeling the training data, optimizing.
Classification / Regression Support Vector Machines

SVM—Support Vector Machines
Support vector machine
The multi-layered organization of information in living systems
20,000 GENES IN HUMAN GENOME; WHAT WOULD HAPPEN IF ALL THESE GENES WERE EXPRESSED IN EVERY CELL IN YOUR BODY? WHAT WOULD HAPPEN IF THEY WERE EXPRESSED.
N.U.S. - January 13, 2006 Gert Lanckriet U.C. San Diego Classification problems with heterogeneous information sources.
Robust Multi-Kernel Classification of Uncertain and Imbalanced Data
Support Vector Machines
Support Vector Machine
Support Vector Machines (SVMs) Chapter 5 (Duda et al.)
Identifying Extracellular Plant Proteins Based on Frequent Subsequences of Amino Acids Y. Wang, O. Zaiane, R. Goebel.
Rutgers CS440, Fall 2003 Support vector machines Reading: Ch. 20, Sec. 6, AIMA 2 nd Ed.
Support Vector Machines Formulation  Solve the quadratic program for some : min s. t.,, denotes where or membership.  Different error functions and measures.
MURI Meeting July 2002 Gert Lanckriet ( ) L. El Ghaoui, M. Jordan, C. Bhattacharrya, N. Cristianini, P. Bartlett.
Lecture 10: Support Vector Machines
Optimization Theory Primal Optimization Problem subject to: Primal Optimal Value:
Comparing Kernel-based Learning Methods for Face Recognition Zhiguo Li
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Affinity chromatography/mass spec Bait protein GST Page 252.
Selective Transfer Machine for Personalized Facial Action Unit Detection Wen-Sheng Chu, Fernando De la Torre and Jeffery F. Cohn Robotics Institute, Carnegie.
Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.
A statistical framework for genomic data fusion William Stafford Noble Department of Genome Sciences Department of Computer Science and Engineering University.
Does gene order matter? Cis-regulatory elements, proteins, and messengers are integrated into biological circuits. Does gene location in the genome affect.
Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction.
ResponseNet revealing signaling and regulatory networks linking genetic and transcriptomic screening data CSE Fall.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Inferring strengths of protein-protein interactions from experimental data using linear programming Morihiro Hayashida, Nobuhisa Ueda, Tatsuya Akutsu Bioinformatics.
SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
Least Squares Support Vector Machine Classifiers J.A.K. Suykens and J. Vandewalle Presenter: Keira (Qi) Zhou.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
1 Kernel based data fusion Discussion of a Paper by G. Lanckriet.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
Exploring Alternative Splicing Features using Support Vector Machines Feature for Alternative Splicing Alternative splicing is a mechanism for generating.
Nonlinear Data Discrimination via Generalized Support Vector Machines David R. Musicant and Olvi L. Mangasarian University of Wisconsin - Madison
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Exploring Alternative Splicing Features.
Predicting protein function from heterogeneous data
1 CISC 841 Bioinformatics (Fall 2007) Kernel engineering and applications of SVMs.
Robust Optimization and Applications Laurent El Ghaoui IMA Tutorial, March 11, 2003.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
Large-scale Prediction of Yeast Gene Function Introduction to Bio-Informatics Winter Roi Adadi Naama Kraus
Prediction of Protein Binding Sites in Protein Structures Using Hidden Markov Support Vector Machine.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features 王荣 14S
Chapter 6. Classification and Prediction Classification by decision tree induction Bayesian classification Rule-based classification Classification by.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
Gist 2.3 John H. Phan MIBLab Summer Workshop June 28th, 2006.
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences TuyetLinh Nguyen.
Sequence Kernel Association Tests (SKAT) for the Combined Effect of Rare and Common Variants 統計論文 奈良原.
1 CISC 841 Bioinformatics (Fall 2008) Review Session.
Support Vector Machines Reading: Textbook, Chapter 5 Ben-Hur and Weston, A User’s Guide to Support Vector Machines (linked from class web page)
1 C.A.L. Bailer-Jones. Machine Learning. Support vector machines Machine learning, pattern recognition and statistical data modelling Lecture 9. Support.
BNFO 615 Fall 2016 Usman Roshan NJIT. Outline Machine learning for bioinformatics – Basic machine learning algorithms – Applications to bioinformatics.
Overview of two hybrid system for identifying and characterizing protein–protein interactions. Shown are the basic components and operation of the two.
PREDICT 422: Practical Machine Learning
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
Overview of two hybrid system for identifying and characterizing protein–protein interactions. Shown are the basic components and operation of the two.
Geometrical intuition behind the dual problem
Robust Optimization and Applications in Machine Learning
Support Vector Machines Introduction to Data Mining, 2nd Edition by
Minimax Probability Machine (MPM)
Machine Learning Week 3.
COSC 4368 Machine Learning Organization
Kernel Methods for large-scale Genomics Data Analysis
Presentation transcript:

Robust Optimization and Applications in Machine Learning

Time-series prediction via linear least-squares

Predicted output

Properties of solution

Non-linear prediction and kernels

Properties of solution

What is a kernel, anyway? SVM, LR, LS, MPM, PCA, CCA, FDA…

Example: 2 nd -order polynomial kernel

A classical way to use kernels

Transduction framework

Important property of kernel matrices

Kernel optimization in least-squares

Kernel optimization for least-squares

Kernel optimization via SDP or SOCP

A non-classical way to use kernels

Kernel optimization in other problems

Kernel optimization in SVM classifiers

Kernel optimization in SVM classifiers (cont’d)

Link with robust optimization

Kernel optimization and data fusion mRNA expression data upstream region data (TF binding sites) protein-protein interaction data hydrophobicity data sequence data (gene, protein)

Challenge

Example of a Kernel for Genomic Data: Pairwise Comparison Kernel

protein 2 Example of a Kernel for Genomic Data: Linear Interaction Kernel

Exampe of a Kernel for Genomic Data: Diffusion Kernel

Learning the Optimal Kernel K

Integrate constructed kernels Learn a linear mix Large margin classifier (SVM) Maximize the margin

Yeast Protein Function Prediction

MRF SDP/SVM (binary) SDP/SVM (enriched) Yeast Protein Function Prediction

MRF SDP/SVM (binary) SDP/SVM (enriched) Yeast Protein Function Prediction

Part 3: summary