Semi-supervised Learning

Slides:



Advertisements
Similar presentations
Learning Riemannian metrics for motion classification Fabio Cuzzolin INRIA Rhone-Alpes Computational Imaging Group, Pompeu Fabra University, Barcellona.
Advertisements

Machine Learning for Vision-Based Motion Analysis Learning pullback metrics for linear models Oxford Brookes Vision Group Oxford Brookes University 17/10/2008.
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Classification / Regression Support Vector Machines
Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.
Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.
The UNIVERSITY of Kansas EECS 800 Research Seminar Mining Biological Data Instructor: Luke Huan Fall, 2006.
1 A Survey on Distance Metric Learning (Part 1) Gerry Tesauro IBM T.J.Watson Research Center.
A Probabilistic Framework for Semi-Supervised Clustering
Discriminative and generative methods for bags of features
6/26/2006CGI'06, Hangzhou China1 Sub-sampling for Efficient Spectral Mesh Processing Rong Liu, Varun Jain and Hao Zhang GrUVi lab, Simon Fraser University,
Lecture 21: Spectral Clustering
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Principal Component Analysis
1 Machine Learning: Symbol-based 10d More clustering examples10.5Knowledge and Learning 10.6Unsupervised Learning 10.7Reinforcement Learning 10.8Epilogue.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Segmentation. Methods Region Growing Split and Merge Clustering.
What is Cluster Analysis
Incorporating User Provided Constraints into Document Clustering Yanhua Chen, Manjeet Rege, Ming Dong, Jing Hua, Farshad Fotouhi Department of Computer.
Supervised Distance Metric Learning Presented at CMU’s Computer Vision Misc-Read Reading Group May 9, 2007 by Tomasz Malisiewicz.
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
The Implicit Mapping into Feature Space. In order to learn non-linear relations with a linear machine, we need to select a set of non- linear features.
Feature Selection and Its Application in Genomic Data Analysis March 9, 2004 Lei Yu Arizona State University.
Improving the MI registration metric Narendhran Vijayakumar 04/25/2008.
CSC 4510 – Machine Learning Dr. Mary-Angela Papalaskari Department of Computing Sciences Villanova University Course website:
Unsupervised Learning Reading: Chapter 8 from Introduction to Data Mining by Tan, Steinbach, and Kumar, pp , , (
Xiaoxiao Shi, Qi Liu, Wei Fan, Philip S. Yu, and Ruixin Zhu
Active Learning An example From Xu et al., “Training SpamAssassin with Active Semi- Supervised Learning”
Lecture 20: Cluster Validation
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Local Fisher Discriminant Analysis for Supervised Dimensionality Reduction Presented by Xianwang Wang Masashi Sugiyama.
Lecture 27: Recognition Basics CS4670/5670: Computer Vision Kavita Bala Slides from Andrej Karpathy and Fei-Fei Li
Clustering.
Properties of Kernels Presenter: Hongliang Fei Date: June 11, 2009.
Cluster Analysis.
Date: 2011/1/11 Advisor: Dr. Koh. Jia-Ling Speaker: Lin, Yi-Jhen Mr. KNN: Soft Relevance for Multi-label Classification (CIKM’10) 1.
Optimal Reverse Prediction: Linli Xu, Martha White and Dale Schuurmans ICML 2009, Best Overall Paper Honorable Mention A Unified Perspective on Supervised,
Principal Component Analysis and Linear Discriminant Analysis for Feature Reduction Jieping Ye Department of Computer Science and Engineering Arizona State.
Unsupervised Streaming Feature Selection in Social Media
Clustering, performance evaluation, and Term Project 1.Term Project 2.Resource for review.
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
Intelligent Numerical Computation1 Center:Width:.
Advanced Artificial Intelligence Lecture 8: Advance machine learning.
Semi-Supervised Learning William Cohen. Outline The general idea and an example (NELL) Some types of SSL – Margin-based: transductive SVM Logistic regression.
Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.
COMP24111 Machine Learning K-means Clustering Ke Chen.
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Experience Report: System Log Analysis for Anomaly Detection
Data Mining: Basic Cluster Analysis
Clustering I Data Mining Soongsil University.
Semi-Supervised Clustering
Linli Xu Martha White Dale Schuurmans University of Alberta
Data Driven Attributes for Action Detection
Constrained Clustering -Semi Supervised Clustering-
Restricted Boltzmann Machines for Classification
Geometrical intuition behind the dual problem
Metric Learning for Clustering
Mikhail Bilenko, Sugato Basu, Raymond J. Mooney
Research Areas Christoph F. Eick
Jianping Fan Dept of Computer Science UNC-Charlotte
Chapter 10.
Machine Learning Week 3.
Concave Minimization for Support Vector Machine Classifiers
Vector Spaces, Subspaces
Unsupervised Learning: Clustering
Label propagation algorithm
Progress Report Alvaro Velasquez.
Presentation transcript:

Semi-supervised Learning COMP 790-90 Seminar Spring 2011

Similarity Based Methods Questions: given a set of points and the class labels, can we learn a distance matrix such that intra-cluster distance are minimized and inter-cluster distance are maximized?

Distance metric learning Define a new distance measure of the form: Linear transformation of the original data

Distance metric learning

Semi-Supervised Clustering Example Similarity Based

Semi-Supervised Clustering Example Distances Transformed by Learned Metric

Semi-Supervised Clustering Example Clustering Result with Trained Metric

Evaluation Source: E. Xing, et al. Distance metric learning

Evaluation Source: E. Xing, et al. Distance metric learning

Additional Readings Combining Similarity and Search-Based Semi-Supervised Clustering “Comparing and Unifying Search-Based and Similarity-Based Approaches to Semi-Supervised Clustering”, Basu, et al. Ontology based semi-supervised clustering “A framework for ontology-driven subspace clustering”, Liu et al.

References UT machine learning group http://www.cs.utexas.edu/~ml/publication/unsupervised.html Semi-supervised Clustering by Seeding http://www.cs.utexas.edu/users/ml/papers/semi-icml-02.pdf Constrained K-means clustering with background knowledge http://www.litech.org/~wkiri/Papers/wagstaff-kmeans-01.pdf Some slides are from Jieping Ye at Arizona State