Three steps are separately conducted

Slides:



Advertisements
Similar presentations
Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao, Wei Fan, Jing Jiang, Jiawei Han l Motivate Solution Framework Data Sets Synthetic.
Advertisements

PEBL: Web Page Classification without Negative Examples Hwanjo Yu, Jiawei Han, Kevin Chen- Chuan Chang IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING,
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
ICONIP 2005 Improve Naïve Bayesian Classifier by Discriminative Training Kaizhu Huang, Zhangbing Zhou, Irwin King, Michael R. Lyu Oct
1 Semi-supervised learning for protein classification Brian R. King Chittibabu Guda, Ph.D. Department of Computer Science University at Albany, SUNY Gen*NY*sis.
Self Taught Learning : Transfer learning from unlabeled data Presented by: Shankar B S DMML Lab Rajat Raina et al, CS, Stanford ICML 2007.
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Learning Maximum Likelihood Bounded Semi-Naïve Bayesian Network Classifier Kaizhu Huang, Irwin King, Michael R. Lyu Multimedia Information Processing Laboratory.
Efficient Convex Relaxation for Transductive Support Vector Machine Zenglin Xu 1, Rong Jin 2, Jianke Zhu 1, Irwin King 1, and Michael R. Lyu 1 4. Experimental.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Spatial Semi- supervised Image Classification Stuart Ness G07 - Csci 8701 Final Project 1.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
Discriminative Naïve Bayesian Classifiers Kaizhu Huang Supervisors: Prof. Irwin King, Prof. Michael R. Lyu Markers: Prof. Lai Wan Chan, Prof. Kin Hong.
Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.
1 PageSim: A Link-based Similarity Measure for the World Wide Web Zhenjiang Lin, Irwin King, and Michael, R., Lyu Computer Science & Engineering, The Chinese.
Finite mixture model of Bounded Semi- Naïve Bayesian Network Classifiers Kaizhu Huang, Irwin King, Michael R. Lyu Multimedia Information Processing Laboratory.
Cross Validation Framework to Choose Amongst Models and Datasets for Transfer Learning Erheng Zhong ¶, Wei Fan ‡, Qiang Yang ¶, Olivier Verscheure ‡, Jiangtao.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
Relaxed Transfer of Different Classes via Spectral Partition Xiaoxiao Shi 1 Wei Fan 2 Qiang Yang 3 Jiangtao Ren 4 1 University of Illinois at Chicago 2.
What is the Best Multi-Stage Architecture for Object Recognition Kevin Jarrett, Koray Kavukcuoglu, Marc’ Aurelio Ranzato and Yann LeCun Presented by Lingbo.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
Active Learning for Class Imbalance Problem
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large- scale Data Collections Xuan-Hieu PhanLe-Minh NguyenSusumu Horiguchi GSIS,
Employing Active Learning to Cross-Lingual Sentiment Classification with Data Quality Controlling Shoushan Li †‡ Rong Wang † Huanhuan Liu † Chu-Ren Huang.
ICML2004, Banff, Alberta, Canada Learning Larger Margin Machine Locally and Globally Kaizhu Huang Haiqin Yang, Irwin King, Michael.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Transfer Learning for Image Classification Group No.: 15 Group member : Feng Cai Sauptik Dhar Sauptik.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Question Routing in Community Question Answering: Putting Category in Its Place 1 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong 2 AT&T Labs.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
A MULTI CLOUD SERVICE CO-DEPLOYMENT MECHANISM Yu Kang, Zibin Zheng, and Michael R. Lyu {ykang, zbzheng, Department of Computer Science.
Bing LiuCS Department, UIC1 Chapter 8: Semi-supervised learning.
1 Effect of Spatial Locality on An Evolutionary Algorithm for Multimodal Optimization EvoNum 2010 Ka-Chun Wong, Kwong-Sak Leung, and Man-Hon Wong Department.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu Yu Kang, Yangfan Zhou, Zibin Zheng, and Michael R. Lyu {ykang,yfzhou,
Weakly Supervised Training For Parsing Mandarin Broadcast Transcripts Wen Wang ICASSP 2008 Min-Hsuan Lai Department of Computer Science & Information Engineering.
Consensus Group Stable Feature Selection
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Wenyuan Dai, Ou Jin, Gui-Rong Xue, Qiang Yang and Yong Yu Shanghai Jiao Tong University & Hong Kong University of Science and Technology.
Feature Selection and Weighting using Genetic Algorithm for Off-line Character Recognition Systems Faten Hussein Presented by The University of British.
Validation methods.
Text Categorization by Boosting Automatically Extracted Concepts Lijuan Cai and Tommas Hofmann Department of Computer Science, Brown University SIGIR 2003.
Self-taught Clustering – an instance of Transfer Unsupervised Learning † Wenyuan Dai joint work with ‡ Qiang Yang, † Gui-Rong Xue, and † Yong Yu † Shanghai.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
The Chinese University of Hong Kong Learning Larger Margin Machine Locally and Globally Dept. of Computer Science and Engineering The Chinese University.
Data Science Credibility: Evaluating What’s Been Learned
Experience Report: System Log Analysis for Anomaly Detection
A Collaborative Quality Ranking Framework for Cloud Components
Mining Data Semantics (MDS'2011) Workshop
Bridging Domains Using World Wide Knowledge for Transfer Learning
Semi-Supervised Clustering
Sofus A. Macskassy Fetch Technologies
WSRec: A Collaborative Filtering Based Web Service Recommender System
Discriminative Training of Chow-Liu tree Multinet Classifiers
Project Implementation for ITCS4122
PEBL: Web Page Classification without Negative Examples
Zhenjiang Lin, Michael R. Lyu and Irwin King
Outline Background Motivation Proposed Model Experimental Results
Identify Different Chinese People with Identical Names on the Web
GANG: Detecting Fraudulent Users in OSNs
Concave Minimization for Support Vector Machine Classifiers
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Mingzhen Mo and Irwin King
Semi-Automatic Data-Driven Ontology Construction System
Web Page Classification with Heterogeneous Data Fusion
University of Wisconsin - Madison
MAS 622J Course Project Classification of Affective States - GP Semi-Supervised Learning, SVM and kNN Hyungil Ahn
Machine Learning: Lecture 5
Presentation transcript:

Three steps are separately conducted The Chinese Univ. of Hong Kong University of Bristol Supervised Self-taught Learning: Actively Transferring Knowledge from Unlabeled Data Kaizhu Huang1, Zenglin Xu2, Irwin King2 , Michael R. Lyu2, and Colin Campbell1 1 Department of Engineering Mathematics University of Bristol, UK {K.Huang, C.Campbell}@bristol.ac.uk 2 Department of Computer Science and Engineering The Chinese University of Hong Kong {zlxu, king, lyu}@cse.cuhk.edu.hk Background Semi-supervised Learning (SSL)- Unlabeled data share the same set of categories as the labeled data. Transfer Learning (TL) – Supervised Learning with an additional labeled data set which is very similar to the training data Self-taught Learning (STL) – Unlabeled data could be random data unnecessarily sharing the same categories as the labeled data motivations & Framework Proposed Supervised STL Framework Steps of STL Learning basis by sparse coding from unlabeled data ( which are even randomly downloaded data) Representing labeled data by the basis obtained in step 1 Learning a classification function based on certain algorithms Sparse Coding Classifier learning Problems of STL Advantages of SSTL Three steps are separately conducted Irrelevant Basis may be extracted and could hurt the classification Basis selection is interacted with the classifier learning in a supervised fashion. Only useful basis will be extracted! Experiment results Data Repositories: 4 Subsets from WebKB 3, Reuters-21578 4, and Ohsumed Setup 1. Training data: 4 or 10 labeled samples randomly chosen for each category. Remaining data are considered as test data. 2. 1000 webpages searched by GOOGLE as unlabeled data 3. Randomly run the training and test 10 times. The average accuracies are returned as the final performance. Parameters are tuned via cross validation. Contribution & Conclusion The first study that performs Self-taught learning in a supervised way Able to learn basis and classification function simultaneously Iterative optimization with convergence guaranteed Significantly improve the classification accuracy of STL IJCNN 2009, Atlanta, U.S.A. June 14-19, 2009