Presentation is loading. Please wait.

Presentation is loading. Please wait.

Bridging Domains Using World Wide Knowledge for Transfer Learning

Similar presentations


Presentation on theme: "Bridging Domains Using World Wide Knowledge for Transfer Learning"— Presentation transcript:

1 Bridging Domains Using World Wide Knowledge for Transfer Learning
Evan Wei Xiang, Bin Cao, Derek Hao Hu, and Qiang Yang TKDE, 2010 presented by Wen-Chung Liao, 2010/05/12

2 Outlines Motivation Objectives Methodology Experiments Conclusions
Comments

3 Motivation Supervised learning, require sufficient labeled instances
It is not easy or feasible to obtain new labeled data in a domain of interest To solve this problem, transfer learning techniques capture the shared knowledge from some related domains (source domains ) where labeled data are available use the knowledge to improve the performance of data mining tasks in a target domain. domain adaptation techniques, However, transfer learning may not work well when the difference (information gap) between the source and target domains is large.

4 Objectives To solve this problem, introduce a bridge between the two different domains by leveraging additional knowledge sources Wikipedia or the Open Directory Project (ODP) treat the two domains  from a single underlying distribution “domain adaptation problem”  classification problem under the supervised setting or a semisupervised (transductive) setting. Introduces a novel domain adaptation algorithm called BIG (Bridging Information Gap). we apply semisupervised learning (SSL) to domain adaption problems based on the use of the auxiliary data (bridge). the labeled data from the source domain the unlabeled data from the target domain an auxiliary data source such as the Wikipedia.

5 Support vector machines (SVMs)

6

7

8

9 Methodology SVM TSVM NP-Hard
Information Gap with No Background Knowledge Available SVM Information Gap with Background Knowledge TSVM Selecting the set of unlabeled data {xi} from K to minimize the margin NP-Hard

10

11 Experiments

12

13

14

15 Conclusions THREE MAJOR CONTRIBUTIONS FUTURE WORK
1) We view the problem from a new perspective, i.e., we consider the problem of transfer learning as one of filling in the information gap based on a large document corpus. 2) we show that we can successfully bridge the source and target domains using well developed semisupervised learning algorithms. 3) We propose a minmargin algorithm that can effectively identify and reduce the information gap between two domains. FUTURE WORK First, we plan to validate the effectiveness of our approach through other semisupervised learning algorithms and other relational knowledge bases We plan to extend our approach to be able to consider heterogeneous transfer learning Finally, we will try to develop online TSVM methods for incremental cross-domain transductive learning.

16 Comments Advantage Shortage Applications
new perspective Shortage Applications Web and document data mining applications information retrieval spam detection online advertisement Web search


Download ppt "Bridging Domains Using World Wide Knowledge for Transfer Learning"

Similar presentations


Ads by Google