Chang WangChang Wang, Sridhar mahadevanSridhar mahadevan.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.
Chapter 5: Introduction to Information Retrieval
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Multi-AbstractionRetrievalMulti-AbstractionRetrieval MotivationMotivation ExperimentsExperiments Overall Framework Multi-Abstraction Concern Localization.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Frustratingly Easy Domain Adaptation
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
1 Minimum Ratio Contours For Meshes Andrew Clements Hao Zhang gruvi graphics + usability + visualization.
Multiple Instance Learning
Learning to Rank: New Techniques and Applications Martin Szummer Microsoft Research Cambridge, UK.
Scott Wen-tau Yih Joint work with Kristina Toutanova, John Platt, Chris Meek Microsoft Research.
Image Categorization by Learning and Reasoning with Regions Yixin Chen, University of New Orleans James Z. Wang, The Pennsylvania State University Published.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Learning from Multiple Outlooks Maayan Harel and Shie Mannor ICML 2011 Presented by Minhua Chen.
(ACM KDD 09’) Prem Melville, Wojciech Gryc, Richard D. Lawrence
Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date:
Introduction n Keyword-based query answering considers that the documents are flat i.e., a word in the title has the same weight as a word in the body.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Leveraging Conceptual Lexicon : Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
Predictive Modeling with Heterogeneous Sources Xiaoxiao Shi 1 Qi Liu 2 Wei Fan 3 Qiang Yang 4 Philip S. Yu 1 1 University of Illinois at Chicago 2 Tongji.
Review of the web page classification approaches and applications Luu-Ngoc Do Quang-Nhat Vo.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
The Necessity of Combining Adaptation Methods Cognitive Computation Group, University of Illinois Experimental Results Title Ming-Wei Chang, Michael Connor.
Event Detection using Customer Care Calls 04/17/2013 IEEE INFOCOM 2013 Yi-Chao Chen 1, Gene Moo Lee 1, Nick Duffield 2, Lili Qiu 1, Jia Wang 2 The University.
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Employing Active Learning to Cross-Lingual Sentiment Classification with Data Quality Controlling Shoushan Li †‡ Rong Wang † Huanhuan Liu † Chu-Ren Huang.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.
Modern Topics in Multivariate Methods for Data Analysis.
Source-Selection-Free Transfer Learning
Transfer Learning Motivation and Types Functional Transfer Learning Representational Transfer Learning References.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
Geodesic Flow Kernel for Unsupervised Domain Adaptation Boqing Gong University of Southern California Joint work with Yuan Shi, Fei Sha, and Kristen Grauman.
Authors: Marius Pasca and Benjamin Van Durme Presented by Bonan Min Weakly-Supervised Acquisition of Open- Domain Classes and Class Attributes from Web.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
HAITHAM BOU AMMAR MAASTRICHT UNIVERSITY Transfer for Supervised Learning Tasks.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Copyright Paula Matuszek Kinds of Machine Learning.
The Building Adapter: Towards Quickly Applying Building Analytics at Scale Dezhi Hong, Hongning Wang, *Jorge Ortiz, Kamin Whitehouse University of Virginia,
A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.
BOOTSTRAPPING INFORMATION EXTRACTION FROM SEMI-STRUCTURED WEB PAGES Andrew Carson and Charles Schafer.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
 Effective Multi-Label Active Learning for Text Classification Bishan yang, Juan-Tao Sun, Tengjiao Wang, Zheng Chen KDD’ 09 Supervisor: Koh Jia-Ling Presenter:
Efficient Text Categorization with a Large Number of Categories Rayid Ghani KDD Project Proposal.
Ontology Based Annotation of Text Segments Presented by Ahmed Rafea Samhaa R. El-Beltagy Maryam Hazman.
Web-Mining Agents: Transfer Learning TrAdaBoost R. Möller Institute of Information Systems University of Lübeck.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Scalable Person Re-identification on Supervised Smoothed Manifold
Bridging Domains Using World Wide Knowledge for Transfer Learning
Semi-Supervised Clustering
Intro to Machine Learning
Cross Domain Distribution Adaptation via Kernel Mapping
Transfer Learning in Astronomy: A New Machine Learning Paradigm
Introductory Seminar on Research: Fall 2017
Learning to Rank Shubhra kanti karmaker (Santu)
Presenter: Hajar Emami
Statistical NLP: Lecture 9
Prepared by: Mahmoud Rafeek Al-Farra
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

Chang WangChang Wang, Sridhar mahadevanSridhar mahadevan

 Problem Introduction  Problem Definition  Who cares  Previous work & challenges  Contribution  A glance at methods

 Example problem  Input: Three collections of documents in  English (sufficient labels)  Italian (sufficient labels)  Arabic (few labels).  Target: Assign labels to the Arabic documents.  A way: find a common feature space for 3 domains  Shared labels, (sports, military)  No shared documents. (no instance correspondence)  No words translations are available.

English docs Italian Docs Arabic docs Shared label set: {sports, military} No corresponding instances or words Question : Can we construct a common feature space so that we can use English docs and Italian docs to help classify Arabic docs? docword1word2…label 102sports 220military sports docparola1parola2…etichetta 120sports 202military sports doc1 كلمة 2 كلمة … ملصق 102sports 220military 310? 401?

English docs Italian docs Arabic docs docword1word2…label 102sports 220military sports docparola1parola2…etichetta 120sports 202military sports doc1 كلمة 2 كلمة … ملصق 102sports 220military 310? 401? docfeature1feature2…label 102sports 220military sports military sports military 1110? 1201? Common feature space

 Given K input datasets in different domains, with different features, but all of the datasets shared the same label set.  Source domain have sufficient labeled instances.  Target domain have few labeled instances.  Question: Can we construct a common feature space?  So all instances in different domain can be mapped to the same feature space, so that we can perform learning task?

Source k Source 1 Target Common feature space Learning

 Search engine  classify docs, rank docs, find docs topics  Businessman  Customer clustering  Biologist  Match protein

 Target domain have little labels  No instance correspondence  Source domain and target domain have different feature space

 Most work assumes that the source domain and the target domain have the same features.  Manifold regularization  Do not leverage source domain information  Transfer learning based on manifold alignment: use both label and unlabeled instance to learn mapping  require small amount of instance correspondence.

 Transfer learning perspective  Can work on different feature space  Cope with multiple input domain  Can combine with existing domain adaption methods  Manifold alignment perspective  Need no instance correspondence  Use label to learn alignment

English docs Italian docs Arabic docs docword1word2…label 102sports 220military sports docparola1parola2…etichetta 120sports 202military sports doc1 كلمة 2 كلمة … ملصق 102sports 220military 310? 401? docfeature1feature2…label 102sports 220military sports military sports military 1110? 1201? Common feature space