Translated Learning Wenyuan Dai, Yuqiang Chen, Gui-Rong Xue, Qiang Yang, and Yong Yu. Translated Learning. In Proceedings of Twenty- Second Annual Conference.

Slides:



Advertisements
Similar presentations
Self-Paced Learning for Semantic Segmentation
Advertisements

Latent Variables Naman Agarwal Michael Nute May 1, 2013.
A Survey on Transfer Learning Sinno Jialin Pan Department of Computer Science and Engineering The Hong Kong University of Science and Technology Joint.
Welcome. ROUFUN NAHAR Assistant Teacher( English) Jamalpur Govt. Girls’ High School Jamalpur. ROUFUN NAHAR Assistant Teacher( English) Jamalpur Govt.
Convex Point Estimation using Undirected Bayesian Transfer Hierarchies Gal Elidan, Ben Packer, Geremy Heitz, Daphne Koller Computer Science Dept. Stanford.
An Introduction of Support Vector Machine
Co-clustering based classification for Out-of-domain Documents
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
Learning on Probabilistic Labels Peng Peng, Raymond Chi-wing Wong, Philip S. Yu CSE, HKUST 1.
Proximal Methods for Sparse Hierarchical Dictionary Learning Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski, Francis Bach Presented by Bo Chen,
Data Visualization STAT 890, STAT 442, CM 462
Frustratingly Easy Domain Adaptation
Jun Zhu Dept. of Comp. Sci. & Tech., Tsinghua University This work was done when I was a visiting researcher at CMU. Joint.
Self Taught Learning : Transfer learning from unlabeled data Presented by: Shankar B S DMML Lab Rajat Raina et al, CS, Stanford ICML 2007.
Multiple Instance Learning
Sample Selection Bias Lei Tang Feb. 20th, Classical ML vs. Reality  Training data and Test data share the same distribution (In classical Machine.
Maximum Entropy Model LING 572 Fei Xia 02/07-02/09/06.
Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, Bing Qin
Bing LiuCS Department, UIC1 Learning from Positive and Unlabeled Examples Bing Liu Department of Computer Science University of Illinois at Chicago Joint.
Maximum Entropy Model LING 572 Fei Xia 02/08/07. Topics in LING 572 Easy: –kNN, Rocchio, DT, DL –Feature selection, binarization, system combination –Bagging.
Online Stacked Graphical Learning Zhenzhen Kou +, Vitor R. Carvalho *, and William W. Cohen + Machine Learning Department + / Language Technologies Institute.
Online Learning Algorithms
A k-Nearest Neighbor Based Algorithm for Multi-Label Classification Min-Ling Zhang
1 Towards Heterogeneous Transfer Learning Qiang Yang Hong Kong University of Science and Technology Hong Kong, China
Selected Applications of Transfer Learning
Lecture 2: Bayesian Decision Theory 1. Diagram and formulation
Dinosaurs. Millions of years ago, long before there were any people, there were dinosaurs. The largest dinosaurs were over 30 meters long and up to 15.
Sampletalk Technology Presentation Andrew Gleibman
Welcome. ROUFUN NAHAR Assistant Teacher( English) Jamalpur Govt. Girls High School ROUFUN NAHAR Assistant Teacher( English) Jamalpur Govt. Girls High.
1 Towards Heterogeneous Transfer Learning Qiang Yang Hong Kong University of Science and Technology Hong Kong, China
Treatment Learning: Implementation and Application Ying Hu Electrical & Computer Engineering University of British Columbia.
Multiple Instance Real Boosting with Aggregation Functions Hossein Hajimirsadeghi and Greg Mori School of Computing Science Simon Fraser University International.
A Language Independent Method for Question Classification COLING 2004.
Group Sparse Coding Samy Bengio, Fernando Pereira, Yoram Singer, Dennis Strelow Google Mountain View, CA (NIPS2009) Presented by Miao Liu July
Pseudo-supervised Clustering for Text Documents Marco Maggini, Leonardo Rigutini, Marco Turchi Dipartimento di Ingegneria dell’Informazione Università.
Transfer Learning Motivation and Types Functional Transfer Learning Representational Transfer Learning References.
Welcome. Md. Ikramul Haque B.Sc, M.A in English Assistant Teacher( English) Mirzapur Islamia Dakhil Madrasah Dapunia, Pabna sadar, Pabna.
Virtual Vector Machine for Bayesian Online Classification Yuan (Alan) Qi CS & Statistics Purdue June, 2009 Joint work with T.P. Minka and R. Xiang.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Classification Derek Hoiem CS 598, Spring 2009 Jan 27, 2009.
Word Translation Disambiguation Using Bilingial Bootsrapping Paper written by Hang Li and Cong Li, Microsoft Research Asia Presented by Sarah Hunter.
T. REX Tyrannosaurus Rex Tyranno = Tyrant Saurus = Lizard Rex = King
Introduction to LDA Jinyang Gao. Outline Bayesian Analysis Dirichlet Distribution Evolution of Topic Model Gibbs Sampling Intuition Analysis of Parameter.
Bing LiuCS Department, UIC1 Chapter 8: Semi-supervised learning.
Optimal Dimensionality of Metric Space for kNN Classification Wei Zhang, Xiangyang Xue, Zichen Sun Yuefei Guo, and Hong Lu Dept. of Computer Science &
© 2013 Adobe Systems Incorporated. All Rights Reserved. Adobe Confidential. Learning Safe Strategies in Digital Marketing Mohammad Ghavamzadeh Adobe research.
Exploring in the Weblog Space by Detecting Informative and Affective Articles Xiaochuan Ni, Gui-Rong Xue, Xiao Ling, Yong Yu Shanghai Jiao-Tong University.
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Wenyuan Dai, Ou Jin, Gui-Rong Xue, Qiang Yang and Yong Yu Shanghai Jiao Tong University & Hong Kong University of Science and Technology.
Carnegie Mellon School of Computer Science Language Technologies Institute CMU Team-1 in TDT 2004 Workshop 1 CMU TEAM-A in TDT 2004 Topic Tracking Yiming.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Self-taught Clustering – an instance of Transfer Unsupervised Learning † Wenyuan Dai joint work with ‡ Qiang Yang, † Gui-Rong Xue, and † Yong Yu † Shanghai.
Machine Learning Lecture 1: Intro + Decision Trees Moshe Koppel Slides adapted from Tom Mitchell and from Dan Roth.
Information Retrieval in Practice
Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C
Information Retrieval (in Practice)
True/False questions (3pts*2)
Introduction to Machine Learning
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Group Norm for Learning Latent Structural SVMs
Deep Learning based Machine Translation
Distributed Representation of Words, Sentences and Paragraphs
Variational Knowledge Graph Reasoning
I. Research Overview and Outcome
Overview of Machine Learning
John Lafferty, Chengxiang Zhai School of Computer Science
Word embeddings Text processing with current NNs requires encoding into vectors. One-hot encoding: N words encoded by length N vectors. A word gets a.
Word embeddings (continued)
Hal Daumé III Microsoft Research University of Maryland
Presentation transcript:

Translated Learning Wenyuan Dai, Yuqiang Chen, Gui-Rong Xue, Qiang Yang, and Yong Yu. Translated Learning. In Proceedings of Twenty- Second Annual Conference on Neural Information Processing Systems (NIPS 2008), December 8, 2008, Vancouver, British Columbia, Canada.

Definition Translated Learning – Learning across Different Feature Spaces

Applications Cross-language Classification – Rigutini et al., WI2005; Ling et al., WWW2008; … Text-aided Image Classification – We submitted two papers to AAAI2008 & ICML2008 respectively. Future work – Text to Music – Text to Video – Image to Video –…–…

Related Work Cross-language Classification – Rigutini et al., WI2005 English to Italian – Ling et al., WWW2008 English to Chinese Most cross-language classification approaches relies on machine translation. – Ad hoc – Machine translation is difficult in most scenarios. E.g. text-to-picture translation

Basic Idea Instance-level machine translation relies on understanding instances, at least basically. – Machine translation in NLP is an easy special case, since it is based on sentence understanding. Classification models are usually on the feature- level. Translating classification models is also on the feature-level. – could be much easier than instance-level translation

Human Learning Example Task: tyrannosaurus vs stegosaurus – htyrannosaurus: bipedal carnivore with a massive skull balanced by long, heavy tail. Its forelimbs were small and retained only two digits. – stegosaurus: quadruped ornithischian dinosaur of four long bony spikes on a flexible tail and two rows of upright triangular bony plates running along the back.

Model-level Translation Learning InputOutput Learning InputOutput Elephants are big mammals on earth... massive hoofed mammal of Africa... translating learning models

Naive Bayesian Approach Incorporating translator difficult to estimate

Risk Minimization Approach Loss function

Inference Assume there is no prior difference among all the classes

Model Estimation KL-divergence Negative of cosine Negative of PCC

Experimental Results

Outline Introduction Related Work Our Research Future Work

More applications – Cross-language classification Using dictionaries as translators – Text to music, video, … – Image to video Improving translator estimation – Integrating text classification and translator estimation into one optimization model

Questions