Multi-Task Learning for Boosting with Application to Web Search Ranking Olivier Chapelle et al. Presenter: Wei Cheng.

Slides:



Advertisements
Similar presentations
A Survey on Transfer Learning Sinno Jialin Pan Department of Computer Science and Engineering The Hong Kong University of Science and Technology Joint.
Advertisements

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Personalized Query Classification Bin Cao, Qiang Yang, Derek Hao Hu, et al. Computer Science and Engineering Hong Kong UST.
An Introduction of Support Vector Machine
Ziming Zhang*, Ze-Nian Li, Mark Drew School of Computing Science Simon Fraser University Vancouver, Canada {zza27, li, AdaMKL: A Novel.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Decision Tree under MapReduce Week 14 Part II. Decision Tree.
Personal Name Classification in Web queries Dou Shen*, Toby Walker*, Zijian Zheng*, Qiang Yang**, Ying Li* *Microsoft Corporation ** Hong Kong University.
Discriminative and generative methods for bags of features
Transfer Learning for WiFi-based Indoor Localization
Retrieving Actions in Group Contexts Tian Lan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010.
A New Biclustering Algorithm for Analyzing Biological Data Prashant Paymal Advisor: Dr. Hesham Ali.
Chang WangChang Wang, Sridhar mahadevanSridhar mahadevan.
1 Transfer Learning Algorithms for Image Classification Ariadna Quattoni MIT, CSAIL Advisors: Michael Collins Trevor Darrell.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
OCFS: Optimal Orthogonal Centroid Feature Selection for Text Categorization Jun Yan, Ning Liu, Benyu Zhang, Shuicheng Yan, Zheng Chen, and Weiguo Fan et.
Sample-Separation-Margin Based Minimum Classification Error Training of Pattern Classifiers with Quadratic Discriminant Functions Yongqiang Wang 1,2, Qiang.
Sparse vs. Ensemble Approaches to Supervised Learning
1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.
Ensemble Learning (2), Tree and Forest
Relaxed Transfer of Different Classes via Spectral Partition Xiaoxiao Shi 1 Wei Fan 2 Qiang Yang 3 Jiangtao Ren 4 1 University of Illinois at Chicago 2.
TransRank: A Novel Algorithm for Transfer of Rank Learning Depin Chen, Jun Yan, Gang Wang et al. University of Science and Technology of China, USTC Machine.
Cao et al. ICML 2010 Presented by Danushka Bollegala.
A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.
Yu-Gang Jiang, Yanran Wang, Rui Feng Xiangyang Xue, Yingbin Zheng, Hanfang Yang Understanding and Predicting Interestingness of Videos Fudan University,
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Transfer Learning Part I: Overview Sinno Jialin Pan Sinno Jialin Pan Institute for Infocomm Research (I2R), Singapore.
Dual Coordinate Descent Algorithms for Efficient Large Margin Structured Prediction Ming-Wei Chang and Scott Wen-tau Yih Microsoft Research 1.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Transfer Learning Motivation and Types Functional Transfer Learning Representational Transfer Learning References.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
MODEL ADAPTATION FOR PERSONALIZED OPINION ANALYSIS MOHAMMAD AL BONI KEIRA ZHOU.
Considering Cost Asymmetry in Learning Classifiers Presented by Chunping Wang Machine Learning Group, Duke University May 21, 2007 by Bach, Heckerman and.
Dual Transfer Learning Mingsheng Long 1,2, Jianmin Wang 2, Guiguang Ding 2 Wei Cheng, Xiang Zhang, and Wei Wang 1 Department of Computer Science and Technology.
Institute of Computing Technology, Chinese Academy of Sciences 1 A Unified Framework of Recommending Diverse and Relevant Queries Speaker: Xiaofei Zhu.
Group ID: 19 ZHU Wenya & LIN Dandan Predicting student performance from book-borrowing records.
Multiple Instance Learning for Sparse Positive Bags Razvan C. Bunescu Machine Learning Group Department of Computer Sciences University of Texas at Austin.
Ariadna Quattoni Xavier Carreras An Efficient Projection for l 1,∞ Regularization Michael Collins Trevor Darrell MIT CSAIL.
Wenyuan Dai, Ou Jin, Gui-Rong Xue, Qiang Yang and Yong Yu Shanghai Jiao Tong University & Hong Kong University of Science and Technology.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
A Novel Relational Learning-to- Rank Approach for Topic-focused Multi-Document Summarization Yadong Zhu, Yanyan Lan, Jiafeng Guo, Pan Du, Xueqi Cheng Institute.
Self-taught Clustering – an instance of Transfer Unsupervised Learning † Wenyuan Dai joint work with ‡ Qiang Yang, † Gui-Rong Xue, and † Yong Yu † Shanghai.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Nawanol Theera-Ampornpunt, Seong Gon Kim, Asish Ghoshal, Saurabh Bagchi, Ananth Grama, and Somali Chaterji Fast Training on Large Genomics Data using Distributed.
Transfer and Multitask Learning Steve Clanton. Multiple Tasks and Generalization “The ability of a system to recognize and apply knowledge and skills.
Learning by Loss Minimization. Machine learning: Learn a Function from Examples Function: Examples: – Supervised: – Unsupervised: – Semisuprvised:
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Day 17: Duality and Nonlinear SVM Kristin P. Bennett Mathematical Sciences Department Rensselaer Polytechnic Institute.
Computer Science and Engineering Jianye Yang 1, Ying Zhang 2, Wenjie Zhang 1, Xuemin Lin 1 Influence based Cost Optimization on User Preference 1 The University.
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Bridging Domains Using World Wide Knowledge for Transfer Learning
Learning Recommender Systems with Adaptive Regularization
Boosting and Additive Trees (2)
Predicting Long-Term Impact of CQA Posts: A Comprehensive Viewpoint
Source: Procedia Computer Science(2015)70:
Asymmetric Gradient Boosting with Application to Spam Filtering
Machine Learning Week 1.
Combining Species Occupancy Models and Boosted Regression Trees
Fuzzy Support Vector Machines
Open-Category Classification by Adversarial Sample Generation
Review-Level Aspect-Based Sentiment Analysis Using an Ontology
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks
An Efficient Projection for L1-∞ Regularization
Presentation transcript:

Multi-Task Learning for Boosting with Application to Web Search Ranking Olivier Chapelle et al. Presenter: Wei Cheng

Outline Motivation Backgrounds Algorithm – From svm to boosting using L1 regularization – Є-boosting for optimization – Overall algorithm Evaluation Overall review and new research points discussion

Motivation Different/same search engine(s) for different countries? Domain specific engine is better! e.g. ‘gelivable’ (very useful) Domain specific engine is better! e.g. ‘gelivable’ (very useful)

Motivation Should we train ranking model separately? – Corps in some domains might be too small to train a good model – Solution: Multi-task learning

Backgrounds Sinno Jialin Pan and Qiang Yang, TKDE’2010

Backgrounds Sinno Jialin Pan and Qiang Yang, TKDE’2010

Backgrounds Why transfer learning works? Sinno Jialin Pan et al. At WWW’10

Backgrounds Why transfer learning works?(continue)

Backgrounds Why transfer learning works?(continue)

Backgrounds LearnerA Input: Target: LearnerB Dog/human Girl/boy traditional learning

Backgrounds Joint Learning Task Input: Target: Dog/human Girl/boy Multi-task learning

Algorithm The algorithm aims at designing an algorithm based on gradient boosted decision trees Inspired by svm based multi-task solution and boosting-trick. Using Є-boosting for optimization

Algorithm From svm to boosting using L1 regularization Previous svm based multi-task learning:

Algorithm Svm(kernel-trick)---  boosting (boosting trick) Pick set of non-linear functions(e.g., decision trees, regression trees,….) H |H|=J Apply every single function to each data point X  ф(X)

Є-boosting for optimization Using L1 regulization Using Є-boosting

Algorithm

Evaluation Datasets

Evaluation

Overall review and new research point discussion Contributions: – Propose a novel multi-task learning method based on gradient boosted decision tree, which is useful for web-reranking applications. (e.g., personalized search). – Have a thorough evaluation on we-scale datasets. New research points: – Negative transfer: – Effective grouping: flexible domain adaptation

Q&A Thanks!