Thesis Proposal PrActive Learning: Practical Active Learning, Generalizing Active Learning for Real-World Deployments.

Slides:



Advertisements
Similar presentations
Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao Wei Fan Jing JiangJiawei Han University of Illinois at Urbana-Champaign IBM T. J.
Advertisements

Actively Transfer Domain Knowledge Xiaoxiao Shi Wei Fan Jiangtao Ren Sun Yat-sen University IBM T. J. Watson Research Center Transfer when you can, otherwise.
Systematic Data Selection to Mine Concept Drifting Data Streams Wei Fan IBM T.J.Watson.
Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao, Wei Fan, Jing Jiang, Jiawei Han l Motivate Solution Framework Data Sets Synthetic.
A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions Jing Gao Wei Fan Jiawei Han Philip S. Yu University of Illinois.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Active Learning with Feedback on Both Features and Instances H. Raghavan, O. Madani and R. Jones Journal of Machine Learning Research 7 (2006) Presented.
Imbalanced data David Kauchak CS 451 – Fall 2013.
Extraction and Transfer of Knowledge in Reinforcement Learning A.LAZARIC Inria “30 minutes de Science” Seminars SequeL Inria Lille – Nord Europe December.
Yue Han and Lei Yu Binghamton University.
Proactive Learning: Cost- Sensitive Active Learning with Multiple Imperfect Oracles Pinar Donmez and Jaime Carbonell Pinar Donmez and Jaime Carbonell Language.
Retrieving Actions in Group Contexts Tian Lan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
ACM Multimedia th Annual Conference, October , 2004
Maximizing Classifier Utility when Training Data is Costly Gary M. Weiss Ye Tian Fordham University.
Part I: Classification and Bayesian Learning
Learning from Multiple Outlooks Maayan Harel and Shie Mannor ICML 2011 Presented by Minhua Chen.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
1 Efficiently Learning the Accuracy of Labeling Sources for Selective Sampling by Pinar Donmez, Jaime Carbonell, Jeff Schneider School of Computer Science,
Learning to Learn By Exploiting Prior Knowledge
Anomaly detection Problem motivation Machine Learning.
Hybrid-ε-greedy for Mobile Context- Aware Recommender System Djallel Bouneffouf, Amel Bouzeghoub & Alda Lopes Gançarski Institut Télécom, Télécom SudParis,
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
Where Are the Nuggets in System Audit Data? Wenke Lee College of Computing Georgia Institute of Technology.
Maximizing long-term ROI for Active Learning Systems
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Overcoming Dataset Bias: An Unsupervised Domain Adaptation Approach Boqing Gong University of Southern California Joint work with Fei Sha and Kristen Grauman.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Cost-Sensitive Bayesian Network algorithm Introduction: Machine learning algorithms are becoming an increasingly important area for research and application.
Computer-aided modelling of constraint problems C. Bessiere, LIRMM-CNRS, Montpellier, France In collaboration with: E. Freuder, B. O’Sullivan, U. College.
Universit at Dortmund, LS VIII
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Data Mining and Machine Learning Lab Unsupervised Feature Selection for Linked Social Media Data Jiliang Tang and Huan Liu Computer Science and Engineering.
Transfer Learning Motivation and Types Functional Transfer Learning Representational Transfer Learning References.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Intelligent Database Systems Lab Presenter: Wu, Jhen-Wei Authors: Fabian Bürger, Josef Pauli ICPRAM. Representation Optimization with Feature Selection.
Learning from Positive and Unlabeled Examples Investigator: Bing Liu, Computer Science Prime Grant Support: National Science Foundation Problem Statement.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Chapter 13 (Prototype Methods and Nearest-Neighbors )
Data Mining for Surveillance Applications Suspicious Event Detection Dr. Bhavani Thuraisingham.
Xiangnan Kong,Philip S. Yu An Ensemble-based Approach to Fast Classification of Multi-label Data Streams Dept. of Computer Science University of Illinois.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
A PAC-Bayesian Approach to Formulation of Clustering Objectives Yevgeny Seldin Joint work with Naftali Tishby.
Predicting Consensus Ranking in Crowdsourced Setting Xi Chen Mentors: Paul Bennett and Eric Horvitz Collaborator: Kevyn Collins-Thompson Machine Learning.
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Interactive Data Mining and Business Applications Rayid Ghani Collaboration with Chad Cumby, Divna Djordjevic, Andy Fano, Marko Krema, Mohit Kumar, Abhimanyu.
Mining Concept-Drifting Data Streams Using Ensemble Classifiers Haixun Wang Wei Fan Philip S. YU Jiawei Han Proc. 9 th ACM SIGKDD Internal Conf. Knowledge.
Jian Li Institute for Interdisciplinary Information Sciences Tsinghua University Multi-armed Bandit Problems WAIM 2014.
Mining Statistically Significant Co-location and Segregation Patterns.
Introduction to Machine Learning, its potential usage in network area,
Contextual Intelligence as a Driver of Services Innovation
Intro to Machine Learning
Data Mining to Predict and Prevent Errors in Health Insurance Claims Processing Mohit Kumar, Rayid Ghani and Zhu-Song Mei Copyright © 2010 Accenture All.
KDD 2004: Adversarial Classification
Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo
Damiano Bolzoni, Sandro Etalle, Pieter H. Hartel
Overview of Machine Learning
The use of Neural Networks to schedule flow-shop with dynamic job arrival ‘A Multi-Neural Network Learning for lot Sizing and Sequencing on a Flow-Shop’
Knowledge Transfer via Multiple Model Local Structure Mapping
Active Learning with Unbalanced Classes & Example-Generation Queries
Concave Minimization for Support Vector Machine Classifiers
Jia-Bin Huang Virginia Tech
Anirban Laha and Vikas C. Raykar, IBM Research – India.
Presentation transcript:

Thesis Proposal PrActive Learning: Practical Active Learning, Generalizing Active Learning for Real-World Deployments

Generic example system flow for interactive classification problems Large volume (in millions) of transactions coming in Majority transactions automatically cleared Minority transactions flagged for manual processing Transactions processed successfully Domain specific transaction processing Credit Card Fraud transactions High false positive rates for typical rule- based/hypothesis systems Rule Based System to Flag Transactions for Manual Intervention Hypothesis/Rule- based system for flagging exceptions

Generic example system flow for interactive classification problems Large volume (in millions) of transactions coming in Majority transactions automatically cleared Minority transactions flagged for auditing Transactions processed successfully Domain specific transaction processing Machine Learning model Goal: Optimize Return On Investment of Auditor’s time over long term Common Characteristics Skewed class distribution (minority events) Concept/Feature drift Expensive domain experts Biased sampling of labeled historical data Lots of unlabeled data Lower false positive rates based on learning model Introduce Learning Model to Flag Transactions for Manual Intervention

Interactive Classification Applications Fraud detection Network Intrusion detection Video Surveillance Information Filtering / Recommender Systems Error prediction/Quality Control

Classifier trained from labeled data Human (user/expert) in the loop using the results but also providing feedback at a cost Goal: Maximize the Return on Investment which is equivalent to the productivity of the human Interactive Classification Setting Unlabeled + Labeled Data Trained Classifier Ranked List scored by classifier

Factorization of the problem Cost (Time of human expert) Exploration (Future classifier performance) Exploitation (Relevancy to the expert) Exploration-Exploitation Tradeoffs Cost-Sensitive Active Learning Standard Ranking / Relevance Feedback Active Learning Cost-Sensitive Exploitation

Labeled Data (1,…,t-1) Trained Classifier (1,…,t-1) Ranked List Cost (Time of human expert) Exploration (Future classifier performance) Exploitation (Relevancy to the expert) Labeled Data (t) Unlabeled Data (t) Interactive Classification-High Level Picture

Thesis Contributions Problem Statement: How to generalize active learning to incorporate differential utility of a labeled example(dynamic/variable exploitation), dynamic cost of labeling an example, concept drift in a unified framework that makes the deployment of such learning systems practical Contributions – Generalization of Active Learning along the following dimensions Differential utility of a labeled example Dynamic cost of labeling an example Tackling concept drift Cost-Sensitive Exploitation A unified framework to solve these considerations jointly – First solution: Optimizing joint utility function based on cost, exploration utility and exploitation utility – Second solution: Using Upper Confidence Bound approach with contextual multi-armed bandit setup to incorporate the different factors – Empirical Evaluation of the proposed framework Using evaluation metric motivated by real business tasks Datasets – Synthetic dataset – Real world dataset: Health Insurance Claims Rework Comparison with multiple baselines based on underlying factors

Situating the thesis work wrt related work Active Learning Cost-sensitive Proactive Learning Unreliable Oracle Oracle variation PrActive Learning Differential Utility Dynamic cost Concept Drift Efficiency & Representation Feature level feedback Feature acquisition Batch active learning