Interactive Data Mining and Business Applications Rayid Ghani Collaboration with Chad Cumby, Divna Djordjevic, Andy Fano, Marko Krema, Mohit Kumar, Abhimanyu.

Slides:



Advertisements
Similar presentations
T HE V ALUE OF E NTERPRISE S EARCH Robert Gill & Pieter-Jan De Boeck.
Advertisements

Context-Aware Mobile Music Recommendation for Daily Activities
Clustering and Load Balancing Optimization for Redundant Content Removal Shanzhong Zhu (Ask.com) Alexandra Potapova, Maha Alabduljalil (Univ. of California.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
A Revolution In Customer Engagement:
1. Abstract 2 Introduction Related Work Conclusion References.
About ISoft … What is Decision Tree? Alice Process … Conclusions Outline.
Chapter 2 Health Care Information Systems: A Practical Approach for Health Care Management 2nd Edition Wager ~ Lee ~ Glaser.
Maximizing Classifier Utility when Training Data is Costly Gary M. Weiss Ye Tian Fordham University.
REACTION REACTION Workshop Overview Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal.
Hierarchical Subquery Evaluation for Active Learning on a Graph Oisin Mac Aodha, Neill Campbell, Jan Kautz, Gabriel Brostow CVPR 2014 University College.
Hypertext Categorization using Hyperlink Patterns and Meta Data Rayid Ghani Séan Slattery Yiming Yang Carnegie Mellon University.
Meta Learning and Active Learning: Meta Learning and Active Learning: Collaborative Knowledge Discovery in Distributed Systems Dr Yonghong Peng Department.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.
Online Communities Academic Publishing Perspective.
Major Tasks in Data Preprocessing(Ref Chap 3) By Prof. Muhammad Amir Alam.
Business Intelligence
A Joint Model of Feature Mining and Sentiment Analysis for Product Review Rating Jorge Carrillo de Albornoz Laura Plaza Pablo Gervás Alberto Díaz Universidad.
Fraud Detection McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Management Information Systems By Effy Oz & Andy Jones
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Where Are the Nuggets in System Audit Data? Wenke Lee College of Computing Georgia Institute of Technology.
© 2010 IBM Corporation © 2011 IBM Corporation September 6, 2012 NCDHHS FAMS Overview for Behavioral Health Managed Care Organizations.
Maximizing long-term ROI for Active Learning Systems
© Negnevitsky, Pearson Education, Introduction, or what is data mining? Introduction, or what is data mining? Data warehouse and query tools Data.
Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO,
Thesis Proposal PrActive Learning: Practical Active Learning, Generalizing Active Learning for Real-World Deployments.
Personalized Interaction with Web Resources First Sino-German Symposium on KNOWLEDGE HANDLING: REPRESENTATION, MANAGEMENT AND PERSONALIZED APPLICATION.
Introduction, or what is data mining? Introduction, or what is data mining? Data warehouse and query tools Data warehouse and query tools Decision trees.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Adaptive Mining Techniques for Data Streams using Algorithm Output Granularity Mohamed Medhat Gaber, Shonali Krishnaswamy, Arkady Zaslavsky In Proceedings.
Knowledge and Learning in Complex Business Systems Zuobing Xu University of California, Santa Cruz (Silicon Valley Center) Ram Akella, Kristin Fridgeirsdottir,
Personalizing Java based Answers for Hundreds of Millions of Users Anurag Gupta Senior Architect, Yahoo Answers & Groups
Chapter 6 Management of Information Systems and Management Innovation.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
DDM Kirk. LSST-VAO discussion: Distributed Data Mining (DDM) Kirk Borne George Mason University March 24, 2011.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Embedded Lab. Park Yeongseong.  Introduction  Problem Formulation  Approach Overview  AOI(Area Of Interest) Extraction  CallStack Pattern Mining.
Hypertext Categorization using Hyperlink Patterns and Meta Data Rayid Ghani Séan Slattery Yiming Yang Carnegie Mellon University.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Sigmafine 4.3 Roberto Linares, Ph.D. Sigmafine Group Lead.
N VISUAL ANALYTICS FOR HEALTHCARE: BIG DATA, BIG DECISIONS David Gotz Healthcare Analytics Research Group IBM T.J. Watson Research Center.
Location-based Social Networks 6/11/20161 CENG 770.
 What is a CRM  Uses of a CRM  What is Data Mining  Data Mining Tasks  How a CRM Utilizes Data Mining  Companies who Use CRM Data Mining.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Presenter: Siddharth Krishna Sinha Instructor: Jing Gao
Fragmented Services: 7+ Average number of health related vendors employees need to interact with. Lack Engagement: 57% of large employers say that a lack.
Model Discovery through Metalearning
Kyriaki Dimitriadou, Brandeis University
Machine Learning overview Chapter 18, 21
Machine Learning overview Chapter 18, 21
Contextual Intelligence as a Driver of Services Innovation
FINTECH IN CYBER RISK.
Data Mining to Predict and Prevent Errors in Health Insurance Claims Processing Mohit Kumar, Rayid Ghani and Zhu-Song Mei Copyright © 2010 Accenture All.
Future-oriented Benchmarking Through Social Media Analysis
Crossing the gap between multimedia data and semantics
Best Social Media Marketing Company in Chennai
Microsoft /7/2018 1:14 PM BRK2045 Explore new personal, intelligence powered search experiences in SharePoint, Delve and Office 365 Alina Skarbovsky.
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Contributing to the Integrity of the System
CLIENT ZipDial.
כריית נתונים.
Using Data Mining To Improve Company Strategies
Opening Weka Select Weka from Start Menu Select Explorer Fall 2003
Sergei Ananyan, Ph.D. Healthcare Fraud Detection through Data Mining Your Knowledge Partner TM (c) Megaputer Intelligence.
PolyAnalyst Web Report Training
Microarray Data Set The microarray data set we are dealing with is represented as a 2d numerical array.
Wipro Promax POI Tech lightning talk
PolyAnalyst™ text mining tool Allstate Insurance example
Interactive Information Retrieval
Presentation transcript:

Interactive Data Mining and Business Applications Rayid Ghani Collaboration with Chad Cumby, Divna Djordjevic, Andy Fano, Marko Krema, Mohit Kumar, Abhimanyu Lad, Yiming Yang

Accenture Technology LabsRayid Ghani Tradeoffs Cost (Time of human expert) Exploration (Future classifier performance) Exploitation (Relevancy to the expert) Exploration-Exploitation Tradeoffs Cost-Sensitive Active Learning Standard Ranking / Relevance Feedback Active Learning

Accenture Technology LabsRayid Ghani Case Studies Product Attribute Discovery & Extraction Health Insurance: Error Detection in Claims Social Media: Sentiment Analysis Knowledge Management: Form Filling

Accenture Technology LabsRayid Ghani Tradeoffs in Interactive Data Mining Cost (Time of human expert) Exploration (Future classifier performance) Exploitation (Relevancy to the expert)

Accenture Technology LabsRayid Ghani System Demo 4

Accenture Technology LabsRayid Ghani More Like This strategy Labeled Data Ranked List scored by classifier Select Top m% claims Rank Online Strategy Cluster

Accenture Technology LabsRayid Ghani Live System Results 90% relative improvement in accuracy over standard system 27% reduction in audit time ~$10 Million savings/year for a typical insurance company

Accenture Technology LabsRayid Ghani Summary Interactive Data Mining settings are prevalent in many business applications Challenge: efficiently calculate the incremental cost and benefit of any information that passes between expert and data mining system Allows users to control and manage tradeoffs making adoption easier and faster