SCORE AS YOU LIFT (SAYL) A Statistical Relational Learning Approach to Uplift Modeling Houssam Nassif 1, Finn Kuusisto 1, Elizabeth S. Burnside 1, David.

Slides:



Advertisements
Similar presentations
Statistical Machine Learning- The Basic Approach and Current Research Challenges Shai Ben-David CS497 February, 2007.
Advertisements

Lecture 3 Validity of screening and diagnostic tests
Imbalanced data David Kauchak CS 451 – Fall 2013.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
View Learning: An extension to SRL An application in Mammography Jesse Davis, Beth Burnside, Inês Dutra Vítor Santos Costa, David Page, Jude Shavlik &
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Clinical Trial Designs for the Evaluation of Prognostic & Predictive Classifiers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.
Departments of Medicine and Biostatistics
Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose.
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
Model and Variable Selections for Personalized Medicine Lu Tian (Northwestern University) Hajime Uno (Kitasato University) Tianxi Cai, Els Goetghebeur,
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
ROC Curves.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
How do we know whether a marker or model is any good? A discussion of some simple decision analytic methods Carrie Bennette on behalf of Andrew Vickers.
CSCI 5582 Fall 2006 CSCI 5582 Artificial Intelligence Lecture 22 Jim Martin.
Ordinal Decision Trees Qinghua Hu Harbin Institute of Technology
© Jesse Davis 2006 View Learning Extended: Learning New Tables Jesse Davis 1, Elizabeth Burnside 1, David Page 1, Vítor Santos Costa 2 1 University of.
Discriminant Analysis Testing latent variables as predictors of groups.
Learning at Low False Positive Rate Scott Wen-tau Yih Joshua Goodman Learning for Messaging and Adversarial Problems Microsoft Research Geoff Hulten Microsoft.
Uncovering Age-Specific Invasive and DCIS Breast Cancer Rules Using Inductive Logic Programming Houssam Nassif, David Page, Mehmet Ayvaci, Jude Shavlik,
Comparison of MRI Perfusion and PET-CT in Differentiating Brain Tumor Progression from Radiation Injury after Cranial Irradiation T. Jonathan Yang, M.D.
Chapter 13: Inference in Regression
1/24 Learning to Extract Genic Interactions Using Gleaner LLL05 Workshop, 7 August 2005 ICML 2005, Bonn, Germany Mark Goadrich, Louis Oliphant and Jude.
UOG Journal Club: January 2013
197 Case Study: Predicting Breast Cancer Invasion with Artificial Neural Networks on the Basis of Mammographic Features MEDINFO 2004, T02: Machine Learning.
Criteria for Assessment of Performance of Cancer Risk Prediction Models: Overview Ruth Pfeiffer Cancer Risk Prediction Workshop, May 21, 2004 Division.
On ranking in survival analysis: Bounds on the concordance index
Designing Efficient Cascaded Classifiers: Tradeoff between Accuracy and Cost Vikas Raykar Balaji Krishnapuram Shipeng Yu Siemens Healthcare KDD 2010 TexPoint.
Age Stratified Risk Prediction of Invasive versus In-situ Breast Cancer: A Logistic Regression Model Mehmet Ayvaci 1,2 Oguzhan Alagoz 1,Jagpreet Chhatwal.
EDRN Approaches to Biomarker Validation DMCC Statisticians Fred Hutchinson Cancer Research Center Margaret Pepe Ziding Feng, Mark Thornquist, Yingye Zheng,
The PCA3 Assay improves the prediction of initial biopsy outcome and may be indicative of prostate cancer aggressiveness de la Taille A, Irani J, Graefen.
Learning Ensembles of First-Order Clauses for Recall-Precision Curves A Case Study in Biomedical Information Extraction Mark Goadrich, Louis Oliphant and.
Integrating Machine Learning and Physician Knowledge to Improve the Accuracy of Breast Biopsy Inês Dutra University of Porto, CRACS & INESC-Porto LA Houssam.
Metabolic Syndrome and Recurrence within the 21-Gene Recurrence Score Assay Risk Categories in Lymph Node Negative Breast Cancer Lakhani A et al. Proc.
Extracting BI-RADS Features from Portuguese Clinical Texts H. Nassif, F. Cunha, I.C. Moreira, R. Cruz- Correia, E. Sousa, D. Page, E. Burnside, and I.
Screening and Diagnostic Testing Sue Lindsay, Ph.D., MSW, MPH Division of Epidemiology and Biostatistics Institute for Public Health San Diego State University.
Transfer in Reinforcement Learning via Markov Logic Networks Lisa Torrey, Jude Shavlik, Sriraam Natarajan, Pavan Kuppili, Trevor Walker University of Wisconsin-Madison,
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
Skewing: An Efficient Alternative to Lookahead for Decision Tree Induction David PageSoumya Ray Department of Biostatistics and Medical Informatics Department.
Department of Computer Sciences
BOOSTING David Kauchak CS451 – Fall Admin Final project.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Use of Candidate Predictive Biomarkers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.
Mebi 591D – BHI Kaggle Class Baselines kaggleclass.weebly.com/
Using Predictive Classifiers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
Computational Intelligence: Methods and Applications Lecture 16 Model evaluation and ROC Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Poster Title ABSTRACT #59 Cell cycle progression genes differentiate indolent from aggressive prostate cancer. Steven Stone 1 Jack Cuzick 2, Julia Reid.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
A comparative study of survival models for breast cancer prognostication based on microarray data: a single gene beat them all? B. Haibe-Kains, C. Desmedt,
The Correlational Research Strategy Chapter 12. Correlational Research The goal of correlational research is to describe the relationship between variables.
Linear Discriminant Analysis and Logistic Regression.
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
1 Solving ILP Problems in the EELA infrastructure Inês Dutra Departamento de Ciência de Computadores Universidade do Porto, Portugal.
Information Extraction for Clinical Data Mining: A Mammography Case Study H. Nassif, R. Woods, E. Burnside, M. Ayvaci, J. Shavlik and D. Page University.
Data Analytics CMIS Short Course part II Day 1 Part 4: ROC Curves Sam Buttrey December 2015.
Automatic extraction of BI-RADS breast tissue composition classes from mammography reports Bethany Percha (Stanford) Houssam Nassif (U. Wisconsin) Jafi.
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
David Amar, Tom Hait, and Ron Shamir
#4826 Cancer/testis antigen expression pattern is a potential biomarker for prostate cancer aggressiveness Luciane T. Kagohara1, Prakash Kulkarni1, Takumi.
Effects of User Similarity in Social Media Ashton Anderson Jure Leskovec Daniel Huttenlocher Jon Kleinberg Stanford University Cornell University Avia.
An Artificial Intelligence Approach to Precision Oncology
A Long Noncoding RNA Signature That Predicts Pathological Complete Remission Rate Sensitively in Neoadjuvant Treatment of Breast Cancer  Gen Wang, Xiaosong.
Evaluating Classifiers (& other algorithms)
Published online September 20, 2017 by JAMA Surgery
Evaluating Classifiers
Predicting Breast Cancer by Applying Deep Learning to Linked Health Records and Mammograms The algorithm predicted future breast cancer based on the index.
The Burden of Cancer in Nova Scotia an evaluation of loss in expectation of life Ron Dewar Registry and Analytics Presented to the joint NAACCR.
Evidence Based Diagnosis
Presentation transcript:

SCORE AS YOU LIFT (SAYL) A Statistical Relational Learning Approach to Uplift Modeling Houssam Nassif 1, Finn Kuusisto 1, Elizabeth S. Burnside 1, David Page 1, Jude Shavlik 1, and Vítor Santos Costa 2 1 University of Wisconsin, Madison, USA 2 CRACS-INESC TEC and DCC-FCUP, University of Porto, Portugal

The Task Identify patients with breast cancer who may be good candidates for watchful waiting Produce interpretable classifiers, for insight on the problem 9/24/2013Score As You Lift

Breast Cancer Stages In Situ Earlier stage Cancer is localized Invasive Later stage Cancer has invaded surrounding tissue 9/24/2013Score As You Lift

Breast Cancer Age Difference Older Cancer tends to progress less aggressively Patient has less time for cancer progression Younger Cancer tends to progress more aggressively Patient has more time for cancer progression 9/24/2013Score As You Lift

Overtreatment Problem Who is treated? Everyone Can we reduce overtreatment in older patients with in situ cancer? 9/24/2013Score As You Lift

Watchful Waiting Candidates Older In Situ Sufficiently different from that of younger patients 9/24/2013Score As You Lift

Problem Definition Given: a Multi-Relational DB Labeling Find: Watchful Waiting Candidates Interpretable Explanations 9/24/2013Score As You Lift Subject Group Control Group Positive Class Negative Class OlderYoungerIn SituInvasive

Differential Prediction New Problem? Psychology: results of a test on two populations Medicine: ADR Marketing: Uplift Modeling 9/24/2013Score As You Lift

Model Filtering (MF) 1. Use ILP to learn theories 2. Learn a theory T O for older 3. T = { Rules in T O that do badly on younger } 9/24/2013Score As You Lift

Differentially Predictive Search (DPS) Use ILP as before But change search: Change node scoring to incorporate case/controls + on cases - on controls 9/24/2013Score As You Lift

Uplift Modeling How to evaluate different approaches? Key Work from Marketing Idea: Performance on Cases – Performance on Controls Robust Measure of Performance: Area under Lift Curve 9/24/2013Score As You Lift

Lift Curve Lift Curve: Fraction E% of Examples to correctly classify P positive examples Lift(0) = 0 Lift(1) = Pos Similar to ROC curve 9/24/2013Score As You Lift

AUL and Uplift 9/24/2013Score As You Lift Uplift SC (x) = Lift S (x) – Lift C (x) Uplift SC (0) = 0 Uplift SC (1) = Pos S - Pos C AU_UPLIFT(S, C) = AUL(S) – AUL(C)

Lift and Uplift 9/24/2013Score As You Lift

Maximise UPLIFT: SAYL Can we do better than DPS? SAYU was designed to max an external theory score Can we use SAYU for a differential score? 9/24/2013Score As You Lift

SAYU SAYU learns a classifier from ILP rules Classifiers are NBAYES, TAN It learns incrementally, by adding rules Rule in if it improves score on tune test 9/24/2013Score As You Lift

SAYL Separates cases (subjects) and controls Learns one classifier for each Scores on uplift 9/24/2013Score As You Lift

SAYL Algorithm 9/24/2013Score As You Lift

Evaluation How does it perform? Do rules interact? Are they useful? 9/24/2013Score As You Lift

Dataset OlderYounger In SituInvasiveIn SituInvasive InterventionSubject Group Control Group Positive Class Negative Class TimeOlderYoungerIn SituInvasive 9/24/2013Score As You Lift

Performance – Uplift Curves 9/24/2013Score As You Lift

Performance - Significance AlgorithmUplift AUCLift (Older) AUC Lift (Younger) AUC Rules Avg # DPS p-value SAYL DPS MF Baseline /24/2013Score As You Lift

Example Older TAN Model 9/24/2013Score As You Lift

Example Younger TAN Model 9/24/2013Score As You Lift

Example Rules 1. Current study combined BI-RADS increased up to 3 points over previous mammogram. 2. Patient had previous in situ biopsy at same location. 3. Breast BI-RADS score = 4. 9/24/2013Score As You Lift

Conclusions SAYL maximizes area under the uplift curve. SAYL significantly outperforms previous ILP methods on breast cancer application. Output rules are clinically interpretable. 9/24/2013Score As You Lift

Future Work Investigate clinical relevance further. Investigate uplift maximization using different metrics. Investigate use of initial TAN model structure. 9/24/2013Score As You Lift

Questions?

Uplift and AUC AUL is proportional to AUC AUL = P ( skew/2 + (1-skew)AUC) 0 <= skew <= linear and monotonic UPLIFT(s, c) = C 1 + C 2 (AUC s – AUC c ) 9/24/2013Score As You Lift

Marketing Customer Groups Persuadables Customers who will respond only when targeted. Sure Things Customers who will respond even when not targeted. Lost Causes Customers who will not respond, regardless of whether they were targeted or not. Sleeping Dogs Customers who will not respond as a result of being targeted. 9/24/2013Score As You Lift

Marketing Ideal Ranking 9/24/2013Score As You Lift

Marketing Dataset 9/24/2013Score As You Lift TargetControl ResponseNo ResponseResponseNo Response Persuadables Sure Things Sleeping Dogs Lost Causes Sleeping Dogs Sure Things Persuadables Lost Causes

In Situ vs. Invasive Dataset 9/24/2013Score As You Lift OlderYounger In SituInvasiveIn SituInvasive Indolent In Situ Aggressive In Situ Always ExciseAggressive In SituAlways Excise