4Saarland University, 5Cornell Tech

Slides:

Advertisements

Similar presentations

Random Forest Predrag Radenković 3237/10

Advertisements

Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.

Discriminant Analysis Database Marketing Instructor:Nanda Kumar.

Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.

Linear Regression. PSYC 6130, PROF. J. ELDER 2 Correlation vs Regression: What’s the Difference? Correlation measures how strongly related 2 variables.

Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.

Decision Tree Rong Jin. Determine Milage Per Gallon.

Multiple Regression Involves the use of more than one independent variable. Multivariate analysis involves more than one dependent variable - OMS 633 Adding.

Multiple Regression – Basic Relationships

1 Measurement Adapted from The Research Methods Knowledge Base, William Trochim (2006). & Methods for Social Researchers in Developing Counries, The Ahfad.

CORRELATIO NAL RESEARCH METHOD. The researcher wanted to determine if there is a significant relationship between the nursing personnel characteristics.

Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.

Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.

Chapter 1: Introduction to Statistics

Moderation & Mediation

Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.

Analyzing and Interpreting Quantitative Data

Bug Localization with Machine Learning Techniques Wujie Zheng

Zhangxi Lin ISQS Texas Tech University Note: Most slides are from Decision Tree Modeling by SAS Lecture Notes 5 Auxiliary Uses of Trees.

Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)

Universit at Dortmund, LS VIII

Patterns of Event Causality Suggest More Effective Corrective Actions Abstract: The Occurrence Reporting and Processing System (ORPS) has used a consistent.

Henry Domenico Vanderbilt University Medical Center.

Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.

HLM Models. General Analysis Strategy Baseline Model - No Predictors Model 1- Level 1 Predictors Model 2 – Level 2 Predictors of Group Mean Model 3 –

Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.

Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.

One-Way Analysis of Covariance (ANCOVA)

C OMPARING A SSOCIATION R ULES AND D ECISION T REES FOR D ISEASE P REDICTION Carlos Ordonez.

1 MGT 511: Hypothesis Testing and Regression Lecture 8: Framework for Multiple Regression Analysis K. Sudhir Yale SOM-EMBA.

Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.

Data Mining and Decision Support

CLUSTERING AND SEGMENTATION MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.

10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.

Nemours Biomedical Research Statistics April 9, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.

(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.

NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.

Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.

26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.

Machine Learning with Spark MLlib

Carina Omoeva, FHI 360 Wael Moussa, FHI 360

Regression Analysis Module 3.

Supplementary Table 1. PRISMA checklist

Data mining and statistical learning, lecture 1b

Multiple Regression Analysis and Model Building

Martin Rajman, Martin Vesely

ASPIRE Workshop 5: Application of Biostatistics

ASPIRE Workshop 5: Application of Biostatistics

ECE 5424: Introduction to Machine Learning

Applications of IScore (using R)

NBA Draft Prediction BIT 5534 May 2nd 2018

Data Mining Practical Machine Learning Tools and Techniques

Survey phases, survey errors and quality control system

Prepared by Lee Revere and John Large

Dr. Morgan C. Wang Department of Statistics

Building a predictive model to enhance students' self-driven engagement Moletsane Moletsane T: +27(0) | |

Stratified Sampling for Data Mining on the Deep Web

Code search & recommendation engines

Multiple Regression – Split Sample Validation

Chapter 13 Additional Topics in Regression Analysis

Decision trees MARIO REGIN.

DESIGN OF EXPERIMENTS by R. C. Baker

ASPIRE Workshop 5: Application of Biostatistics

Enhancing Causal Inference in Observational Studies

Enhancing Causal Inference in Observational Studies

Presentation transcript:

4Saarland University, 5Cornell Tech FairTest: Discovering unwarranted associations in data-driven applications IEEE EuroS&P April 28th, 2017 Florian Tramèr1, Vaggelis Atlidakis2, Roxana Geambasu2, Daniel Hsu2, Jean-Pierre Hubaux3, Mathias Humbert4, Ari Juels5, Huang Lin3 1Stanford University, 2Columbia University, 3École Polytechnique Fédérale de Lausanne, 4Saarland University, 5Cornell Tech

“Unfair” associations + consequences

“Unfair” associations + consequences These are software bugs: need to actively test for them and fix them (i.e., debug) in data-driven applications… just as with functionality, performance, and reliability bugs.

Unwarranted Associations Model Data-driven application User inputs Application outputs Protected inputs

Limits of preventative measures What doesn’t work: Hide protected attributes from data-driven application. Aim for statistical parity w.r.t. protected classes and service output. Foremost challenge is to even detect these unwarranted associations.

A Framework for Unwarranted Associations Specify relevant data features: Protected variables (e.g., Gender, Race, …) “Utility”: a function of the algorithm’s output (e.g., Price, Error rate, …) Explanatory variables (e.g., Qualifications) Contextual variables (e.g., Location, Job, …) Find statistically significant associations between protected attributes and utility Condition on explanatory variables Not tied to any particular statistical metric (e.g., odds ratio) Granular search in semantically meaningful subpopulations Efficiently list subgroups with highest adverse effects

FairTest: a testing suite for data-driven apps Finds context-specific associations between protected variables and application outputs, conditioned on explanatory variables Bug report ranks findings by assoc. strength and affected pop. size location, click, … prices, tags, … Data-driven application User inputs Application outputs Protected vars. Context vars. FairTest Association bug report for developer Explanatory vars. race, gender, … zip code, job, … qualifications, …

A data-driven approach Core of FairTest is based on statistical machine learning FairTest Find context-specific associations Statistically validate associations Data Training data Test data Ideally sampled from relevant user population Statistical machine learning internals: top-down spatial partitioning algorithm confidence intervals for assoc. metrics …

Reports for Fairness bugs Example: simulation of location based pricing scheme Test for disparate impact on low-income populations Low effect over whole US population High effects in specific sub- populations

Association-Guided Decision Trees Goal: find most strongly affected user sub-populations … Occupation A B C Split into sub-populations with Increasingly strong associations between protected variables and application outputs … Age < 50 ≥ 50

Association-Guided Decision Trees Efficient discovery of contexts with high associations Outperforms previous approaches based on frequent itemset mining Easily interpretable contexts by default Association-metric agnostic Greedy strategy (some bugs could be missed) Metric Use Case Binary ratio/difference Binary variables Mutual Information Categorical variables Pearson Correlation Scalar variables Regression High dimensional outputs Plugin your own! ???

Example: healthcare application Predictor of whether patient will visit hospital again in next year (from winner of 2012 Heritage Health Prize Competition) FairTest findings: strong association between age and prediction error rate Hospital re-admission predictor age, gender, # emergencies, … Will patient be re-admitted? Association may translate to quantifiable harms (e.g., if model is used to adjust insurance premiums)

Debugging with FairTest Are there confounding factors? Do associations disappear after conditioning? ⇒ Adaptive Data Analysis! Example: the healthcare application (again) Estimate prediction confidence (target variance) Does this explain the predictor’s behavior? Yes, partially High confidence in prediction FairTest helps developers understand & evaluate potential association bugs.

Other applications studied using FairTest Image tagger based on ImageNet data ⇒ Large output space (~1000 labels) ⇒ FairTest automatically switches to regression metrics ⇒ Tagger has higher error rate for pictures of black people Simple movie recommender system ⇒ Men are assigned movies with lower ratings than women ⇒ Use personal preferences as explanatory factor ⇒ FairTest finds no significant bias anymore

Closing remarks Thanks! The Unwarranted Associations Framework Captures a broader set of algorithmic biases than in prior work Principled approach for statistically valid investigations FairTest The first end-to-end system for evaluating algorithmic fairness Developers need better statistical training and tools to make better statistical decisions and applications. http://arxiv.org/abs/1510.02377 Thanks!