Algorithmic Transparency with Quantitative Input Influence

Slides:



Advertisements
Similar presentations
Machine Learning Contest Group 5. Problem Description age: continuous. workclass: Private, Self-emp-not-inc, Self-emp-inc, Federal-gov, Local-gov, State-gov,
Advertisements

EBI Statistics 101.
Statistical Methods Chichang Jou Tamkang University.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Equal Pay. Equal Remuneration Convention, 1951 (No. 100) ARTICLE 2 1.Each Member shall, by means appropriate to the methods in operation for determining.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Inferential Statistics
Accountability through Information Flow Experiments Michael Carl Tschantz UC Berkeley Amit Datta, CMU Anupam Datta, CMU Jeannette M. Wing, MSR
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Prediction model building and feature selection with SVM in breast cancer diagnosis Cheng-Lung Huang, Hung-Chang Liao, Mu- Chen Chen Expert Systems with.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
… Algo 1 Algo 2 Algo 3 Algo N Meta-Learning Algo.
Classification Tree Interaction Detection. Use of decision trees Segmentation Stratification Prediction Data reduction and variable screening Interaction.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Which socio-demographic living arrangement helps to reach 100? Michel POULAIN & Anne HERM Orlando 8 January 2014.
INTRODUCTION TO RESEARCH METHODS IN ECONOMICS Topic 5 Data Collection Strategies These slides are copyright © 2010 by Tavis Barr. This work is licensed.
Man-Yee Kan, University of Oxford Heather Laurie, University of Essex Who is doing the housework in multicultural.
Multivariate Analysis - Introduction. What is Multivariate Analysis? The expression multivariate analysis is used to describe analyses of data that have.
Quantitative Literacy Assessment
Algorithmic Transparency & Quantitative Influence
National Association of Governmental Labor Officials
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Jane Mariara, Andy McKay, Andy Newell and Cinzia Rienzo Presented by:
Taking a deeper dive into your survey data with key driver analysis
Olga Maslovskaya, Gabriele Durrant, Peter WF Smith
Chapter 7. Classification and Prediction
Bagging and Random Forests
Rebel A. Cole DePaul University For presentation at the
A Comparison of Two Nonprobability Samples with Probability Samples
Differentially Private Verification of Regression Model Results
Genetic and Environmental Influences on Intelligence
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
18734: Foundations of Privacy Information Flow Experiments
Experimental Design, Data collection, and sampling Techniques
Health during transition to adulthood in the population with learning disabilities: analysis of Scotland’s Census (2011) Ms Genevieve Young-Southward1,
Bi-variate #1 Cross-Tabulation
Multiple Regression Analysis with Qualitative Information
COMP61011 : Machine Learning Ensemble Models
Basic machine learning background with Python scikit-learn
U.S. Hispanic Population: 2000
Micro-level Social and Structural Syndemic of HIV Risk among Nepalese Female Sex Workers Keshab Deuba, Sarah Anderson, Anna Mia Ekström, Satish Raj Pandey,
Chapter 2 Looking at Data— Relationships
Machine Learning Week 1.
Combining Base Learners
Multiple Regression Analysis with Qualitative Information
Human Diversity Why learn about human diversity?
Tabulations and Statistics
Towards Accountable Machine Learning Systems
Correlational Studies
Data Availability, Accessibility and Quality
Do Now: Mr. Buckley gathered some information on his class
Predicting Income of Customers
WHO MAKES THE INTERNET SO POPULAR?
Emily A. Davis & David E. Szwedo James Madison University Introduction
Heredity vs. Environment: Which is More Important?
The European Statistical Training Programme (ESTP)
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Seth Neel University of Pennsylvania EC 2018: MD4SG Workshop
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Classification Breakdown
Chapter: 9: Propensity scores
Leverage Consensus Partition for Domain-Specific Entity Coreference
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Chapter 1: Exploring Data
Derek Hoiem CS 598, Spring 2009 Jan 27, 2009
Azure Machine Learning
Multivariate Analysis - Introduction
Misc Internal Validity Scenarios External Validity Construct Validity
A Machine Learning Analysis of US Census Salary Data.
Presentation transcript:

Algorithmic Transparency with Quantitative Input Influence Anupam Datta Carnegie Mellon University 18734: Foundations of Privacy

Machine Learning Systems are Opaque ? Credit Classifier User data Decisions

Machine Learning Systems are Opaque Credit Classifier User data Decisions

Algorithmic Transparency| Decisions with Explanations [Datta, Sen, Zick IEEE Symposium on Security and Privacy 2016] How much influence do various inputs (features) have on a given classifier’s decision about individuals or groups? Age 27 Workclass Private Education Preschool Marital Status Married Occupation Farming-Fishing Relationship to household income Other Relative Race White Gender Male Capital gain $41310 ….. Negative Factors: Occupation Education Level Positive Factors: Capital Gain

Challenge | Correlated Inputs Example: Credit decisions Conclusion: Measures of association not informative Classifier (uses only income) Age Decision Income

Challenge | General Class of Transparency Queries Individual Which input had the most influence in my credit denial? Group What inputs have the most influence on credit decisions of women? Disparity What inputs influence men getting more positive outcomes than women?

Result | Quantitative Input Influence (QII) A technique for measuring the influence of an input of a system on its outputs. Causal Intervention Deals with correlated inputs Quantity of Interest Supports a general class of transparency queries

Key Idea 1| Causal Intervention Classifier (uses only income) Age 21 44 63 28 Decision Income $90K $10K $100K $20K Replace feature with random values from the population, and examine distribution over outcomes.

QII for Individual Outcomes 𝑈 𝑖 Inputs: 𝑖∈𝑁 Classifier Outcome 𝑥 1 𝑥 2 … 𝑥 𝑖 𝑐 𝑋 𝑥 1 𝑥 2 … 𝑥 𝑖 𝑐 𝑋 −𝑖 𝑈 𝑖 𝑢 𝑖 𝐏𝐫 𝑐 𝑋 =1 𝑋= 𝒙 Joe ] 𝐏𝐫 𝑐 𝑋 −𝑖 𝑈 𝑖 =1 𝑋= 𝒙 Joe ] Causal Intervention: Replace feature with random values from the population, and examine distribution over outcomes.

Key Idea 2| Quantity of Interest Various statistics of a system: Classification outcome of an individual 𝐏𝐫 𝑐 𝑋 =𝑐( 𝒙 0 ) 𝑋= 𝒙 0 ] − 𝐏𝐫 𝑐 𝑋 −𝑖 𝑈 𝑖 =𝑐( 𝒙 0 ) 𝑋= 𝒙 0 ] Classification outcomes of a group of individuals 𝐏𝐫 𝑐 𝑋 =1 𝑋 is female] − 𝐏𝐫 𝑐 𝑋 −𝑖 𝑈 𝑖 =1 𝑋 is female] Disparity between classification outcomes of groups 𝐏𝐫 𝑐 𝑋 =1 𝑋 is male]−𝐏𝐫 𝑐 𝑋 =1 𝑋 is female] − 𝐏𝐫 𝑐 𝑋 −𝑖 𝑈 𝑖 =1 𝑋 is male]−𝐏𝐫 𝑐 𝑋 −𝑖 𝑈 𝑖 =1 𝑋 is female]

QII | Definition 𝜄 𝒬 𝒜 𝑖 = 𝒬 𝒜 𝑋 − 𝒬 𝒜 ( 𝑋 −𝑖 𝑈 𝑖 ) The Quantitative Input Influence (QII) of an input 𝑖 on a quantity of interest 𝒬 𝒜 (𝑋) of a system 𝒜 is the difference in the quantity of interest when the input is replaced with random value via an intervention. 𝜄 𝒬 𝒜 𝑖 = 𝒬 𝒜 𝑋 − 𝒬 𝒜 ( 𝑋 −𝑖 𝑈 𝑖 )

Result | Quantitative Input Influence (QII) A technique for measuring the influence of an input of a system on its outputs. Causal Intervention Deals with correlated inputs Quantity of Interest Supports a general class of transparency queries

Result | Quantitative Input Influence (QII) A technique for measuring the influence of an input of a system on its outputs. Causal Intervention Deals with correlated inputs Quantity of Interest Supports a general class of transparency queries

Challenge | Single Inputs have Low Influence Classifier Only accept old, high-income individuals Young Age Decision Low Income

Naïve Approach | Set QII Replace 𝑋 𝑆 with independent random value from the joint distribution of inputs 𝑆⊆𝑁. 𝜄 𝑆 =𝒬 𝑋 −𝒬( 𝑋 −𝑆 𝑈 𝑆 ) 𝑢 𝑖 𝑢 1 𝑆 𝑥 1 𝑥 2 … 𝑥 𝑖 𝑥 1 𝑥 2 … 𝑥 𝑖 𝑐 𝑐

Need to aggregate Marginal QII across all sets Not all features are equally important within a set. Marginal QII: Influence of age and income over only income. 𝜄 {age, income} −𝜄 {income} But age is a part of many sets! Need to aggregate Marginal QII across all sets 𝜄 {age, gender, job} −𝜄 {gender, job} 𝜄 {age} −𝜄 {} 𝜄 {age, gender} −𝜄 {gender} 𝜄 {age, job} −𝜄 {job} 𝜄 {age, gender, job} −𝜄 {gender, job} 𝜄 {age, gender, income} −𝜄 {gender, income} 𝜄 {age, gender, income,job} −𝜄 {gender, income,job}

Key Idea 3| Set QII is a Cooperative Game 𝑁: set of agents 𝑣(𝑆): value of set 𝑆 Example of cooperative game Voting: 𝑣(𝑆) = does motion pass if voters in 𝑆 vote yes? Marginal contribution of voter 𝑖: 𝑚 𝑖 𝑆 =𝑣 𝑆∪ 𝑖 −𝑣(𝑆) Our setting Input features are agents Influence of feature set 𝑆 , i.e. Set QII 𝜄 𝑆 is 𝑣 𝑆 Marginal QII is 𝑚 𝑖 𝑆 =𝑣 𝑆∪ 𝑖 −𝑣(𝑆)

Shapley Value | Aggregating Marginal Contributions Marginal QII of feature i wrt set S [Shapley’53] The Shapley Value Only measure that satisfies a set of axioms These axioms are reasonable in our setting. Aggregate over all sets

Shapley Axioms (Symmetry) Equal marginal contribution implies equal influence Example: cloned features (Dummy) Zero marginal contribution implies zero influence Example: features never touched by ML model (Monotonicity) Consistently higher marginal contribution across games yields higher influence Necessary to compare feature influence scores of individuals

Shapley Axioms | Math (Symmetry) For all 𝑖,𝑗 and 𝑆⊆𝑁\{𝑖,𝑗}, 𝑣 𝑆∪{𝑖} =𝑣(𝑆∪{𝑗}), implies 𝜙 𝑖 𝑣 = 𝜙 𝑗 𝑣 (Dummy) For all 𝑖,𝑆⊆𝑁, 𝑣 𝑆∪ 𝑖 =𝑣 𝑆 , implies 𝜙 𝑖 𝑣 =0 (Monotonicity) For two games 𝑣 1 , 𝑣 2 , if for all 𝑆, 𝑚 𝑖 𝑆, 𝑣 1 ≥ 𝑚 𝑖 (𝑆, 𝑣 2 ), then 𝜙 𝑖 𝑣 1 ≥ 𝜙 𝑖 𝑣 2 .

Experiments | Test Applications arrests Predictive policing using the National Longitudinal Survey of Youth (NLSY) Features: Age, Gender, Race, Location, Smoking History, Drug History Classification: History of Arrests ~8,000 individuals income Income prediction using a benchmark census dataset Features: Age, Gender, Relationship, Education, Capital Gains, Ethnicity Classification: Income >= 50K ~30,000 individuals Implemented with Logistic Regression, Kernel SVM, Decision Trees, Decision Forest

Personalized Explanation | Mr X Age 23 Workclass Private Education 11th Marital Status Never married Occupation Craft repair Relationship to household income Child Race Asian-Pac Island Gender Male Capital gain $14344 Capital loss $0 Work hours per week 40 Country Vietnam Age 23 Workclass Private Education 11th Marital Status Never married Occupation Craft repair Relationship to household income Child Race Asian-Pac Island Gender Male Capital gain $14344 Capital loss $0 Work hours per week 40 Country Vietnam Can assuage concerns of discrimination. income income

Personalized Explanation | Mr Y Age 27 Workclass Private Education Preschool Marital Status Married Occupation Farming-Fishing Relationship to household income Other Relative Race White Gender Male Capital gain $41310 Capital loss $0 Work hours per week 24 Country Mexico Age 27 Workclass Private Education Preschool Marital Status Married Occupation Farming-Fishing Relationship to household income Other Relative Race White Gender Male Capital gain $41310 Capital loss $0 Work hours per week 24 Country Mexico Explanations of superficially similar people can be different. income

Related Work | Influence Measures Randomized Causal Intervention Feature Selection: Permutation Importance [Breiman 2001] [Kononenko and Strumbelj 2010] Importance of Causal Relations [Janzing et al. 2013] Can be viewed as instances of our transparency schema Do not consider marginal influence or general quantities of interest Associative Measures Quantitative Information Flow: Appropriate for secrecy FairTest [Tramèr et al. 2015] Indirect Influence [Adler et al. 2016] Measure potential indirect use Correlated inputs hide causality

Related Work | Interpretable Models Interpretability-by-Design Regularization for simplicity (Lasso) Bayesian Rule Lists, Falling Rule Lists [Letham et al. 2015, Wang and Rudin 2015] Possible accuracy loss, but shows promising performance characteristics Individual explanations can still be useful Interpretable approximate models [Baehrens et al. 2010] LIME [Ribeiro et al. 2016] Can provide richer explanantions Causal relation to original model can be unclear

Result | Quantitative Input Influence (QII) A technique for measuring the influence of an input of a system on its outputs. Causal Intervention Deals with correlated inputs Quantity of Interest Supports a general class of transparency queries Cooperative Game Computes joint and aggregate marginal influence Performance QII measures can be approximated efficiently

Thanks!