Shapley Value Regression

Slides:



Advertisements
Similar presentations
Scaling Session Measurement implies “assigning numbers to objects or events…” Distinguish two levels: we can assign numbers to the response levels for.
Advertisements

Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Developing the Research Question
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
1 Ganesh Iyer Creating and Measuring Brand Equity “Intel Inside” EWMBA 206 Fall 2007.
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Scaling and Attitude Measurement in Travel and Hospitality Research Research Methodologies CHAPTER 11.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Review Regression and Pearson’s R SPSS Demo
Chapter 8: Bivariate Regression and Correlation
Example of Simple and Multiple Regression
1 Chapter 17 Data Analysis: Investigation of Association © 2005 Thomson/South-Western.
Conjoint Analysis Y. İlker TOPCU, Ph.D twitter.com/yitopcu.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter Fourteen Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or.
Examining Relationships in Quantitative Research
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Managerial Economics Demand Estimation & Forecasting.
10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.
Chapter Sixteen Copyright © 2006 McGraw-Hill/Irwin Data Analysis: Testing for Association.
Chapter 16 Data Analysis: Testing for Associations.
Examining Relationships in Quantitative Research
Slide 10-1 © 1999 South-Western Publishing McDaniel Gates Contemporary Marketing Research, 4e Using Measurement Scales to Build Marketing Effectiveness.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.
Regression Chapter 5 January 24 – Part II.
FACTOR ANALYSIS.  The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
Conjoint Analysis. 1. Managers frequently want to know what utility a particular product feature or service feature will have for a consumer. 2. Conjoint.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Methods of Presenting and Interpreting Information Class 9.
The simple linear regression model and parameter estimation
Regression Analysis.
Taking a deeper dive into your survey data with key driver analysis
Analysis of Variance and Covariance
Principles and Worldwide Applications, 7th Edition
Sections Review.
Regression Analysis Module 3.
4.12 & 4.13 UNDERSTAND DATA-COLLECTION METHODS TO EVALUATE THEIR APPROPRIATENESS FOR THE RESEARCH PROBLEM/ISSUE. RATING SCALES 4.00 Understand promotion.
Ordinary Least Squares (OLS) Regression
Multiple Regression Analysis and Model Building
Regression Analysis.
Delwin Derksen Carlene Gilbert Kim Luchsinger Beth Mammenga Jon White
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Multidimensional Scaling and Correspondence Analysis
Principles of Business, Marketing and Finance
Multidimensional Scaling
CHAPTER 26: Inference for Regression
Managerial Economics in a Global Economy
Tabulations and Statistics
MOHAMMAD NAZMUL HUQ, Assistant Professor, Department of Business Administration. Chapter-16: Analysis of Variance and Covariance Relationship among techniques.
Correlation ... beware.
Chapter 3 Describing Relationships Section 3.2
Conjoint Analysis.
Multidimensional Scaling
Multiple Regression – Split Sample Validation
Chapter 26 Comparing Counts.
Multivariate Analysis - Introduction
Scatterplots contd: Correlation The regression line
Presentation transcript:

Shapley Value Regression

Driver Analysis Motivation: Don’t Ask Why Why not just ask respondents directly why they purchase a particular product? Consumers are generally unconscious of why we do what we do when it comes to product purchase decisions. Respondents will tell you answers that they think you want to hear. You get their justifications for their purchase, not their motivation. We recommend using a derived importance method through driver analysis.

Driver Analysis Motivation: Stated Importance Why not ask respondents to state the importance of product/service attributes? The traditional approach of asking respondents to indicate the importance of attributes on a scale requires no tradeoff: “Everything is really important” No Differentiation between what’s truly important to the business and what are the marginal items. Tradeoff methods such as Conjoint and/or MaxDiff are more suitable, but are often more costly and time consuming.

SV Regression Background Shapley Value Regression is part of the family of the techniques used in Driver Analysis models. In Driver analysis, we seek to understand the motivation behind consumer behaviours by observing the pattern of associations and correlations between their decisions and their perception/experience with the product/service being offered If we are interested in what drives consumer purchase decisions, we look for correlations between purchase decision and consumer perception and experience with the product. If we are interested in what drives customer satisfaction, we look for correlations between overall customer satisfaction rating and their satisfaction with key service points.

SV Regression Background Shapley Value Regression is able to deal effectively with the multicollinearity issue that’s often present in Market Research data. Multicollinearity is when there are strong correlations between the various aspects of consumer perceptions to the point that it affects the stability of the results in the driver analysis via traditional approaches. Similar to other methods used in driver analysis, variables with little variations will not show up as key drivers. Table stake attributes are unlikely to show up as key drivers. All airlines are safe, so safety is not a key driver of airline choice among travelers.

SV Regression Advantage & Disadvantages Key Advantages Effectively deals with the correlated nature of market research data (i.e. multicollinearity). It is inherently stable and can be used as a tracking tool. Clients who are used to running regression will be open to this approach. Key Disadvantages Requires complete data for every variable. Missing data must be replaced with the mean, or other value. Alternatively, a reduced base size must be used in the analysis. Does not distinguish between drivers of satisfaction and drivers of dissatisfaction. Not predictive. The analysis is based on observation of current behavior of consumers only, not predictive of their future behavior.

SV Regression: Correlation & Causation

SV Regression Methodology Model Specification Dependent variable is the response variable you want to study. Has to be at least ordinal, i.e. 3 ordered response category or more. Interval or ratio scales are preferred. Independent variables are the potential drivers that could influence the response variable. can be metric or non-metric. Examples: brand association questions: which of the following brands do you associate with these attributes? What sample size do you need? We require at least 10 cases of data for each potential drivers. If there are 20 potential drivers, we need at least n=200 cases of data. As the ratio falls below 10:1, we encounter the risk of overfitting the model to the sample, making the results too specific to the sample and lacking generalizability.

SV Regression Example Output R2 – How much do the potential drivers together influence the response variable Relative Importance – the relative importance of the potential drivers (sums to 100%).

SV Regression Application: Example: Quadrant Map The relative importance of attributes can be combined with other information to create Quadrant maps.

SV Regression Application: Example: Quadrant Map Top Priority for Enhancement High Importance Low Performance Strength High Importance High Performance Importance Second Priority for Enhancement Low Importance Low Performance Maintenance Low Importance High Performance Performance

Appendix

Contribution measured by R-square SV Regression Details The Shapley Value Principle was developed to evaluate an ordering of the worth of players in a multi-player cooperative game. The key to understanding its utility is that it represents the worth of each player over all possible combinations of players. In Shapely Value Regression, we extend this to the problem of comparative usefulness of potential drivers. SV regression assigns a value for each potential drivers calculated over all possible combinations of all the other drivers in regressions. We use Ordinary Least Square (OLS) regression for all possible combinations of explanatory variables Contribution measured by R-square

SV Regression Output - Details R2 - Coefficient of Determination Measures the proportion of the variance of the response variable that is explained by all the potential drivers. It varies between 0 and 1. The higher the R2, the stronger the association between potential drivers and the response variable. SV – Shapley Value the contribution of each potential driver to the overall R2 of the regression sdSV – Standard deviations of the Shapley Values Relative Importance Rebasing the Shapley Values so that they sum to 100%. The relative importance of an item = SV/ Overall R2.