Presentation is loading. Please wait.

Presentation is loading. Please wait.

WEMBA, Regression Analysis Market Intelligence Julie Edell Britton Session 9 October 9, 2009.

Similar presentations


Presentation on theme: "WEMBA, Regression Analysis Market Intelligence Julie Edell Britton Session 9 October 9, 2009."— Presentation transcript:

1 WEMBA, Regression Analysis Market Intelligence Julie Edell Britton Session 9 October 9, 2009

2 Today’s Agenda Announcements WEMBA C Multiple Regression Conjoint & dummy variable regression Multiple R2Y|X1, X2 vs.  r2Y|X(i) Uncorrelated predictors Correlated predictors Promotion analysis Course Evaluations

3 Announcements Submit Nestle Contadina slides by 8 am tomorrow, Sat., 10/10

4 4 A Model of School Choice Values Perceptions Individual Differences & Constraints Become a Fuqua Student Assumes that behavior is driven by differences in: Values (Importance of key attributes) Perceptions (Duke and Competition on key attributes) Individual Differences & Constraints (travel, cost, etc.)

5 5 The Funnel Matriculate Admitted Opt Out Apply Selected Out Attend Information Session Opt Out Do not attend Information Session Opt Out

6 6 The Analysis Approach Sample groups that differ in behavior Compare the groups on relevant dimensions: Perceptions Values Individual Differences & Constraints Infer that any difference found between groups are partly responsible for differences in behavior

7 7 7 Disproportionate Stratified Random Sample Matriculate Admitted Opt Out Apply Selected Out Attend Information SessionDo not attend Information Session Not Apply n = 26 (of 158, 16.5%) Not Apply n = 24 (of 173, 14%) n = 30 (of 60, 50%) n = 26 (of 60, 52%) n = 42 (of 56, 75%) n = 12 (of 56, 25%) n = 56

8 8 What Drives Application to Duke? Compare Applied to Did Not Apply Perceptions (Duke – Competitor) Constraints Financial assistance program % Cost Time to travel to Duke Can’t use attendance at Information Session as a factor here without going to population data What did you learn?

9 9 What Drives Application to Duke? * = chi-square test statistic FACTOR AppliedDid Not Applyt-stat p-value (2-tail) Perceptions (Duke – Competitor) Faculty Reputation.500.200 -2.102.039 Cost-.667-.273 1.777.080 Mix of Face to Face and Distance Learning.09.36 1.737.087 Demographics % Cost You Pay52.1672.60 2.771.007 How Long to Travel Duke2.343.05 1.911.059 Company Helps Pay85.7%72.0% 3.038*.081

10 10 What Drives Acceptance? Compare accepted to did not accept, conditional having applied Perceptions (Duke – Competitor) Constraints Financial assistance program % Cost paid by employer Time to travel to Duke Can consider info session attendance here What did you learn?

11 11 FACTOR AcceptedDid Not Accept t-stat p-value (2-tail) Perceptions (Duke – Competitior) Faculty Reputation.679.083 2.947.005 Teaching Quality.462-.231 3.469.001 Core Quality.464-.077 3.203.003 Electives.421-.923 3.599.001 Student Quality.759.167 2.614.013 Technology.500.077 1.714.096 Advance Career.667-.154 4.176.000 Cost-.448-1.154 2.202.033 Student Reputation.900.385 2.130.039 Demographics % Cost You Pay46.4668.86 1.845.070 How Long to Travel Duke2.0973.064 1.978.053 Attended an Information Session59.5%39.7% 2.406*.108 What Drives Acceptance? * = chi-square test statistic

12 12 Acceptance by Sponsorship % paid by company: Student = 53.5 NonStudent = 31.1

13 13 The Impact of Info Sessions? Compare those who attended to those who did not attend on perceptions of Duke What did you learn?

14 14 The Impact of Info Sessions? FACTOR Info SessionNo Info Session t-stat p-value (2-tail) Perceptions (Duke) Ability to Shift Work3.073.35 1.887.063 Cost2.692.391.962.053 Half the perceptual factors (9 of 17) were in the wrong direction! Information sessions don’t seem to be doing much good at all. *Only positive evidence is that, in overall population, probability of applying was 16.5% for those who attend an info session vs. 9.5% for those who did not, χ 2 = 8.70, p <.005. *No significant effect on matriculation / acceptance: 59.5% of those who attended accepted v. 39.7% of those who did not attend, χ 2 = 2.41, p =.11

15 15 Who to Target? FACTOR AppliedDid Not Applyt-stat p-value (2-tail) % Cost You Pay 52.1672.60 2.771.007 How Long to Travel Duke 2.343.05 1.911.059 Company Helps Pay 85.7%72.0% 3.038*.081 FACTOR AcceptedDid Not Accept t-stat p-value (2-tail) % Cost You Pay 46.4668.86 1.845.070 How Long to Travel Duke 2.0973.064 1.978.053

16 16 What Should be Emphasized in Information Sessions? Attributes that are important and where we do well and/or where our competition does not do well. Importance Focus on attributes that predict applying & accepting Rank order attribute importance Look at important attributes where perceptions (Duke – Comp) is positive.

17 17 Quasi-MAAM for Communication Content: Demonstrated Importance Duke Advantage Duke Disadvantage Important Faculty Rep, Cost Advance Career, Teaching Quality, Electives, Core Quality, Faculty Rep, Student Quality, School Rep BragMisperception (fix Marcom) or Real Problem (fix Product)? UnimportantSave Your Breath Ignore

18 18 Quasi-MAAM for Communication Content: Using Importance Ratings Duke Advantage Duke Disadvantage Important Continue Career, School Rep, Forwarding Career, Teaching Quality, Faculty Reputation, Other Students BragMisperception (fix Marcom) or Real Problem (fix Product)? UnimportantSave Your Breath Ignore

19 19 WEMBA Takeaways Be backward in your analysis process too Outline your analysis before you begin Think about tables needed, then do the analysis to make them Real data is imperfect…do the best you can Survey data is correlational, not causal The funnel approach applies to many business problems

20 20 Multiple Regression Simple linear regression, with more than one predictor. a = intercept: predicted value of y if x 1 = x 2 = …x k = 0 b 1 =Slope of y on x 1 given that x 2 …x k are already in equation R =multiple correlation = correlation of Y-hat with Y (0< R < 1) R 2 = % variance in Y explained by best linear regression equation.

21 21 Conjoint as Dummy Variable Regression (Y) (X 1 )(X 2 ) (X 3 ) Rating Size16oz Pepsi Caffeine 7 0 0 0 9 0 0 1 3 0 1 0 4 0 1 1 8 1 0 0 10 1 0 1 5 1 1 0 6 1 1 1

22 22 Bivariate Correlation Matrix

23 23 R 2 Y|X1,X2,X3 = r 2 Y|X1 +r 2 Y|X2 +r 2 Y|X3 Because Dummies are Uncorrelated.976 =[(.327) 2 +(-.873) 2 +.327) 2 ]

24 24 Dummy Variable Regression

25 25 A Framework for Understanding Multicollinearity Major Problem in Multiple Regression: Assessing the (unique) contribution of the individual predictors. ab Variance in x 1 Variance in x 2 Variance in y c e

26 26 The area in c causes ambiguity in specifying the contributions of x 1 and x 2 to explaining y. Should we attribute all of this variance to x 1 ? All to x 2 ? Somehow split it? Two different ways exist of assessing the contribution of x1 to explaining y. Multicollinearity

27 27 2 Ways to Deal with Overlap (see slide 26)

28 28 Understand Expenditures in Milan Food Run two regressions Total in householdWeekly food expenditures Any kids 6- 18? (0=no, 1=yes) Annual income

29 29 But Check Correlations Between Predictors

30 30 Zero order coefficients tell us: r bc = +.43 r bi = +.40 r bj = +.37 More people in household, more weekly food expenditures If any kids 6-18, more weekly food expenditures All strongly statistically significant with 498 df Higher income, more spent on food Which two should predict Weekly expenditures best?

31 31 Multicollinearity,  r2, and R2 ModelPredictor 1Predictor 2Sum of r 2 Actual R 2 1HHSizeKid 6-18.346.205 2HHSizeIncomeK.327.282 3IncomeKIncome$.274.140

32 32

33 33 “Partial Effect” Milan Food Problem Run two regressions Weekly food expenditures Any children under 6? (0=no, 1=yes) Total in household Any children under 6? (0=no, 1=yes)

34 34

35 35 Doritos XL Models. Effects of own price (& price promotion) & price promotions of other sizes (SM, XXL, 3XL) on sales of XL size

36 36

37 37 Only Own XL Price Significant

38 38 Drop 3 XL from the model

39 39 Promotion Models (p. 195) If coupon dummy (1 = yes, 0 = no) and promo dummy (1=yes, 0=no) are perfectly correlated and each is correlated, say, r =.5 with weekly sales, R 2 (sales |coupon, promo) =.25 <<.25 +.25. Coefficients on coupon, promo would be indistinguishable from zero (nonsignificant), with huge standard errs.

40 40 Regression Analysis Data WeekY = SalesX1 = CouponX2 =Promotion 1100000 2290011 3110000 ………… 52310011

41 41 Promotion Models Omitted Variable Bias Promo, Coupons each boost 1000 units, but Coupon omitted & r = 1 with promo, coefficient for promo will be 2000. (P. 175-181) Multicollinearity & Overloaded Models

42 42 Sales Promotion Analysis, p. 187

43 43 Takeaways Multiple Regression Dummy Variable Regression for Conjoint (uncorrelated predictors) Correlated predictors make it difficult to assess each predictor’s unique contribution. Common in promotion analysis because it is common to pull multiple promotional levers simultaneously. 2 Solutions: Drop a predictor (omitted variable bias so reinterpret coefficients) Leave both in (inflated Standard Errors, hard to assess impact of each)


Download ppt "WEMBA, Regression Analysis Market Intelligence Julie Edell Britton Session 9 October 9, 2009."

Similar presentations


Ads by Google