Regression & factor analyses …. Regression example - revisited uOur example: nA financial company wishes to ascertain what the drivers of satisfaction.

Slides:



Advertisements
Similar presentations
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Advertisements

The %LRpowerCorr10 SAS Macro Power Estimation for Logistic Regression Models with Several Predictors of Interest in the Presence of Covariates D. Keith.
Introduction to Predictive Modeling with Examples Nationwide Insurance Company, November 2 D. A. Dickey.
Experimental design and analyses of experimental data Lesson 2 Fitting a model to data and estimating its parameters.
Topic 15: General Linear Tests and Extra Sum of Squares.
Machine Learning in Practice Lecture 7 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
EPI 809/Spring Probability Distribution of Random Error.
Topic 3: Simple Linear Regression. Outline Simple linear regression model –Model parameters –Distribution of error terms Estimation of regression parameters.
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Multiple regression analysis
Chapter 11: Inferential methods in Regression and Correlation
Be humble in our attribute, be loving and varying in our attitude, that is the way to live in heaven.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
OLS versus MLE Example YX Here is the data:
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
Example 16.3 Estimating Total Cost for Several Products.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy variable classification with two categories Original citation:
Chapter Outline 5.0 PROC GLM
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
1 Experimental Statistics - week 4 Chapter 8: 1-factor ANOVA models Using SAS.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
Topic 7: Analysis of Variance. Outline Partitioning sums of squares Breakdown degrees of freedom Expected mean squares (EMS) F test ANOVA table General.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Regression Examples. Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Lecture 6 Correlation and Regression STAT 3120 Statistical Methods I.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
A Casual Tutorial on Sample Size Planning for Multiple Regression Models D. Keith Williams M.P.H. Ph.D. Department of Biostatistics.
Why Design? (why not just observe and model?) CopyrightCopyright © Time and Date AS / Steffen Thorsen All rights reserved. About us | Disclaimer.
Services Providers Conjoint Hotel A $$$ *** Friendly Customized Value___ Hotel B $$$ *** Friendly Standardized Value ___ Hotel C $$$ *** Not Friendly Customized.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Topic 13: Multiple Linear Regression Example. Outline Description of example Descriptive summaries Investigation of various models Conclusions.
Three Statistical Issues (1) Observational Study (2) Multiple Comparisons (3) Censoring Definitions.
1 Experimental Statistics - week 14 Multiple Regression – miscellaneous topics.
Topic 25: Inference for Two-Way ANOVA. Outline Two-way ANOVA –Data, models, parameter estimates ANOVA table, EMS Analytical strategies Regression approach.
PSYC 3030 Review Session April 19, Housekeeping Exam: –April 26, 2004 (Monday) –RN 203 –Use pencil, bring calculator & eraser –Make use of your.
Simple Linear Regression. Data available : (X,Y) Goal : To predict the response Y. (i.e. to obtain the fitted response function f(X)) Least Squares Fitting.
1 Experimental Statistics - week 12 Chapter 12: Multiple Regression Chapter 13: Variable Selection Model Checking.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Variable Selection 1 Chapter 8 Variable Selection Terry Dielman Applied Regression Analysis:
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Regression & factor analyses. Where regression can go wrong uAn example: nA financial company wishes to ascertain what the drivers of satisfaction are.
1 Chapter 5 : Volatility Models Similar to linear regression analysis, many time series exhibit a non-constant variance (heteroscedasticity). In a regression.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
1 Forecasting/ Causal Model MGS Forecasting Quantitative Causal Model Trend Time series Stationary Trend Trend + Seasonality Qualitative Expert.
1 Experimental Statistics - week 13 Multiple Regression Miscellaneous Topics.
Experimental Statistics - week 9
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
MBF1413 | Quantitative Methods Prepared by Dr Khairul Anuar 8: Time Series Analysis & Forecasting – Part 1
Regression & factor analyses. Where regression can go wrong uAn example: nA financial company wishes to ascertain what the drivers of satisfaction are.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Sports Medicine Example. Background Does Sports Med Fellowship have an impact? Study Design –Compare people with fellowship to those without on ABFM exams.
Stats Methods at IC Lecture 3: Regression.
Multiple Regression.
Regression & factor analyses …
Multiple Regression.
MBF1413 | Quantitative Methods Prepared by Dr Khairul Anuar
Simple Linear Regression
Machine Learning in Practice Lecture 7
Chapter 13 Excel Extension: Now You Try!
Presentation transcript:

Regression & factor analyses …

Regression example - revisited uOur example: nA financial company wishes to ascertain what the drivers of satisfaction are for their service: They are: EXPERT="experts" Q30A2 ="Take the time to understand who you are" Q30A3 ="Communicate clearly, in plain language" Q30A6 ="Go out of their way to tailor the best deal" Q30A7 ="Have the knowledge and authority to make" Q30A8 ="Have a positive, can-do approach" Q30A11 ="Understand your business and the market" Q30A12 ="Are proactive with ideas on how to get t" Q30A13 ="Are prompt and reliable in handling any" Q30A14 ="Treat you with respect and listen" Q30A15 ="Keep in regular contact to keep you updated" Q32A1 ="The competitiveness of their fees and rates" Q32A2 ="Offering a flexible range of lending/rep" Q32A3 ="How easy it is to take out a commercial" Q32A4 ="The features and benefits of their comments" Q32A5 ="Providing a full range of commercial product" Q32A6 ="Being fair and reasonable in their lending“ Q24 ="Q3a. AMP BANKING OVERALL RATING“ NB: this is the response nThese were all on a 10 point scale

Let’s do a factor analysis proc factor data = hold.model rotate =varimax fuzz=.3 nfact=3; var expert Q30A2 Q30A3 Q30A6 Q30A7 Q30A8 Q30A11 Q30A12 Q30A13 Q30A14 Q30A15 Q32A1 Q32A2 Q32A3 Q32A4 Q32A5 Q32A6; run ; Rotated Factor Pattern Factor1 Factor2 Factor3 EXPERT STAFF - Experts in Commercial Finance Ma Q30A2 Take the time to understand who you are Q30A3 Communicate clearly, in plain language Q30A6 Go out of their way to tailor the best d... Q30A7 Have the knowledge and authority to make Q30A8 Have a positive, can-do approach to doin Q30A11 Understand your business and the market Q30A12 Are proactive with ideas on how to get t Q30A13 Are prompt and reliable in handling any Q30A14 Treat you with respect and listen to wha Q30A15 Keep in regular contact to keep you upda Q32A1 The competitiveness of their fees and ra Q32A2 Offering a flexible range of lending/rep Q32A3 How easy it is to take out a commercial Q32A4 The features and benefits of their comme Q32A5 Providing a full range of commercial and Q32A6 Being fair and reasonable in their lendi Values less than 0.5 are not printed.

Let’s do a factor analysis proc factor data = hold.model rotate =varimax fuzz=.5 nfact=4; var expert Q30A2 Q30A3 Q30A6 Q30A7 Q30A8 Q30A11 Q30A12 Q30A13 Q30A14 Q30A15 Q32A1 Q32A2 Q32A3 Q32A4 Q32A5 Q32A6; run ; Rotated Factor Pattern Factor1 Factor2 Factor3 Factor4 EXPERT STAFF - Experts in Commercial Finance Ma Q30A2 Take the time to understand who you are Q30A3 Communicate clearly, in plain language Q30A6 Go out of their way to tailor the best d Q30A7 Have the knowledge and authority to make Q30A8 Have a positive, can-do approach to doin Q30A11 Understand your business and the market Q30A12 Are proactive with ideas on how to get t Q30A13 Are prompt and reliable in handling any Q30A14 Treat you with respect and listen to wha Q30A15 Keep in regular contact to keep you upda Q32A1 The competitiveness of their fees and ra Q32A2 Offering a flexible range of lending/rep Q32A3 How easy it is to take out a commercial Q32A4 The features and benefits of their comme Q32A5 Providing a full range of commercial and Q32A6 Being fair and reasonable in their lendi Values less than 0.5 are not printed.

Let’s go for three factors uCommunication: uProducts: uExpertise:

How do we go about regressing these? uFirst save the factor output to a file and rename : proc factor data = hold.model out =hold.model* outstat =hold.modelfac** rotate =varimax fuzz=.5 nfact=3; var expert Q30A2 Q30A3 Q30A6 Q30A7 Q30A8 Q30A11 Q30A12 Q30A13 Q30A14 Q30A15 Q32A1 Q32A2 Q32A3 Q32A4 Q32A5 Q32A6; run; data hold.model; set hold.model; rename factor1 = comms factor2 = prod factor3 = expt; run; * This just put output for Factor1-3 on the end of the file hold.model; ** this yields all the stats used in the FA

Regressing the factors proc reg data =hold.model; model Q24 = comms prod expt; run; proc reg data =hold.model; model Q24 = comms prod expt; run; Dependent Variable: Q24 Q3a. AMP BANKING OVERALL RATING Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable Label DF Estimate Error t Value Pr > |t| Intercept Intercept <.0001 COMMS <.0001 PROD <.0001 EXPT <.0001

Conclude We conclude that: uNote also the orthogonality (linear indepedence of the factors) Pearson Correlation Coefficients, N = 300 Prob > |r| under H0: Rho=0 COMMS PROD EXPT COMMS PROD EXPT Note also that usual regression checks should apply (not done here - but will need to be inspected by you!)

Getting to the actual attributes uThis is all very well to recommend more emphasis on communication - but just which components do we need to look at? uEasy look at the combination of regression coefficients with the scoring parameters for each driver: COMMS <.0001 PROD <.0001 EXPT <.0001 and Standardized Scoring Coefficients Comms Prod Expt EXPERT STAFF - Experts in Commercial Finance Ma Q30A2 Take the time to understand who you are Q30A3 Communicate clearly, in plain language Q30A6 Go out of their way to tailor the best d Q30A7 Have the knowledge and authority to make Q30A8 Have a positive, can-do approach to doin Q30A11 Understand your business and the market Q30A12 Are proactive with ideas on how to get t Q30A13 Are prompt and reliable in handling any Q30A14 Treat you with respect and listen to wha Q30A15 Keep in regular contact to keep you upda Q32A1 The competitiveness of their fees and ra Q32A2 Offering a flexible range of lending/rep Q32A3 How easy it is to take out a commercial Q32A4 The features and benefits of their comme Q32A5 Providing a full range of commercial and Q32A6 Being fair and reasonable in their lendi

Getting to the actual attributes…  The scoring algorithm tells us how much each standardised attribute { (x-  } contributes to each factor score  So one way to see the importance of each attribute is looking at the change in modelled score as each attribute incerases by a value of 1 ( ie 1  uThe works out to be Importance for attrib i =  i b j F ij nEasy to compute in Excel (cut and paste output into excel – hint: use the ‘Text to columns’.., options in the ‘Data’ FAlternatively export hold.modelfac to excel via.csv option

Getting to the actual attributes… NB: compute importance using this type of code: =SUMPRODUCT(C6:E6,C$3:E$3) where C6:E6 is the attribute say and C$3:E$3 are the beta’s.

Conclusions uNote how things have changed: