17-1 COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN 6 th edition (SIE)

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

Chapter Nineteen Factor Analysis.
Bivariate Regression Analysis
Chapter 17 Overview of Multivariate Analysis Methods
Multivariate distributions. The Normal distribution.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Linear Discriminant Function LDF & MANOVA LDF & Multiple Regression Geometric example of LDF & multivariate power Evaluating & reporting LDF results 3.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Statistics for Managers Using Microsoft® Excel 5th Edition
Discrim Continued Psy 524 Andrew Ainsworth. Types of Discriminant Function Analysis They are the same as the types of multiple regression Direct Discrim.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Discriminant Analysis – Basic Relationships
Simple Linear Regression Analysis
Goals of Factor Analysis (1) (1)to reduce the number of variables and (2) to detect structure in the relationships between variables, that is to classify.
7-2 Estimating a Population Proportion
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation. The sample covariance matrix: where.
Discriminant Analysis Testing latent variables as predictors of groups.
Discriminant analysis
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Example of Simple and Multiple Regression
1 Statistical Tools for Multivariate Six Sigma Dr. Neil W. Polhemus CTO & Director of Development StatPoint, Inc.
Introduction to Linear Regression and Correlation Analysis
Multiple Discriminant Analysis and Logistic Regression.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Chapter Eighteen Discriminant Analysis Chapter Outline 1) Overview 2) Basic Concept 3) Relation to Regression and ANOVA 4) Discriminant Analysis.
Analysis of Variance (ANOVA) and Multivariate Analysis of Variance (MANOVA) Session 6.
Discriminant Analysis
18-1 © 2007 Prentice Hall Chapter Eighteen 18-1 Discriminant and Logit Analysis.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
Copyright © 2010 Pearson Education, Inc Chapter Eighteen Discriminant and Logit Analysis.
Review for Final Examination COMM 550X, May 12, 11 am- 1pm Final Examination.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Lecture 12 Factor Analysis.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Discriminant Analysis Dr. Satyendra Singh Professor and Director University of Winnipeg, Canada
Machine Learning 5. Parametric Methods.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Two-Group Discriminant Function Analysis. Overview You wish to predict group membership. There are only two groups. Your predictor variables are continuous.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Unit 7 Statistics: Multivariate Analysis of Variance (MANOVA) & Discriminant Functional Analysis (DFA) Chat until class starts.
D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Basics of Multivariate Probability
BINARY LOGISTIC REGRESSION
Correlation, Bivariate Regression, and Multiple Regression
EXPLORATORY FACTOR ANALYSIS (EFA)
Correlation and Simple Linear Regression
Multiple Discriminant Analysis and Logistic Regression
Chapter 15 Linear Regression
Hypothesis testing and Estimation
Chapter_19 Factor Analysis
Parametric Methods Berlin Chen, 2005 References:
Lecture Slides Elementary Statistics Twelfth Edition
Presentation transcript:

17-1 COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN 6 th edition (SIE)

17-2 Chapter 17 Multivariate Analysis

17-3 The Multivariate Normal Distribution Discriminant Analysis Principal Components and Factor Analysis Using the Computer Multivariate Analysis 17

17-4 Describe a multivariate normal distribution Explain when a discriminant analysis could be conducted Interpret the results of a discriminant analysis Explain when a factor analysis could be conducted Differentiate between principal components and factors Interpret factor analysis results LEARNING OUTCOMES 17 After studying this chapter, you should be able to:

17-5 A k-dimensional (vector) random variable X: X = (X 1, X 2, X 3..., X k ) A realization of a k-dimensional random variable X: x = (x 1, x 2, x 3..., x k ) A joint cumulative probability distribution function of a k-dimensional random variable X: F(x 1, x 2, x 3..., x k ) = P(X 1  x 1, X 2  x 2,..., X k  x k ) 17-2 The Multivariate Normal Distribution

17-6 The Multivariate Normal Distribution

17-7 f(x 1,x 2 ) x1x1 x2x2 Picturing the Bivariate Normal Distribution

17-8 In a discriminant analysis, observations are classified into two or more groups, depending on the value of a multivariate discriminant function. X2X2 X1X1 Group 1 Group 2 11 22 Line L As the figure illustrates, it may be easier to classify observations by looking at them from another direction. The groups appear more separated when viewed from a point perpendicular to Line L, rather than from a point perpendicular to the X 1 or X 2 axis. The discriminant function gives the direction that maximizes the separation between the groups Discriminant Analysis

17-9 Group 1Group 2 C Cutting Score The form of the estimated predicted equation: D = b 0 +b 1 X 1 +b 2 X b k X k where the b i are the discriminant weights. b 0 is a constant. The intersection of the normal marginal distributions of two groups gives the cutting score, which is used to assign observations to groups. Observations with scores less than C are assigned to group 1, and observations with scores greater than C are assigned to group 2. Since the distributions may overlap, some observations may be misclassified. The model may be evaluated in terms of the percentages of observations assigned correctly and incorrectly. The Discriminant Function

17-10 Discriminant 'Repay' 'Assets' 'Debt' 'Famsize'. Group 0 1 Count Summary of Classification Put into....True Group.... Group Total N N Correct Proport N = 32 N Correct = 23 Prop. Correct = Linear Discriminant Function for Group 0 1 Constant Assets Debt Famsize Discriminant 'Repay' 'Assets' 'Debt' 'Famsize'. Group 0 1 Count Summary of Classification Put into....True Group.... Group Total N N Correct Proport N = 32 N Correct = 23 Prop. Correct = Linear Discriminant Function for Group 0 1 Constant Assets Debt Famsize Discriminant Analysis: Example 17-1 (Minitab)

17-11 Summary of Misclassified Observations Observation True Pred Group Sqrd Distnc Probability Group Group 4 ** ** ** ** ** ** ** ** ** Summary of Misclassified Observations Observation True Pred Group Sqrd Distnc Probability Group Group 4 ** ** ** ** ** ** ** ** ** Example 17-1: Misclassified Observations

set width 80 2 data list free / assets income debt famsize job repay 3 begin data 35 end data 36 discriminant groups = repay(0,1) 37 /variables assets income debt famsize job 38 /method = wilks 39 /fin = 1 40 /fout = 1 41 /plot 42 /statistics = all Number of cases by group Number of cases REPAY Unweighted Weighted Label Total set width 80 2 data list free / assets income debt famsize job repay 3 begin data 35 end data 36 discriminant groups = repay(0,1) 37 /variables assets income debt famsize job 38 /method = wilks 39 /fin = 1 40 /fout = 1 41 /plot 42 /statistics = all Number of cases by group Number of cases REPAY Unweighted Weighted Label Total Example 17-1: SPSS Output (1)

D I S C R I M I N A N T A N A L Y S I S On groups defined by REPAY Analysis number 1 Stepwise variable selection Selection rule: minimize Wilks' Lambda Maximum number of steps Minimum tolerance level Minimum F to enter … Maximum F to remove Canonical Discriminant Functions Maximum number of functions Minimum cumulative percent of variance Maximum significance of Wilks' Lambda Prior probability for each group is D I S C R I M I N A N T A N A L Y S I S On groups defined by REPAY Analysis number 1 Stepwise variable selection Selection rule: minimize Wilks' Lambda Maximum number of steps Minimum tolerance level Minimum F to enter … Maximum F to remove Canonical Discriminant Functions Maximum number of functions Minimum cumulative percent of variance Maximum significance of Wilks' Lambda Prior probability for each group is Example 17-1: SPSS Output (2)

Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda ASSETS INCOME DEBT FAMSIZE JOB * * * * * * * * * * * ** * * * * * * * * * * * * * * * * * * * * * At step 1, ASSETS was included in the analysis. Degrees of Freedom Signif. Between Groups Wilks' Lambda Equivalent F Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda ASSETS INCOME DEBT FAMSIZE JOB * * * * * * * * * * * ** * * * * * * * * * * * * * * * * * * * * * At step 1, ASSETS was included in the analysis. Degrees of Freedom Signif. Between Groups Wilks' Lambda Equivalent F Example 17-1: SPSS Output (3)

Variables in the Analysis after Step Variable Tolerance F to Remove Wilks' Lambda ASSETS Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda INCOME DEBT FAMSIZE JOB At step 2, DEBT was included in the analysis. Degrees of Freedom Signif. Between Groups Wilks' Lambda Equivalent F Variables in the Analysis after Step Variable Tolerance F to Remove Wilks' Lambda ASSETS Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda INCOME DEBT FAMSIZE JOB At step 2, DEBT was included in the analysis. Degrees of Freedom Signif. Between Groups Wilks' Lambda Equivalent F Example 17-1: SPSS Output (4)

Variables in the Analysis after Step Variable Tolerance F to Remove Wilks' Lambda ASSETS DEBT Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda INCOME FAMSIZE JOB At step 3, FAMSIZE was included in the analysis. Degrees of Freedom Signif. Between Groups Wilks' Lambda Equivalent F Variables in the Analysis after Step Variable Tolerance F to Remove Wilks' Lambda ASSETS DEBT Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda INCOME FAMSIZE JOB At step 3, FAMSIZE was included in the analysis. Degrees of Freedom Signif. Between Groups Wilks' Lambda Equivalent F Example 17-1: SPSS Output (5)

Variables in the Analysis after Step Variable Tolerance F to Remove Wilks' Lambda ASSETS DEBT FAMSIZE Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda INCOME JOB Summary Table Action Vars Wilks' Step Entered Removed in Lambda Sig. Label 1 ASSETS DEBT FAMSIZE Variables in the Analysis after Step Variable Tolerance F to Remove Wilks' Lambda ASSETS DEBT FAMSIZE Variables not in the Analysis after Step Minimum Variable Tolerance Tolerance F to Enter Wilks' Lambda INCOME JOB Summary Table Action Vars Wilks' Step Entered Removed in Lambda Sig. Label 1 ASSETS DEBT FAMSIZE Example 17-1: SPSS Output (6)

17-18 Classification function coefficients (Fisher's linear discriminant functions) REPAY = 0 1 ASSETS DEBT FAMSIZE (Constant) Unstandardized canonical discriminant function coefficients Func 1 ASSETS DEBT FAMSIZE (Constant) Classification function coefficients (Fisher's linear discriminant functions) REPAY = 0 1 ASSETS DEBT FAMSIZE (Constant) Unstandardized canonical discriminant function coefficients Func 1 ASSETS DEBT FAMSIZE (Constant) Example 17-1: SPSS Output (7)

17-19 Case Mis Actual Highest Probability 2nd Highest Discrim Number Val Sel Group Group P(D/G) P(G/D) Group P(G/D) Scores ** ** ** ** ** ** ** ** ** Case Mis Actual Highest Probability 2nd Highest Discrim Number Val Sel Group Group P(D/G) P(G/D) Group P(G/D) Scores ** ** ** ** ** ** ** ** ** Example 17-1: SPSS Output (8)

17-20 Classification results - No. of Predicted Group Membership Actual Group Cases Group % 28.6% Group % 72.2% Percent of "grouped" cases correctly classified: 71.88% Classification results - No. of Predicted Group Membership Actual Group Cases Group % 28.6% Group % 72.2% Percent of "grouped" cases correctly classified: 71.88% Example 17-1: SPSS Output (9)

17-21 All-groups Stacked Histogram Canonical Discriminant Function | | F | | r e | 2 | q | 2 | u | 2 | e n | | c | | y | | | | X X out out Class Centroids 2 1 All-groups Stacked Histogram Canonical Discriminant Function | | F | | r e | 2 | q | 2 | u | 2 | e n | | c | | y | | | | X X out out Class Centroids 2 1 Example 17-1: SPSS Output (10)

17-22 First Component Second Component x y Total Variance Remaining After Extraction of First Second Third Component 17-4 Principal Components and Factor Analysis

17-23 The k original X i variables written as linear combinations of a smaller set of m common factors and a unique component for each variable: X 1 = b 11 F 1 + b 12 F b 1m F m + U 1 X 1 = b 21 F 1 + b 22 F b 2m F m + U 2. X k = b k1 F 1 + b k2 F b km F m + U k The F j are the common factors. Each U i is the unique component of variable X i. The coefficients b ij are called the factor loadings. Total variance in the data is decomposed into the communality, the common factor component, and the specific part. The k original X i variables written as linear combinations of a smaller set of m common factors and a unique component for each variable: X 1 = b 11 F 1 + b 12 F b 1m F m + U 1 X 1 = b 21 F 1 + b 22 F b 2m F m + U 2. X k = b k1 F 1 + b k2 F b km F m + U k The F j are the common factors. Each U i is the unique component of variable X i. The coefficients b ij are called the factor loadings. Total variance in the data is decomposed into the communality, the common factor component, and the specific part. Factor Analysis

17-24 Rotation of Factors

17-25 Factor Loadings Satisfaction with: Communality Information Variety Closure Pay Factor Loadings Satisfaction with: Communality Information Variety Closure Pay Factor Analysis of Satisfaction Items