Outline of Today’s Discussion 1.Introduction to Discriminant Analysis 2.Assumptions for Discriminant Analysis 3.Discriminant Analysis in SPSS.

Slides:



Advertisements
Similar presentations
MANOVA (and DISCRIMINANT ANALYSIS) Alan Garnham, Spring 2005
Advertisements

Discriminant Analysis and Classification. Discriminant Analysis as a Type of MANOVA  The good news about DA is that it is a lot like MANOVA; in fact.
Sociology 680 Multivariate Analysis Logistic Regression.
Correlation and Linear Regression.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Chapter 17 Overview of Multivariate Analysis Methods
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
1 Canonical Analysis Introduction Assumptions Model representation An output example Conditions Procedural steps Mechanical steps - with the use of attached.
Statistical Methods Chichang Jou Tamkang University.
Discrim Continued Psy 524 Andrew Ainsworth. Types of Discriminant Function Analysis They are the same as the types of multiple regression Direct Discrim.
Discriminant Analysis – Basic Relationships
Discriminant Analysis Objective Classify sample objects into two or more groups on the basis of a priori information.
Analysis of Variance & Multivariate Analysis of Variance
Lecture 5: Simple Linear Regression
Multiple Regression – Basic Relationships
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Discriminant Analysis Testing latent variables as predictors of groups.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
SPSS Session 4: Association and Prediction Using Correlation and Regression.
Repeated Measures ANOVA Used when the research design contains one factor on which participants are measured more than twice (dependent, or within- groups.
Discriminant analysis
Example of Simple and Multiple Regression
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Inferential Statistics: SPSS
One-Way Manova For an expository presentation of multivariate analysis of variance (MANOVA). See the following paper, which addresses several questions:
SPSS Series 1: ANOVA and Factorial ANOVA
CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
Chapter Eighteen Discriminant Analysis Chapter Outline 1) Overview 2) Basic Concept 3) Relation to Regression and ANOVA 4) Discriminant Analysis.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
Discriminant Analysis
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
 Muhamad Jantan & T. Ramayah School of Management, Universiti Sains Malaysia Data Analysis Using SPSS.
17-1 COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN 6 th edition (SIE)
Statistical Analysis of Data1 of 38 1 of 42 Department of Cognitive Science Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 MANOVA Multivariate.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
Multiple Discriminant Analysis
Adjusted from slides attributed to Andrew Ainsworth
Review for Final Examination COMM 550X, May 12, 11 am- 1pm Final Examination.
Linear Discriminant Analysis (LDA). Goal To classify observations into 2 or more groups based on k discriminant functions (Dependent variable Y is categorical.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
ANCOVA. What is Analysis of Covariance? When you think of Ancova, you should think of sequential regression, because really that’s all it is Covariate(s)
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
ANOVA, Regression and Multiple Regression March
Outline of Today’s Discussion
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Two-Group Discriminant Function Analysis. Overview You wish to predict group membership. There are only two groups. Your predictor variables are continuous.
Outline of Today’s Discussion 1.Seeing the big picture in MR: Prediction 2.Starting SPSS on the Different Models: Stepwise versus Hierarchical 3.Interpreting.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
ANCOVA.
Unit 7 Statistics: Multivariate Analysis of Variance (MANOVA) & Discriminant Functional Analysis (DFA) Chat until class starts.
ANCOVA (adding covariate) MANOVA (adding more DVs) MANCOVA (adding DVs and covariates) Group Differences: other situations…
D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
MANOVA Lecture 12 Nuance stuff Psy 524 Andrew Ainsworth.
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Multivariate vs Univariate ANOVA: Assumptions. Outline of Today’s Discussion 1.Within Subject ANOVAs in SPSS 2.Within Subject ANOVAs: Sphericity Post.
Methods of Presenting and Interpreting Information Class 9.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
BINARY LOGISTIC REGRESSION
MANOVA Dig it!.
Logistic Regression APKC – STATS AFAC (2016).
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Discriminant Analysis – Basic Relationships
Linear Discriminant Analysis
Multivariate Statistics
Presentation transcript:

Outline of Today’s Discussion 1.Introduction to Discriminant Analysis 2.Assumptions for Discriminant Analysis 3.Discriminant Analysis in SPSS

Part 1 Introduction To Discriminant Analysis

Definition 1.Discriminant Analysis - A procedure for (1) determining whether two or more mutually exclusive groups can be distinguished from each other based on linear combinations of predictor variables, and (2) determining which variables contribute to that separation. 2.Generic Linear Discriminant Analysis Equation: Score = aX 1 + bX 2 + cX 3 + dX 4 + intercept 3.Why is the equation above said to be linear?

Definition 1.Generic Linear Discriminant Analysis Equation: Score = aX 1 + bX 2 + cX 3 + dX 4 + intercept 2.To the extent that the scores resulting from the Linear Discriminant Analysis (LDA) form two or more distinct (mutually exclusive) groups, the LDA is successful. 3.Example: Scores ranging from… 10 to 20  Group 1 60 to 70  Group 2

Definition 1.Additional LDAs could be used to sort scores on separate, ORTHOGONAL, dimensions of variability! 2.Dim 2 Score = eX 5 + fX 6 + gX 7 + hX 8 + intercept 3.Dim 3 Score = eX 9 + fX 10 + gX 11 + hX 12 + intercept 4.Dim 4 Score = iX 13 + jX 14 + kX 15 + lX 16 + intercept

Questions Addressed by LDA 1.What are the linear combinations of predictors whose average values best separate the groups? (On average, which combos work best?) 2.Are the linear combinations useful in actually classifying cases into groups? (What’s the accuracy?) 3.Is there a smaller subset of predictors that captures the differences between the groups? (What’s the parsimony?)

Sample LDA Research Questions 1.Predict which universities will fail financially, based on endowment size, debt, operating budget, and alumni support. 2.Predict which students should skip PSYC 100, based on SAT scores, grades in HS psyc classes, HS class rank, and Psyc AP score. 3.Predict which clients will relapse, based on education level, income, number of mental health hospitalizations, family stability, frequency of contact with drug / alcohol abusers. 4.Predict which children on the autistic spectrum will benefit from Applied Behavior Analysis (ABA), based on family SES, parental attachment scores, IQ, parental motivation, number of siblings.

Relation to Other Stats 1.LDA is similar to the ANOVA family of stats because… 2.It is a linear model. 3.It is based on means (it is parametric). 4.It can be univariate (anova) or multivariate (manova).

Relation to Other Stats 1.In the special case of prediction for exactly two groups, LDA is similar to multiple regression. 2.In the two-group case, the LDA coefficients and multiple regression coefficients will be proportional to each other (will differ from each other by only a simple multiplicative scaling factor) if the MR is computed with “Group” as the DV (criterion). This proportionality is only true in the 2-group case. 3.Informally, LDA is the “reverse” of MR and (M)ANOVA. In MR and (M)ANOVA, we typically use group as an IV, not a DV. In LDA, one or more DVs are used to predict group membership, a typical IV!

Relation to Other Stats 1.LDA is also similar to Factor Analysis in that both procedures use linear combinations of measured variables to “extract” (latent) ORTHOGONAL trends. 2.However, in LDA the researcher *must* explicitly identify variables as DVs or IVs. 3.By contrast, in Factor Analysis, variables are just variables; researchers do not identify DVs or IVs as such!

Part 2 Assumptions for Discriminant Analysis

Discriminant Analysis Assumptions 1.Perhaps counter-intuitively, a discriminant analysis requires a data set for which each participant’s group membership is already known. 2.So why bother “predicting” group membership if it’s already known? Answer: We use the current known- memberships to derive LDA equations for classifying future cases whose group membership is NOT known. (We’re building a model!)

Discriminant Analysis Assumptions 1.If the cases in your data set are not already assigned to groups, you can’t use LDA.  2.Note: There is an alternate multivariate procedure called Cluster Analysis, which discerns groups (“clusters”) in data sets where group membership is NOT known. 3.We won’t do a cluster analysis (nor its mighty big-sibling, hierarchical cluster analysis) this semester. 

Discriminant Analysis Assumptions 1.LDA’s assumption set is similar to that of many other linear models in the ANOVA family. 2.Assumptions: Normalcy Equal Variance Independence Note: The LDA and other ANOVA-like linear models can be robust to violations of the normalcy assumption to the extent that the groups have equal n. No such luck with the other two assumptions.

Discriminant Analysis Assumptions 1.If your LDA data set does not meet the normalcy or equal variance assumptions, there is another statistical procedures that might help... 2.Logistic Regression can predict binary group membership, like LDA, but without assuming either normalcy or equal variance. 3.Note: If the assumptions underlying LDA are met, LDA is often more sensitive (more likely to produce significance) than LR is. We won’t learn LR this semester. 

Part 3 Discriminant Analysis In SPSS

Discriminant Analysis in SPSS 1.Assume we would like to build an LDA to address the issue of criminal recidivism. 2.We will use a data set to classify 20 current prison inmates based on whether they had or had not been previously incarcerated. 3.In our SPSS Variable View, we will use a “1” for the group that had NOT been previously incarcerated, and a “2” for prior incarceration. 4.We will use the values on the following variables for the prediction: An Alcohol Abuse Test A Drug Abuse Test An Anger Test

Discriminant Analysis in SPSS 1.Click Analyze, Classify, and Discriminant. 2.Click your group variable and move it to the Grouping Variable box. 3.Click Define Range, and enter your minimum & maximum values. 4.Click Continue. 5.Click on the variables to be used for prediction; move to Independent box. 6.Click Statistics and click Means and univariate ANOVAs. 7.Click all four options in the Matrices box. 8.Click continue and Classify. 9.Click All Groups Equal (if equal n, or group size proportional to N). Else, click “Compute Group from Group Sizes”. 10.Click Combined Groups in the Plot Box and Within Groups in the Covariance Matrix box. 11.Click Summary Table in the Display box and Continue. 12.Click Save and Predicted Group Membership. 13.Click Continue and OK.

Discriminant Analysis in SPSS First of Three Helpful Output Boxes (occurs in the “Discriminant” Section) Here, you might informally ask yourself, “based on the means, which variables distinguish the groups?”

Discriminant Analysis in SPSS Second of Three Helpful Output Boxes (occurs under “Analysis 1. Summary of Cannonical Discriminant Functions”) Wilks’ Lambda also occurs in MANOVA. It reflects the proportion of variance NOT explained by the model. Lower Wilks’ Lambda scores are better. Our model’s predictions are significantly better than chance! ( p = )

Discriminant Analysis in SPSS Third of Three Helpful Output Boxes (occurs under “Classification Statistics”) Our model has 80% accuracy…beats a coin flip!

Discriminant Analysis in SPSS Hey Look! A New Variable Appears Automatically In Our Data View! (Better than Christmas!) The new “Dis_1” variable shows us what group (1 vs 2) was predicted for each case. Our Linear Discriminant Analysis misclassified cases 6,8,10, & 20.

Discriminant Analysis in SPSS 1.To summarize, we used a discriminant analysis to assess how well three variables (alcohol score, drug score, anger score) could predict whether an inmate had been previously incarcerated. 2.The Wilks’ Lambda was significant, meaning that our discriminant function produced a significant difference between the two target groups. 3.Our linear classifier (our LDA) “grouped” the inmates with an accuracy of 80%. 4.More generally, we can use LDA to predict membership in a wide range of groups…including clinically relevant DSM groups! (And there was much rejoicing!)