Multivariate Data Exploration with Stata: Evaluation and Wish List Stephen Soldz Boston Graduate School of Psychoanalysis

Slides:



Advertisements
Similar presentations
Factor Analysis Continued
Advertisements

Exploratory Factor Analysis
PCA + SVD.
Overview of field trial analysis procedures National Research Coordinators Meeting Windsor, June 2008.
Lecture 7: Principal component analysis (PCA)
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
LISA Short Course Series Multivariate Analysis in R Liang (Sally) Shan March 3, 2015 LISA: Multivariate Analysis in RMar. 3, 2015.
Principal Components An Introduction Exploratory factoring Meaning & application of “principal components” Basic steps in a PC analysis PC extraction process.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Principal Components An Introduction exploratory factoring meaning & application of “principal components” Basic steps in a PC analysis PC extraction process.
Factor Analysis Purpose of Factor Analysis Maximum likelihood Factor Analysis Least-squares Factor rotation techniques R commands for factor analysis References.
Factor Analysis Research Methods and Statistics. Learning Outcomes At the end of this lecture and with additional reading you will be able to Describe.
Factor Analysis Purpose of Factor Analysis
Canonical correlations
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Bayesian belief networks 2. PCA and ICA
Principal Components An Introduction
Statistics for Marketing & Consumer Research Copyright © Mario Mazzocchi 1 Correspondence Analysis Chapter 14.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Chapter 2 Dimensionality Reduction. Linear Methods
DATA MINING from data to information Ronald Westra Dep. Mathematics Knowledge Engineering Maastricht University.
Additive Data Perturbation: data reconstruction attacks.
Canonical Correlation Analysis and Related Techniques Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia.
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS ESSENTIALS -- Elliott & Woodward1.
Chapter 9 Factor Analysis
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
N– variate Gaussian. Some important characteristics: 1)The pdf of n jointly Gaussian R.V.’s is completely described by means, variances and covariances.
Thursday AM  Presentation of yesterday’s results  Factor analysis  A conceptual introduction to: Structural equation models Structural equation models.
Factor Analysis Revealing the correlational structure among variables Understanding & Reducing Complexity.
Factor Analysis ( 因素分析 ) Kaiping Grace Yao National Taiwan University
Principal Component vs. Common Factor. Varimax Rotation Principal Component vs. Maximum Likelihood.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Why to reduce the number of the features? Having D features, we want to reduce their number to n, where n
Principal Components Analysis and Factor Analysis by Dr. Winai Bodhisuwan.
Math 5364/66 Notes Principal Components and Factor Analysis in SAS Jesse Crawford Department of Mathematics Tarleton State University.
Chapter 7 Multivariate techniques with text Parallel embedded system design lab 이청용.
Introduction to Multivariate Analysis of Variance, Factor Analysis, and Logistic Regression Rubab G. ARIM, MA University of British Columbia December 2006.
Dimension Reduction in Workers Compensation CAS predictive Modeling Seminar Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc.
Lecture 12 Factor Analysis.
Exploratory Factor Analysis. Principal components analysis seeks linear combinations that best capture the variation in the original variables. Factor.
Tutorial I: Missing Value Analysis
Factor Analysis I Principle Components Analysis. “Data Reduction” Purpose of factor analysis is to determine a minimum number of “factors” or components.
Chapter 4 –Dimension Reduction Data Mining for Business Analytics Shmueli, Patel & Bruce.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
Feature Extraction 主講人:虞台文.
Principal Components Analysis ( PCA)
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
CSE 554 Lecture 8: Alignment
EXPLORATORY FACTOR ANALYSIS (EFA)
COMP 1942 PCA TA: Harry Chan COMP1942.
School of Computer Science & Engineering
Factor Analysis An Alternative technique for studying correlation and covariance structure.
Factor analysis Advanced Quantitative Research Methods
Principal Component Analysis (PCA)
Applied Statistics Using SAS and SPSS
Measuring latent variables
Bayesian belief networks 2. PCA and ICA
Measuring latent variables
Descriptive Statistics vs. Factor Analysis
Multivariate Statistics
Factor Analysis An Alternative technique for studying correlation and covariance structure.
Principal Component Analysis
Applied Statistics Using SPSS
Lecture 8: Factor analysis (FA)
因子分析.
Exploratory Factor Analysis. Factor Analysis: The Measurement Model D1D1 D8D8 D7D7 D6D6 D5D5 D4D4 D3D3 D2D2 F1F1 F2F2.
Factor Analysis.
Presentation transcript:

Multivariate Data Exploration with Stata: Evaluation and Wish List Stephen Soldz Boston Graduate School of Psychoanalysis

Principal Components Analysis Purpose: Data exploration and data reduction Available in Stata Base ado ( pca ) Built-in ( factor, pcf ) score will produce component scores Issues/Limitations pca just a wrapper for (now undocumented) pc option to factor, which user cannot access and modify Confusing documentation on difference between pca and factor, pcf (i.e., scaling of eigenvectors) Does not directly allow pca of correlation/ covariance matrix – must use corr2data, introducing error Does not allow rotate to “protect” user – seems patronizing and uncharacteristic of Stata

Exploratory Factor Analysis Purpose: Data exploration and data reduction Available in Stata Built-in factor allows principal factors (with and without iteration of communalities), maximum likelihood Built-in rotate allows varimax (with and without Horst correction) and promax Issues/Limitations  factor, pfi (prinipal factors with iteration) does not allow specification of number of times to iterate – this directly conflicts with Gorsuch (1983) recommendation that communalities be iterated only 3-4 times  As factor built-in, users cannot modify or build on it rotate options very limited (only varimax and promax) and users cannot modify, though they could access eigenvectors ( matix_get ) and write their own

Exploratory Factor Analysis, Continued Available in Stata Built-in factor allows principal factors (with and without iteration of communalities), maximum likelihood Built-in rotate allows varimax (with and without Horst correction) and promax Issues/Limitations  rotate not well documented, so not clear if one could, e.g., rotate canonical correlations as suggested by Cliff & Krus (1976).

Correspondence Analysis Purpose: Data exploration and reduction of categorical data Available in Stata User-written coranal (correspondence analysis) User-written mca (multiple correspondence analysis) Issues/Limitations Graphics broken in Stata 8 Statalist question as to whether mca is producing correct output Few variations implemented

Optimal Scaling Purpose: Data exploration, reduction, and transformation Available in Stata None (that I’m aware of) Issues/Limitations

Multidimensional Scaling Purpose: Data exploration Available in Stata None (that I’m aware of) Issues/Limitations

Conclusion Stata is weak in”multivariate exploratory data analysis” procedures. Many existing procedures are inflexible and not extensible, or user-contributed and not currently maintained. Stata lags behind SPSS, SAS, S-Plus, and R in this area.