Statistical Software “SAMMIF” Okayama Univ. of Science Graduate School, Okayama Univ. Kurashiki Univ. of Science and the Arts Okayama Univ. Yuichi MORI.

Slides:



Advertisements
Similar presentations
Analysis of multivariate transformations. Transformation of the response in regression The normalized power transformation is: is the geometric mean of.
Advertisements

I OWA S TATE U NIVERSITY Department of Animal Science Using Basic Graphical and Statistical Procedures (Chapter in the 8 Little SAS Book) Animal Science.
Aggregating local image descriptors into compact codes
PCA for analysis of complex multivariate data. Interpretation of large data tables by PCA In industry, research and finance the amount of data is often.
Confirmatory Factor Analysis
Outline input analysis input analyzer of ARENA parameter estimation
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
12-1 Multiple Linear Regression Models Introduction Many applications of regression analysis involve situations in which there are more than.
1 A Common Measure of Identity and Value Disclosure Risk Krish Muralidhar University of Kentucky Rathin Sarathy Oklahoma State University.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Psychology 202b Advanced Psychological Statistics, II April 5, 2011.
The Simple Linear Regression Model: Specification and Estimation
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Linear statistical models 2008 Model diagnostics  Residual analysis  Outliers  Dependence  Heteroscedasticity  Violations of distributional assumptions.
Independent Component Analysis (ICA) and Factor Analysis (FA)
A Structural Equation Modeling Approach to Students’ Homework Assignment Web Sites Usage Emel DIKBAS TORUN Hacettepe University Eralp ALTUN Ege University.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
AMOS TAKING YOUR RESEARCH TO THE NEXT LEVEL Mara Timofe Research Intern.
Factor Analysis Psy 524 Ainsworth.
G Lecture 61 G SEM Lecture 6 An Example Measures of Fit Complex nonrecursive models How can we tell if a model is identified? Direct and.
Statistics for Marketing & Consumer Research Copyright © Mario Mazzocchi 1 Correspondence Analysis Chapter 14.
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
Basic R Programming for Life Science Undergraduate Students Introductory Workshop (Session 1) 1.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.
Computational Methods of Scientific Programming Lecturers Thomas A Herring, Room A, Chris Hill, Room ,
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
1 Variable selection for factor analysis and structural equation models Yutaka Kano & Akira Harada Osaka University International Symposium on Structural.
R2WinBUGS: Using R for Bayesian Analysis Matthew Russell Rongxia Li 2 November Northeastern Mensurationists Meeting.
Hyperparameter Estimation for Speech Recognition Based on Variational Bayesian Approach Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee and Keiichi.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
Anaregweek11 Regression diagnostics. Regression Diagnostics Partial regression plots Studentized deleted residuals Hat matrix diagonals Dffits, Cook’s.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Advanced Statistics Factor Analysis, II. Last lecture 1. What causes what, ξ → Xs, Xs→ ξ ? 2. Do we explore the relation of Xs to ξs, or do we test (try.
This supervised learning technique uses Bayes’ rule but is different in philosophy from the well known work of Aitken, Taroni, et al. Bayes’ rule: Pr is.
June 26, 2003 Dept. of Computer Science Memorial University of Newfoundland Structuring Interactive Cluster Analysis Wayne Oldford University of Waterloo.
Worked Example Using R. > plot(y~x) >plot(epsilon1~x) This is a plot of residuals against the exploratory variable, x.
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
CpSc 881: Machine Learning
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
Bayesian Speech Synthesis Framework Integrating Training and Synthesis Processes Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda Nagoya Institute.
Examples. Path Model 1 Simple mediation model. Much of the influence of Family Background (SES) is indirect.
ALISON BOWLING CONFIRMATORY FACTOR ANALYSIS. REVIEW OF EFA Exploratory Factor Analysis (EFA) Explores the data All measured variables are related to every.
Tutorial I: Missing Value Analysis
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
1 Tom Edgar’s Contribution to Model Reduction as an introduction to Global Sensitivity Analysis Procedure Accounting for Effect of Available Experimental.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
Bursts modelling Using WinBUGS Tim Watson May 2012 :diagnostics/ :transformation/ :investment planning/ :portfolio optimisation/ :investment economics/
CSE 4705 Artificial Intelligence
Regression Models for Linkage: Merlin Regress
CHAPTER 3 Describing Relationships
Chapter 6 Diagnostics for Leverage and Influence
Chapter 15 Confirmatory Factor Analysis
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
COMP 1942 PCA TA: Harry Chan COMP1942.
Inference for the mean vector
CJT 765: Structural Equation Modeling
Ch11 Curve Fitting II.
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
Topic 11: Matrix Approach to Linear Regression
Latest features Frankfurt, 7 June 2016
Presentation transcript:

Statistical Software “SAMMIF” Okayama Univ. of Science Graduate School, Okayama Univ. Kurashiki Univ. of Science and the Arts Okayama Univ. Yuichi MORI Yoshiro YAMAMOTO Shingo WATADANI Yoshimasa ODAKA Tomoyuki TARUMI Yutaka TANAKA SAMMIF Project Sensitivity Analysis in Multivariate Methods based on Influence Functions

KJCS-97 , Outline of our presentation About SAMMIF –Motivation and process of developments. –Ideas in SAMMIF. –General procedure and flow of “SAMMIF”. Demonstration of “SAMMIF” –Exploratory factor analysis. open/closed book data (Mardia et al, 1979) Discussion and Future plans

KJCS-97 , Motivation for development Sensitivity analysis procedures using influence functions and their analogues; SAM ( Tarumi and Tanaka, 1986 ), SAF/B ( Odaka, Watadani and Tanaka, 1991 ), SAF/S ( Inoue, Odaka and Tanaka, 1991 ), SAM II ( Mori and Tarumi, 1993 ), SACS ( Watadani and Tanaka, 1994 ), etc. –developed for a specified family of multivariate methods separately, –most of them run on MS-DOS (BASIC) platform. Necessary to develop a new program; –unifying any multivariate method where influence functions or their analogues are available, –reinforcing GUI under the Windows environment.

KJCS-97 , Sensitivity analysis Data Multivariate method Influence of the i-th observation ? Results θ ^ Data the i-th obs. Results ( i ) θ ( i ) ^ Multivariate method i = 1,..., n

KJCS-97 , Empirical Influence Function Introduce perturbation to the cdf F,, where is the cdf of a unit point mass at x. Empirical Influence Function ( EIF ),.

KJCS-97 , Empirical Influence Function Influence of k observations –approximation of,. Influence of one observation Influences of subsets, influential individually, similar influence patterns. Canonical Variate Analysis Cook’s Local Influence

KJCS-97 , Sensitivity analysis based on influence functions Data n obs. Data n obs. EIF Vectors, More than one Vectors, More than one Parameters Observable measures Observable measures Summarizing them into scalar measures Summarizing them into scalar measures Influence of multiple obs. Influence of multiple obs. Applying CVA or PCA to EIF Applying CVA or PCA to EIF

KJCS-97 , Flow of SAMMIF Data entry Prior analysis Single-case diagnostics Multiple-case diagnostics Posterior analysis Comparison (Pre / Post) 1) Data 2) Pre Estimate ordinarily using all observations. 3) Diagnostics 3-1) SD Compute EIFs and summarize them to influence measures. 3-2) MD Apply CVA or PCA to EIFs. 4) Post Estimate without observations of interest. 5) Comp Compare results in 4) with in 2).

KJCS-97 , Flow of SAMMIF Data entry Prior analysis Single-case diagnostics Multiple-case diagnostics Posterior analysis Comparison (Pre / Post) 1) Data 2) Pre Estimate ordinarily using all observations. 3) Diagnostics 3-1) SD Compute EIFs and summarize them to influence measures. 3-2) MD Apply CVA or PCA to EIFs. 4) Post Estimate without observations of interest. 5) Comp Compare results in 4) with in 2). Influence measures 1) Influence on the estimate ; (a) Euclidean norm, (b) Generalized Cook's distance. 2) Influence on the precision; (c) COVRATIO-like measure. 3) Influence on the goodness-of-fit; (d) Change of X 2, (e) Change of goodness-of-fit index (GFI), (f) Change of root mean square residual (RMR). Influence measures 1) Influence on the estimate ; (a) Euclidean norm, (b) Generalized Cook's distance. 2) Influence on the precision; (c) COVRATIO-like measure. 3) Influence on the goodness-of-fit; (d) Change of X 2, (e) Change of goodness-of-fit index (GFI), (f) Change of root mean square residual (RMR).

KJCS-97 , Features of SAMMIF (Version0.8) Developing tool: MS-Visual Basic Language: English / Japanese Included method: Factor analysis –Sensitivity analysis in Exploratory FA –Sensitivity analysis in Confirmatory FA Features (other than Windows functions) : –Clicablemap-type flowchart, –Graphical displays, –Brief tutorial, Hint options, –Standard / detail outputs, –Intermediate reports, savable outputs.

KJCS-97 , Example (Demonstration) Factor analysis –Basic model, f :common factor , e:unique factor , : mean vector :Factor loading matrix –Covariance matrix of x, Estimated parameters Example –Method: Exploratory factor analysis –Data: “open/closed book data (Mardia et al, 1979)”, –88 observations, 5 variables.

KJCS-97 , Demonstration of “SAMMIF” Demonstration of “SAMMIF” Demonstration of “SAMMIF” Demonstration of “SAMMIF” Method: Exploratory factor analysis Data: “open/closed book data (Mardia et al, 1979)” 88 observations, 5 variables.

KJCS-97 , Discussion and Future plans SAMMIF –provides important information easily, –for diagnostic checking of the data. Future plans –Include other multivariate methods: LISREL type covariance structure analysis, PCA, Canonical correlation analysis, etc. –Other type of perturbation scheme –Improve convenience: for both beginners and experts, various options, Interface with other software, enrichment of helps and consultation.

KJCS-97 , SAMMIF Sensitivity Analysis in Multivariate Methods based on Influence Functions Contact to Yuichi MORI Okayama University of Science Dept. Socio-Information Distribution Exe files Brief tutorial file Sample data Manual (in HTML) English / Japanese Publication On the Web okayama-u.ac.jp ous.ac.jp kusa.ac.jp January, 1998