Advanced statistics for master students Correspondence analysis.

Slides:



Advertisements
Similar presentations
Introduction to Stats Honors Analysis. Data Analysis Individuals: Objects described by a set of data. (Ex: People, animals, things) Variable: Any characteristic.
Advertisements

Mutidimensional Data Analysis Growth of big databases requires important data processing.  Need for having methods allowing to extract this information.
Univariate Statistics 9/18/2012. Readings Chapter 2 Measuring and Describing Variables (Pollock) (pp.32-33) Chapter 2 Descriptive Statistics (Pollock.
SPSS Session 1: Levels of Measurement and Frequency Distributions
Correspondence Analysis
Contingency tables and Correspondence analysis
Data Analysis Statistics. OVERVIEW Getting Ready for Data Collection Getting Ready for Data Collection The Data Collection Process The Data Collection.
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
Dr. Michael R. Hyman, NMSU Statistics for a Single Measure (Univariate)
Introduction to Educational Statistics
Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New.
Contingency tables and Correspondence analysis Contingency table Pearson’s chi-squared test for association Correspondence analysis using SVD Plots References.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
Crosstabs and Chi Squares Computer Applications in Psychology.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Statistics for Marketing & Consumer Research Copyright © Mario Mazzocchi 1 Correspondence Analysis Chapter 14.
Exploring Marketing Research William G. Zikmund Chapter 20: Basic Data Analysis.
CHAPTER 8 Basic Data Analysis for Quantitative Research ESSENTIALS OF MARKETING RESEARCH Hair/Wolfinbarger/Ortinau/Bush.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Statistical Methods For Health Research. History Blaise Pascl: tossing ……probability William Gossett: standard error of mean “ how large the sample should.
Range, Variance, and Standard Deviation in SPSS. Get the Frequency first! Step 1. Frequency Distribution  After reviewing the data  Start with the “Analyze”
Week 10 Chapter 10 - Hypothesis Testing III : The Analysis of Variance
The introduction to SPSS Ⅱ.Tables and Graphs for one variable ---Descriptive Statistics & Graphs.
Analyzing and Interpreting Quantitative Data
Sociological metodology Quantification Petr Soukup.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
BUSINESS MARKET RESEARCH
Advanced statistics for master students Loglinear models.
CADA Final Review Assessment –Continuous assessment (10%) –Mini-project (20%) –Mid-test (20%) –Final Examination (50%) 40% from Part 1 & 2 60% from Part.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
MK346 – Undergraduate Dissertation Preparation Part II - Data Analysis and Significance Testing.
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
Correspondence Analysis Ahmed Rebai Center of Biotechnology of Sfax.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Active Learning Lecture Slides For use with Classroom Response Systems Chapter 5 Association between Categorical.
SPSS Instructions for Introduction to Biostatistics Larry Winner Department of Statistics University of Florida.
Notes 9.6 – Statistics and Data - Graphically. I. Variables A.) Def: characteristics of individuals being identified or measured. 1.) CATEGORICAL – Class.
Reasoning in Psychology Using Statistics Psychology
Fundamental Statistics in Applied Linguistics Research Spring 2010 Weekend MA Program on Applied English Dr. Da-Fu Huang.
SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.
Statistical Analysis using SPSS Dr.Shaikh Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Basic Data Analysis for Quantitative Research Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Advanced statistics for master students Loglinear models II The best model selection and models for ordinal variables.
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
Introduction to statistics I Sophia King Rm. P24 HWB
Multidimensional Scaling and Correspondence Analysis © 2007 Prentice Hall21-1.
1. State the null and alternative hypotheses. 2. Select a random sample and record observed frequency f i for the i th category ( k categories) Compute.
Multidimensional Scaling
Elementary Analysis Richard LeGates URBS 492. Univariate Analysis Distributions –SPSS Command Statistics | Summarize | Frequencies Presents label, total.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Test of Independence Tests the claim that the two variables related. For example: each sample (incident) was classified by the type of crime and the victim.
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
Introduction to SPSS SOCI 301 Lab session.
Statistics in SPSS Lecture 2
Chapter 3 Describing Data Using Numerical Measures
Data Analysis & Report Writing
Analyzing and Interpreting Quantitative Data
Statistics in SPSS Lecture 4
Statistics in SPSS Lecture 9
Computing A Variable Mean
 .
Correspondence Analysis
Presentation transcript:

Advanced statistics for master students Correspondence analysis

zz zzz z Correspondence analysis Literature: Hebák a kol., 3. díl SPSS Categories 8.0 Chicago, SPSS Inc. Introduction to SPSS Optimal Scaling Procedures for Categorical Data, pp. 1-20, Corresponce Analysis pp Jobson, J.D.: Applied Multivariate Data AnalysisVolume II: Categorical and Multivariate Methods, Springer-Verlag, Berlin, 1992 Aplication in Czech: Mareš, P.: Češi: Zaměstnání a práce-jak jsou Češi spokojeni ve své práci, Sociální studia 6: , 2001 Vlachová, K.: Stranická identifikace v České republice, Sociologické texty č.5, 2000, SoÚ AV ČR Matějů, P., Vlachová, K. a kol.: Nerovnost, spravedlnost, politika, Slon, Praha, str , 2000

zz zzz z Corresponce Analysis Historical note: Metod originated in France (70 ties, Jean-Paul Benzérci), from the middle of 80-ties in English Contingency table-base for this technique Well-known facts: Chi-square test and adjusted residuals (sign scheme)

zz zzz z Correspondence analysis The goal – to find deviations from independency in contingency table (resp. in correspondence table) in chart The result –chart with categories of row and column variables Interpretation – distances between categories of row and column variables (electorate, brands) variables-nominal or ordinal

zz zzz z Correspondence analysis characteristics of technique – quite pretty, nice charts, used by academics as well as in business correspondence table – divide absolute frequencies by the number of respondents (sometimes in % as total percentages) Example - correspondence table of accused in EXCEL – file corresp.xls

zz zzz z Correspondence analysis -logic Row and column profiles – looking for departures from independence in correspondence table (we use row and column profiles – simple analogy to row and column percentages) Graphical presentation – see again corresp.xls (interpretation) SPSS – file corresp.sps) Three plots – row profile plot, column profile plot and biplot - Different information in individual plots - Biplot –the most common output

zz zzz z Correspondence analysis Example on data –affiliation of Czech parties electorate Vlachová, K.. Stranická identifikace v České republice. Sociologické texty č Menu SPSS Analyze-Data Reduction-Correspondence Analysis Number of dimensions –selection (2 possibilities) =max. nr. of rows or columns (min of them) minus 1 - 1) According to inertia - „analogy" to explained variance in factor analysis - 2) based on our task Most common is to use 2 dimensions (simple to plot and interpret)

zz zzz z Distance measure Chi-square- for classical contingency table Normalization -symmetrical- for relations between categories of both variables -row principal – for relations between categories of row variable -column principal - for relations between categories of row variable -principal – combination of row and colum but no possibility to draw biplot -custom – from -1 thru 1 from column (-1) thru row principal (1), 0=symmetrical;

zz zzz z Standardization -remove row and column means – centering of variables – the best for most purposes Outputs (Plots) - Biplot - row points - column points

zz zzz z Outputs –interpretation -inertia and its interpretation - Row and column scores tables, interpretation of dimensions, explanation of individual points - Biplot, row and column plots - Changes of plots with row and column normalization

zz zzz z Outputs in SPSS -Plot in EXCEL use Score in Dimension from tables row and column scores -example !!!the same scaling on x and y axes is necessary!!!! Other possibilities in correspondence analysis - supplemental categories- picture based on categories not including supplementak, supplemental are used for this space- příklad-next party (SPR-RSČ) to common space of 5 parties – menu and command

HW Try to use correspondence analysis on your data (at least two variables with 5 categories)