By Wendiann Sethi Spring 2011.  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.

Slides:



Advertisements
Similar presentations
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Advertisements

CHOOSING A STATISTICAL TEST © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Statistical Tests Karen H. Hagglund, M.S.
Basic Data Analysis for Quantitative Research
WENDIANN SETHI SPRING 2011 SPSS ADVANCED ANALYSIS.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
QUANTITATIVE DATA ANALYSIS
MSc Applied Psychology PYM403 Research Methods Quantitative Methods I.
QM Spring 2002 Business Statistics SPSS: A Summary & Review.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chapter 19 Data Analysis Overview
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 18-1 Chapter 18 Data Analysis Overview Statistics for Managers using Microsoft Excel.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Statistical Analysis KSE966/986 Seminar Uichin Lee Oct. 19, 2012.
Inferential Statistics
Leedy and Ormrod Ch. 11 Gray Ch. 14
Understanding Research Results
Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Inferential Statistics: SPSS
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
Research Methods in Human-Computer Interaction
Hypothesis of Association: Correlation
2 Categorical Variables (frequencies) Testing mean differences of a continuous variable between groups (categorical variable) 2 Continuous Variables 2.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
What is SPSS  SPSS is a program software used for statistical analysis.  Statistical Package for Social Sciences.
DESCRIPTIVE STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Recap of data analysis and procedures Food Security Indicators Training Bangkok January 2009.
12: Basic Data Analysis for Quantitative Research.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
ANALYSIS PLAN: STATISTICAL PROCEDURES
Psychology 820 Correlation Regression & Prediction.
Angela Hebel Department of Natural Sciences
Statistical Analysis. Z-scores A z-score = how many standard deviations a score is from the mean (-/+) Z-scores thus allow us to transform the mean to.
SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
PART 2 SPSS (the Statistical Package for the Social Sciences)
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Nonparametric Statistics
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Chapter 15 Analyzing Quantitative Data. Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
Introduction to Marketing Research
Nonparametric Statistics
Dr. Siti Nor Binti Yaacob
CHOOSING A STATISTICAL TEST
Dr. Siti Nor Binti Yaacob
Social Research Methods
Understanding Research Results: Description and Correlation
Nonparametric Statistics
Introduction to Statistics
Basic Statistical Terms
Chapter Fourteen Data Preparation
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Hypothesis Testing Part 2: Categorical variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Parametric versus Nonparametric (Chi-square)
Presentation transcript:

By Wendiann Sethi Spring 2011

 The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods of data exploration using crosstabulations, inferences on the mean, regression and ANOVA. Students are encouraged to bring data that they are analyzing in class or projects to discuss what methods would be best to use.

 Descriptive statistics  Identifying outliers  Missing data – by variable, by respondent  Manipulating data – ◦ reversing the scale ◦ Collapsing a continuous variable into groups  Choosing the right statistic  Data Analysis: ◦ Cross tabulations ◦ Correlations and regression ◦ Tests about mean and proportions ◦ ANOVA  Multiple Responses

 Measures of central tendency  Measures of spread  Analyze> Descriptive Statistics What do you use for each type of variable and why?

 Look at the mean, median, st.dev and skewness to determine if there might be outliers  Could also use a box plot to see

Two potential problems with missing data: 1. Large amount of missing data – number of valid cases decreases – drops the statistical power 2. Nonrandom missing data – either related to respondent characteristics and/or to respondent attitudes – may create a bias

Examine missing data By variable By respondent By analysis If no problem found, go directly to your analysis If a problem is found: Delete the cases with missing data Try to estimate the value of the missing data

 Use Analyze > Descriptive Statistics > Frequencies  Look at the frequency tables to see how much missing  If the amount is more than 5%, there is too much. Need analyze further.

1. Use transform>count 2. Create NMISS in the target variable 3. Pick a set of variables that have more missing data 4. Click on define values 5. Click on system- or user-missing 6. Click add 7. Click continue and then ok 8. Use the frequency table to show you the results of NMISS

 Use Analyze>descriptive statistics>crosstabs  Look to see if there is a correlation between NMISS (row) and another variable (column)  Use column percents to view the % of missing for the value of the variable

 Proceed anyway  Estimate (impute) the missing data with substituting the mean or median value

 Recoding  Calculating  When to create a new variable versus creating a new one.

 What do you want to explore?  What do you need in your data to do that exploration?

 Analyze>Descriptive Statistics>Crosstab  Good for categorical data to see the relationship between two or more variables  Statistics: correlation, Chi Square, association  Cells: Percentages – row or column  Cluster bar charts

 Finding the relationship between two scale or ordinal variables.  Analyze > Correlate > bivariate  Analyze > regression > linear

 Aim:find out whether a relationship exists and determining its magnitude and direction  Two correlation coefficients: ◦ Pearson product moment correlation coefficient –r- interval or ratio scale variables ◦ Spearman rank order correlation coefficient –rho- ordered or ranked data  Assumptions: ◦ Related pairs of scores ◦ Relationship of the variables is linear ◦ Variables are measured at least at the ordinal level ◦ Homoscedasticity – variability of y variable should remain constant at all values of x variable

 Aim:Use after finding there is a correlation to find an appropriate Linear model to predict the results of the DV based on one or more IV’s  Assumptions: ◦ Related pairs of scores ◦ Relationship of the variables is linear ◦ Variables are measured at least at the ordinal level ◦ Homoscedasticity – variability of y variable should remain constant at all values of x variable  Procedure:  Linear Regression ◦ One IV to one DV ◦ ANALYZE>REGRESSION>LINEAR ◦ After placing the appropriate DV and IV, click STATISTICS ◦ Click CONTINUE and then OK to run the analysis

 Comparing the means of a scale (or ordinal) when grouped by a category  Analyze > Compare means ◦ Means – simplest form DV – scale to be compared given the IV – categories ◦ One-sample t-test : test the mean of the variable against a set value. ◦ Independent samples t-test: looking at the difference of two means of the variable given a grouping variable (two- groups only) ◦ Paired-samples t-test: looking at the difference of the means when there is paired data (pre-test vs post-test) ◦ One-way ANOVA: comparing the means of dependent variables (scale or ordinal) given a factor (one IV-category)

 Aim: Testing the differences between the means of two independent samples or groups  Requirements: ◦ Only one independent (grouping) variable IV (ex. Gender) ◦ Only two levels for that IV (ex. Male or Female) ◦ Only one dependent variable (DV)  Assumptions: ◦ Sampling distribution of the difference between the means is normally distributed ◦ Homogeneity of variances – Tested by Levene’s Test for Equality of Variances  Procedure: ◦ ANALYZE>COMPARE MEANS>INDEPENDENT SAMPLES T-TEST ◦ Test variable – DV ◦ Grouping variable – IV ◦ DEFINE GROUPS (need to remember your coding of the IV) ◦ Can also divide a range by using a cut point

 Aim:used in repeated measures or correlated groups designs, each subject is tested twice on the same variable, also matched pairs  Requirements: ◦ Looking at two sets of data – (ex. pre-test vs. post-test) ◦ Two sets of data must be obtained from the same subjects or from two matched groups of subjects  Assumptions: ◦ Sampling distribution of the means is normally distributed ◦ Sampling distribution of the difference scores should be normally distributed  Procedure: ◦ ANALYZE>COMPARE MEANS>PAIRED SAMPLES T-TEST

 Aim: looks at the means from several independent groups, extension of the independent sample t-test  Requirements: ◦ Only one IV ◦ More than two levels for that IV ◦ Only one DV  Assumptions: ◦ The populations that the sample are drawn are normally distributed ◦ Homogeneity of variances ◦ Observations are all independent of one another  Procedure: ANALYZE>COMPARE MEANS>One-Way ANOVA  Dependent List – DV  Factor – IV

 How to deal with questions were the participant can choose several choices.  ANALYZE>MULTIPLE RESPONSE ◦ Define sets ◦ Frequencies ◦ Crosstabs  Example data: survey_sample.sav ◦ Eth1, 2, 3 – multiple response method ◦ News 1, 2, 3 – multiple dichotomy method

Wendiann Sethi AS 202C or SC 128