PSY6010: Statistics, Psychometrics and Research Design Professor Leora Lawton Spring 2007 Wednesdays 7-10 PM Room 204.

Slides:



Advertisements
Similar presentations
Selecting a Data Analysis Technique: The First Steps
Advertisements

Bivariate Analyses Categorical Variables Examining Relationship between two variables.
Sociology 680 Multivariate Analysis Logistic Regression.
Tools of the Trade: An Introduction to SPSS Presenter: Michael Duggan, Suffolk University
Introduction to Research Design Statlab Workshop, Fall 2010 Jeremy Green Nancy Hite.
Bivariate Analysis Cross-tabulation and chi-square.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
The World’s Fastest Crash Course in Statistics Or, What You Need to Know to Answer Your Research Question 13 November 2006.
By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
1 An Introduction to IBM SPSS PSY450 Experimental Psychology Dr. Dwight Hennessy.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Data analysis Incorporating slides from IS208 (© Yale Braunstein) to show you how 208 and 214 are telling you many of the the same things; and how to use.
A Simple Guide to Using SPSS© for Windows
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to SPSS Descriptive Statistics. Introduction to SPSS Statistics Program for the Social Sciences (SPSS) Commonly used statistical software.
Multiple Regression – Assumptions and Outliers
15a.Accessing Data: Frequencies in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Multiple Regression – Basic Relationships
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Data Management: Quantifying Data & Planning Your Analysis
Crosstabs. When to Use Crosstabs as a Bivariate Data Analysis Technique For examining the relationship of two CATEGORIC variables  For example, do men.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
SW388R7 Data Analysis & Computers II Slide 1 Analyzing Missing Data Introduction Problems Using Scripts.
Leedy and Ormrod Ch. 11 Gray Ch. 14
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Selecting the Correct Statistical Test
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
Questionnaire Development: SPSS and Reliability Personality Lab October 8, 2010.
Interactions POL 242 Renan Levine March 13/15, 2007.
Sociology 680 Multivariate Analysis: Analysis of Variance.
 Muhamad Jantan & T. Ramayah School of Management, Universiti Sains Malaysia Data Analysis Using SPSS.
Recap of data analysis and procedures Food Security Indicators Training Bangkok January 2009.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
A Simple Guide to Using SPSS ( Statistical Package for the Social Sciences) for Windows.
Review. POL 242 – Strong Correlation. Positive or Negative?
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
» So, I’ve got all this data…what now? » Data screening – important to check for errors, assumptions, and outliers. » What’s the most important? ˃Depends.
One-Way Analysis of Covariance (ANCOVA)
CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH (CSSCR) UNIVERSITY OF WASHINGTON SPRING 2013 CONSULTANT: SHIN HAENG LEE Introduction to SPSS.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
1. Tables, Charts, and Graphs Microsoft Word & Excel 2003.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
1 PSY6010: Statistics, Psychometrics and Research Design Professor Leora Lawton Spring 2007 Wednesdays 7-10 PM Room 204.
Data Management Research Methods Professional Development Institute December 4, 2015.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
SOC 305, Southeastern Louisiana University Prof. Robert Martin.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Data Entry, Coding & Cleaning SPSS Training Thomas Joshua, MS July, 2008.
Lecture note on statistics, data analysis planning – week 14 Elspeth Slayter, M.S.W., Ph.D.
Introduction to SPSS July 28, :00-4:00 pm 112A Stright Hall
BINARY LOGISTIC REGRESSION
Bivariate & Multivariate Regression Analysis
Dr. Siti Nor Binti Yaacob
Just the basics: Learning about the essential steps to do some simple things in SPSS Larkin Lamarche.
Microsoft Office Illustrated
Dr. Siti Nor Binti Yaacob
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
Hypothesis Testing Part 2: Categorical variables
Multiple Regression – Split Sample Validation
Individual Assignment 6
Presentation transcript:

PSY6010: Statistics, Psychometrics and Research Design Professor Leora Lawton Spring 2007 Wednesdays 7-10 PM Room 204

PSY6010 Intro Purpose –Homework is due every week. Cut&Paste SPSS output into the word document. –We will cover Correlative and predictive statistics Group membership predictors and descriptors. –Implications: you must know your research question and therefore the kind of answer necessary. –Anyone can play around with SPSS – a trained researcher will not produce garbage. –To use quantitative data to test your hypotheses in the best possible way available, given your constraints of time and money. Select the correct method Use it correctly Interpret the results

Working with Data Finalize your research question and study objectives: –What are your ‘success criteria’? –Then, to use quantitative data to test your hypotheses in the best possible way available, given your constraints of time and money, you: Select the correct method Use it correctly Interpret the results

Working with Data Steps to follow –Identify your dependent variable for operationalizing your outcome variable –Identify your potential independent variables –Run frequency with descriptives –Test relationship between DV and IVs with bivariate tests. –Try out the multivariate method, if relevant. OLS regression Binomial, polynomial or ordinal logistic regression Discriminant Anova/Manova Factor analysis Cluster analysis Etc.

Frequencies – what to look for Distribution – Is it skewed? Which direction? What is the impact on your analysis choice? Are there outliers? Is the current coding useful? What are the means and measures of variance? How many missings are there? How do these help you understand the data?

Example Suppose you’ve been asked by a family social services agency to understand how to best handle the suggested legislation for banning spanking in California. –What’s your research question? –Look in GSS93.sav …what’s a good DV for this study? –What Independent variables make sense for this study? –Would other variables help? Is that an insurmountable problem?

Frequencies 33.5% missing 73.4% agree or strongly agree With no ‘neutral’ position or midpoint, everyone takes a stand The bigger the skew, the more it’s lumped at the bottom. 0 is no skew, - values are lumped at the top In this data, missing values have already been defined

Bivariate tests - crosstabs Make the ‘row’ your DV and the ‘column’ the IV. Then select column cell statistics Note that cell counts get very small in some subgroups Chi-Square option in statistics shows that the distribution is not random.

Bivariate tests – compare means Note that lower value = stronger agreement. Would be good to reverse code for logic. Difference is statistically significant: Blacks support spanking more so than do whites and others.

Bivariate tests – Correlations The more children one has, the less likely they are to disagree (or the more likely they are to think spanking is okay) And this is why theory matters: What does ‘number of children’ operationalize? What else, that’s related to number of children, could also affect this attitude?

Preliminary multivariate model The model does not predict much as is as evidenced by the very small R 2 But of those predictors, race and number of children are significant, in the ways expected by the bivariate results.

Working with data sets When using quantitative data, you need to be able to prepare the data to be usable. –Load data set into spss –Label spss file if no syntax file is provided. –Run frequencies to investigate missings, outliers. –Define missing values to be excluded from analysis (either always or just for some specific analyses) –Recode variables from alpha to numeric. –Recode variables from categorical to dichotomous. –Recode missings to mean value. –Recode values to real midpoint values. –Special formats

Working with data sets Uploading data from excel. –File, Open Data. Set Files of Type to.xls. Locate file in folder, Make sure you select the correct worksheet (older versions only read one worksheet). Click on Open. –If you need to, add the labels. This can make it easier for others to work with a data set. –Check the correct variable type (string, numeric or special). –Add values by clicking on the Values tab, and enter the numeric value in Value, and the label in Value Label. Click Add after each value/value label added, then when finished, click OK.

Cleaning up 1 Recode string to numeric (very helpful when there are many many values). Transform – Autorecode – Give it a variable name, Click on Okay. Recode to reverse values. If 5 = poor and 1 = excellent, it’s too hard to think about. Transform – Autorecode. Again, give it a variable name, and then click on Recode from Highest value. Oops, but now the 6 (no answer) is given the low value. So first set 6 as missing, then do the auto-recode. Set as missing.. Click on Missing. Add the discrete value. Click on okay. Now try the previous autorecode again. Another way to recode missing: Recode – Into Same value – Old & New Values. Put ‘3’ in old value and ‘system missing’ for new value. Click on Add, then Continue, then okay. For recode a categorical to dichotomous (necessary for OLS regression), use Transform - Recode into Different Variable. Give it a new name and a new label. Click on Old & New Values. Old value for focal category is set to 1, all others to 0. Add new values, click on Continue, then OK. You can select cases so it’s based correctly. In this example, exclude those under 18 years old by clicking on If… (optional case selection) and then Include if case satisfied condition, and then click on the variable, and identify the value or values you want to select for this new variable).

Cleaning up 2 Identify and transform outliers. (go to other.xls). Recode values to meaningful values. Income, age ranges, are common variables requiring this transformation. –Recode into different value. Recode each value to the midpoint value of the range. For the low range, select 10% under, for the high range, it depends on how much skew there is, but 10-20% is appropriate. –On the original variable, calculate the mean. You can either reset missings to the recoded value closest to it, or impute a value. –Transforming to deal with heteroscedasticity. Logging income is a standard one.

Character of Data Look for linearity, curvilinearity, multicollinearity, singularity. –Conduct bivariate analyses between your DV and your IVs. –If the DV is a continuous, or at least nominal variable, then you can compare means and look at the t-test or, anova. If it’s dichotomous, do crosstabs and look at the chi-square. –Curvilinear relationships will require a transformation of the IV to something more usable. –A correlation analysis will help you identify collinearity. Multicollinearity requires either dropping variables, or transforming the variables into an index, or factor analysis.