Introductory Workshop SPSS CSU Fresno March 12, 2010.

Slides:



Advertisements
Similar presentations
Data Analysis using SPSS By Dr. Shaik Shaffi Ahamed Ph. D
Advertisements

Tools of the Trade: An Introduction to SPSS Presenter: Michael Duggan, Suffolk University
Introductory Workshop SPSS CSU Stanislaus February 21, 2014 Ed Nelson – CSU Fresno 1.
Advance to next slide1 Interactive Introduction to SPSS Statistical Software Elizabeth Bigham, Ph.D. California State University San Marcos May
©2004, 2006, 2008 UIW Department of Instructional Technology Meat and Potatoes SPSS Presented by Terence Peak.
Intermediate Workshop SPSS CSU Stanislaus May 2, 2014 Ed Nelson – CSU Fresno 1.
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
SPSS Introductory Workshop Humboldt State University May 6, /6/2011www.ssric.org.
Roper Center for Public Opinion Research Social Science Research and Instructional Council April,
Managing Grades with Excel Viewing Help To view Help 1.Open Excel on your computer. 2.In the top right hand corner of the Excel Screen type in the.
1 An Introduction to IBM SPSS PSY450 Experimental Psychology Dr. Dwight Hennessy.
LSP 121 Week 2 Intro to Statistics and SPSS/PASW.
1 SPSS Recently it has gone through a name change so your icon on your computer may be under a different name (i.e. PASW- Predictive Analytics SoftWare).
LSP 121 Intro to Statistics and SPSS. Statistics One of many definitions: The mathematics of collecting and analyzing data to draw conclusions and make.
A Simple Guide to Using SPSS© for Windows
Introduction to SPSS (For SPSS Version 16.0)
Introduction to SPSS Descriptive Statistics. Introduction to SPSS Statistics Program for the Social Sciences (SPSS) Commonly used statistical software.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
Problem 1: Relationship between Two Variables-1 (1)
FEBRUARY, 2013 BY: ABDUL-RAUF A TRAINING WORKSHOP ON STATISTICAL AND PRESENTATIONAL SYSTEM SOFTWARE (SPSS) 18.0 WINDOWS.
Introduction to SPSS (For SPSS Version 16.0)
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
PY550 Research and Statistics Dr. Mary Alberici Central Methodist University.
The Field (California) Poll. What is the Field Poll? The Field Poll was established in 1947 by Mervin Field. An independent non-partisan survey of California.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 9: Quantitative.
9/18/2015Slide 1 The homework problems on comparing central tendency and variability extend the focus central tendency and variability to a comparison.
Introduction to SPSS Edward A. Greenberg, PhD
9/23/2015Slide 1 Published reports of research usually contain a section which describes key characteristics of the sample included in the study. The “key”
Creating a Web Site to Gather Data and Conduct Research.
Advance to next slide1 Set Up Module Section 1. Advance to next slide2 Interactive Introduction to SPSS Statistical Software Elizabeth Bigham, Ph.D. California.
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
Intro to Statistics and SPSS. Mean (average) Median – the middle score (even number of scores or odd number of scores) Percent Rank (percentile) – calculates.
Using SPSS for Windows Part II Jie Chen Ph.D. Phone: /6/20151.
Roper Center for Public Opinion Research Social Science Research and Instructional Council June, 2015.
Inter-University Consortium for Political and Social Research Social Science Research and Instructional Council June, 2015.
Social Science Data Bases CSU Fresno October 30, 2009.
Introductory Workshop SPSS CSU Bakersfield December 9, 2005.
SW318 Social Work Statistics Slide 1 Compare Central Tendency & Variability Group comparison of central tendency? Measurement Level? Badly Skewed? MedianMeanMedian.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Using the SDA on the Web Ed Nelson, CSU Fresno Social Science Research and Instructional Council.
What is SPSS  SPSS is a program software used for statistical analysis.  Statistical Package for Social Sciences.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
WINKS 7 Tutorial 7 – Advanced Topic: Labels and Formats Permission granted for use for instruction and for personal use. © Alan C. Elliott,
Introduction to SPSS Prof. Ramez Bedwani. Outcomes By the end of this lecture, the student will be able to Know definition, uses and types of statistics.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
A Simple Guide to Using SPSS ( Statistical Package for the Social Sciences) for Windows.
Analyses using SPSS version 19
Dr. Engr. Sami ur Rahman Research Methods in Computer Science Lecture: Data Analysis (Introduction to SPSS)
Perform Descriptive Statistics Section 6. Descriptive Statistics Descriptive statistics describe the status of variables. How you describe the status.
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
SW318 Social Work Statistics Slide 1 Percentile Practice Problem (1) This question asks you to use percentile for the variable [marital]. Recall that the.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
1. Tables, Charts, and Graphs Microsoft Word & Excel 2003.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
PSY6010: Statistics, Psychometrics and Research Design Professor Leora Lawton Spring 2007 Wednesdays 7-10 PM Room 204.
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
Intermediate Workshop SPSS CSU Stanislaus May 13, 2016 Ed Nelson – CSU Fresno 1.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
SOC 305, Southeastern Louisiana University Prof. Robert Martin.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
Introduction to SPSS July 28, :00-4:00 pm 112A Stright Hall
Survey Documentation and Analysis (SDA)
Introduction to SPSS.
By Dr. Madhukar H. Dalvi Nagindas Khandwala college
Using a set-up file to read ASCII data into SPSS
DEPARTMENT OF COMPUTER SCIENCE
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
ICPSR: Resources for Instructors Finding and Analyzing Data 9/26/2012
By A.Arul Xavier Department of mathematics
Presentation transcript:

Introductory Workshop SPSS CSU Fresno March 12, 2010

Social Science Research and Instructional Council (SSRIC) Discipline council for the social sciences made up of representatives from each campus in the CSU. List of campus representatives can be found at Promotes use of data analysis in research and teaching Website is at

Social Science Data Bases The SSRIC helps maintain and promote the use of the social science data bases in the CSU Data bases include: –Inter-university Consortium for Political and Social Research (ICPSR) –The Field Institute –The Roper Center for Public Opinion Research

Agenda for the Introductory SPSS Workshop Overview of SPSS A brief tour Creating you’re your own SPSS data file or opening a data file you got somewhere else Transforming data –Recode –Compute –Select If Univariate analysis –Frequencies –Descriptives –Explore A look ahead at the intermediate workshop – March 15 from 9:00 am to noon

Overview of SPSS SPSS is a statistical package for beginning, intermediate, and advanced data analysis Other statistical packages include SAS and Stata Online statistical packages that don’t require site licenses include SDA

Text – SPSS for Windows Version 16 A Basic Tutorial Authors: Linda Fiddler (Bakersfield), Laura Hecht (Bakersfield), Ed Nelson (Fresno), Elizabeth Nelson (Fresno), Jim Ross (Bakersfield) Available from McGraw-Hill Custom Publishing. Call to order. Request ISBN Available on the web at The data set for this workshop can be downloaded at this site

SPSS Files and Extensions Portable file --.por Data file --.sav Output file --.spo Syntax file --.sps

Opening SPSS Go to start and find SPSS for Windows Click on SPSS 16.0 to open You’ll need to update your SPSS license every year (or your school technician will do it for you)

A Brief Tour of SPSS (see ch. 1 in text) Frequencies -- Analyze/Descriptive Statistics/Frequencies –Select ABANY and move it to the big box and click on OK Crosstabs – Analyze/Descriptive Statistics/Crosstabs –Move ABANY to the “Row” box –Move SEX to the “Column” box –Click on “Cells” and select “Column” percents –Click on OK

A Brief Tour Continued Comparing means – Analyze/Compare Means/Means –Move AGEKDBRN and EDUC in the “Dependent List” box –Move SEX to the “Independent List” box –Click on OK

A Brief Tour Continued Correlations –Analyze/Correlate/Bivariate –Move EDUC, MAEDUC, and PAEDUC into the “Variables” box –Click on OK

A Brief Tour Continued Scatterplots –Graphs/Legacy Dialogs/Scatter/Dot –Click on “Simple Scatter” and then on “Define” –Move EDUC into the “Y axis” box –Move PAEDUC into the “X Axis” box –Click on OK

Creating Your Own SPSS Data File (see ch. 2 in text) Involves creating: –Variable names –Variable labels –Value labels –Missing values

Creating a Data File in SPSS Questions (see p. 11) –Age –Sex –Religious preference –Political views –Type of marriage preferred –Opinion on abortion (7 different questions)

Basic Steps in Creating a Data File Assign identification number to each case Assign each variable a variable name and an extended variable label Each variable will have a set of values. Assign each value an extended value label If a variable has missing information, decide which values will be used as the missing values

Variable Names Traditionally variable names had to be 8 characters or less, start with a letter, and contain no embedded blanks Now they can be longer than 8 characters, but we’ll stick with names of 8 or fewer characters Names can contain some special characters, but not all such characters. So we only use hyphens (-) as special characters in names

Variable Names Age is named AGE Sex is named SEX Religious preference is named REL Political orientation is named C-L Preferred marriage is named MG There are seven abortion variables and they are named ABD, ABN, ABH, ABP, ABR, ABS, ABA

Entering the Information for a Data File You already have SPSS open Click on File/New/Data You should see a blank data screen that looks like a spreadsheet At the bottom are two tabs called “Data View” and “Variable View”. Click on “Variable View”

Defining the Variables Enter the variable names in the “Names” columns in the order you want them Enter the variable labels in the “Label” column Enter the value labels in the “Values” column. To do this you will need to click in the appropriate cell and then click in the little gray box on the right Enter the missing values in the “Missing” column. To do this you will need to click in the appropriate cell and then click in the little gray box on the right

Adding in the Data Now that you have defined the variables, click on the tab at the bottom called “Data View” and enter the data into the appropriate cells. The data are on p. 18 of the text Once you have entered the data, go back and check to make sure you didn’t make any data entry errors Congratulations!! – you created a SPSS data file. You could also enter the data using a spreadsheet like Excel

Saving the Data File Now you want to save your data file Click on “Save as”. The default is to save it as a SPSS data file with.sav as the extension Give it a file name and indicate where you want to save it on your hard drive or on your flashdrive

Opening an Existing File You Got Somewhere Else Often you will want to open a data set that you got from someplace else such as: –ICPSR –Field Institute –Roper Center These files will usually be in the form of a: –SPSS portable file (.por) –SPSS data file (.sav) –Raw data file with a SPSS syntax file (.sps) –Raw data file without a syntax file

Opening a Portable file Click on the open yellow folder to open a new file Change file type to.por Browse to where the portable file you want to open is located and double click on that file

Opening an SPSS Data File Click on the open yellow folder to open a new file Change file type to.sav Browse to where the data file you want to open is located and double click on that file We’re going to use the data set that comes with the text – gss06a.sav. You can download it from the web site that has the text -- Look for the text – “Right click here to download GSS06A.”

Opening a Raw Data File with a SPSS Syntax File Sometimes you will need to open a raw data file (ASCII or text) and there will be an accompanying SPSS syntax file You will need to modify the “File Handle” and “Save Outfile” commands See and xml for more information xml You may need help doing this. Feel free to contact me for help

Opening a Raw Data File Without a SPSS Syntax File If you don’t have a SPSS syntax file you will have to use the codebook that came with the data and create your own syntax file You may need help doing this. Feel free to contact me for help

Choosing Options in SPSS Click on “Edit” and “Options.” General tab -- under “Variable Lists,” check “Display Names” and “Alphabetical.” Output Labels tab -- select “Names and Labels” in the first box, and “Values and Labels” in the second.

What’s Next? Now you know how to create a SPSS data file and how to open an existing SPSS portable or data file Next we’ll learn how to transform variables

Transforming Data (see ch. 3 in text) We can transform variables by recoding which means to combine categories on an existing variable into fewer categories We can transform variables by creating new variables out of existing variables We can select particular cases and analyze only these cases We can do other things like weighting cases that we’re not going to talk about in this workshop.

Recoding Variables Recoding into different variables Recoding into the same variable We recommend recoding into different variables and not using the into same variable option

Recoding into Different Variables Click on “Transform” and then on “Recode” and then on “into different variables” Select the variable you want to recode Start by giving the new variable a new name and assigning a variable label to the new variable. Click on “Change”

Recoding AGE into AGE1 Recode AGE into four categories and give it the name of AGE1 –Click on “Old and New Values” Use “Range” (fourth option down) to recode as follows. Remember to click on “Add” after entering each recode –18 to 29 = 1 –30 to 49 = 2 –50 to 69 = 3 –70 to 89 = 4

Recoding Options When you click on “Old and New Values” there will be seven options For most recoding you will only have to use two of these options –The first option from the top allows you to recode a single value into a new value –The fourth option from the top allows you to recode a range of values from X to Y into a new value

Assign Value Labels to the Four Categories of AGE1 Go into “Variable View” Find the variable AGE1 (should be at the bottom of the list of variables) Click in the “Values” column and then click on the small gray box Enter the value labels Click on OK

Exercises for Recoding INCOME06 is total family income. Do a frequency distribution to see what it looks like before recoding Recode into 4 categories and call this new variable INCOME1. Use the following categories: under $20K, $20K to under $40K, $40K to under $60K, and $60K and over Add the value labels Run a frequency distribution for INCOME1 and check to make sure that you recoded it correctly by comparing the unrecoded and recoded frequency distributions

More Exercises for Recoding Now recode INCOME06 again and call the new variable INCOME2 This time use 8 categories: under $10K, $10K to under $20K, $20K to under $30K, $30K to under $40K, $40K to under $50K, $50K to under $60K, $60K to under $75K, and $75K and over Add the value labels Run a frequency distribution for INCOME2 and check to make sure that you recoded it correctly by comparing the unrecoded and recoded frequency distributions

Creating a New Variable with Compute Let’s create a new variable and call it ABORTION which is the sum of the seven abortion variables Click on “Transform” and then on “Compute” Enter the new variable name (ABORTION) into the target variable box Enter the formula for this new variable into the “Numeric Expression” box Click on OK

Dealing with Missing Data If there is missing data for any of these variables (ABANY to ABSINGLE), the new variable ABORTION will be assigned a system missing value What do we do if we want to allow no more than two missing values? Let’s compute the mean value and divide the sum of the abortion values by the number of cases with valid information But let’s allow only two variables with missing values

Dealing with Missing Data Continued Click on “Reset” to erase what is currently in the “Compute Variable” box Click on “Statistical” in the “Function Group” box Then double click on “Mean” in the “Function and Special Variables” box In the “Target Variable” box, enter the name of the new variable. Let’s call it ABORMEAN In the “Numeric Expression” box, you should see “MEAN(?,?)”

Dealing with Missing Data Continued Replace the “?,?” with the variables you want to include so it reads “MEAN (abany,abdefect,abhlth,abnomore,abpoor, abrape,absingle)” Insert.5 following MEAN so it reads “Mean.5”. This indicates that you want to have at least five variables with valid information Click on OK

Exercises for Compute There are five variables that measure tolerance for letting someone speak in your community who may have different views than your own: SPKATH, SPKCOM, SPKHOMO, SPKMIL, and SPKRAC For each of these variables, 1 means they would allow such a person to speak and 2 means they would not allow it

Exercises for Compute Continued Create a new variable (call it SPEAK) which is the sum of these five variables Run a frequency distribution for SPEAK What do the values in this new variable tell us?

More Exercises for Compute Now let’s create a variable called SPKMEAN which allows for one of the five variables (SPKATH to SPKRAC) to be missing What happens if there is more than one variable with a missing value? How does SPSS calculate the new variable if there is only one variable with a missing value?

Using Select Cases to Select Specific Cases for Analysis Let’s select only Protestants for further analysis Click on “Data” and then on “Select Cases” Click on “If condition is satisfied” and then on the “If” button below it Select the variable RELIG and move it into the box on the right In this box, enter the expression “relig = 1” Click on “Continue” and on OK

Using Select Cases Continued Now lets select Protestants who are under 35 years age old Enter the expression “relig = 1” as you did before. Use & for and. Enter “age < 35” so the expression reads “relig = 1 & age < 35” Click on OK

Exercises for Select If Select all males (1 on the variable SEX) and do a frequency distribution for the variable FEAR (afraid to walk alone at night in the neighborhood) Now select all females (2 on the variable SEX) and fun a frequency distribution for FEAR Are males or females more fearful of walking alone at night?

More Exercises for Select If Now let’s select males under age 35 and run a frequency distribution for FEAR Do the same thing for females under 35 Are males or females under 35 more fearful of walking alone at night?

Important Note on Using Select Cases When you are finished using “Select Cases” and want to revert to using all the cases be sure to click on Data/Select Cases and select “All cases”. Then click on OK If you don’t do this, you will continue to use only those cases you last selected

Univariate Analysis Now that we know how to open existing files and transform variables, we’re ready to begin analyzing data Univariate analysis refers to analyzing variables one-at-a-time

Types of Univariate Analysis Procedures (see ch. 4 in text) Frequencies Descriptives Explore

Frequencies Go to Analyze/Descriptive Statistics/Frequencies Select ABANY and AGE and click on OK

Bar Charts Bar charts – click on Analyze/Descriptive Statistics/Frequencies Click on “Charts” Select “Bar Charts” and click on “Continue” and then on OK Do you think bar charts are appropriate for both ABANY and AGE?

Histograms Click on click on Analyze/Descriptive Statistics/Frequencies Click on “Charts” Select “Histograms” and click on “Continue” and then on OK Do you think histograms are appropriate for both ABANY and AGE? Which do you think is the most appropriate chart (bar chart or histogram) for ABANY and for AGE?

Statistics Click on Analyze/Descriptive Statistics/Frequencies Click on “Statistics” Select the statistics you want and click on “Continue” and then on OK

Exercises for Frequencies There are seven variables dealing with abortion: ABANY, ABDEFECT, ABHLTH ABNOMORE, ABPOOR, ABRAPE, and ABSINGLE Run a frequency distribution for each variable Get a bar chart for each variable Compare and contrast how people answered these seven questions

More Exercises for Frequencies Run the frequency distribution for AGE Get a histogram for AGE Compute the following statistics for AGE: –Mean –Median –Standard deviation –Percentiles – 25 th, 50 th, and 75 th

Descriptives Click on Analyze/Descriptive Statistics/Descriptives Select AGE and EDUC Click on “Options” and select the statistics you want and then click on “Continue” and OK

Exercises for Descriptives Use Descriptives to compute the following statistics for AGE –Mean –Standard deviation –Variance –Skewness –Kurtosis

More Exercises for Descriptives Use Descriptives to compute the mean for EDUC, MAEDUC, PAEDUC Who has the most education – respondents or their parents? Who has the most education – mothers or fathers?

Explore Click on Analyze/Descriptive Statistics/Explore Select EDUC and put it in the “Dependent List” In the Display box on the lower left, click on “Both” Click on OK

Selecting Statistics for Explore Click on Analyze/Descriptive Statistics/Explore Click on “Statistics” and select the statistics you want Click on “Continue” and then OK

Selecting Plots for Explore Click on “Plots” Select the plots you want Click on “Continue” and then OK

Exercises for Explore Using Explore to get the following statistics and plots for the variables EDUC, PAEDUC, and MAEDUC –Descriptives –Outliers –Stem-and-leaf plot –Histogram –Boxplot First select “Factor levels together” and run it Then select “Dependents together” and run it again What’s the difference?

Intermediate Workshop for SPSS In the next workshop we’ll look at different types of statistical analysis you can do in SPSS –Cross tabulations (ch. 5) –Comparing means (ch. 6) –Correlation and regression (ch. 7) –Multivariate analysis (ch. 8) Cross tabulations Multiple regression –Presenting your data – charts and tables (ch. 9)