17a.Accessing Data: Manipulating Variables in SPSS ®

Slides:



Advertisements
Similar presentations
4. NLTS2 Data Sources: Parent and Youth Surveys. 4. Sources: Parent and Youth Surveys Prerequisites Recommended modules to complete before viewing this.
Advertisements

14a. Accessing Data Files in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module 1. Introduction to the NLTS2 Training.
Prerequisites Recommended modules to complete before viewing this module 1. Introduction to the NLTS2 Training Modules 2. NLTS2 Study Overview 3. NLTS2.
12. NLTS2 Documentation: Quick References. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
13.Analysis Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data.
11. NLTS2 Documentation: Data Dictionaries. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2.
10. NLTS2 Documentation Overview. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training Modules.
16b. Accessing Data: Means in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
SW388R7 Data Analysis & Computers II Slide 1 Solving Problems in SPSS The data sets Options for variable lists in statistical procedures Options for variable.
9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
©2004, 2006, 2008 UIW Department of Instructional Technology Meat and Potatoes SPSS Presented by Terence Peak.
19.Multivariate Analysis Using NLTS2 Data. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Bivariate Analysis Cross-tabulation and chi-square.
7.Implications for Analysis: Parent/Youth Survey Data.
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
Detecting univariate outliers Detecting multivariate outliers
A Simple Guide to Using SPSS© for Windows
Introduction to SPSS Descriptive Statistics. Introduction to SPSS Statistics Program for the Social Sciences (SPSS) Commonly used statistical software.
15a.Accessing Data: Frequencies in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
15b. Accessing Data: Frequencies in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
Generating Random Samples SAS, EXCEL, JMP, SPSS. Population of Data  Sample Data should be in a dataset where each row represents an individual unit,
SW388R7 Data Analysis & Computers II Slide 1 Analyzing Missing Data Introduction Problems Using Scripts.
Sociology 690 SPSS Introduction. Using SPSS The Statistical Package for the Social Sciences (SPSS) started at Stanford University in the late 1960’s.
Introduction to SPSS (For SPSS Version 16.0)
EASY TEAM MANAGER By Dave Abineri EASYWARE: PO Box 231, Milford, OHIO (Cincinnati) Phone: (513) Use UP arrow to move to the NEXT slide Use.
Data Entry Data Management Basic Descriptive Statistics Jamie Lynn Marincic Leanne Hicks Survey, Statistics, and Psychometrics Core Facility (SSP) July.
Advanced SAGE Formative Adding Your Own Resources Using Common Assessments Creating Educator Groups.
Using the Frequencies Procedure in SPSS 9.0 for Windows © by Julia Hartman © Copyright 2000, Julia Hartman.
6. Implications for Analysis: Data Content. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2.
Data Analysis Using SPSS
8.Implications for Analysis: School Survey, Student Assessment, and Transcript Data.
U-Tab™ Tutorial - Creating New Variables Overview © 2004 Weeks Computing Services. All Rights Reserved. If the variables included in your U-Tab file are.
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
2. NLTS2 Study Overview. 1 Prerequisites Recommended module to complete before viewing this module  1. Introduction to the NLTS2 Training Modules.
Using SPSS for Windows Part II Jie Chen Ph.D. Phone: /6/20151.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
1.Introduction to the NLTS2 Training Modules Jose Blackorby Renee Cameto Camille Marder Christopher Sanford Kathryn Valdes James Van Campen SRI International.
Chapter One An Introduction to Visual Basic 2010 Programming with Microsoft Visual Basic th Edition.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Entering Data Manually PowerPoint Prepared by.
Example SPSS Basic Medical Statistics Course October 2010 Wilma Heemsbergen.
Chapter 17 Creating a Database.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Recoding Variables PowerPoint Prepared by Alfred.
WINKS 7 Tutorial 7 – Advanced Topic: Labels and Formats Permission granted for use for instruction and for personal use. © Alan C. Elliott,
Developed By Information Technology Services University Of Saskatchewan.
Reports and Learning Resources Module 5 1. SLMS Primary Administrator Training Module 5: Reports and Learning Resources 2.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
SW318 Social Work Statistics Slide 1 Frequency: Nominal Variable Practice Problem This question asks the frequency of widowed respondents of the survey.
Dr. Engr. Sami ur Rahman Research Methods in Computer Science Lecture: Data Analysis (Introduction to SPSS)
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
SPSS- Tutorial The following power-point slides show you how to use some of the features in SPSS. A survey of 20 randomly selected companies asked them.
SW318 Social Work Statistics Slide 1 Percentile Practice Problem (1) This question asks you to use percentile for the variable [marital]. Recall that the.
CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH (CSSCR) UNIVERSITY OF WASHINGTON SPRING 2013 CONSULTANT: SHIN HAENG LEE Introduction to SPSS.
SW388R7 Data Analysis & Computers II Slide 1 Detecting Outliers Detecting univariate outliers Detecting multivariate outliers.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
PSY6010: Statistics, Psychometrics and Research Design Professor Leora Lawton Spring 2007 Wednesdays 7-10 PM Room 204.
14b. Accessing Data Files in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
SW388R6 Data Analysis and Computers I Slide 1 Comparing Central Tendency and Variability across Groups Impact of Missing Data on Group Comparisons Sample.
16a. Accessing Data: Means in SPSS ®. 16a. Accessing Data: Means in SSPS ® 1 Prerequisites Recommended modules to complete before viewing this module.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Selecting Cases PowerPoint Prepared by Alfred.
Using SPSS Next. An Introduction SPSS (the Statistical Package for the Social Sciences)
Sociology 680 SPSS Introduction. Using SPSS The Statistical Package for the Social Sciences (SPSS) started at Stanford University in the late 1960’s.
17b.Accessing Data: Manipulating Variables in SAS ®
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
Entering Data in SPSS Open SPSS. Select the radio button beside ‘type in data’. Click OK. At the bottom of the SPSS spreadsheet, select variable view.
SOC 305, Southeastern Louisiana University Prof. Robert Martin.
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
TRAINING OF FOCAL POINTS ON THE CountrySTAT/FENIX SYSTEM
Presentation transcript:

17a.Accessing Data: Manipulating Variables in SPSS ®

17a. Accessing Data: Manipulating Variables in SSPS ® 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training Modules  2. NLTS2 Study Overview  3. NLTS2 Study Design and Sampling  NLTS2 Data Sources, either 4. Parent and Youth Surveys or 5. School Surveys, Student Assessments, and Transcripts  NLTS2 Documentation 10. Overview 11. Data Dictionaries 12. Quick References

17a. Accessing Data: Manipulating Variables in SSPS ® 2 Prerequisites Recommended modules to complete before viewing this module (cont’d)  13. Analysis Example: Descriptive/Comparative Using Longitudinal Data  Accessing Data 14a. Files in SPSS 15a. Frequencies in SPSS

17a. Accessing Data: Manipulating Variables in SSPS ® 3 Overview  Purpose  Modifying existing variables  Creating new variables  Summary  Closing  Important information

17a. Accessing Data: Manipulating Variables in SSPS ® 4 NLTS2 restricted-use data NLTS2 data are restricted. Data used in these presentations are from a randomly selected subset of the restricted-use NLTS2 data. Results in these presentations cannot be replicated with the NLTS2 data licensed by NCES.

17a. Accessing Data: Manipulating Variables in SSPS ® 5 Purpose Learn to  Modify an existing variable  Create a new variable  Join/combine data from different sources

17a. Accessing Data: Manipulating Variables in SSPS ® 6 Modifying existing variables How to modify a variable. It is necessary to create a new variable in SPSS to  Collapse categories  Break a continuous variable into categories  Recode a variable. Note about created variables in the NLTS2 database  Our analyses were done in SAS, and this recoding step is usually not necessary in SAS because of the external formats feature.  Collapsed or recategorized variables do not necessarily exist in SAS or SPSS files even if these items appear in published tables.  There are many created variables in the NLTS2 database, but most of them are not simply collapsed versions of an existing variable. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 7 Modifying existing variables Syntax to recode into collapsed categories RECODE np1B2a (MISSING=SYSMIS) (Lowest thru 1=1) (2 thru 5=2) (6 thru 10=3) (11 thru Highest=4) INTO np1B2a_Cat. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 8 Modifying existing variables Syntax to assign a variable label to the new variable *assign variable label to new categorical variable. VARIABLE LABELS np1B2a_Cat '(np1B2a_cat) Age of youth when diagnosed categorized'. EXECUTE. Syntax to assign value labels * assign value labels to new categorical variable. VALUE LABELS np1B2a_Cat 1 "(1) 1 or younger" 2 "(2) 2 to 5 years of age" 3 "(3) 6 to 10 years of age" 4 "(4) 11 or older". These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 9 Modifying existing variables Menu  Transform: Recode into Different Variables  Select the variable to be recoded from the list and click the right-facing arrow.  Give the new variable a name in the box under “Output Variable.”  Assign a label to the new variable in the “Label” box under “Output Variable.”  Click “Change.”  Click on the box marked “Old and New Values,” and a new box pops up.  In the new box, under “Old Values” click the radio button “System or User-missing,” click “System Missing” under “New Values,” and click “Add” next to “Old -- >New.” These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 10 Modifying existing variables Menu (cont’d)  For each old to new value(s) Under “Old Values,” click a radio button by an actual value or range of values box. Designate what the old values are, either actual or range of values, in the appropriate box. Assign a new code under “New Values” and click “Add.”  When finished with values, click “Continue” to return to the first box.  In the original box, click “OK” or “Paste” to generate code. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 11 Modifying existing variables Look at results. New variable should appear at bottom of “Variable View.” Specify formats so values are meaningful.  In variable view, click on the cell in the “Values” column to bring up a new box.  Enter a value in the “Value” box, a label for that value in the “Label” box, and click “Add.”  Do this for every value. Look at frequency distribution.  Useful to look at a crosstab of the original by the new variable. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 12 Modifying existing variables These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 13 Modifying existing variables These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 14 Modifying existing variables: Example Modifying a variable  Open Wave 3 parent/youth interview file.  Collapse np3NbrProbs into new variable  Remember to Label variable Add value formats Account for missing values Paste your code. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 15 Modifying existing variables: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 16 Modifying existing variables: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 17 Creating new variables How to create a new variable. The values in the new variable can be the results of calculations, assignments, or logic. A new variable can be created from an existing variable or from multiple variables, including variables from other sources and/or waves.  Variables from other sources/waves must be added to the active data file before the new variable is created. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 18 Creating new variables Be aware of any coding differences between the variables when combining values. Decide what to do with missing values. Example: Create a variable using parent interview data from Waves 1, 2, and 3.  Has a student been suspended and/or expelled in any wave? These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 19 Creating new variables Syntax IF (np1D7h=0 and np2D5d=0 and np3D5d=0 and np4D5d=0) np4D5d_ever=0. IF (np1D7h=1 or np2D5d=1 or np3D5d=1 or np4D5d=1) np4D5d_ever = 1. IF (np1D7h=1 and np2D5d=1 and np3D5d=1 and np4D5d=1) np4D5d_ever = 2. IF (MISSING(np1D7h) or MISSING(np2D5d) or MISSING(np3D5d) or MISSING(np4D5d)) np4D5d_ever = EXECUTE. This code will result in a variable that  Requires a value for every wave  Is 0 if never suspended/expelled  Is 1 if suspended/expelled in any wave  Is 2 if suspend/expelled in all three waves. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 20 Creating new variables These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 21 Creating new variables Menu  Transform: Compute  Enter a variable name under “Target Variable.”  Click “Type & Label” and assign a label.  If applicable, find and select the source variable(s) and click the right-facing arrow to move the variable name into the “Numeric Expression” box.  Enter functions/operations from the keypad boxes or select from the list of functions.  For logical conditions, click “If…” and build the condition in the pop-up box.  Click “OK” or “Paste.”  For multiple conditions (i.e., if-then-else), repeat all steps. Specify conditions in order of overriding conditions. If true, each subsequent condition will override the previous condition. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 22 Creating new variables: Example Creating a new variable  Open the Wave 4 parent/youth interview file.  Bring in np1F7 from Wave 1, np2P8_J4 from Wave 2, and np3P8_J4 from Wave 3 interview files.  Create a new variable np4P8_J4_ever (ever done volunteer or community service).  Initialize value to “0” if any value in np1F7, np2P8_J4, np3P8_J4, or np4P8_J4 is “0.”  Reassign to “1” if any value in np1F7, np2P8_J4, np3P8_J4, or np4P8_J4 is “1.” These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 23 Creating new variables: Example Creating a new variable (cont’d)  Assign variable label and value labels.  Run a frequency of np4P8_J4_ever.  Run a crosstabulation of np4P8_J4_ever by np4P8_J4. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 24 Creating new variables: Example Code for example IF (np1F7=0 or np2P8_J4 = 0 or np3P8_J4=0 or np4P8_J4=0) np4P8_J4_ever = 0. IF (np1F7=1 or np2P8_J4=1 or np3P8_J4=1 or np4P8_J4=1) np4P8_J4_ever = 1. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 25 Creating new variables: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 26 Creating new variables: Example These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 27 Summary Be aware of differences in coding between similar variables when building composite variables. Missing values must be considered.  Know how missing values are being coded, particularly when using more than one variable to create another.  Joined data are more likely to have missing values. Weights  Generally, the analysis weight should be the weight from the smallest sample when combining data.  When filling in values for a variable in an active file with values from another, it is OK to use the weight in the active file. Strongly recommended: Paste your code when creating variables. These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 28 Summary Know the values, mind the missing, and watch your weights! These results cannot be replicated with full dataset; all output in modules generated with a random subset of the full data.

17a. Accessing Data: Manipulating Variables in SSPS ® 29 Closing Topics discussed in this module  Modifying existing variables  Creating new variables  Summary Next module  18a. Complex Samples Procedures in SPSS

17a. Accessing Data: Manipulating Variables in SSPS ® 30 Important information  NLTS2 website contains reports, data tables, and other project-related information  Information about obtaining the NLTS2 database and documentation can be found on the NCES website  General information about restricted data licenses can be found on the NCES website  address: