Data Liberation Training 2001 Complex Files: Pasting and Cutting with SPSS Université de Montréal Wendy Watkins April 24, 2001.

Slides:



Advertisements
Similar presentations
Aggregate Data and Statistics
Advertisements

10. NLTS2 Documentation Overview. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training Modules.
1 Data Mining Classification Techniques: Decision Trees (BUSINESS INTELLIGENCE) Slides prepared by Elizabeth Anglo, DISCS ADMU.
Variables 9/10/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp.48-58) Chapter 1 Introduction.
19.Multivariate Analysis Using NLTS2 Data. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Recap of basic SPSS and statistics 5 th - 9 th December 2011, Rome.
SPSS Session 1: Levels of Measurement and Frequency Distributions
17a.Accessing Data: Manipulating Variables in SPSS ®
Process Description and Control Chapter 3. Major Requirements of an Operating System Interleave the execution of several processes to maximize processor.
QM Spring 2002 Business Statistics SPSS: A Summary & Review.
1 Economics 240A Power One. 2 Outline w Course Organization w Course Overview w Resources for Studying.
Introduction to SPSS Descriptive Statistics. Introduction to SPSS Statistics Program for the Social Sciences (SPSS) Commonly used statistical software.
15a.Accessing Data: Frequencies in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Data Management: Quantifying Data & Planning Your Analysis
SPSS 1: An Introduction to the Statistical Package SPSS Suzie Cro MRC Clinical Trials Unit.
SPSS 202: Data Management by SPSS (Workshop) Dr. Daisy Dai Department of Medical Research 1.
Basic Concept of Data Coding Codes, Variables, and File Structures.
Clustering analysis workshop Clustering analysis workshop CITM, Lab 3 18, Oct 2014 Facilitator: Hosam Al-Samarraie, PhD.
Canadian Travel Survey, 1998 Throughout 1998, Statistics Canada interviewed approximately 180,000 Canadians across the country about their trips in Canada,
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
A Survey of Probability Concepts Chapter 5. 2 GOALS 1. Define probability. 2. Explain the terms experiment, event, outcome, permutations, and combinations.
Chapter 1: The What and the Why of Statistics
Disclosure Avoidance: An Overview Irene Wong ACCOLEDS/DLI Training December 8, 2003.
Chi-Square Test of Independence Practice Problem – 1
Multi-Part Requests/ Parent & Child Service Items.
Part 4 Syntax or point-and-click?. British Social Attitudes 1986 Q.114, page 43b.
Plotting Data Exploring Computer Science Lesson 5-7.
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Splitting Files PowerPoint Prepared by Alfred.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Chapter 1: The What and the Why of Statistics  The Research Process  Asking a Research Question  The Role of Theory  Formulating the Hypotheses  Independent.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 23 Getting Started with Access Essentials 1 Morrison / Wells / Ruffolo.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
Time Management Personal and Project. Why is it important Time management is directly relevant to Project Management If we cannot manage our own time.
Types of Data (Measurement) AMA Collegiate Marketing Research Certificate Program.
DATABASE SYSTEMS. DATABASE u A filing system for holding data u Contains a set of similar files –Each file contains similar records Each record contains.
Data and Construction of Economic Table Washington Child Support Group December 2007 Session I.
Quantitative research – variables, measurement levels, samples, populations HEM 4112 – Research methods I Martina Vukasovic.
A Survey of Probability Concepts
Foundations of Sociological Inquiry Statistical Analysis.
SPSS- Tutorial The following power-point slides show you how to use some of the features in SPSS. A survey of 20 randomly selected companies asked them.
Use of ICT in Data Management AS Applied ICT. Back to Contents Back to Contents.
1.Introduction to SPSS By: MHM. Nafas At HARDY ATI For HNDT Agriculture.
Analytical Example Using NHIS Data Files John R. Pleis.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
Using SPSS Next. An Introduction SPSS (the Statistical Package for the Social Sciences)
Stretching Your Data Management Skills Chuck Humphrey University of Alberta Atlantic DLI Workshop 2003.
Probability Concepts. 2 GOALS 1. Define probability. 2. Explain the terms experiment, event, outcome, permutations, and combinations. 3. Define the terms.
Virtual Memory Pranav Shah CS147 - Sin Min Lee. Concept of Virtual Memory Purpose of Virtual Memory - to use hard disk as an extension of RAM. Personal.
What statistical tests have we learned so far? Descriptive statistics (chp. 12) –Mean, median, mode –Frequency of each response (frequencies), range, standard.
Scientific data storage: How are computers involved in the following?
Chapter 4 Selected Nonparemetric Techniques: PARAMETRIC VS. NONPARAMETRIC.
Making Every Child Visible: Triangulating Data to Reveal Systems Level Challenges Catherine Callow-Heusser, Ph.D. EndVision Research and Evaluation.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
BUS 362 Marketing Research SPSS Exam Prep Fall’16
PROCESSING DATA.
Session 15 Merging Data in SPSS
Active Learning Lecture Slides
Introduction to SPSS.
Reasoning in Psychology Using Statistics
Process Description and Control
SPSS Propensity Score Matching: An overview
Computing A Variable Mean
CONCEPT TO BE INCLUDED Variables Values Recoding Ratio.
Text Categorization Berlin Chen 2003 Reference:
 .
Presentation transcript:

Data Liberation Training 2001 Complex Files: Pasting and Cutting with SPSS Université de Montréal Wendy Watkins April 24, 2001

Objectives u To be able to recognize types of complex files u To understand the process of matching and adding files u To have enough information to warn users about how to handle complex files

Outline: Concepts u Complex Files u Longitudinal Files u Hierarchical Files u Separate Files u Combined Files u “Split” Files

Outline: Tasks u Pasting and Cutting with SPSS u Pasting u Adding variables u Adding cases u Cutting u Selecting Flag Variables u Weighting

Complex Files Concepts

Longitudinal Files u eg. Kids, NPHS and SLID surveys u Same respondents u Different variables or variable names u Data collected on a regular schedule u Provide a look at what happens over time

Longitudinal Files u Have a common linking variable u Usually an ID number u Are combined through a matching process

Separate Hierarchical Files u eg. GSS10 - Family u Same respondents u Different units of analysis u Allow matching of individuals with attributes u Based on data structure

Separate Hierarchical Files: Structure u GSS 10 - Family u Main file u Respondent 1(R1) u Respondent 2 (R2) …. u Respondent n (Rn) u Child file u Kid 1 (R1) u Kid 2 (R1) u Kid 3 (R3)…. u Kid N (Rn)

Separate Hierarchical Files u Must be certain to put the right child/children with the right respondent u Each respondent has a unique identifier (id number) u Each child has a matching identifier

Combined Hierarchical Files u eg. GSS 3 - Vicimization u Same respondents u Different units of analysis u Everything in one file u Based on data structure

Combined Hierarchical Files: Structure u GSS 3 - Victimization u Respondent 1(R1) u Incident 1 (I1-R1) u Incident 2 (I2-R1) u Respondent 2 (R2) …. u Incident 1 (I1-R2) u Respondent 3 (R3) u Respondent n (Rn) u Incident 1 (I1-Rn) u Incident 2 (I2-Rn) u Incident 3 (I3-Rn)

Combined Hierarchical Files u Must be certain to put the right incident with the right respondent u Also need to be able to separate the units of analyses (individuals and incidents)

Combined Hierarchical Files u Each unit of analysis has a flag and weight u Individuals u Person flag/Person weight u Incidents u Incident flag/Incident weight

“Split” Files u Different respondents u Same variables u Same unit of analysis u Files literally in pieces u Monthly files - Travel Survey u Regional files - HIFE u Based on data-management

“Split” Files u eg. Travel Survey u January file + u February file + …. + u December file = u Annual file u Combine by simply adding u No matching necessary

Complex Files Tasks: Pasting and Cutting with SPSS

Complex Files u NOT like word-processing u Either paste u Add cases u Add variables u Or cut u Select flags and weights

Pasting with SPSS u Longitudinal files u Adding variables u Same respondents u Different variables u Same units of analysis

Pasting with SPSS u Longitudinal files u Must ensure the files are in the same order u Each individual has a unique ID number u Files must be sorted by this ID, before they are matched

Pasting with SPSS u Longitudinal files u Step 1: Sort all files by matching variable and save results

Pasting with SPSS u Longitudinal files u Step 2: Merge sorted files by adding variables.

Pasting with SPSS u Longitudinal files u Step 3: Match files by matching variable and save

Pasting with SPSS u Separate Hierarchical Files u Similar to longitudinal files u Must ensure the files are in the same order u Each record has a unique identifier used for matching

Pasting with SPSS u Separate Hierarchical Files u Must match all attributes to individual u One respondent may have none, one or many u eg. parent / child(ren)

Pasting with SPSS u Separate Hierarchical Files u Sort files by matching variable and save results u Match files by adding variables u main respondent is in TABLE u attributes are in FILE

Pasting with SPSS u Separate Hierarchical Files u Main respondent=keyed table

Pasting with SPSS u “Split” Files u Add cases u Different respondents u Same variables u Same units of analysis u No need to match or sort

Pasting with SPSS u “Split” Files u One-step process; no sorting required

Cutting with SPSS u Combined Hierarchical Files u Same cases u Different units of analysis u Files are already matched u Want to analyze one unit of analysis u Must use: u Flag Variables u Appropriate Weights

Cutting with SPSS u Combined Hierarchical Files u Step 1: Select unit of analysis (eg. person) u Step 2: Select appropriate flag u Step 3: Apply appropriate weight

Cutting with SPSS u Combined Hierarchical Files u Steps 1 and 2

Cutting with SPSS u Combined Hierarchical Files u Step 3

In a Nutshell Pasting u Longitudinal files u Sort and match with FILE u Separate hierarchical files u Sort and match with TABLE u Split files u Add cases Cutting u Combined hierarchical files u SELECT and WEIGHT

A Quick Review from 2000: Levels of Measurement and SPSS Procedures u Nominal variables u Ordinal variables u Frequencies u Crosstabs u Interval variables u Descriptives u Compare means

Levels of Measurement u Categorical Variables u Numbers Denote Categories u Have No Intrinsic Meaning u Nominal u Are unordered u Ordinal u Have an order

Categorical Variables u Nominal Variables u Numbers stand for names u Can’t order them u eg. Marital Status u 1=Single u 2=Married or Common Law u 3=Separated/Divorced/Widowed u Can’t use arithmetic to add, etc.

Categorical Variables u Ordinal Variables u Numbers can be ordered u Spaces between numbers can’t be measured u eg. How well do you like Harris? u 1=Not at all u 2=Less still u 3=Even less than that u Can’t use arithmetic to add, etc.

Continuous Variables u Interval Variables u Numbers stand for what they are u Spaces between numbers are equal u eg. How many children do you have? u Can use arithmetic u eg. What is the average number of children in a family?

Levels of Information u Interval Variables = most information u Ordinal Variables = less information u Nominal Variables = least information

Using Crosstabs u How does ‘x’ relate to ‘y’? u Use with nominal and ordinal measures u eg. Are men or women more likely to use computers at work?

Using Means u Compares the average (mean) between groups u Use when one variable is interval and the other is ordinal or nominal u eg. Who has worked longer at their job, men or women?

Time for a Break!