Mannheim Research Institute for the Economics of Aging www.mea.uni-mannheim.de SHARE data versions & IDs Stephanie Stuck MEA Antwerpen February 2008.

Slides:



Advertisements
Similar presentations
ARK is a resource dedicated to making social and political information on Northern Ireland available to all.
Advertisements

Week 2 DUE This Week: Safety Form and Model Release DUE Next Week: Project Timelines and Website Notebooks Lab Access SharePoint Usage Subversion Software.
Multiple Indicator Cluster Surveys MICS3 Regional Training Workshop Household Information Panel.
MICS4 Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Data Entry Editing.
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Household Questionnaire: Household Information Panel and Household.
MICS4 Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop MICS Dictionary and Forms.
The English Longitudinal Study of Ageing (ELSA) Data & Documentation 2008 Jibby Medina NatCen.
The English Longitudinal Study of Ageing (ELSA) 2009 Update Natasha Wood.
CTR Form B: Testing Information. Overview: Form B collects information about the testing and counseling services provided to a client. It is intended.
Inability to Create a Separate Application WHEN DENIED AND TOLD SEPARATE APPLICATION NEEDED.
Multiple Indicator Cluster Surveys Data Processing Workshop CAPI Menu System MICS Data Processing Workshop.
CASE STUDY: IMPLEMENTING SDMX EXCHANGE WITH MEMBER COUNTRIES IN SHORT-TERM ECONOMIC STATISTICS (STES)
1 Formatting Your Survey. What should a format look like? For any questionnaire, whether small or big, the important things are: a.Skip patterns b.Options.
 Fundamentals of Web Design.  Describe the history and theory of XHTML  Understand the rules for creating valid XHTML documents  Apply a DTD to an.
The English Longitudinal Study of Ageing (ELSA) Sample design & response Shaun Scholes NatCen.
Multiple Indicator Cluster Surveys Data Processing Workshop MICS Dictionary and Forms MICS Data Processing Workshop.
M. Fall, JP. Lorgnet et alii 26/02/2010 Individual Dynamics of Poverty, a study tackling changes in poverty in France via the SILC survey.
Chapter 12 Sample Surveys
FileSecure Implementation Training Patch Management Version 1.1.
Release shipping from today! Christian Hunkler Agnes Orban Stephanie Stuck with support by Martina Brandt Dimitris Christelis Danilo Cavapozzi Giuseppe.
What Are File Maintenance Techniques and Validation Techniques?
Chapter Sixteen Starting the Data Analysis Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Copyright © 2005, SAS Institute Inc. All rights reserved. SAS ABM Survey Kit A Sample Usage Scenario: Collecting Group Assignment Values.
Universal Education Foundation. Methodology For Conducting The Voice of Children Survey By Chief Executive Officer Alpha International For Research Polling.
SHARE data cleaning meeting Frankfurt – December, 6 Some suggestions from the Italian experience Paccagnella Omar Omar Paccagnella Data cleaning meeting.
Data management in the field Ari Haukijärvi 2nd EHES training seminar.
February 1, 2011 Workshop: Persistent Identifiers for the Social Sciences 1 SOEP and DOI Requirements and Challenges Jan Goebel.
Mannheim Research Institute for the Economics of Aging SHARE IDs Stephanie Stuck MEA Frankfurt December 6 th.
Mannheim Research Institute for the Economics of Aging Data Cleaning Process Patrick Bartels MEA Frankfurt, December 6 th.
PALS: Pregnancy & Lifestyle Study Dr Judy Ford (now UniSA) & Team (at QEH)
2006 ICE meeting Using Linked Data to Examine Injury and Disability Beth Rasch and Chris Cox National Center for Health Statistics.
Copyright 2010, The World Bank Group. All Rights Reserved. Data Processing and Tabulation, Part I.
Key Data Management Tasks in Stata
Harmonisation across countries in SHARE Workshop on Harmonisation of Social Survey Data for Cross-National Comparison Prague 19.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
Laura Crespo SHARE Meeting on Data Cleaning The Analysis of Interviewers’ Remarks Laura Crespo Spanish Team CEMFI Frankfurt December 6, 2007.
Data cleaning workshop Berlin, 8-10 June 2009 The Analysis of Interviewers‘ remarks Laura Crespo Spanish team CEMFI.
Jibby Medina & Kelly Ward CLEANING THE ELSA LIFE HISTORY INTERVIEW.
AADAPT Workshop South Asia Goa, December 17-21, 2009 Maria Isabel Beltran 1.
1 Data Linkage for Educational Research Royal Statistical Society March 19th 2007 Andrew Jenkins and Rosalind Levačić Institute of Education, University.
Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan.
ELSA ELSA datasets and documentation available from the archive or by special arrangement Kate Cox National Centre for Social.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Data structure for a discrete-time event history analysis Jane E. Miller, PhD.
Mannheim Research Institute for the Economics of Aging SHARE Data Cleaning Stephanie Stuck MEA Vienna November 5/6 th.
Essex Dependent Interviewing Workshop 17/09/2004 British Household Panel Survey.
A Simple Guide to Using SPSS ( Statistical Package for the Social Sciences) for Windows.
Working with EU-SILC: data files, variables and data management Practical computing session I – Part 1 Heike Wirth GESIS – Leibniz Institut für Sozialwissenschaften.
Project? Microdata? Say what? TRY Conference May 5, 2008 Suzette Giles, Ryerson University Laine Ruus, University of Toronto.
Gateway to Global Aging Data September 17 th, 2014 APRU Data Workshop Drystan Phillips.
Peter Granda Archival Assistant Director / Data Archives and Data Producers: A Cooperative Partnership.
MICS Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Entry Using Tablets / Laptops.
Administrators on Springerlink Petra Schuster, Global Manager Customer Service eProducts.
Data Cleaning in Financial Modules Workshop in Frankfurt Mario Schnalzenberger.
TIMOTHY SERVINSKY PROJECT MANAGER CENTER FOR SURVEY RESEARCH Data Preparation: An Introduction to Getting Data Ready for Analysis.
MICS Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Entry Using Tablets / Laptops.
SHARELIFE Meeting Vienna – November, 5-6 The Italian experience in SHARE data cleaning Paccagnella Omar Omar Paccagnella SHARELIFE meeting November 6,
13-Jul-07 State of the art of the ISCO-08 implementation.
Multiple Indicator Cluster Surveys Data Processing Workshop Overview of SPSS structural check programs and frequencies MICS Data Processing Workshop.
2015 Afghanistan Demographic and Health Survey (AfDHS) Key Indicators Report.
Component D: Activity D.3: Surveys Department EU Twinning Project.
Mannheim Research Institute for the Economics of Aging SHARE Data Cleaning General rules and procedures Stephanie Stuck MEA Antwerp.
DLHE Longitudinal Survey Catherine Benfield HESA.
Use of CAPI for agricultural surveys
ADE EDIS READ & Optimizer TRAINING Colorado Department of Education
Working with Preregistration.
Panel care in the Austrian EU-SILC
Andrew Jenkins and Rosalind Levačić
3 EGR Identification Service
3 EGR Identification Service
Presentation transcript:

Mannheim Research Institute for the Economics of Aging SHARE data versions & IDs Stephanie Stuck MEA Antwerpen February 2008

2 Data versions and ID-variables Data cleaningPublications who gets the idinternal versionpublic version Household ID all householdssampidsampid2 (scrambled version of sampid) Person ids all household members (in CV), that means: non-eligible persons get a cvid, too, e.g. children, other people living in the household cvid should be used to merge modules within waves all eligible household members (in CV), that means: all household members that should be interviewed, e.g. respondents and partners even if partners areyounger than 50 years respid should be used to merge waves

3 Data versions and ID-variables Data cleaningPublications internal versionpublic version Household IDsampidsampid2 (scrambled version of sampid) Download site country specific versions Raw version (updates during fieldwork and ‘shortly’ after fieldwork) CentERdata site Corrected versions during cleaning process new internal SHARE site (not yet available) all countries public website & internal website (data for working groups) Available forrespective country team, CentERdata, MEAworking groups, external users

4 sampid rules (old)  Digits 1-2: country code (e.g. 23 for Belgium French speaking)  Digits 3-5: wave indicator (042 for wave 1 and 062 for wave 2 main survey)  Digits 6-11: household ID  Digits 12-13: longitudinal household split indicator 00 by default, if respondent moves out based on respid, e.g. if ‘moving out respondent’ has respid 01 it is changed to 01 Examples : Austria, starting in wave 1 (longitudinal sample) : Belgium (French), starting in wave 2 (refresher)  One needs to combine sampid with the respondent ID (respid) to identify and merge cases on the respondent level  Merging problems esp. for split households / ‘moving’ respondents across waves

5 Therefore...  We will change the system and  have unique person ids, that can be used to merge modules and waves  person id will not change across waves, even if a household splits  have string country codes instead of numeric ones  We will divide sampid into different parts:  household id (fixed part and split indicator if needed)  new wave indictor variable ‘wi’ indicates when a household first entered the sample

6 New household identifier hhid1 (internal) & hhid (public)  Digits 1-2: country code in letters. e.g. AT for Austria, Bf for Belgium French speaking (internal)  Digits 3-8: fixed household ID This part will not change across waves if household splits off  Digit 9: one digit added to the fixed household id to identify whether it is an ‘additional’ household that resulted from a split,  A for all ‘original’ household (all in wave 1, refresher in wave 2)  B used only if a household has split. A is than still used for the ‘first’ part of the household and B for the ‘splitting part’ (the one that is interviewed second, normally the one that moved out)  C is used for very rare case of split off household when original household in wave 1 consisted of 3 eligible sisters for example and split in 3 parts. Examples for new household id AT100100A: Austria, ‘original’ household AT100100B: Austria, split off household Bf140103A: Belgium French speaking household (internal)

7 New person identifier: person1  Digits 1-2: country code (CC) in letters e.g. AT for Austria, Bf for Belgium French speaking  Digits 3-8: fixed household ID this part will not change across waves.  Digit 9-10: respondent id, e.g if respid is 1 it will be 01 Respondent identifier oldnew Sampid & respidperson AT AT Bf

8 Old and new ids internal versionpublic version (scrambled) oldnewoldnew Household ID & wave indicator sampidhhid1 & wi sampid2hhid & wi Person idsampid & respid person1 sampid & respid personid

9 In addition:  A dataset will be generated that shows to which households a respondent belonged during her or his ‘SHARE history’, e.g.: person1hhid1_w1hhid1_w2hhid1_w3 AT AT100100A AT AT100100A AT100100B Bf Bf140103A  A compatibility file will be made for internal use to merge the old sampid respid files with the new ids  We will have an additional person id (uuid) to insure uniqueness, but it will be used in the background only for technical reasons

10 Data cleaning  always use the unscrambled version that includes sampid for data cleaning  use sampid and respid to identify respondents  generate/compute sampid_original, respid_original and cvid_original before you change ids