Presentation is loading. Please wait.

Presentation is loading. Please wait.

IPUMS-International Methods Matt Sobek Minnesota Population Center

Similar presentations


Presentation on theme: "IPUMS-International Methods Matt Sobek Minnesota Population Center"— Presentation transcript:

1 IPUMS-International Methods Matt Sobek Minnesota Population Center sobek@pop.umn.edu

2 IPUMS-International Development Process 1. Inventory 2. Metadata Preparation 3. Data Preparation 4. Harmonization 5. Data Enhancements 6. Dissemination

3 IPUMS-International Development Process 1. Inventory a) Data b) Data dictionary c) Census questionnaire and instructions d) Sample design

4 IPUMS-International Development Process 2. Metadata Preparation English translation English translation

5 IPUMS-International Development Process 2. Metadata Preparation English translation English translation Data dictionaries Data dictionaries

6 Original Data Dictionary (Kenya 1989)

7 Original Data Dictionary (Romania 1992)

8 Original Data Dictionary (China 1982)

9 Original Data Dictionary (Mexico 1990)

10 Variable Labels File – IPUMS Metadata (Costa Rica 2000)

11 IPUMS-International Development Process 2. Metadata Preparation English translation English translation Data dictionaries Data dictionaries Questionnaires and instructions Questionnaires and instructions

12 Census Questionnaire (Mexico 2000) WaterAccess

13 Text of Census Questionnaire (Mexico 2000)

14 XML-Tagged Census Questionnaire (Mexico 2000) Source variable MX00A016 MX00A017 MX00A018 (water access)

15 Source variable MX00A018 XML-Tagged Census Instructions (Mexico 2000)

16 IPUMS-International Development Process 3. Data Preparation Data reformatting Data reformatting

17 geographyhousing person (head) person (child) geographyhousingperson (head) geographyhousingperson (child) geographyhousingperson (child) geographyhousingperson (head) geographyhousingperson (spouse) geographyhousingperson (child) geographyhousingperson (child) geographyhousing person (head) person (spouse) person (child) (Brazil 1980) (Person records only; household data duplicated on person records) Reformat Rectangular Sample

18 dwelling household person (head) person (spouse) person (child) household person (head) person (child) person (head) person (spouse) dwelling household dwellinghousehold person (head) person (spouse) person (child) dwellinghousehold person (head) person (child) dwellinghousehold person (head) person (spouse) (Chile 1992) (Separate dwelling and household records) Reformat Dwelling-Household-Person Sample

19 serial 001head serial 001spouse serial 002head serial 002child serial 003head serial 001geog & housing serial 002geog & housing serial 003geog & housing serial 001household serial 001head serial 001spouse serial 003household serial 002household serial 002head serial 002child serial 003head Household File Person File (Brazil 2000) Merge Separate Household and Person Files

20 IPUMS-International Development Process 3. Data Preparation Data reformatting Data reformatting Draw samples Draw samples Confidentiality measures Confidentiality measures Convert source variables to input Convert source variables to input

21 Original Source Variable IPUMSI Input Variable Input Variables – Data

22 Input Variables – Description Assigned by computer Developed by researchers Assembled by computer from XML markups

23 IPUMS-International Development Process 4. Harmonization Data Data Correspondence tables Correspondence tables

24 Correspondence Table – Marital Status China1982Colombia1973Kenya1989Mexico1970U.S.A.1990

25 General Codes

26 IPUMS-International Development Process 4. Harmonization Data Data Correspondence tables Correspondence tables Supplemental programming Supplemental programming

27 Supplementary Variable Programming (INCTOT)

28 IPUMS-International Development Process 4. Harmonization Data Data Correspondence tables Correspondence tables Supplemental programming Supplemental programming Documentation Documentation Integration Integration Mark-up for web delivery Mark-up for web delivery

29 XML-Tagged Variable Text (Literacy) VariableName Description GeneralComparability ComparabilityBrazil ComparabilityChina

30 Variable Description on Website (Literacy)

31 IPUMS-International Development Process 5. Data Enhancements Data editing Data editing Consistency edits Consistency edits Hot-deck imputation Hot-deck imputation

32 Missing Data Allocation Script (Occupation variable, USA) 5 dimensional table 324 cells

33 IPUMS-International Development Process 5. Data Enhancements Data editing Data editing Consistency edits Consistency edits Hot-deck imputation Hot-deck imputation Family interrelationship “pointers” Family interrelationship “pointers”

34 PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a Spouse’s Mother’sFather’s IPUMS “Pointer” Variables Location 2 1 0 0 0 0 0 0 00 0 0 21 1 1 2 2 (Simple household)

35 PernumRelationshipAgeSexMarstChborn 1head53femaleseparated6 2child28malesinglen/a 3child22malesinglen/a 4child21malesinglen/a 5child25femalemarried2 6child-in-law28malemarriedn/a 7grandchild3malesinglen/a 8grandchild1malesinglen/a 9non-relative32femaleseparated2 10non-relative10malesinglen/a 11non-relative5femalesinglen/a Location 0 0 0 0 0 6 5 0 0 0 0 0 0 1 1 1 1 0 5 5 0 9 9 0 0 0 6 6 0 0 0 0 0 Spouse’sFather’sMother’s IPUMS “Pointer” Variables (Complex household)

36 IPUMS-International Development Process 6. Dissemination Documentation system Documentation system Preferences and dynamic content delivery Preferences and dynamic content delivery

37 IPUMS-International Development Process 6. Dissemination Documentation system Documentation system Preferences and dynamic content delivery Preferences and dynamic content delivery Data extraction system Data extraction system Sample, variable, and case selection Sample, variable, and case selection General and detailed variables General and detailed variables Advanced extract features Advanced extract features

38 End sobek@pop.umn.edu


Download ppt "IPUMS-International Methods Matt Sobek Minnesota Population Center"

Similar presentations


Ads by Google