Background Data harmonization Data output Web: Variable documentation system Web: Data extract system IPUMS Dissemination System
Variable Harmonization China 1982 Colombia 1973 Kenya 1989 Mexico 1970 U.S.A (Marital status)
IPUMS Microdata Home Ownership Relation to Head Age Marital Status Occupation Data extract
3. Submit extract Pooled Data Extracts samplewatersexeducation Argentina million Chile million Cuba million Extract Engine Argentina 2001 Chile 2002 Cuba 2002 Water supply Sex Education 1. Select samples 2. Select variables 1 dataset 3 censuses 4 variables 6.2 million records Harmonized codes
Q: How can we give researchers the information they need without overwhelming them? Q: How can we best encourage comparative research? A: Organize information by variable, not sample A: Ability to filter out unnecessary information A: Access to full detail when that is desired Variable Documentation System
1. Exploring the Database
Variables Page
159 samples
Sample Filtering
Variables Page – Filtered
2. Variables – Codes
Variable Codes (Marital status)
Variable Codes (Marital status)
Variable Codes (Marital status)
3. Variable Descriptions
Variable Description (Marital status)
Comparability Discussion (Marital status)
4. Variables – Deep Documentation
Enumeration Text
(Marital status, Cambodia)
Variable Description (Unharmonized source variables)
Unharmonized Variables (Source data for marital status)
Make it easy to get only the variables and samples that a user needs. Pool the data across time and countries. Provide tools to help users manage the size of the data. Provide advanced features to empower researchers to do new kinds of research. Data Extract System
Extract – Select Samples
Extract – Select Variables
1. Case selection 2. Customized sample size 3. Attached characteristics 4. Extract revisions Advanced Extract Features
Case Selection
Customize Sample Sizes
PernumRelationshipAgeSexMarstChborn 1head53femaleseparated6 2child28malesinglen/a 3child22malesinglen/a 4child21malesinglen/a 5child25femalemarried2 6child-in-law28malemarriedn/a 7grandchild3malesinglen/a 8grandchild1malesinglen/a 9non-relative32femaleseparated2 10non-relative10malesinglen/a 11non-relative5femalesinglen/a Location Spouse’sFather’sMother’s Constructed “Pointer” Variables Attached Characteristics
Age of spouse Employment status of father Occupation of father Attached Characteristics
Download or Revise Extract
END Matt Sobek Minnesota Population Center