Integrated Public Use Microdata Series IPUMSwww.ipums.org
IPUMS Overview 1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination 1. What is the IPUMS
Census Samples in IPUMS-USA
Planned
Datasets in IPUMS-International
IPUMS-International Census Sample Holdings and Release Dates
Datasets in IPUMS-CPS
What Are Microdata? Individual-level data every record represents a separate person all of their individual characteristics are recorded users must manipulate the data themselves Different from aggregate/summary/tabular data a disability table from an occupation table from a published census volume from the library
1930 Census Population Schedule
Raw Census Microdata from IPUMS
IPUMS Data Structure Household record (shaded) followed by a person record for each member of the household Relationship Age Sex Race Birthplace Mother’s birthplace Occupation For each type of record, columns correspond to specific variables
The Advantages of Microdata Combination of all of a person’s characteristics Characteristics of everyone with whom a person lived Freedom to make any table you need Freedom to make models examining multivariate relationships Basically, you are only limited by the questions asked in the particular census
1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview
Translation Table – Marital Status China1982Colombia1973Kenya1989Mexico1970U.S.A.1990 (IPUMS-International)
Translation Table – Marital Status General Codes
Variable Description: Farm Status (USA)
Variable Description: Literacy (International)
1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview
PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a Spouse’s Mother’sFather’s IPUMS “Pointer” Variables Location (Simple household)
PernumRelationshipAgeSexMarstChborn 1head53femaleseparated6 2child28malesinglen/a 3child22malesinglen/a 4child21malesinglen/a 5child25femalemarried2 6child-in-law28malemarriedn/a 7grandchild3malesinglen/a 8grandchild1malesinglen/a 9non-relative32femaleseparated2 10non-relative10malesinglen/a 11non-relative5femalesinglen/a Location Spouse’sFather’sMother’s IPUMS “Pointer” Variables (Complex household)
Additional Improvements to the U.S. PUMS Additional documentation, including all enumeration forms and instructions Consistent occupation/industry classifications Consistent metropolitan classifications Constructed family variables Missing data allocation International – pointers for some samples; occupation and industry; missing data in the future
1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview
USA – unrestricted, automated registration IPUMS Access CPS – unrestricted, automated registration International – restricted access Scholarly and educational purposes Conditions of use: key is not to redistribute Serious vetting
Economics (36%) Sociology (16%) Demography (12%) Other Academic (19%) Other Non-academic (15%) IPUMS Users 46% students; 23% faculty
Other IPUMS-USA Data Sources Querylogic ( PDQ ( Fathom (
1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview
4 Key Strengths of the Census Microdata Samples National in scope Results not subject to local peculiarities Provide context for local studies More cases than any comparable datasets Enable study of relatively small populations Large Temporal depth Provide historical perspective Microdata Can make your own tabulations Apply multivariate techniques
Limitations of the Microdata Samples Geographic detail Confidentiality restrictions Not annual Any historical analysis will have gaps Cross-sectional data Not longitudinal Need knowledge of a statistical package Samples Too small to answer some questions
Limitations of the Different IPUMS Data Series IPUMS-USA Geography 1940-present IPUMS-International Varying geography User burden: need to read documentation, information overload IPUMS-CPS Sample size (60 to 200K)
Studies that do not need to identify small geographic areas 100,000+ population for USA 1940-present Varies for International: as low as 20,000+ Subjects that are likely to deal with 10,000+ people Varies by sample density Topics where the key census questions were asked in comparable ways across samples Topics that take advantage of the hierarchical structure of the data: co-resident persons Some Characteristics of Good IPUMS Topics
IPUMS-International Research Topics Child labor outside the household in Mexico and Colombia Effect of NAFTA on educational attainment and school enrollment by region within Mexico Concentration of mortality within families in Kenya Life course patterns of co-residence among Mexicans in Mexico, Mexicans in the U.S., and Mexican Americans Brain drain from developing countries How language diversity is affected by migration and economic factors
Percent in Labor Force Mexico Costa Rica Ecuador Chile Venezuela Colombia Brazil Married Female Labor Force Participation in Latin America (age 18 to 65)
Percent in Labor Force Latin America United States Married Female Labor Force Participation: Latin America and U.S. (age 18 to 65)
Percent in Labor Force United States Mexico Costa Rica Ecuador Chile Venezuela Colombia Brazil Married Female Labor Force Participation: Latin America and U.S. (age 18 to 65) Compare Latin America to U.S. 40 years ago
Married Female Labor Force Participation: Mexican-born Women, Percent in Labor Force Mexican-born Women in United States Women in Mexico
1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Users and Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview