Integrated Public Use Microdata Series IPUMSwww.ipums.org.

Slides:



Advertisements
Similar presentations
The Samples of Anonymised Records: Understanding Individual differences Mark Brown.
Advertisements

The Census Area Statistics Myles Gould Understanding area-level inequality & change.
Studying internal migrations with census microdata.
A comparison of the characteristics of childless women and mothers in the ONS Longitudinal Study Simon Whitworth Martina Portanti Office for National Statistics.
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
Demystifying Data Reference Helping non-specialists make sense of data.
Hist.umn.edu/~rmccaa/ipums-europe1 Population Activities Unit 1990 census round harmonization project: focused on Aging » Begun 1992: PAU/UNECE, UNFPA,
Aggregate data Also called summary data, tabular data Counts of things for places (e.g. counties) or entities Examples: –census volumes –HSUS –ICPSR files.
Mady Biaye, Advisor UNFPA CST Harare by IPUMS - WORKSHOP PARIS,10th June 2006 by Deric Zanera Demography & Social Statistics National Statistical Office.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
6. Managing access to IPUMS integrated census microdata “extracts” (13 slides)
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
5. Integration of Microdata and Metadata (9 slides)
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
Hist.umn.edu/~rmccaa/ipums-europe1 From IPUMS-USA (1989-) & PAU-Aging (1992-) From IPUMS-USA (1989-) & PAU-Aging (1992-) to IPUMS-International (1999-)
Users and Uses of IPUMS International Data Presented by Dr. Miriam King.
Original dataOriginal data. (various) Reformat dataReformat data: structural issues draw sample confidentiality (general tools) Data dictionary. (txt/pdf)
Census Processing Procedures Matt Sobek Funded by the National Science Foundation Minnesota Population Center.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota.
Raw Census Microdata from IPUMS IPUMS Data Structure Household record (shaded) followed by a person record for each member of the household Relationship.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Census Bureau – Fernando Casimiro, Coordinator Lisboa IPUMS - Portugal Country Report.
1 Commuting and Migration Data Products from the American Community Survey Journey-to-Work and Migration Statistics Branch U.S. Census Bureau State Data.
U.S. Census Bureau Demographic Census 2000 July 8, 2003.
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
RATIONALE The storage in a smart phone would cost (in 2011 dollars) $7,571 in 2001 $212,040 in 1991 $3,796,800 in 1981 $56,168,800 in 1971 $1,233,179,000.
United Nations Demographic Yearbook Data Collection System Adriana Skenderi United Nations Statistics Division Third Regional Workshop on Production and.
2014 SDC and CIC Annual Training Conference: Accessing ACS PUMS Data Tim Gilbert U.S. Census Bureau April 2, 2014.
Making Graphs. The Basics … Graphical Displays Should: induce the viewer to think about the substance rather than about the methodology, graphic design,
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
Father Involvement and Child Well-Being: 2006 Survey of Income and Program Participation (SIPP) Child Well-Being Topical Module 1 By Jane Lawler Dye Fertility.
Population Census Topics included in the 2011 Population and Housing Census for Jamaica Presented by: Valerie Nam Director, 2011 Population and Housing.
Design and Use of the IPUMS-International Data Series
Roomers and Boarders: Melissa Scopilliti, University of Maryland, Maryland Population Research Center; Population Division, U.S. Census Bureau.
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
IPUMS-International Steven Ruggles Minnesota Population Center.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Integrating ACS with the World’s Census Data: ACS Microdata and the IPUMS Presented at the Pre-ALAP ACS/IPUMS Workshop November 16, 2010 Trent Alexander.
TerraPop Vision An organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing.
1 Sources of gender statistics Angela Me UNECE Statistics Division.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
IPUMS-International Methods Matt Sobek Minnesota Population Center
Data on the Foreign Born in 2010: Accessing Information on Immigrants and Immigration from the U.S. Census Bureau’s American Community Survey Thomas A.
Using Census Data to Understand Things ​ OpenGovChicago March 26, 2014.
IPUMS Microdata Relation to head Marital status Literacy Occupation.
American Community Survey (ACS) Product Types: Tables and Maps Samples Revised
 Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System.
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
The Integrated Public Use Microdata Series database IPUMSwww.ipums.org Lab 1 Background on the IPUMS and SPSS.
Using Census Data for Monitoring Millennium Development Goals Session 2 Subregional Workshop on Dissemination and Use of Population and Housing Census.
IPUMS-International Process Matt Sobek Minnesota Population Center
1 Understanding how the Trinidad and Tobago 2011 Census Data can inform National Development Presented by A. Noguera- Ramkissoon, UNFPA, OIC, SALISES Forum,
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-II Consistency check.
American Community Survey (ACS) Using Census Data by Block Group January 21, 2016 Presentation at the National Community Development Association Winter.
Challenges of Census Data Harmonization: IPUMS-International Matt Sobek Minnesota Population Center
Click “Browse and Select Data”:  to view integrated metadata  and to get microdata (make an “extract”) Note: the data are “pooled” into a single file–
Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center
Samples of Anonymised Records from the U.K. Census 1991 and 2001 Integrating Census Microdata Workshop Barcelona th July 2005 Dr. Ed Fieldhouse Cathie.
Census 2010: Accessing Census Data THURSDAY, July 21, :30am.
Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.
ASDC Annual Meeting November 10, 2011 Kathleen Gabler Socioeconomic Research Associate Center for Business and Economic Research Culverhouse College of.
Organised by Minnesota Population Centre Held in Lisbon, Portugal, 22 – 26 August 2007 IPUMS Global Workshop “Integrating Global Census Microdata” in conjunction.
Matt Sobek Minnesota Population Center
Census Bureau – Fernando Casimiro, Coordinator
IPUMS-International Integration Process
and the Future of Historical Family Demography
Presentation transcript:

Integrated Public Use Microdata Series IPUMSwww.ipums.org

IPUMS Overview 1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination 1. What is the IPUMS

Census Samples in IPUMS-USA

Planned

Datasets in IPUMS-International

IPUMS-International Census Sample Holdings and Release Dates

Datasets in IPUMS-CPS

What Are Microdata? Individual-level data every record represents a separate person all of their individual characteristics are recorded users must manipulate the data themselves Different from aggregate/summary/tabular data a disability table from an occupation table from a published census volume from the library

1930 Census Population Schedule

Raw Census Microdata from IPUMS

IPUMS Data Structure Household record (shaded) followed by a person record for each member of the household Relationship Age Sex Race Birthplace Mother’s birthplace Occupation For each type of record, columns correspond to specific variables

The Advantages of Microdata  Combination of all of a person’s characteristics  Characteristics of everyone with whom a person lived  Freedom to make any table you need  Freedom to make models examining multivariate relationships  Basically, you are only limited by the questions asked in the particular census

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

Translation Table – Marital Status China1982Colombia1973Kenya1989Mexico1970U.S.A.1990 (IPUMS-International)

Translation Table – Marital Status General Codes

Variable Description: Farm Status (USA)

Variable Description: Literacy (International)

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a Spouse’s Mother’sFather’s IPUMS “Pointer” Variables Location (Simple household)

PernumRelationshipAgeSexMarstChborn 1head53femaleseparated6 2child28malesinglen/a 3child22malesinglen/a 4child21malesinglen/a 5child25femalemarried2 6child-in-law28malemarriedn/a 7grandchild3malesinglen/a 8grandchild1malesinglen/a 9non-relative32femaleseparated2 10non-relative10malesinglen/a 11non-relative5femalesinglen/a Location Spouse’sFather’sMother’s IPUMS “Pointer” Variables (Complex household)

Additional Improvements to the U.S. PUMS  Additional documentation, including all enumeration forms and instructions   Consistent occupation/industry classifications  Consistent metropolitan classifications  Constructed family variables  Missing data allocation   International – pointers for some samples; occupation and industry; missing data in the future

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

USA – unrestricted, automated registration IPUMS Access CPS – unrestricted, automated registration International – restricted access Scholarly and educational purposes Conditions of use: key is not to redistribute Serious vetting

Economics (36%) Sociology (16%) Demography (12%) Other Academic (19%) Other Non-academic (15%) IPUMS Users 46% students; 23% faculty

Other IPUMS-USA Data Sources Querylogic ( PDQ ( Fathom (

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

4 Key Strengths of the Census Microdata Samples National in scope Results not subject to local peculiarities Provide context for local studies More cases than any comparable datasets Enable study of relatively small populations Large Temporal depth Provide historical perspective Microdata Can make your own tabulations Apply multivariate techniques

Limitations of the Microdata Samples Geographic detail Confidentiality restrictions Not annual Any historical analysis will have gaps Cross-sectional data Not longitudinal Need knowledge of a statistical package Samples Too small to answer some questions

Limitations of the Different IPUMS Data Series IPUMS-USA Geography 1940-present IPUMS-International Varying geography User burden: need to read documentation, information overload IPUMS-CPS Sample size (60 to 200K)

Studies that do not need to identify small geographic areas 100,000+ population for USA 1940-present Varies for International: as low as 20,000+ Subjects that are likely to deal with 10,000+ people Varies by sample density Topics where the key census questions were asked in comparable ways across samples Topics that take advantage of the hierarchical structure of the data: co-resident persons Some Characteristics of Good IPUMS Topics

IPUMS-International Research Topics Child labor outside the household in Mexico and Colombia Effect of NAFTA on educational attainment and school enrollment by region within Mexico Concentration of mortality within families in Kenya Life course patterns of co-residence among Mexicans in Mexico, Mexicans in the U.S., and Mexican Americans Brain drain from developing countries How language diversity is affected by migration and economic factors

Percent in Labor Force Mexico Costa Rica Ecuador Chile Venezuela Colombia Brazil Married Female Labor Force Participation in Latin America (age 18 to 65)

Percent in Labor Force Latin America United States Married Female Labor Force Participation: Latin America and U.S. (age 18 to 65)

Percent in Labor Force United States Mexico Costa Rica Ecuador Chile Venezuela Colombia Brazil Married Female Labor Force Participation: Latin America and U.S. (age 18 to 65) Compare Latin America to U.S. 40 years ago

Married Female Labor Force Participation: Mexican-born Women, Percent in Labor Force Mexican-born Women in United States Women in Mexico

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Users and Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview