Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center

Slides:



Advertisements
Similar presentations
How IPUMS Harmonizes Microdata Data Sources and Bibliography Data Sources: Original census data are contributed to the IPUMS- International project by.
Advertisements

IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota additional information.
Hist.umn.edu/~rmccaa/ipums-europe1 Population Activities Unit 1990 census round harmonization project: focused on Aging » Begun 1992: PAU/UNECE, UNFPA,
Aggregate data Also called summary data, tabular data Counts of things for places (e.g. counties) or entities Examples: –census volumes –HSUS –ICPSR files.
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS i integration principles IPUMS i integration principles » 1. Respect absolute anonymity and confidentiality »
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Hist.umn.edu/~rmccaa/ipums-europe1 Sister-project: IPUMS-Latin America: 17 countries, ~500 million pop., 5 census rounds 80+ samples, 100+ million person.
WORKSHOP ON INTEGRATING GLOBAL CENSUS MICRO DATA Paris, June 7 – 10, 2006 UGANDA COUNTRY REPORT by Andrew Mukulu.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
Building Data-rich Web Sites: The Integration Projects of the Minnesota Population Center William C. Block IASSIST 2006 Ann Arbor, Michigan, USA 24 May.
Proposed IPUMS-International Secure Data Enclave Patricia Kelly Hall
The IPUMS-International dynamic metadata system * * * Robert McCaa, Professor of Population History University of Minnesota.
Hist.umn.edu/~rmccaa/ipums-europe1 From IPUMS-USA (1989-) & PAU-Aging (1992-) From IPUMS-USA (1989-) & PAU-Aging (1992-) to IPUMS-International (1999-)
Original dataOriginal data. (various) Reformat dataReformat data: structural issues draw sample confidentiality (general tools) Data dictionary. (txt/pdf)
Census Processing Procedures Matt Sobek Funded by the National Science Foundation Minnesota Population Center.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota.
Raw Census Microdata from IPUMS IPUMS Data Structure Household record (shaded) followed by a person record for each member of the household Relationship.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
IPUMS-International: August * * * Robert McCaa, Professor of Population History University of Minnesota
Census Bureau – Fernando Casimiro, Coordinator Lisboa IPUMS - Portugal Country Report.
Census.ac.uk Census Area Statistics and Casweb David Rawnsley Census Dissemination Unit (CDU) Mimas University of Manchester.
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
Country Paper on: Census Data Accessibility, Confidentiality and Copyright Policy: Ethiopia’s Experience Seminar United Nations Regional Seminar on Census.
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS-Europe, : Restricted-access, anonymized microdata for scientific and policy research * * * Robert McCaa,
11 The American Community Survey Steve Murdock, Ph.D. Director, Hobby Center for the Study of Texas Rice University.
Issues Related to Data Dissemination in Official Statistics Presented at the European Conference On Quality in Official Statistics Helsinki, Finland May.
Saadia GreenbergElena Fazio Office of Performance and Evaluation Administration on Aging US Department.
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
Design and Use of the IPUMS-International Data Series
Roomers and Boarders: Melissa Scopilliti, University of Maryland, Maryland Population Research Center; Population Division, U.S. Census Bureau.
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
IPUMS-International Steven Ruggles Minnesota Population Center.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Integrating ACS with the World’s Census Data: ACS Microdata and the IPUMS Presented at the Pre-ALAP ACS/IPUMS Workshop November 16, 2010 Trent Alexander.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
TerraPop Vision An organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
The Minnesota Data Harmonization Projects Bill & Melinda Gates Foundation Seattle, Washington May 21, 2014 Elizabeth Boyle, Miriam King, Matthew Sobek.
IPUMS-International Methods Matt Sobek Minnesota Population Center
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Data Projects at the Minnesota Population Center Resources for Comparative Population and Health Research Seattle, Washington May 22, 2014 Elizabeth Boyle,
IPUMS Microdata Relation to head Marital status Literacy Occupation.
 Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System.
Sociological Research Methods. The Research Process Sociologists answer questions about society through empirical research (observation and experiments)
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
OVERVIEW OF ARCHIVING OF MICRODATA SILAS M. MULWA Kenya National Bureau of Statistics United Nations Regional Seminar on Census Data Archiving for Africa.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
The Integrated Public Use Microdata Series database IPUMSwww.ipums.org Lab 1 Background on the IPUMS and SPSS.
IPUMS-International Process Matt Sobek Minnesota Population Center
Challenges of Census Data Harmonization: IPUMS-International Matt Sobek Minnesota Population Center
Census Office Fernando Casimiro Geneva, July 2010 Portugal – Census results tailored to user needs «
Click “Browse and Select Data”:  to view integrated metadata  and to get microdata (make an “extract”) Note: the data are “pooled” into a single file–
Integrated Public Use Microdata Series IPUMSwww.ipums.org.
Census 2010: Accessing Census Data THURSDAY, July 21, :30am.
Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.
ASDC Annual Meeting November 10, 2011 Kathleen Gabler Socioeconomic Research Associate Center for Business and Economic Research Culverhouse College of.
Matt Sobek Minnesota Population Center
IPUMS-International Schedule
Census Bureau – Fernando Casimiro, Coordinator
Introduction to IPUMS NYTS and IPUMS YRBSS
IPUMS-International Integration Process
and the Future of Historical Family Demography
2. Applying for Access (10 slides)
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
Introduction to IPUMS NYTS and IPUMS YRBSS
Danilo Dolenc Statistical Office of the Republic of Slovenia
The IPUMS-International Dissemination System
Presentation transcript:

Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center

IPUMS Overview 1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination 1. What is the IPUMS

IPUMS-USA Steve Ruggles All existing samples of US census All existing samples of US census Data extraction system 1998 Data extraction system 1998 Bob McCaa IPUMS-International IPUMS-Latin America 2004 IPUMS-Latin America 2005 IPUMS-Europe 2005 IPUMS-Europe 2005 NSF Expansion 2005 NSF Expansion World’s largest collection of census data 200 million records and growing 200 million records and growing 70 countries have agreed to join the project 70 countries have agreed to join the project Brief History

Datasets in IPUMS

May 2008 Data Release

Sample Sizes

African Datasets in IPUMS Archive Further agreements: Ethiopia, Lesotho, Tanzania

Khartoum, CBS-Sudan

Dhaka, Bangladesh Bureau of Statistics

Non-African Countries in IPUMS Archive

IPUMS Global Coverage

Selected Variable Availability -- PERSON

Selected Variable Availability -- HOUSEHOLD

What Are Microdata? Individual-level data every record represents a separate person all of their individual characteristics are recorded “raw” data that must be analyzed Different from aggregate/summary/tabular data a count of persons by municipality an employment status table by sex from a published census volume

Kenya 1999 Census Questionnaire

Raw Census Microdata from IPUMS

IPUMS Data Structure Household record (shaded) followed by a person record for each member of the household Relationship Age Sex Race Birthplace Mother’s birthplace Occupation

The Advantages of Microdata  Combination of all of a person’s characteristics  Characteristics of everyone with whom a person lived  Freedom to make any table you need  Freedom to make models examining multivariate relationships  Basically, you are only limited by the questions asked in the particular census

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

Translation Table – Marital Status China1982Colombia1973Kenya1989Mexico1970U.S.A.1990

General Codes

Variable Description: Literacy

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a Spouse’s Mother’sFather’s IPUMS “Pointer” Variables Location (Simple household)

PernumRelationshipAgeSexMarstChborn 1head53femaleseparated6 2child28malesinglen/a 3child22malesinglen/a 4child21malesinglen/a 5child25femalemarried2 6child-in-law28malemarriedn/a 7grandchild3malesinglen/a 8grandchild1malesinglen/a 9non-relative32femaleseparated2 10non-relative10malesinglen/a 11non-relative5femalesinglen/a Location Spouse’sFather’sMother’s IPUMS “Pointer” Variables (Complex household)

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations 6. Dissemination IPUMS Overview

IPUMS Access Restricted access Scholarly and educational purposes Conditions of use: key is not to redistribute Serious vetting

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Access 5. Strengths and Limitations IPUMS Overview 6. Dissemination

4 Key Strengths of the Census Samples National in scope Results not subject to local peculiarities Provide context for local studies More cases than any comparable datasets Enable study of relatively small populations Large Temporal depth Provide historical perspective Microdata Can make your own tabulations Apply multivariate techniques

Limitations of the Census Samples Confidentiality Geography 20,000 population or larger Sensitive variables, swapping, etc Samples Too small to answer some questions

Other Issues and Limitations Not annual Any temporal analysis will have gaps Cross-sectional data Not longitudinal Need knowledge of a statistical package User burden Information overload; culturally specific knowledge Very large extracts

1. What is the IPUMS 2. Harmonization 3. Additional Data Enhancements 4. Users and Access 5. Strengths and Limitations IPUMS Overview 6. Dissemination

Web Dissemination System