Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota.

Slides:



Advertisements
Similar presentations
National Database Templates for the Biosafety Clearing-House Application (NDT-nBCH) Overview of the US nBCH Applications.
Advertisements

How IPUMS Harmonizes Microdata Data Sources and Bibliography Data Sources: Original census data are contributed to the IPUMS- International project by.
Using synthetic data to improve the accessibility of the SLS Susan Carsley, SLS Project Manager.
Variance Estimation: Drawing Statistical Inferences from IPUMS-International Census Data Lara L. Cleveland IPUMS-International November 14, 2010 Havana,
Welcome IPUMS/IECM-Europe Workshop: Accomplishments, plans and challenges * * * Robert McCaa, Professor of.
IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota additional information.
Census 2000 symposium, session 4 paper 261 Archiving Census Documentation and Microdata: Preserving Memory, Increasing Stakeholders * * * Wendy L. Thomas.
Using a restricted-access web-site of anonymized, integrated census microdata (for 1, 2, 3, 4,
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS i integration principles IPUMS i integration principles » 1. Respect absolute anonymity and confidentiality »
6. Managing access to IPUMS integrated census microdata “extracts” (13 slides)
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Hist.umn.edu/~rmccaa/ipums-europe1 Sister-project: IPUMS-Latin America: 17 countries, ~500 million pop., 5 census rounds 80+ samples, 100+ million person.
Bridging the Gaps: Dealing with Major Survey Changes in Data Set Harmonization Joint Statistical Meetings Minneapolis, MN August 9, 2005 Presented by:
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
Building Data-rich Web Sites: The Integration Projects of the Minnesota Population Center William C. Block IASSIST 2006 Ann Arbor, Michigan, USA 24 May.
Statistical confidentiality and privacy. 2. Case study: IPUMS-International * * * Robert McCaa Minnesota Population Center.
The IPUMS-International dynamic metadata system * * * Robert McCaa, Professor of Population History University of Minnesota.
IPUMS-EurAsia, : Changing Patterns of Microdata Use * * * Robert McCaa, Professor of Population History University.
The IECM project: Integrating the European Census Microdata IECM team* *A. Cabré, A. Esteve, J.Garcia, T. López, M. Valls PROJECT.
IPUMS-International: August * * * Robert McCaa, Professor of Population History University of Minnesota
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
KIPO’s progress on ST.96. Contents II. Projects I. I. Progress on XML Standards III. Future plans.
Multiple Indicator Cluster Surveys Data Dissemination and Further Analysis Workshop Data Archiving MICS4 Data Dissemination and Further Analysis Workshop.
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Promoting I/UCRC Best Practices NSF IUCRC Annual Directors Meeting Awarded August 2004 Awarded August 2004
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Dissemination to support Research & Analysis John Cornish.
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
IPUMS-International Steven Ruggles Minnesota Population Center.
Integrating ACS with the World’s Census Data: ACS Microdata and the IPUMS Presented at the Pre-ALAP ACS/IPUMS Workshop November 16, 2010 Trent Alexander.
1 Canadian Century Research Infrastructure CCRI An Interdisciplinary Census Database Project.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
The Minnesota Data Harmonization Projects Bill & Melinda Gates Foundation Seattle, Washington May 21, 2014 Elizabeth Boyle, Miriam King, Matthew Sobek.
Teaching with IPUMS (ready access for students) Press to continue tutorial Click here to: set-up a classroom account set-up a classroom account Share syllabi.
IPUMS-International Methods Matt Sobek Minnesota Population Center
How to get data for small areas: Example: Regency of Bangli in the province of Bali, from the 2010 and 2000 census samples of Indonesia 1.Login 2.Browse.
* IPUMS-International * Using Integrated unit records for demographic and health research: Local, regional, national, and international * * * Robert McCaa,
IPUMS-International Free census samples (microdata) for researchers and policy makers: * * * Robert McCaa, Minnesota Population.
Data Projects at the Minnesota Population Center Resources for Comparative Population and Health Research Seattle, Washington May 22, 2014 Elizabeth Boyle,
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Drinking Water Infrastructure Needs Survey and Assessment 2007 Website.
Multi-modal of data collection for the 2010 Population and Housing Census National Statistical Office, Thailand (Daejeon, Republic of Korea, April.
Trans-Border access to Census Microdata: The IPUMS-IECM partnership * * * Robert McCaa and Albert Esteve Palós “You have to.
IPUMS Microdata Relation to head Marital status Literacy Occupation.
How to Make an extract of Puerto Ricans censused abroad : 1.Login 2.Select samples (default is all) 3.Select variables (include BPLCTRY) 4.Select cases:
 Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System.
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
The Integrated Public Use Microdata Series database IPUMSwww.ipums.org Lab 1 Background on the IPUMS and SPSS.
Introduction Hist3797: History of world populations » 1. Course theme: how human populations grow (and decrease) » 2. Some population theorists: Malthus,
Challenges of Census Data Harmonization: IPUMS-International Matt Sobek Minnesota Population Center
Census Office Fernando Casimiro Geneva, July 2010 Portugal – Census results tailored to user needs «
Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center
Integrated Public Use Microdata Series IPUMSwww.ipums.org.
1. Introduction 2. Background 3. Funding framework 4. EU participation 5. Timetable 6. Progress report 7. Future plans I ntegrating the E uropean C ensus.
Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.
Teaching with IPUMS (ready access for students)
Matt Sobek Minnesota Population Center
CENSUS & IPUMS DATA RETRIEVAL
Integrating the European Census Microdata
Welcome IPUMS/IECM-Europe Workshop: Accomplishments, plans and challenges * * * Robert McCaa, Professor.
Building A Web-based University Archive
IPUMS-International Integration Process
Survey Documentation and Analysis (SDA)
The role of metadata in census data dissemination
Presentation transcript:

Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota Population Center

How to get data (once approved) (also SAS, STATA) 1. Access web-site study documentation 2. Make and submit extract 3. Get extract ready 3. Get extract ready 4. Retrieve extract 5. Decompress extract 6. Analyze using stat. package

Outline History of Public Use Census Microdata IPUMS IPUMS-International NAPP Differences among the projects –Data format –Harmonization –Administration, work processes, and legal constraints

History of U.S. Public Use Census Microdata The 1960 One-In-One-Thousand Public Use Sample The 1970 Public Use Samples DUALabs, Beresford, and the harmonized and expanded 1960 sample The new historical samples: Preston, Winsborough, Ruggles The 1980, 1990, and 2000 PUMS: incompatible

Table 1. Census files incorporated in the original version of IPUMS 1991: eight census years, four investigators, six performance sites, seven record layouts

IPUMS : SHRL Common format FORTRAN programs –Limitations: lost information, false cognates, poor documentation, expensive custom datasets IPUMS was an attempt to do it right –Single harmonized database, comprehensive integrated documentation, no lost information –Beta release 1993, full public release 1995 Internet dissemination –ftp in 1993, web-based interactive extraction in 1995

Table 2. Current and Planned IPUMS-USA Data Files

IPUMS-International After 1960, most censuses around the world were tabulated by computer McCaa decided that IPUMS model should be applied to other countries Began with a project for Colombia, then in 1999 NSF Infrastructure grant to add six more countries : three major new grants to increase database to 50+ countries

IPUMS-International Tasks Inventory and preservation of data and documentation Processing (standardizing format, correcting format errors, drawing samples, adding confidentiality protections, harmonizing codes, etc.) Documentation (especially comparability) Dissemination—obtain licenses that allow us to disseminate data for educational and scholarly usae, and set up secure web-based dissemination system

Table 3. Current IPUMS-International Samples

IPUMS-International, August 2005 dark green = disseminating medium green = harmonizing light green = negotiating Mollenweide projection

Table 4. Status of IPUMS-International Countries

North Atlantic Population Project IMAG 1999: LDS data for Britain, Canada, U.S. Minneapolis 2000: meetings to define scope of a harmonization project –Added Norway and Iceland –Adopted decentralized structure with coding work carried out at seven sites, coordination and programming at Minneapolis : preliminary datasets for all countries released : planned expansions (funding pending)

Table 5a. Phase I NAPP datasets

Table 5b. Phase II NAPP datasets

Differences Data Format Problems Harmonization Project administration and work process Ownership and dissemination restrictions

Merging the databases Current compatibility and incompatibilities Two formats Integration of web access tools

Thank you