Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.

Slides:



Advertisements
Similar presentations
Presentation on Civil Registration and Vital Statistics Systems in Namibia __________________________________ Workshop on Improvement of Civil Registration.
Advertisements

How IPUMS Harmonizes Microdata Data Sources and Bibliography Data Sources: Original census data are contributed to the IPUMS- International project by.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota additional information.
REPUBLIC OF RWANDA National Institute of Statistics Prepared by Emmanuel GATERA National Institute of Statistics of Rwanda Management Information Systems.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
St. Lucia Country Report By Edwin St Catherine Director, Central Statistical Office Presented to IPUMS Workshop August 24 th, 2007.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
The process of [social research theory/model/framework conceptual relationships hypotheses working hypotheses and measurement research design data collection.
MONGOLIA COUNTRY REPORT National Statistical Office IPUMS-Global Workshop, Lisbon, Portugal, August 22-26, 2007.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota.
IPUMS-International: August * * * Robert McCaa, Professor of Population History University of Minnesota
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
Census Bureau – Fernando Casimiro, Coordinator Lisboa IPUMS - Portugal Country Report.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
Census.ac.uk Census Area Statistics and Casweb David Rawnsley Census Dissemination Unit (CDU) Mimas University of Manchester.
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
RATIONALE The storage in a smart phone would cost (in 2011 dollars) $7,571 in 2001 $212,040 in 1991 $3,796,800 in 1981 $56,168,800 in 1971 $1,233,179,000.
Integrated household based agricultural survey methodology applied in Ethiopia, new developments and comments on the Integrated survey frame work.
Census Census of Population, Housing,Buildings,Establishments and Agriculture Huda Ebrahim Al Shrooqi Central Informatics Organization.
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Integrating ACS with the World’s Census Data: ACS Microdata and the IPUMS Presented at the Pre-ALAP ACS/IPUMS Workshop November 16, 2010 Trent Alexander.
TerraPop Vision An organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing.
1 Sources of gender statistics Angela Me UNECE Statistics Division.
United Nations Economic Commission for Europe Statistical Division Sources of gender statistics Angela Me UNECE Statistics Division.
September, 2013 Best Practices in Data Dissemination Public Use Data files (PUF) Mohammed Sahmoud Programmer & Web Developer Dissemination & Documentation.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
The Minnesota Data Harmonization Projects Bill & Melinda Gates Foundation Seattle, Washington May 21, 2014 Elizabeth Boyle, Miriam King, Matthew Sobek.
Census Mapping A Case of Zambia UN Workshop on Census Cartography and Management, Lusaka, 8-12 th October 2007.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
* IPUMS-International * Using Integrated unit records for demographic and health research: Local, regional, national, and international * * * Robert McCaa,
IPUMS-International Free census samples (microdata) for researchers and policy makers: * * * Robert McCaa, Minnesota Population.
Data Projects at the Minnesota Population Center Resources for Comparative Population and Health Research Seattle, Washington May 22, 2014 Elizabeth Boyle,
Presentation by David Yenukwa Kombat Ghana Statistical Service 25 March, 2014.
For Needs Assessment Conference on Census Analysis in Asia and the Pacific, November 2011, Bali, Indonesia The Experience of Census Data Analysis.
Increasing the Effectiveness of Household Surveys Eric Swanson, Program Manager for Global Monitoring Development Data Group World Bank.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman - Jordan 16 – 19 May 2011 Determination of the scope and form of.
IPUMS Microdata Relation to head Marital status Literacy Occupation.
 Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System.
1 The United Nations Demographic Yearbook and the Work Programme for Social Statistics Expert Group Meeting to Review the United Nations Demographic Yearbook.
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
TerraPop Mission Enabling research, learning, and policy analysis by providing integrated spatiotemporal data describing people and their environment.
African Centre for Statistics United Nations Economic Commission for Africa Report on 2010 Round of Population and Housing Census in Africa Statistical.
Workshop on Collection and Dissemination of Socio-economic Data from Population and Housing Censuses New Delhi, India, May 2012 United Nations Demographic.
IPUMS-International Process Matt Sobek Minnesota Population Center
Methods of Statistical Analysis and Dissemination of Census Results in Guyana MORGAN CLITUS DIAS SENIOR CARTOGRAPHER BUREAU OF STATISTICS GEORGEOWN,GUYANA.
Census Office Fernando Casimiro Geneva, July 2010 Portugal – Census results tailored to user needs «
Palestinian Central Bureau of Statistics Best Practices in Data Dissemination Public Use Data files (PUF) Ola Awad, Acting President March, 2010.
Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center
Meeting of the Group of Experts on Population and Housing Censuses Geneva, 23 – 26 September 2014 Principles and Recommendations for Population and Housing.
United Nations Workshop on Revision 3 of Principles and recommendations for Population and Housing Censuses and Census Evaluation Amman, Jordan, 19 – 23.
Integrated Public Use Microdata Series IPUMSwww.ipums.org.
Lessons Learned from the production of Gridded Population of the World Version 4 (GPW4) Columbia University, CIESIN, USA EFGS October 2014.
CENSUS MICRODATA : THAILAND NATIONAL STATISTICAL OFFICE by PAKAMAS RATTANALANGKARN Thailand National Statistical Office.
Data Processing Hollerith 1921
Collaboration and Outreach
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
Census Bureau – Fernando Casimiro, Coordinator
CyberGIS: Reston, VA, September 22, 2018
Central Statistics Organization
Terra Populus Data Domains
IPUMS-International Integration Process
Sub-regional workshop on integration of administrative data, big data
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
The role of metadata in census data dissemination
Recommended Tabulations of the Principles and Recommendations for Population and Housing Censuses, Rev. 2 Session 4 United Nations Statistics Division.
Presentation transcript:

Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and international experiences with microdata dissemination and public use New York 11 April 2016

 World’s largest archive of population data  Individual-level microdata describing ~3 billion persons enumerated in 100 countries  IPUMS model:  Harmonize variables codes across data collection  Integrate documentation  Web dissemination  Free to the research and policy community IPUMS Data Integration Projects

IPUMS Data Dissemination, Gigabytes per week

Outline  International IPUMS projects  IPUMS-International  Integrated DHS  Terra Populus  Spatial data integration  Implications for SDGs

IPUMS-International Coverage Over 100 Collaborating National Statistical Agencies

 82 countries  277 censuses  614 million person records  Two-thirds of samples from developing countries Microdata Currently Disseminated

IPUMS Samples per Country Argentina5Fiji5Malawi3Senegal2 Armenia1France7Malaysia4Sierra Leone1 Austria4Germany4Mali2Slovenia1 Bangladesh3Ghana2Mexico7South Africa3 Belarus1Greece4Mongolia2South Sudan1 Bolivia3Guinea2Morocco3Spain3 Brazil6Haiti3Nepal1Sudan1 Burkina Faso3Hungary4Netherlands3Switzerland4 Cambodia2India5Nicaragua3Tanzania2 Cameroon3Indonesia9Nigeria5Thailand4 Canada4Iran1Pakistan3Turkey3 Chile5Iraq1Palestine2Uganda2 China2Ireland9Panama6Ukraine1 Colombia4Israel1Peru2UK2 Costa Rica4Italy1Philippines3USA7 Cuba1Jamaica3Portugal3Uruguay6 Dominican Republic5Jordan1Puerto Rico5Venezuela4 Ecuador6Kenya5Romania3Vietnam3 Egypt2Kyrgyz Republic2Rwanda2Zambia3 El Salvador2Liberia2Saint Lucia2

IPUMS Samples per Country Argentina5Fiji5Malawi3Senegal2 Armenia1France7Malaysia4Sierra Leone1 Austria4Germany4Mali2Slovenia1 Bangladesh3Ghana2Mexico7South Africa3 Belarus1Greece4Mongolia2South Sudan1 Bolivia3Guinea2Morocco3Spain3 Brazil6Haiti3Nepal1Sudan1 Burkina Faso3Hungary4Netherlands3Switzerland4 Cambodia2India5Nicaragua3Tanzania2 Cameroon3Indonesia9Nigeria5Thailand4 Canada4Iran1Pakistan3Turkey3 Chile5Iraq1Palestine2Uganda2 China2Ireland9Panama6Ukraine1 Colombia4Israel1Peru2UK2 Costa Rica4Italy1Philippines3USA7 Cuba1Jamaica3Portugal3Uruguay6 Dominican Republic5Jordan1Puerto Rico5Venezuela4 Ecuador6Kenya5Romania3Vietnam3 Egypt2Kyrgyz Republic2Rwanda2Zambia3 El Salvador2Liberia2Saint Lucia2

 geographic location (places of 20,000+ persons in most samples)  assets and utilities: water supply, sewage, toilet, electricity, mobile telephones, Internet  building materials—floor, roof, etc.  educational attainment, literacy, school enrollment  economic activities, unemployment, disabilities  fertility history and child mortality Many IPUMS variables are relevant to SDGs Microdata: custom analyses tailored to local conditions

Dissemination

Select Samples

Select Variables

 12,000 approved users  70,000 custom data extracts IPUMS Usage

IPUMS Users – Region of Residence

IPUMS Samples Extracted, by Region

Web Portal: UNECA, Addis Ababa

IPUMS On-line Tabulator

3.1 million cases in one second

Physical Data Recovery: Old Media

Darfur 1973

Bangladesh 1981

Metadata Preservation

Collaboration of IPUMS, DHS Program, USAID Integrated Demographic and Health Surveys

Motivation: DHS is incredibly valuable, but it’s hard to capitalize on its full potential. Problems:  Data discovery  Dispersed documentation  Data management  Variable changes over time and between countries Why an Integrated DHS?

Integrated DHS Coverage

Three Source Data Formats Microdata Characteristics of individuals and households Small-area data Characteristics of places defined by administrative boundaries Raster data Values tied to spatial coordinates

Location-Based Integration Microdata Area-level dataRasters Mix and match variables originating in any of the data structures Obtain output in the data structure most useful to you

Location-Based Integration Attach land cover and climate data to individuals Microdata Area-level data: Summarize rasters Rasters: Environmental data

 To analyze change over time within countries  To combine data sources at the subnational level Spatial Data Integration

Spatial Integration – Digitize Maps

Spatial data integration Buenos Aires province, Argentina

Spatial data integration Some units must be merged or split to make footprint consistent over time

Global harmonized boundaries Harmonized 1st-level boundaries for all countries released 2014

Global harmonized boundaries Harmonized 2nd-level boundaries, in process

Ratio of literate women to men, years old Source: Cuesta and Lovatón (2014) 1990 Census round Millennium Development Goals

Census 1993 Census 2005 Colombia: Adolescent Birth Rate

Millennium Development Goals Source: Cuesta and Lovatón (2014) Data Source: IPUMS-International, Minnesota Population Center Census 1993 Census 2005 Colombia: Adolescent Birth Rate

Source: IPUMS-International, Minnesota Population Center Percentage of the urban population living in slums Mexico, 2000 Sustainable Development Goals

Source: IPUMS-International, Minnesota Population Center Percentage of the urban population living in slums Mexico, 2010

Sustainable Development Goals Men Percentage of the population that owns of mobile phone by sex, Ghana 2010 Women

Conclusions Data integration is expensive, but it saves money in the long run, reduces the potential for error, and simplifies replication Administrative and survey data to assess development goals should be centrally integrated at the individual level wherever possible We need consistent geographic units over time to make sub-national estimates of change Sub-national estimates of change are essential for identifying places where progress has stalled and more resources are needed

Thank You! Matt Sobek

United States 1960