Www.ipums.org/international1 IPUMS-Eurasia, 2003-2007: Preserving Eurasian census microdata, making them useful, and promoting their use * * * Robert McCaa,

Slides:



Advertisements
Similar presentations
Programme: 145 sessions & social events
Advertisements

THE PUBLIC LENDING RIGHT SITUATION IN EUROPE JIM PARKER APRIL 2011.
MUTUALLY REINFORCING INSTITUTIONS NATO HQ - POLITICAL AFFAIRS DIVISION.
UNIVERSITY OF JYVÄSKYLÄ INTERNATIONAL COOPERATION.
Welcome IPUMS/IECM-Europe Workshop: Accomplishments, plans and challenges * * * Robert McCaa, Professor of.
IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota additional information.
Census 2000 symposium, session 4 paper 261 Archiving Census Documentation and Microdata: Preserving Memory, Increasing Stakeholders * * * Wendy L. Thomas.
Using a restricted-access web-site of anonymized, integrated census microdata (for 1, 2, 3, 4,
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS i integration principles IPUMS i integration principles » 1. Respect absolute anonymity and confidentiality »
A proposal to preserve, integrate and manage access to anonymized census samples of the Official Statistical Agencies of the Arab States in cooperation.
6. Managing access to IPUMS integrated census microdata “extracts” (13 slides)
Calibrating census microdata against a gold standard (employment survey): women in the workforce, Mexico 1970, 1990 and 2000.
Hist.umn.edu/~rmccaa/ipums-europe1 Sister-project: IPUMS-Latin America: 17 countries, ~500 million pop., 5 census rounds 80+ samples, 100+ million person.
54th ISI, Berlin IPUMS-International: A Restricted Access Web-Site Providing Anonymized, Integrated Census Microdata.
Statistical confidentiality and privacy. 2. Case study: IPUMS-International * * * Robert McCaa Minnesota Population Center.
Hist.umn.edu/~rmccaa/ipums-europe1 From IPUMS-USA (1989-) & PAU-Aging (1992-) From IPUMS-USA (1989-) & PAU-Aging (1992-) to IPUMS-International (1999-)
Users and Uses of IPUMS International Data Presented by Dr. Miriam King.
IPUMS-Europe: Confidentiality measures for licensing and disseminating restricted-access census microdata extracts
The IECM project: Integrating the European Census Microdata IECM team* *A. Cabré, A. Esteve, J.Garcia, T. López, M. Valls PROJECT.
IPUMS-International: August * * * Robert McCaa, Professor of Population History University of Minnesota
Delegations III KAM, Bratislava 4th to 8th September 2013.
Study Visits IV KAM Prague, 3 rd to 7 th September 2014.
Knowledge Management LXV International Council Meeting Qawra, Malta 16 th - 23 rd of March 2014.
Confronting “Death on Wheels” Making Roads Safe in the Europe and Central Asia Region (ECA) (May 12, 2010)
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
ELSA Shop(ping) – Spring SALE! LXV International Council Meeting Qawra, Malta 16 th - 23 rd of March 2014.
Delegations IV KAM Prague 3rd to 7th September 2014.
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS-Europe, : Restricted-access, anonymized microdata for scientific and policy research * * * Robert McCaa,
Assessing child-well-being: perspectives and experiences of Health Behaviour in School- Aged Children (HBSC) Study A World Health Organization Cross- National.
THE МINISTRY ОF ENVIRONMENT OF THE REPUBLIC OF MOLDOVA Preparation of the fourth session of the Meeting of the Parties to the Aarhus Convention, 29 June.
Entrusting census microdata and metadata for timely integration and dissemination via the IPUMS-EurAsia and IECM initiatives, * * * Robert McCaa,
Health Promotion Networks. Copenhagen, Denmark. 16 October The Health in Prisons Programme HIPP – WHO/Europe’s Network on Prison & Health Stefan.
Study Visits LXV International Council Meeting Qawra, Malta 16 th - 23 rd of March 2014.
Area Definition III KAM,Bratislava. The European Law Students’ Association Albania ˙ Austria ˙ Azerbaijan ˙ Belgium ˙ Bosnia and Herzegovina ˙ Bulgaria.
OECD Review of Russian Statistics Peer Review Mission to Russia April 2012 Tim Davis Head, Global Relations, Statistics Directorate.
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
IPUMS-International Steven Ruggles Minnesota Population Center.
Attorney-General’s Department International Transfer of Prisoners Unit.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Trans-Border access to Census Microdata: The IPUMS-IECM partnership * * * Robert McCaa and Albert Esteve Palós “You have to.
Migration Statistics Global database United Nations Economic Commission for Europe (UNECE) and United Nations Population Fund (UNFPA) Istanbul, Turkey,
Doing Business in Europe Bay Area CITD Seminar Series Tuesday, September 21st, 2004 Kemarra Inc. - Key Marketing Resources & Associates San Francisco USA.
Study Visits ICM Cluj Napoca, 19 th to 26 th April 2015 Patrick Zischeck, Assistant for IV and SV.
Schools for Health in Europe SHE Goof Buijs NIGZ 8 June 2008 Vancouver, partnership track.
IPUMS Microdata Relation to head Marital status Literacy Occupation.
NextLastEurope. NextLastEurope  The region of Europe is the area on the map shaded dark purple. Europe.
Europe Research PowerPoint Each group (2-3) must choose two countries from Europe and create a PPT that teaches their classmates about those nations.
Institutional Visit LXV International Council Meeting Qawra, Malta 16 th - 23 rd of March 2014.
ELSA as the Franchise? LXV International Council Meeting Qawra, Malta 16 th - 23 rd of March 2014.
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
© Enterprise Europe Network South West 2009 The Eurostars Programme Kenny Legg R&D Funding for the Environmental Sector – 29 June 2010 European Commission.
European Federation of Public Service Unions (EPSU)
Delegations LXV International Council Meeting Qawra, Malta 16 th - 23 rd of March 2014.
United Nations Economic Commission for Europe Statistical Division UNECE Databases David Boko UNECE Statistical Division.
Country EPS-12 Total (with ICPS) Hungary7979 Germany5559 Romania3841 Ukraine2527 United Kingdom1930 Finland1842 France1616 Italy1616 Poland1313 Switzerland1314.
The European Law Students’ Association Albania ˙ Austria ˙ Azerbaijan ˙ Belgium ˙ Bosnia and Herzegovina ˙ Bulgaria ˙ Croatia ˙ Cyprus ˙ Czech Republic.
The Mission of CERN  Push back  Push back the frontiers of knowledge E.g. the secrets of the Big Bang …what was the matter like within the first moments.
Global Aluminium Pipe and Tube Market to 2018 (Market Size, Growth, and Forecasts in Nearly 60 Countries) Published Date: Jul-2014 Reports and Intelligence.
1. Introduction 2. Background 3. Funding framework 4. EU participation 5. Timetable 6. Progress report 7. Future plans I ntegrating the E uropean C ensus.
Robert McCaa Antonio López Gay Representing IPUMS – International Project Minnesota Population Center / University of.
Pinger and IEPM-BW activity at FNAL By Frank Nagy FTP/CCF Computing Division Fermilab.
France Ireland Norway Sweden Finland Estonia Latvia Spain Portugal Belgium Netherlands Germany Switzerland Italy Czech Rep Slovakia Austria Poland Ukraine.
1 Giulio C. Zanetti, WIPO Alicante, April Giulio C. Zanetti, WIPO Alicante, April 2000 Recent developments in the Madrid system for the International.
Integrating the European Census Microdata
DISTRIBUTION AUTOMATIC - GENERATION
Welcome IPUMS/IECM-Europe Workshop: Accomplishments, plans and challenges * * * Robert McCaa, Professor.
Nordic Demography Symposium, Tjøme 2001
hist.umn.edu/~rmccaa/ipums-europe
Where in the world is the European Union?
Presentation transcript:

IPUMS-Eurasia, : Preserving Eurasian census microdata, making them useful, and promoting their use * * * Robert McCaa, Steven Ruggles, Matthew Sobek, Deborah Levison and Miriam King University of Minnesota Population Center

If so, the following needs to be done now: IPUMS-Eurasia before Europe » Official: » Formalize agreement » Release 1989 & 1994 samples for project development » Unofficial, agree upon: » Sample density: entire long-form preferred; 10% OK » License fee: $$$ proportional to sample density » Division of tasks (provisional): equitable » Calendar (provisional): begin in 2003 » 1989 sample: OK? Or will a new one be drawn? » 1979 and 1970: do any microdata tapes still exist?

…official statistics that meet the test of practical utility are to be compiled and made available on an impartial basis by official statistical agencies to honor citizens’ entitlement to public information. -- UN Statistical Commission, 1994 …official statistics that meet the test of practical utility are to be compiled and made available on an impartial basis by official statistical agencies to honor citizens’ entitlement to public information. -- UN Statistical Commission, 1994 Widespread Internet Technology diffusion is “a pre-requisite for the development of civil society based on free access to information through the global Internet.“ --President Putin, March 6,

I N T E R N A T I O N A L I P U M S » Easy-to-use web-interface » Highest scientific standards » Proven, powerful integration » A quantum leap in usage Imagine a new statistical product: scientifically anonymized, integrated census microdata samples made up of unidentifiable individuals... » 1998: 1 country signed » 1999: 3 countries » 2000: 9 » 2001: 15 » 2002: 32; first release, 6 countries

BeforeEurope?BeforeEurope?BeforeEurope?BeforeEurope? IPUMS-EURASIAIPUMS-EURASIAIPUMS-EURASIAIPUMS-EURASIA Eurasia Phase: Advantages of a Eurasia-phase, before Europe Statistical coherence of 1989/2000 censuses Statistical coherence of 1989/2000 censuses Readily organizable Readily organizable 12 countries, not countries, not 40 One linguistic standard: Russian One linguistic standard: Russian Progress on negotiating agreements Technical OKs: Belarus, Moldova Republic Technical OKs: Belarus, Moldova Republic Negotiating: Armenia, Azerbaijan Republic, Georgia, Kazahkstan, Kyrghz Republic, Russia, Tajikistan, Turkmenistan, Ukraine, Uzbekistan Negotiating: Armenia, Azerbaijan Republic, Georgia, Kazahkstan, Kyrghz Republic, Russia, Tajikistan, Turkmenistan, Ukraine, Uzbekistan Not participating: none, as yet. Not participating: none, as yet.

B E N E F I T S I P U M Si » Researchers, world-wide: free, high quality data harmonized, comprehensive » National Statistics Institutes: increased usage enhanced cost-benefit ratio payment for license fees, expertise » People: who we are what the future may bring how policies might improve

IPUMS-International, a global collaboratory of National Statistical/Research Institutes: » 1. Inventories the world’s census microdata » 2. Preserves endangered microdata and documentation * * * » 3. Integrates datasets of selected countries using UNSD, Eurostat and other standards » 4. Anonymizes census microdata to preserve statistical confidentiality, using highest standards » 5. Disseminates customized extracts free of charge (with complete copies on CDs to all partners) Integrated Public Use Microdata Series - International

PARTNERSPARTNERSPARTNERSPARTNERS IPUMSiIPUMSiIPUMSiIPUMSi Phase 1: Brazil1960, 1970, 1980, 1991, 2001 Colombia 1964, 1973, 1985, 1993, 2003 Mexico1960, 1970, 1980, 1990, 2000 France 1962, 1968, 1975, 1982, 1990 Hungary1970, 1980, 1990, 2000 Spain 1981, 1991, 2001 Kenya 1989, 1999 Ghana 1984, 2000 China 1982, 1990, 2000 Vietnam 1989, 1999 USA , ,

IPUMS-Latin America, : 16 countries, ~500m. people » Scope: Latin American census microdata, 1960-present census microdata, 1960-present census microdata, 1960-present » Work Plan ( funded by National Institutes of Health) » : Sign licensing agreements with official agencies : Sign licensing agreements with official agencies : Sign licensing agreements with official agencies » 2002: Obtain funding from U.S. NIH » 2003: Develop/translate microdata & metadata » 2004: Country expert teams design national integrations » 2005: MPC/expert teams design regional integration » 2006: MPC integrates microdata and metadata » 2007: MPC disseminates to bona fide researchers who sign non-disclosure license. National census/research institutes via CDs/web.

PARTNERSPARTNERSPARTNERSPARTNERS I P U M S- E U R O P E Europe Phase: Phase 1 European partners: INSEE-France 1962, 1968, 1975, 1982, 1990 CSO-Hungary 1970, 1980, 1990, 2000 INE-Spain 1981, 1991, 2001 Phase 2, : 10 OK: Austria, Bulgaria, Czech Republic, Germany, Ireland, Lithuania, Poland, Romania, Slovenia, UK 5 Approval pending: Finland, Iceland, Israel, Norway, Portugal 11 Negotiating: Belgium, Denmark, Greece, Italy, Latvia, Netherlands, Russia, Sweden, Switzerland, Turkey, Yugoslavia 2 Not participating: Estonia, Slovakia

P R E S E R V E S UN Demographic Center for Latin America (CELADE, Santiago, Chile) ~3000 microdata tapes preserved UN Demographic Center for Latin America (CELADE, Santiago, Chile) ~3000 microdata tapes preserved IPUMSiIPUMSiIPUMSiIPUMSi and metadata (documentation)

Census microdata of the late 20th century: Who will preserve them? Who will make them useful? Census microdata: Public goods should be democratized. Censuses are costly. Where microdata are available, they are used.

S A M P L E S I P U M Si

PAYSPAYSPAYSPAYS IPUMSiIPUMSiIPUMSiIPUMSi National experts are paid to: » Assemble microdata and documentation » Develop samples » to minimize confidentiality risks » and to maximize robustness » Design national/regional integration plan » census-by-census » concept-by-concept concept-by-concept » code-by-code code-by-code » Write integrated documentation National Statistical Institutes are paid a non-exclusive license fee for integrated data

INTEGRATESINTEGRATESINTEGRATESINTEGRATES Photos from Colombia integration project, February-March, 2000: 4 experts from DANE (census office) +7 academics (3 universities) IPUMSiIPUMSiIPUMSiIPUMSi Standard:UN/Eurostat Principles & Recs... Census documentation compiled for Colombian microdata

IPUMS i integration principles IPUMS i integration principles » 1. Respect absolute anonymity and confidentiality » 2. Preserve all original data, except adjustments to insure privacy (top codes, blurrings, masking, re- ordering, etc.) » 3. Harmonize codes using international standards occupation: ISCO, HISCO (detailed, general) education: ISCED “ “ family: IPUMS, etc. “ “ » 4. Enhance with constructed variables

Variable availability, preliminary release

Composite coding scheme example: marital status

Occupation: the ISCO standard, preliminary release: “1” digit final: 2-3 or 4 digit, depending upon country

A N O N Y M I Z E S IPUMSiIPUMSiIPUMSiIPUMSi » Suppress geographical detail » Blur/aggregate sensitive codes » Convert dates to ages (blur key vars.) » Swap cases between districts » Scramble records Using the highest standards available: administrative (license), legal, and technical (US Census Bureau, Eurostat, & others)

‘statistical confidentiality’ shall mean the protection of data related to single statistical units which are obtained directly for statistical purposes or indirectly from administrative or other sources against any breach of the right to confidentiality. It implies the prevention of non-statistical utilization of the data obtained and unlawful disclosure. --COUNCIL REGULATION (EC) No 322/97 of 17 February 1997

Anonymization plan: Kenya, 1989 Kenya: Anonymization Based on Unique Characteristics Threshold (100,000 for geographic variables; 10,000 for other variables) TypeProcedure Variable Name KeySuppressedDivision, Location, Sublocation, Enumeration area Aggregated100,000 minimum: Province, District of Residence, Birth and Past Residence NoneSex, Marital Status, Relationship to Head SensitiveAggregated10,000/1,000 minimum: Tribe/Ethnicity, Occupation, Employment Status Transitory (information is considered too changeable to be used to identify individuals from microdata). NoneAge, Urban/Rural Residence, Literacy, Educational Status, Educational Level, Labor Activity, Children Everborn/Alive/Dead, Last Birth Year, Mortality variables Note: For greater detail and a reproduction of the 1989 enumeration form, see Appendix 3.

EUROSTAT statistical anonymity standards (Thorogood, 1999) --all used by IPUMS-International » 1. small sample size » 2. limited geographical detail » 3. top and bottom coding of unique categories » 4. signed non-disclosure agreement » 5. prohibit redistribution of datasets to third parties » 6. prohibit attempts to identify individuals or the making any claim to that effect » 7. require users to provide copies of publications

EUROSTAT statistical anonymity standards (Thorogood, 1999) --all used by IPUMS i and more » 8. Age (constructed, where necessary) » 9. Never identify date of birth » 10. Never identify place of birth » 11. Migration: timing and place not identified in detail » 12. Place of residence identified by major civil division (pop>60k, 120k, 250k, 1 million--national rule) » 13. Sensitivity analysis of variables by national experts » 14. Confidentiality assessment by national experts

International Monetary Fund’s General Data Dissemination System 52 countries with uniform standards » All embrace strict standards of statistical confidentiality » All prohibit disclosure of information which may identify individuals or entities » And 37 of 52 countries distribute census microdata samples » Why not Russia, Armenia, Azerbaijan Republic, Belarus, Georgia, Kazakhstan, Kyrgyz Republic, Moldova Republic, Tajikistan, Turkmenistan, Ukraine, or Uzbekistan?

DISSEMINATESDISSEMINATESDISSEMINATESDISSEMINATES IPUMSiIPUMSiIPUMSiIPUMSi Legally-binding license agreement » protects privacy and confidentiality » assures proper use; » new sanction: loss of employment. Researcher selects » Countries, » Censuses, » Cases/sub-populations, » Variables, and » Sample densities--makes chronological &/or cross-national research possible Open architecture software and mirror sites Web-based extraction system

IPUMS-Eurasia, : 12 countries, >280 m. people » Scope: Eurasia census microdata, 1989-present census microdata, 1989-presentcensus microdata, 1989-present » Work Plan (contingent upon funding): » Jan 2003: Sign licensing agreements with official agencies Jan 2003: Sign licensing agreements with official agencies Jan 2003: Sign licensing agreements with official agencies » Nov 2003: Obtain funding from US NIH » 2004: Pay licenses/sign contracts to develop/translate microdata & metadata » 2005: Country expert teams design national integrations » 2006: MPC/expert teams design Eurasia integration » 2007: MPC integrates microdata and metadata » 2008 and beyond: MPC disseminates to bona fide researchers who sign non-disclosure license. National census/research institutes disseminate via CDs/web.

On a millennial scale, censuses and census microdata survive for only a short, but significant period

IPUMS-Eurasia, : What needs to be done now? » Official: » Formalize agreement » Release 1989 & 1994 samples for project development » Unofficial, agree upon: » Sample density: entire long-form preferred; 10% OK » License fee: $$$ proportional to sample density » Division of tasks (provisional): equitable » Calendar (provisional): begin in 2003 » 1989 sample: OK? Or will a new one be drawn? » 1979 and 1970: do any microdata tapes still exist?

additional information at: contact: * * * * * Thank you