* * * Robert McCaa and Albert Esteve Palos www.ipums.org/international www.iecm-project.org IPUMS-International and Integrated European Census Microdata.

Slides:



Advertisements
Similar presentations
Programme: 145 sessions & social events
Advertisements

Disseminating census microdata: the IPUMS and IECM experiences, (and plans for beyond) * * * Robert McCaa and Albert Esteve Minnesota Population.
Albert Esteve and the IECM-project team Centre d’Estudis Demogràfics Universitat Autònoma de Barcelona T HE I NTEGRATED E UROPEAN.
Controlled shuffling experiment: detailed 10% sample of 2011 census of Ireland - Risk, confidentiality and utility Presenter: Robert McCaa,
Magician or Math-a-magician?. Math Magic Math Magic – Trick #1 Pick a number… any number! (keep it a secret though) Add 1 to that number Multiply by.
How IPUMS Harmonizes Microdata Data Sources and Bibliography Data Sources: Original census data are contributed to the IPUMS- International project by.
UNIVERSITY OF JYVÄSKYLÄ INTERNATIONAL COOPERATION.
Cordell Golden United States Tenth Meeting of the Washington Group on Disability Statistics 3-5 November 2010 Luxembourg Summary of Annual Activities Related.
Welcome IPUMS/IECM-Europe Workshop: Accomplishments, plans and challenges * * * Robert McCaa, Professor of.
IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota additional information.
Census 2000 symposium, session 4 paper 261 Archiving Census Documentation and Microdata: Preserving Memory, Increasing Stakeholders * * * Wendy L. Thomas.
Using a restricted-access web-site of anonymized, integrated census microdata (for 1, 2, 3, 4,
1 Assortative Mating Patterns in the Developing World Albert Esteve* and Robert McCaa** Presented by: Sula Sarkar** * Centre d ’ Estudis Demogr à fics.
A proposal to preserve, integrate and manage access to anonymized census samples of the Official Statistical Agencies of the Arab States in cooperation.
6. Managing access to IPUMS integrated census microdata “extracts” (13 slides)
Hist.umn.edu/~rmccaa/ipums-europe1 Sister-project: IPUMS-Latin America: 17 countries, ~500 million pop., 5 census rounds 80+ samples, 100+ million person.
54th ISI, Berlin IPUMS-International: A Restricted Access Web-Site Providing Anonymized, Integrated Census Microdata.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
Statistical confidentiality and privacy. 2. Case study: IPUMS-International * * * Robert McCaa Minnesota Population Center.
The IPUMS-International dynamic metadata system * * * Robert McCaa, Professor of Population History University of Minnesota.
IPUMS-EurAsia, : Changing Patterns of Microdata Use * * * Robert McCaa, Professor of Population History University.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota.
The IECM project: Integrating the European Census Microdata IECM team* *A. Cabré, A. Esteve, J.Garcia, T. López, M. Valls PROJECT.
IPUMS-International: August * * * Robert McCaa, Professor of Population History University of Minnesota
Indigenous peoples, ethnicity and identities in contemporary censuses: A global perspective source: *
Delegations III KAM, Bratislava 4th to 8th September 2013.
United Nations CensusInfo User Application Training Workshop, Cairo, Egypt, October World Population and Housing Census Programme United.
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
Do Now 12/5/14 1.Open Binder 2.On a Fresh sheet of paper at the TOP write: 3.DO NOW and the Date LIST as many countries as you can in that make up the.
Reichstag, 1945 Frankfurter Allee, 1945 A Climate for Radical Change:
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Delegations IV KAM Prague 3rd to 7th September 2014.
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS-Europe, : Restricted-access, anonymized microdata for scientific and policy research * * * Robert McCaa,
Where it all starts - RESEARCH LXIV International Council Meeting Opatija, Croatia October 28 th - November 3 rd 2013.
Institutional Visits ICM Cluj Napoca, 19 th to 26 th April 2015 Patrick Zischeck, Assistant for IV and SV.
Entrusting census microdata and metadata for timely integration and dissemination via the IPUMS-EurAsia and IECM initiatives, * * * Robert McCaa,
IS Studies Accreditation: Problems and Challenges Janice C. Sipior, Ph.D. Professor of MIS Department of Accountancy & IS Villanova School of Business.
Area Definition III KAM,Bratislava. The European Law Students’ Association Albania ˙ Austria ˙ Azerbaijan ˙ Belgium ˙ Bosnia and Herzegovina ˙ Bulgaria.
OECD Review of Russian Statistics Peer Review Mission to Russia April 2012 Tim Davis Head, Global Relations, Statistics Directorate.
(R14) (R14) European Culture: Language(s). Today’s Standard describecultural characteristicsEurope SS6G11 The student will describe the cultural characteristics.
ELSA Law Schools ICM Cluj-Napoca, 21st April 2015.
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
* IPUMS-International * Using Integrated unit records for demographic and health research: Local, regional, national, and international * * * Robert McCaa,
IPUMS-International Free census samples (microdata) for researchers and policy makers: * * * Robert McCaa, Minnesota Population.
Make it Smart&Creative ICM Cluj-Napoca, 21st April 2015.
Trans-Border access to Census Microdata: The IPUMS-IECM partnership * * * Robert McCaa and Albert Esteve Palós “You have to.
Doing Business in Europe Bay Area CITD Seminar Series Tuesday, September 21st, 2004 Kemarra Inc. - Key Marketing Resources & Associates San Francisco USA.
Integrated census microdata: a valuable, virgin source for statistical analysis of internal and international migration See handouts: 1. Card for list.
IPUMS Microdata Relation to head Marital status Literacy Occupation.
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
EXTREME MAKEOVER Members’ Magazine LXIV International Council Meeting Opatija, Croatia October 28 th - November 3 rd 2013.
Computer Class – Summer 20092/21/2016 3:45 AM European Countries Albania Andorra Austria Belarus Belgium Bosnia and Herzegovina Bulgaria Croatia Czech.
Geography Review On Map 1, please identify: -Spain -France -England -Russia -Ottoman empire -Persia -China -Mughal India -Songhai Empire.
The European Law Students’ Association Albania ˙ Austria ˙ Azerbaijan ˙ Belgium ˙ Bosnia and Herzegovina ˙ Bulgaria ˙ Croatia ˙ Cyprus ˙ Czech Republic.
1. Introduction 2. Background 3. Funding framework 4. EU participation 5. Timetable 6. Progress report 7. Future plans I ntegrating the E uropean C ensus.
Robert McCaa Antonio López Gay Representing IPUMS – International Project Minnesota Population Center / University of.
The EFGS project a common voluntary cooperation Vilni Verner Holst Bloch President of European Forum for Geostatistics Statistics Norway EFGS 2014 conference.
France Ireland Norway Sweden Finland Estonia Latvia Spain Portugal Belgium Netherlands Germany Switzerland Italy Czech Rep Slovakia Austria Poland Ukraine.
Best Sustainable Development Practices for Food Security UV-B radiation: A Specific Regulator of Plant Growth and Food Quality in a Changing Climate The.
SAP Digital Business Services June 2016
Integrating the European Census Microdata
Welcome IPUMS/IECM-Europe Workshop: Accomplishments, plans and challenges * * * Robert McCaa, Professor.
Press <F5> key to start presentation
2. Applying for Access (10 slides)
“Integrating Microbial Knowledge into Human Life”
Major causes of stress Global GfK survey November 2015.
LAMAS Working Group June 2018
Presentation transcript:

* * * Robert McCaa and Albert Esteve Palos IPUMS-International and Integrated European Census Microdata Projects Reduce Risks of Managing Trans-border Access and Add Significant Value * * * Robert McCaa and Albert Esteve Palos Minnesota Population Center and Centre d’Estudis Demografics--Barcelona

“ Dissemination [means] opening up the value inherent in our data.” -- Walter Radermacher and Pieter Everaers Seminar on Emerging Trends in Data Communication and Statistics, UNSC, New York, Feb. 19, 2010 *

Trans-Border access is essential in 21 st Century. Many researchers (e.g., demographers, members of IUSSP) reside outside their country of birth New Zealanders60% reside outside country of birth New Zealanders60% reside outside country of birth Dutch 40% Dutch 40% Germans 38% Germans 38% Danes34% Danes34% Chinese30% Chinese30% Belgians31% Belgians31% British25% British25% Australians22% Australians22% Canadians, Finns, French, Japanese, Swiss, etc. ~20% Canadians, Finns, French, Japanese, Swiss, etc. ~20% Limiting access to in-country is old-fashioned, inefficient, costly, & unfair. Encourages violations, brain drain.

IPUMS-International dark green = anonymized, harmonized and disseminating (69 countries, 212 censuses, 480 millon person records) medium green = to be integrated (29 countries, 75 censuses, ~100 mpr) Mollweide projection IPUMS-International: 2012 (weighted by population size) 2012 launch: El Salvador (2) Indonesia (9) Mexico (2010) Morocco (3) Nicaragua (3) Turkey (3) Uruguay (5) Work began in By 2020 we hope to integrate census microdata of 100 countries, including 2010 round censuses.

IPUMS-International dark green = anonymized, harmonized and disseminating (17 countries, 56 censuses, 93 millon person records) medium green = to be integrated (2 countries, 6 censuses, ~5 mpr) Mollweide projection IECM/ IPUMS-Europe: 2012 (weighted by population size) Countries not yet participating are invited to consider doing so: Albania, Belgium, Bosnia-H, Croatia, Denmark, Estonia, Finland, Iceland, Latvia, Lithuania, Moldova R., Norway, Russia, Serbia, Slovak R., Sweden, etc.

NSOs that disseminate microdata by “going it alone” incur significant risks, substantial costs, & much user dissatisfaction NSOs that disseminate microdata by “going it alone” incur significant risks, substantial costs, & much user dissatisfaction I. IPUMS & IECM offer a “one-stop” comprehensive solution to managing access to census microdata II. Statistical Confidentiality and Security III. Integration IV. Manage trans-border access V. Conclusion: Invitation to cooperate, entrust 2010 round census microdata as soon as feasible. Outline: IPUMS-International & IECM Outline: IPUMS-International & IECM Reduce Risks of Managing Trans-border Access and Add Significant Value

I. One-stop, comprehensive solution to disseminating census microdata & metadata… of Europe and the world 1. OrganizeUniform agreement with each NSO 2. AdministerWe manage approval/denial of user access 3. AnonymizeWe are responsible for data anonymization 4. IntegrateWe do the work Metadata Official language and integrated in English MicrodataIntegrated globally & optimized for Europe 5. DisseminateExtracts, custom-tailored to each request 6. ShareWe share: results, comprehensive electronic bibliography No longer enough to prepare a CD or post a dataset on a web-site

II. Statistical Confidentiality and Security A. Microdata security and confidentiality protections Employees face fines, job loss, and possible imprisonment for violations Employees face fines, job loss, and possible imprisonment for violations Security: “best practice” – Dennis Trewin, ex Aus. Stat. Security: “best practice” – Dennis Trewin, ex Aus. Stat. B. Statistical disclosure control protections: Suppression of records using sub-sampling, names, low- level geography, unique variates, Suppression of records using sub-sampling, names, low- level geography, unique variates, Paired swapping of geographical identifiers of households to create uncertainty Paired swapping of geographical identifiers of households to create uncertainty Top/bottom coding, global recodes, deletion of digits, etc. Top/bottom coding, global recodes, deletion of digits, etc. C. Managing restricted access to microdata (next slide)

II. Statistical Confidentiality and Security (cont’d.) A. Microdata security and confidentiality protections B. Statistical disclosure control protections: C. Managing restricted access to microdata Detailed registration form to establish bona-fides Detailed registration form to establish bona-fides 4/5ths of viewers do not complete the form! --automatic denial 4/5ths of viewers do not complete the form! --automatic denial Conditions of use bind researcher & institution; violations penalize every researcher at institution Conditions of use bind researcher & institution; violations penalize every researcher at institution Custom-tailored extracts encourage researchers to jealously guard their downloads. Custom-tailored extracts encourage researchers to jealously guard their downloads. More than 5,000 researchers approved for access More than 5,000 researchers approved for access

III. Integration: Metadata & Microdata D. Comprehensive source metadata in official language(s) Questionnaires, instructions, manuals, etc. Questionnaires, instructions, manuals, etc. E. Integrated, DDI compatible metadata: definitions, concepts, variable names, value labels, codes--all link back to sources Descriptions of censuses and samples, Descriptions of censuses and samples, Variables defined, comparability discussions, Variables defined, comparability discussions, Example: educational attainment (next slide) Example: educational attainment (next slide) F. Integrated, pooled microdata: multiple censuses in a single file G. Integrated boundary files (GIS) linked to microdata H. IPUMS value added variables

Example of composite coding: Educational attainment

III. Integration: Metadata & Microdata (cont’d.) D. Comprehensive source metadata in official language(s) E. Integrated, DDI compatible metadata: definitions, concepts, variable names, value labels, codes--all link back to sources F. Integrated, pooled microdata: many censuses in single file G. Integrated boundary files (GIS) linked to microdata H. IPUMS value added variables: Technical variables: weights, identifiers Technical variables: weights, identifiers Family, household info: summary indicators Family, household info: summary indicators Person variables: Locations of mother, father, spouse and rules for linking (momloc, poploc, sploc) Person variables: Locations of mother, father, spouse and rules for linking (momloc, poploc, sploc)

IV. Managing Trans-border Access I. Trans-border access: uniform experience for access to all countries, regardless of nationality J. Custom-tailored extracts: user selects country(ies), censuses, variables, sub-populations Extract engine fulfills request, generates custom-tailored microdata and metadata Extract engine fulfills request, generates custom-tailored microdata and metadata 3 unique IPUMS extract tools: 3 unique IPUMS extract tools: 1. Select cases 2. Attach characteristics 3. Customize sample size K. Usage: 8,048 extracts in 2011; 40,142 samples. See next page.

Disclosure Controls for Trans-Border access to Census Microdata via a Single License, Access Point: The IPUMS-IECM partnership * * * Robert McCaa and Albert Esteve Palos “You have to do due diligence, something to assure yourself that the people you’re giving your data to can be trusted.” -- Disclosure Controls for Trans-Border access to Census Microdata via a Single License, Access Point: The IPUMS-IECM partnership * * * Robert McCaa and Albert Esteve Palos Minnesota Population Center and Centre d’Estudis Demografics--Barcelona “You have to do due diligence, something to assure yourself that the people you’re giving your data to can be trusted.” IPUMS-International Google Analytics: 2011 Trans-Border Access: 169 countries/territories 3,033 cities, 45,000 page views. Up 4X from 2010

Table 2. Rank of the Top Five and all European Countries plus Canada and the USA by Number of Extracts for the 2000 round census (statistics for calendar year 2011) RankCountry Sample %* Variables (n)*Years of census samplesExtracts 1Brazil , 70, 80, 91, Mexico p, 70, 90, 95, 2000, United States , 70, 80, 90, 2000, Colombia p, 72, 85, 93, South Africa , 2001, Canada p, 81p, 91p, 2001p409 9France , 68, 75, 82, 90, 99, Spain , 91, Greece , 81, 91, Austria , 81, 91, Italy Portugal , 91, Romania , 92, Switzerland , 80, 90, United Kingdom , 2001p263 38Hungary , 80, 90, The Netherlands p, 71p, 2001p211 45Slovenia Belarus Total samples extracted for 55 countries (162 samples) available from January 1, ,048 *2000 round census; refers to all integrated variables, including IPUMS constructed variables. “p” = person sample; all other samples are of households 15

IECM value-added (in beta test): Password protected, trans-border on-line tabulator

Substantial returns to NSOs; no cost: economies of scale, low risk. Substantial returns to NSOs; no cost: economies of scale, low risk. 96 NSOs are participating 96 NSOs are participating If yours is not, let’s discuss how to resolve the obstacles: If yours is not, let’s discuss how to resolve the obstacles:  Amend legislation,  Revise regulations,  Advocate statistical transparency, etc. Entrust 2011 census microdata, as soon as feasible Entrust 2011 census microdata, as soon as feasible Provide boundary files at low-level geography for each census possible Provide boundary files at low-level geography for each census possible Reflections

IPUMS at the 59 th ISI (Hong Kong, Aug 24-30, 2013) » IPUMS Workshop » Microdata session » IPUMS Funding for delegates from developing countries » IPUMS booth

Thank you If your NSO is not participating yet, please contact: When processing of your 2011 census microdata is completed, please contact: