Access to European microdata for scientific purposes

Slides:



Advertisements
Similar presentations
Eurostat T HE E UROPEAN PROCESS OF ENHANCING ACCESS TO E UROSTAT DATA A LEKSANDRA B UJNOWSKA E UROSTAT.
Advertisements

Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Eurostat M ODES OF ACCESS TO EU MICRODATA IN THE NEW LEGAL FRAMEWORK A LEKSANDRA BUJNOWSKA E UROSTAT S TATISTICAL OFFICE OF THE E UROPEAN U NION.
Session 4. Panel session: How useful is the notion of “circle of trust” concept ? A vision for the future. Maurice Brandt Destatis Germany 2ND EUROPEAN.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
Slide 1WG Public Health Statistics December 2014 Eurostat Modernisation of social statistics - state of play Agenda point 4 WG Public Health Statistics.
Slide 1 Eurostat Unit B3 – Statistical Information Technologies CoRD Meeting – 4 June 2007 Agenda Item 8 Preliminary ideas for a 2011 census hub Giuseppe.
Implementation of the EU regulation on access to European microdata Aleksandra Bujnowska Eurostat.
Michelle Simard, Thérèse Lalor Statistics Canada CSPA Project Manager UNECE Work Session on Statistical Data Confidentiality Helsinki, October 2015 Confidentialized.
Access to EU microdata for research purposes
ESS Vision 2020 Strategic Risk Management Risk Mitigation Involvement of the DIME-ITDG DIME-ITDG Steering Group – item 07 Luxembourg,
19-20 October 2010IT Directors’ Group Meeting 1 Item 3.3.g of the agenda Vision Infrastructure Project on Secure Infrastructure for CONfidential data access.
Access to European Statistical System microdata
Legal, political and methodological issues in confidentiality in the ESS Maria João Santos, Jean-Marc Museux Eurostat.
Harmonisation process of anonymisation of microdata
Item 4.2 Anonymisation rules for Farm Structure Survey
Economy-wide Material Flow Accounts (EW-MFA) (point 4 of the agenda)
EU SURVEY ON HEALTH AND SOCIAL INTEGRATION: SCENARIOS FOR IMPLEMENTATION AGENDA POINT 2.4 Meeting of the Directors of Social Statistics (DSS) September.
Gender statistics in the ESS Issues and challenges
4.1 Data transmission format
Public use files for ESS microdata
Labour Market Statistics: Recent developments
Working Group on Statistical Confidentiality, October 2009
Eurostat's Vision Infrastructure Pilot projects on data matching
LAMAS Working Group 7-8 December 2015
9. Quality and Experimental data
TG EHIS January 2012 Item 3.2 of the agenda EHIS wave 1 anonymised data Bart De Norre, Eurostat.
ESS.VIP ADMIN Sorina Vâju.
Education and Training Statistics work programme 2005
Opinions after the 24/25 February 2016 Plenary
Item 5.6 of the Agenda Remote access to confidential data for scientific purpose Jean-Marc Museux/ Aleksandra Bujnowska - Unit B2 Methodology and research.
LAMAS Working Group June 2017
LAMAS Working Group 29 June-1 July 2016
Item 4.1 Recent activities in confidentiality and micro data access
LAMAS Working Group 29 June-1 July 2016
Access to micro data in Europe
Education and Training Statistics Working Group – 2-3 June 2016
LAMAS Working Group 29 June-1 July 2016
Item 7.1 Implementation of the 2016 Adult Education Survey
Debriefing from the December 2017 LAMAS meeting Item 4
Meeting Of The European Directors of Social Statistics
TG EHIS January 2012 Item 4.1 of the agenda EHIS wave 2 Implementing Regulation Bart De Norre, Eurostat.
Point 5 : Progress of the work on the EHS since the last DSS meeting
ESQRS implementation in the in Labour Force Survey
LAMAS Working Group 7-8 December 2016
Working Group on Statistical Confidentiality
Adult Education Survey progress report Point 6
The modules of the EU Labour Force Survey
The EPSS (European Programme of Social Surveys) project
LAMAS Working Group June 2017
Conclusions of the meeting
Item 4.3 Confidentiality on the fly
High level working group on statistical confidentiality
LAMAS Working Group October 2018
LAMAS Working Group 7-8 December 2015
Adult Education Survey Anonymisation Point 6
Item 4.4 – Dissemination of 2010 CVTS and 2011 AES
Item 8 - Disability statistics
Perturbative methods for ESS census tables
WORKING GROUP ON LIVING CONDITIONS/ ILC
Confidentiality on the Fly
Update on microdata access
LAMAS Working Group June 2018
Meeting of the EHIS Technical Group Luxembourg January 2012
Access to European microdata for scientific purposes
Item 5 Wim Kloek, Eurostat
Item 2.2 Scientific Use Files for the Time Use Survey
Item 5 Modernisation of the EU-SILC Production
Item 4.1: Annual labour market flows
Meeting Of The European Directors of Social Statistics
Presentation transcript:

Access to European microdata for scientific purposes DIME, 23 February 2018 Item 13 Fabian Bach, Aleksandra Bujnowska ESTAT.B.1 Eurostat

Outline Introduction Access to European microdata for scientific purposes - recent activities Microdata access: plans for 2018+ Confidentiality on the fly Eurostat

Need for modernisation in microdata access 1. Introduction Need for modernisation in microdata access 2002 access to microdata regulated by EU law 2013 Reg. replaced: procedures changed (new types of research organisations, enlarge the scope of the European microdata sets , allow new access modes) 2013-2018 ~750 eligibility agreements signed all over the world ~1500 applications for access received (growing yearly) +5 new datasets have become available for scientific use Two access modes: Continuous modernisation (handle volume + improvements) Secure use files Scientific use files Eurostat

Alignment to the personal data protection framework 2. Recent activities Alignment to the personal data protection framework European Data Protection Supervisor (EDPS) - assessment of microdata access procedures EDPS recommendations concerned mostly treatment of applications from entities located outside EU/EEA Information to respondents and data users about rights and obligations related with personal data protection framework Eurostat

On-line system for microdata access 2. Recent activities On-line system for microdata access research proposal application webform Eurostat

On-line system for microdata access 2. Recent activities On-line system for microdata access NEW Eurostat Workflow tool Eurostat

Public use files: EU-LFS & EU-SILC 2. Recent activities Public use files: EU-LFS & EU-SILC 2015 Anonymisation methodology developed by Centre of Excellence on statistical disclosure control; Data concerned: EU Labour Force Survey, EU Statistics on Income and Living Conditions 2016 Validation of the method by the WG on Methodology 2017 Application of the methodology on all countries data (by Eurostat) EU-SILC: consultation with countries, 20 agreements EU-LFS: anonymisation on-going, consultations to be launched soon Eurostat

On-line microdata transmission (implementation on-going) 2. Recent activities On-line microdata transmission (implementation on-going) Currently 12 microdata sets … LFS SILC HBS Eurostat

On-line microdata transmission (implementation on-going) 2. Recent activities On-line microdata transmission (implementation on-going) Currently 12 microdata sets … LFS SILC HBS … Research Project 1 Research Project X Eurostat

Microdata access: plans for 2018+ New datasets New access modes Confidentiality on the fly Eurostat

Microdata access: plans for 2018+ New datasets New access modes Confidentiality on the fly (next slides) Eurostat

Cell suppression and 'c' flags 4. Confidentiality on the fly Cell suppression and 'c' flags Secondary suppression Consistency issues Generic drawbacks Traditional method to address disclosure risks: Cell suppression ('c' flags) Eurostat

Cell suppression and 'c' flags 4. Confidentiality on the fly Cell suppression and 'c' flags Secondary suppression Consistency issues Generic drawbacks Traditional method to address disclosure risks: Cell suppression ('c' flags) 'c' flags sometimes misused / misunderstood EU totals sometimes not published Eurostat issues Human intervention Eurostat

Eurostat issues with 'c' flags / suppression 4. Confidentiality on the fly Eurostat plans Eurostat issues with 'c' flags / suppression Raise awareness on 'c' flags Intended function Correct application Eurostat

Eurostat issues with 'c' flags / suppression 4. Confidentiality on the fly Eurostat plans Eurostat issues with 'c' flags / suppression Raise awareness on 'c' flags Intended function Correct application Investigate modernization Automatized methods Recent developments using random noise (ABS etc.) Eurostat

Eurostat issues with 'c' flags / suppression 4. Confidentiality on the fly Eurostat plans Eurostat issues with 'c' flags / suppression Raise awareness on 'c' flags Intended function Correct application Investigate modernization Automatized methods Recent developments using random noise (ABS etc.) Eurostat

Random noise & cell key method 4. Confidentiality on the fly Random noise & cell key method Microdata Safe tables Eurostat

Random noise & cell key method 4. Confidentiality on the fly Random noise & cell key method Microdata Cell key module  ensures consistency  same for all statistics Noise module  methodological part  adapts to statistics assign fixed record keys query records in cell calculate cell key from record keys val_out = noise_function (val_in, cell key) Table builder + cell key method Safe tables Eurostat

Random noise & cell key method 4. Confidentiality on the fly Random noise & cell key method Microdata Cell key module  ensures consistency  same for all statistics Noise module  methodological part  adapts to statistics assign fixed record keys query records in cell calculate cell key from record keys val_out = noise_function (val_in, cell key) Table builder + cell key method (additivity module) Safe tables Eurostat

4. Confidentiality on the fly Data pilots: 2021 EU Census ESSnet census protection project recommends Cell Key Method for 2021 census (agenda item 3.2): additive noise for unweighted (census) frequency tables SAS implementation provided Eurostat

4. Confidentiality on the fly Data pilots: 2021 EU Census ESSnet census protection project recommends Cell Key Method for 2021 census (agenda item 3.2): additive noise for unweighted (census) frequency tables SAS implementation provided Possible added value for "confidentiality on the fly": planned code translation to open source (e.g. R) existing methodology and straightforward to extend to other statistics (weighted samples, magnitudes) Eurostat

Data pilots: LFS ad-hoc tables 4. Confidentiality on the fly Data pilots: LFS ad-hoc tables Eurostat service: extract ad-hoc tables from LFS data Old approach: Combination of 'c' + 'a' flags, suppression depends on user No longer supported by ~10 Member States  service currently suspended Eurostat

Data pilots: LFS ad-hoc tables 4. Confidentiality on the fly Data pilots: LFS ad-hoc tables Eurostat service: extract ad-hoc tables from LFS data Old approach: Combination of 'c' + 'a' flags, suppression depends on user No longer supported by ~10 Member States  service currently suspended Mitigation proposal: no 'c' flag, if needed random noise (RN) 10 MS Other MS EU28, EA19 Approved users: All cells + RN All cells Other users: + suppress 'a' Eurostat

Goal: table builder tool 4. Confidentiality on the fly Goal: table builder tool "Confidentiality on the fly" in short: Develop tool for BLACK BOX Safe tables Microdata Eurostat

Goal: table builder tool 4. Confidentiality on the fly Goal: table builder tool "Confidentiality on the fly" in short: Develop tool for BLACK BOX Safe tables Microdata flexible microdata back-end interface portability into other production env's Eurostat

Goal: table builder tool 4. Confidentiality on the fly Goal: table builder tool "Confidentiality on the fly" in short: Develop tool for BLACK BOX Live example: ABS TableBuilder user-friendly + powerful front-end Safe tables Microdata flexible microdata back-end interface graded user access: basic  research  internal portability into other production env's Eurostat