Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, 17-19 December 2007 Dealing with Confidentiality in Dissemination: The.

Slides:



Advertisements
Similar presentations
Characterization and Management of Multiple Components of Cost and Risk in Disclosure Protection for Establishment Surveys Discussion of Advances in Disclosure.
Advertisements

Output Consultation Plans and Statistical Disclosure Control Strategy developments Angele Storey and Jane Longhurst ONS.
ONS Research Data Access Strategy AGENDA Background and context Confidentiality The Strategy.
Introduction to basic principles of Regulation (EC) 45/2001 Sophie Louveaux María Verónica Pérez Asinari.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
National Statistical System and Data Dissemination Sok Kosal, Deputy Director General,National Institute of Statistics Ministry of Planning, Cambodia International.
Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
© Statistisches Bundesamt, IIA - Mathematisch Statistische Methoden Summary of Topic ii (Tabular Data Protection) Frequency Tables Magnitude Tables Web.
Chapter 2 - Legal foundation and institutional arrangements 8 th Meeting Baku, Azerbaijan September 2013 Drafting process of the Energy Statistics.
Eurostat Statistical Disclosure Control. Presented by Peter-Paul de Wolf, Statistics Netherlands (CBS)
Measuring Ethno-Cultural Characteristics in Population Censuses United Nations Economic Commission for Europe Statistical Division Regional Training Workshop.
The quality framework of European statistics by the ESCB Quality Conference Vienna, 3 June 2014 Aurel Schubert 1) European Central Bank 1) This presentation.
Access routes to 2001 UK Census Microdata: Issues and Solutions Jo Wathan SARs support Unit, CCSR University of Manchester, UK
Eurostat M ODES OF ACCESS TO EU MICRODATA IN THE NEW LEGAL FRAMEWORK A LEKSANDRA BUJNOWSKA E UROSTAT S TATISTICAL OFFICE OF THE E UROPEAN U NION.
Decentralised and Remote Access to Confidential Data in the ESS (ESSnet DARA) Overview and State of the Art Maurice Brandt Destatis FIRST EUROPEAN DATA.
Metadata driven application for aggregation and tabular protection Andreja Smukavec SURS.
Basque Statistics Office Confidentiality Project: Final stages Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality Tarragona, Spain,
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
1 The system aspect of statistical quality Q2014 european conference on quality in official statistics Special session: Consistency of Concepts and Applied.
Confidentiality Issues with “Small Cell” Data Michael C. Samuel, DrPH STD Control Branch California Department of Public Health 2008 National STD Prevention.
CES Task Force on Confidentiality and Microdata Tiina Luige UNECE Statistical Division Conference of European Statisticians UN Economic Commission for.
WP. 46 Providing access to data and making microdata safe, experiences of the ONS Jane Longhurst Paul Jackson ONS.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Population Census carried out in Armenia in 2011 as an example of the Generic Statistical Business Process Model Anahit Safyan Member of the State Council.
Transition from traditional census to sample survey? (Experience from Population and Housing Census 2011) Group of Experts on Population and Housing Censuses,
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 2 Labor Market Information Produced in Collaboration between World Bank Institute and the.
1 New Implementations of Noise for Tabular Magnitude Data, Synthetic Tabular Frequency and Microdata, and a Remote Microdata Analysis System Laura Zayatz.
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Recommended Practices for Editing and Imputation in the European Statistical System: the EDIMBUS Project* Orietta Luzi (Istat, Italy) Ton De Waal (Statistics.
Methodological questions of migration and ethnocultural diversity in Population Censuses StatCapCA Training Workshop No 3 Dushanbe, March 2007 Werner Haug.
The use of protected microdata in tabulation: case of SDC-methods microaggregation and PRAM Researcher Janika Konnu Manchester, United Kingdom December.
Module 5b: Measuring Household ICT Ms Sheridan Roberts, Consultant Information Society Statistics Tuesday 10 March 2009.
Introduction 1. Purpose of the Chapter 2. Institutional arrangements Country Practices 3. Legal framework Country Practices 4. Preliminary conclusions.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
Disclosure Avoidance at Statistics Canada INFO747 Session on Confidentiality Protection April 19, 2007 Jean-Louis Tambay, Statistics Canada
Legal and institutional foundation of economic statistics Overview of international experience Regional Workshop for African Countries on Compilation of.
Statistical data confidentiality and micro data in Albania
© Federal Statistical Office, Research Data Centre, Maurice Brandt Folie 1 ESSnet Projects “Decentralised Access to EU microdata” Maurice Brandt Research.
Access to microdata in Statistics Estonia First DwB European Data Access Forum Luxembourg, 28th March 2012 Tuulikki Sillajõe.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 2 Quality management Produced in Collaboration between.
Joint UNECE/Eurostat Work Session on Migration Statistics, April 2010,Geneva Monthly Demographic Now Cast. Monthly estimates of migration flows in.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
The views expressed herein are those of the author and should not necessarily be attributed to the IMF, its Executive Board, or its management Data Confidentiality,
FORUM GUIDE TO SUPPORTING DATA ACCESS FOR RESEARCHERS A STATE EDUCATION AGENCY PERSPECTIVE Kathy Gosa, Kansas State Department of Education.
Access to microdata in the Netherlands: from a cold war to co-operation projects Eric Schulte Nordholt Senior researcher and project leader of the Census.
Joint Eurostat Unece Worksession on Statistical Data Confidentiality 2011, Tarragona Initial analyses on comparable dissemination from the Essnet project.
Державна служба статистики України Statistical confidentiality assurance framework in State Statistics Service of Ukraine Anton Tovchenko head of mathematical.
1 European Statistics Code of Practice. I.Institutional Environment Principle II.Statistical processes Principle III.Statistical Output Principle.
Data disclosure control Nordic Forum for Geography and Statistics Stockholm, 10 th September 2015.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Statistical Business Register Enterprise Groups in Latvia Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical.
Mirjana Boshnjak Skopje, 20 to 22 September 2017
Quality assurance in official statistics
Confidentiality in Published Statistical Tables
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
WORKSHOP GROUP ON QUALITY IN STATISTICS
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
Data from statistical modeling (e. g
TG EHIS January 2012 Item 3.2 of the agenda EHIS wave 1 anonymised data Bart De Norre, Eurostat.
Exchange and Sharing of Economic Data
Protecting Confidential Data
Albania 2021 Population and Housing Census - Plans
High level working group on statistical confidentiality
Dealing with confidential data Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION.
Treatment of statistical confidentiality Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE.
Presentation transcript:

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 Dealing with Confidentiality in Dissemination: The experience of the Basque Statistical Office Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA Outline 1.Objectives 2.Legal framework and previous work on statistical confidentiality 3.Current situation at EUSTAT  Confidentiality Board  Establishing confidentiality criteria in dissemination  Checking confidentiality criteria 4.Future tasks 1.Towards a safe-standard microdata structure 2.On-site access facility 5.Conclusions

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December Objectives

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA One of the main goals of a statistical agency is to maintain and provide statistical confidentiality for its respondents. Confidentiality should be preserved in all the stages of statistical production and especially in the dissemination phase. 1. Objectives Principle 5 of the European Statistics Code of Practice “[…] Instructions and guidelines are provided on the protection of statistical confidentiality in the production and dissemination processes. These guidelines are spelled out in writing and made known to the public […]”

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 A comprehensive confidentiality policy has been developed at Basque Statistics Office (EUSTAT) that includes:  standard protection criteria for dissemination products (tables, microdata)  constitution of an expert group  commitment to future tasks: public-use microfiles and on-site access 1. Objectives In order to fulfil this Principle…

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December Legal framework and previous work on statistical confidentiality

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA Basque Statistics Law (23rd April, Law 4/1986). Chapter IV. Organic Law of Personal Data Protection (13th December, Law 15/1999) defines the concepts of Personal Data and Specially Protected Data: European Directive 95/46/EC on the protection of individuals with regard to the processing of personal data defines the concept of identifiable person: 2.1 Legal framework “ […] the duty to keep statistical secret protects any identifiable data as belonging to an specific person […]” “Personal Data: any information related to an identified or identifiable person ” “[…] an identifiable person is one who can be identified, directly or indirectly, in particular by reference to an identification number or to one or more factors specific to his physical, physiological, mental, economic, cultural or social identity.[…]

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 2.2 Previous work on statistical confidentiality PeriodActionOutput Research fellowship on data protection techniques and statistical confidentiality Technical notebook on “Statistical Data Protection Techniques” edited by EUSTAT. April 2000International Seminar on “Confidentiality and statistical data protection techniques” organized by EUSTAT. Lecturer: L.H. Cox Publication: “Confidentiality and statistical data protection techniques” L.H. Cox edited by EUSTAT. September 2000Security Analysis of Census Tables Internal report about sensitive crosses and dissemination proposal

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December Previous work on statistical confidentiality PeriodActionOutput 2001Participation in The Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality (Skopje, Macedonia, March) Article: “A comparative test for several threshold values in frequency tables: A Tau-Argus performance example.” 2002Tabular Data protection of preliminary results of the Census 2001, using Tau- Argus (optimal method). Publication of suppression patterns for frequency tables with fine geographical levels CASC project pursuit.Testing of Argus software. June 2004Attendance of PSD (Privacy in Statistical Databases) Conference. (Barcelona, Spain, 6-9 June)

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December Previous work on statistical confidentiality PeriodActionOutput 2005Staff training on disclosure control and protection software. Internal Workshop on SDC techniques and ARGUS. 2006Work on standard safety criteria.Internal report about analysis of sources and internal situation. December 2006Attendance of PSD Conference. (Rome, December) Feedback and contacts.

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December Current situation at EUSTAT

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA Why? Who? 3.1 Confidentiality Board Need for an expert group to make decisions about data protection and to give advice on confidentiality matters. The highest representatives of each thematic area and the General Direction of EUSTAT

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA Main Duties 3.1 Confidentiality Board  To establish rules and criteria about confidentiality issues  To make decisions concerning sensitive topics and sensitive variables  To discuss and approve public-use microdata structure  To decide about on-site access conditions  To solve specific queries (research-use microdata, etc.)  To advise other statistical agents from the Basque statistics system on confidentiality matters and data protection procedures  To keep a coherent and updated system

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 3.2 Establishing confidentiality criteria in dissemination Research of sources and other experiences - 1st Phase Objective: Compilation of external experiences from other statistical offices regarding their policies on reporting and implementing confidentiality. Sources consulted:  National Statistical Institute – INE (Spain)  Statistical Office of the European Communities – EUROSTAT (UE)  Office of National Statistics – ONS (United Kingdom)  Bureau of the Census – (United States)  Central Bureau of Statistics – CBS (The Netherlands)  Satistics Canada – StatCan (Canada)

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 3.2 Establishing confidentiality criteria in dissemination Research of sources and other experiences - 1st Phase Results:  Legal framework is available to all sources consulted and it is considered essential as a starting point.  It is less common to find information about the sensitivity rules applied and the values for the parameters of such rules.  Disclosure control methods are applied to tables and microdata in most cases.  Geographical thresholds are applied in many cases with diverse values and mainly in microdata releases.  Almost all the sources provide microdata products (for research use and/or public use)

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 3.2 Establishing confidentiality criteria in dissemination Research of sources and other experiences – 2nd Phase Objective: Compilation of data protection practices commonly applied by EUSTAT in dissemination products. Sources consulted: Representatives of the production areas in EUSTAT. Results:  Business statistics - Low frequencies in tables are avoided by means of recodification and manual suppressions - No concentration rules are applied to magnitude tables  Social and population statistics - Ad-hoc protection for each particular case

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 3.2 Establishing confidentiality criteria in dissemination Microdata protection rules  Microdata files released should not include, in any case, either direct identifiers or personal data.  In general, microdata files will not include geographical indicators referring to areas under a fixed threshold (10,000 inhabitants).  Aggregation level for other variables included in the file will depend on geographical level released and sensitivity of the variable itself.  As an additional protection, disclosure control techniques (perturbation methods, record swapping, noise addition, etc.) could be applied to microdata, always preserving the statistical properties of data.

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 3.2 Establishing confidentiality criteria in dissemination Tabular data protection rules  Low values should be avoided in frequency tables with multiple crossings, where at least one of the variables is sensitive and the geographical indicator refers to an area under a fixed threshold (10,000 inhabitants).  Dominant contributions should be avoided in magnitude tables in order to prevent accurate estimation of sensitive data belonging to a contributor in a cell. Concentration rules will be applied to detect sensitive cells.  Appropriate protection techniques for tabular data (recoding of variables, primary and secondary cell suppression, etc.) will be applied in order to protect sensitive cells from disclosure.

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 3.2 Establishing confidentiality criteria in dissemination Checking confidentiality criteria Objective: To check the fulfillment of safety criteria in the dissemination products of EUSTAT Sources checked: Statistical tables and data bank ( Checking points: Geographical scope and sensitivity of variables Results:

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December Future tasks

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 4.1 Towards a safe-standard microdata structure Objective: To develop standard microdata files which will be available for the general public but with all the guarantees of confidentiality Aspects to consider:  Type of statistics (census or sampling survey)  Hierarchical structure of the data (i.e.: families and individuals)  Geographical indicators included  Identifying variables and possible combinations (identifying keys)  Level of detail (number of categories) of the variables included  Sensitive variables included (if any)  Risk indicator  Disclosure control methods to be applied  Information loss measure (Utility measure)

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 4.2 On-site access facility Objective: To provide users (mainly researchers) with an ‘in-situ’ workstation where microdata could be accessed under specific conditions. Aspects to consider:  Type of user (general public, researcher,…)  Purpose of the data access  Conditions of data access (confidentiality agreements, limited access, passwords,…)  Physical aspects (location, number of computers,…)

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 EUSKAL ESTATISTIKA ERAKUNDEA INSTITUTO VASCO DE ESTADÍSTICA 5. Conclusions  EUSTAT has been working on the development of standard criteria to protect statistical products: microdata and tabular data  The establishment of these confidentiality rules has been a hard process, and the discussion about what should be considered as identifiable or sensitive is still ongoing  These criteria should be updated continuously, reflecting changes in the legal framework, in the technological environment and in social reality

Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 More information in… Thanks for your attention! Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007