Eurostat Statistical Disclosure Control. Presented by Peter-Paul de Wolf, Statistics Netherlands (CBS)

Slides:



Advertisements
Similar presentations
Alessandra Capobianchi and Luisa Franconi Istat - Division for Information Technology and Methodology - Italy Ntts 2009 Brussel Febbruary 2009 Cell.
Advertisements

Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Confidentiality risks of releasing measures of data quality Jerry Reiter Department of Statistical Science Duke University
© Statistisches Bundesamt, IIA - Mathematisch Statistische Methoden Summary of Topic ii (Tabular Data Protection) Frequency Tables Magnitude Tables Web.
17 September SME Statistics OECD Workshop SME data and methodologies in the EU - item 5 Paul Feuvrier / Eurostat.
IMPROVING CONFIDENTIALITY WITH tau-ARGUS BY FOCUSSING ON CLEVER USAGE OF MICRODATA Roland van der Meijden MSc. ± 10 minutes.
In a Virtual Data Centre Protecting Confidentiality COMPUTATIONAL INFORMATICS Christine O’Keefe, Mark Westcott, Adrien Ickowicz, Maree O’Sullivan, CSIRO.
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
Security in Databases. 2 Outline review of databases reliability & integrity protection of sensitive data protection against inference multi-level security.
Metadata driven application for aggregation and tabular protection Andreja Smukavec SURS.
Overview of 2002 CIPSEA: Methods to Protect Confidential Tabular Data Amrut Champaneri, Ph.D. U.S. Department of Transportation Bureau of Transportation.
The Application of the Concept of Uniqueness for Creating Public Use Microdata Files Jay J. Kim, U.S. National Center for Health Statistics Dong M. Jeong,
Confidentiality Issues with “Small Cell” Data Michael C. Samuel, DrPH STD Control Branch California Department of Public Health 2008 National STD Prevention.
G-Confid: Turning the tables on disclosure risk Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality Ottawa, Canada 30 October 2013 Peter.
Albania Statistical Training Prosecution / Courts Session 4, January 31, – Overview of the Criminal Justice System and Statistics – Recording.
Disclosure Avoidance: An Overview Irene Wong ACCOLEDS/DLI Training December 8, 2003.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
StatLine 4 metadata implementation Edwin de Jonge Statistics Netherlands.
Discussion of “ Statistical Disclosure Limitation: Releasing Useful Data for Statistical Analysis” Nancy J. Kirkendall Energy Information Administration.
Dissemination and interpretation of time use data Social and Housing Statistics Section United Nations Statistics Division Time Use Statistics workshop.
1 New Implementations of Noise for Tabular Magnitude Data, Synthetic Tabular Frequency and Microdata, and a Remote Microdata Analysis System Laura Zayatz.
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
The Dutch Virtual Census based on registers and already existing surveys Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
Confidentiality issues in the EU Population and Housing Censuses of 2011 Risks and Criteria.
Disclosure Avoidance at Statistics Canada INFO747 Session on Confidentiality Protection April 19, 2007 Jean-Louis Tambay, Statistics Canada
1 Using Fixed Intervals to Protect Sensitive Cells Instead of Cell Suppression By Steve Cohen and Bogong Li U.S. Bureau of Labor Statistics UNECE/Work.
The availability of Dutch census microdata Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands Division Social.
Protection of frequency tables – current work at Statistics Sweden Karin Andersson Ingegerd Jansson Karin Kraft Joint UNECE/Eurostat.
European Conference on Quality in Official Statistics, Rome, July 2008 Community Innovation Survey: a Flexible Approach to the Dissemination of Microdata.
Differential Privacy Some contents are borrowed from Adam Smith’s slides.
1 1 Confidentiality protection of large frequency data cubes UNECE Workshop on Statistical Confidentiality Ottawa October 2013 Johan Heldal and Svetlana.
Access to microdata in the Netherlands: from a cold war to co-operation projects Eric Schulte Nordholt Senior researcher and project leader of the Census.
Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 Dealing with Confidentiality in Dissemination: The.
Watech.wa.gov Records Management In a nutshell. watech.wa.gov What’s a record? A record is anything you create in the course of doing your work – Everything.
The Review of the Dissemination of Health Statistics Carole Abrahams Office for National Statistics.
United Nations Statistics Division Dissemination of IIP data.
Report on the work of the TF on SME Data item 6 of the agenda Structural Business Statistics Working Group 14 April 2015, Luxembourg Tatiana Mrlianová.
G. Merola Winton Capital Management 1 UN/ECE Work Session On Statistical Data Confidentiality (Geneva, 9-11 November 2005) WP30: Safety rules in statistical.
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
Data disclosure control Nordic Forum for Geography and Statistics Stockholm, 10 th September 2015.
Information Governance Jo Wall South East Public Health Intelligence Analyst Training Day 2, Session 5 11 th February 2016.
Administrative Data and Official Statistics Administrative Data and Official Statistics Principles and good practices Quality in Statistics: Administrative.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
ESTP course, SBS module 13 March 2013 Structural Business Statistics Data reporting to Eurostat, transmission format and tools.
Author Name Team Name Date Organisation
Data Confidentiality and the Common Good.
Author Name Team Name Date Organisation
Progress towards a table builder with in-built disclosure control for 2021 Census Keith Spicer UNECE, 22 September 2017.
Confidentiality in Published Statistical Tables
Establishing an Automated Confidentiality Service in Stats NZ
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
Access to European microdata for scientific purposes
Treatment of statistical confidentiality Table protection using Excel and tau-Argus Practical course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER.
Treatment of statistical confidentiality Table protection using Excel and tau-Argus Practical course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER.
Structural Business Statistics Data reporting to Eurostat, transmission format and tools ESTP course, SBS module 13 March 2013.
Data from statistical modeling (e. g
New Zealand Business Demography Statistics: Noise for Counts and Magnitudes (NCM) confidentiality method Mathew Page September 2018.
Roundtable on Business Survey Frames 17-21/10/2005
Disclosure Avoidance: An Overview
Dealing with confidential data Introductory course Part 2: Tables Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH.
Federal Statistical Office Germany Research Data Centre
Urban Statistics – Methodological work
Structural Business Statistics
Perturbative methods for ESS census tables
Confidentiality on the Fly
Treatment of statistical confidentiality Part 3: Generalised Output SDC Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK.
Item 5 Wim Kloek, Eurostat
Anco Hundepool Sarah Giessing
Item 2.2 Scientific Use Files for the Time Use Survey
Presentation transcript:

Eurostat Statistical Disclosure Control

Presented by Peter-Paul de Wolf, Statistics Netherlands (CBS)

Content Introduction What’s the problem? –Specific for business statistics Formalising the problem What to do? –Methods –Software Summary

Introduction General definition of confidential data: Data can not be published “as is” »By law (e.g. statistical law) »Sensitive data (what’s sensitive?) »Respondent considers it confidential »…

Introduction Physical protection –Entrance –Network Legal protection –Oath Statistical Disclosure Control –Protection of statistical output

What’s the problem? Statistical output Microdata –Not often in case of business data –Obvious: each record represents a single respondent Tabular data –In business data often magnitude tables –Sometimes frequency tables –But: aggregated data?!?!?!?

Cell value itself not sensitive: –All contributions are equal (1) Spanning variables –Indentifying, e.g. NACE, Region –Sensitive, e.g. “environmental offence” (illegal dumping of waste, illegal fishing, oil spills, …) What’s the problem (frequency table)

Example: number of ship-owners Environmental offence RegionYes No Total … A

What’s the problem (frequency table) Example: number of ship-owners Environmental offence RegionYes No Total … B

What’s the problem (frequency table) Example: number of ship-owners Environmental offence RegionYes No Total … C

What’s the problem (magnitude table) Turnover (10 6 €) of instrument producing companies Region A B C Total Harps Organs Pianos Other Total

What’s the problem (magnitude table) Turnover (10 6 €) of instrument producing companies Region A B C Total Harps Organs Pianos Other Total ?

Formalising the problem Suppose cell (Piano, A) consists of Company X: 8110 6 € Company Y: 510 6 € Other three: 210 6 € each Total : 9210 6 € 92 – 5 = 87 is within 7.4%!

Formalising the problem General, objective rules needed Threshold rule Dominance rule or (n,k)-rule p%-rule p%-rule is favoured over (n,k)-rule and implies minimum of 3 contributors

What to do? Redesign table –Combine rows/columns –Define different categories Rounding Add noise Cell suppression

Region A B C D Total Harps Organs Pianos Other Total

Cell suppression Region A B C D Total Harps Organs Pianos Other Total X X X

Cell suppression Region A B C D Total Harps Organs Pianos Other Total X X X X XX

Cell suppression Region A B C D Total Harps Organs Pianos Other Total X X X XX X X X X

Cell suppression Region A B C D Total Harps Organs Pianos Other Total X X X XX X X X X

Software Latest version can be found on New Open Source version available end 2014

Contact/info Glossary, handbook, project info – Wiley book