Confidentiality in Published Statistical Tables

Slides:



Advertisements
Similar presentations
Output Consultation Plans and Statistical Disclosure Control Strategy developments Angele Storey and Jane Longhurst ONS.
Advertisements

Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
The introduction of new classifications of economic activities and products in Ukraine Workshop on International Classification Chisinau March 2013.
17 September SME Statistics OECD Workshop SME data and methodologies in the EU - item 5 Paul Feuvrier / Eurostat.
Developing a Statistical Disclosure Standard for Europe Tanvi Desai LSE Research Laboratory Data Manager Research Laboratory IASSIST 2010: Cornell.
Modernisation of Statistical Processing at SURS Andreja Smukavec, SURS Rudi Seljak, SURS Workshop on Modernisation of Statistical Production Geneva, 15–17.
Evaluation of the Role of Audit to Detect Corruption in Thailand Prepared by Dr. Sutthi Suntharanurak Office of the Auditor General of Thailand.
POLICIES AND PROCEDURES FOR ARCHIVING DATA IN BURUNDI.
United Nations Statistics Division/DESA International Recommendations for the Index of Industrial Production (IIP)
Metadata driven application for aggregation and tabular protection Andreja Smukavec SURS.
ISO 9000 & TOTAL QUALITY ISO 9000 refers to a group of quality assurance standards established by the International Organization for Standardization.This.
Version 1.1 Tau-Argus and SuperCROSS A practical example using the UK Business Register Unit data Andrea Staggemeier Philip Lowthian Grant Lee.
Confidentiality Issues with “Small Cell” Data Michael C. Samuel, DrPH STD Control Branch California Department of Public Health 2008 National STD Prevention.
Marina Signore Head of Service “Audit for Quality Istat Assessing Quality through Auditing and Self-Assessment Signore M., Carbini R., D’Orazio M., Brancato.
Transparency and Open Data: GSS Response Iain Bell HoP MoJ.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
Chapter 10 – Dissemination Ilaria Dimatteo United Nations Statistics Division The 4 th meeting of the Oslo Group on energy statistics Ottawa, Canada, 2-6.
Daniel Beckler United States Department of Agriculture National Agricultural Statistics Service Timothy Mulcahy NORC at the University of Chicago Topic.
Disclosure detection & control in research environments Felix Ritchie.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 2 Labor Market Information Produced in Collaboration between World Bank Institute and the.
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
1 Standard Student Identification Method Jeanne Saunders Session 16.
Software Architecture Evaluation Methodologies Presented By: Anthony Register.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
The Application for Statistical Processing at SURS Andreja Smukavec, SURS Rudi Seljak, SURS UNECE Statistical Data Confidentiality Work Session Helsinki,
The views expressed herein are those of the author and should not necessarily be attributed to the IMF, its Executive Board, or its management Data Confidentiality,
Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 Dealing with Confidentiality in Dissemination: The.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
1 The Process of Practicing Statistical Disclosure Control in Tabular Data at Statistics Sweden Q2010 Helsinki, May 4-6 Ingegerd Jansson, Michael Carlson,
United Nations Statistics Division Dissemination of IIP data.
Державна служба статистики України Statistical confidentiality assurance framework in State Statistics Service of Ukraine Anton Tovchenko head of mathematical.
An Introduction to UML COMS 103 section 4 11 January, 1999.
National Statistics - access and disclosure issues for Vital Events data Allan Baker Office for National Statistics.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Improving researcher access to USDA’s Agricultural Resource Management Survey Charles Towe and Mitch Morehart Economic Research Service, USDA.
Juvenile Legislative Update 2013 Confidentiality of Records and Interagency Sharing of Educational Records.
Investment Intentions Survey 2016
Summary of component 9: Statistical Business Register
New Guidelines on Protection of Tabular Data at Statistics Finland
CCG/LA User guide Useful Contacts -ImmForm Helpdesk:
the System of State Statistics in Ukraine
Towards connecting geospatial information and statistical standards in statistical production: two cases from Statistics Finland Workshop on Integrating.
Model Governance Industry Evolution Beyond Model Accuracy
Investment Intentions Survey 2016
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
WORKSHOP GROUP ON QUALITY IN STATISTICS
Governance Assistant for Office365
Access to European microdata for scientific purposes
Harmonisation process of anonymisation of microdata
Ethical questions on the use of big data in official statistics
Problem DC 10-2, Page 547 What is K? The confidence factor
Treatment of statistical confidentiality Table protection using Excel and tau-Argus Practical course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER.
Dissemination guidelines at INE
Structural Business Statistics Data reporting to Eurostat, transmission format and tools ESTP course, SBS module 13 March 2013.
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Data from statistical modeling (e. g
Cyber security Policy development and implementation
Roundtable on Business Survey Frames 17-21/10/2005
Point 6. Eurostat plans for Time Use Survey data processing and dissemination Working Group on Time Use Surveys 10 April 2013.
« LFS series breaks with the adoption of the IESS FR How is Statistics Portugal planning to tackle the issue? 13th Workshop on Labour Force Survey Methodology.
Item 2.2 Use of waivers in business statistics
Structural Business Statistics
Transformation of the National Statistical System: Experience
The role of metadata in census data dissemination
Prodcom Working Group Item 03.5 – Confidentiality & dissemination of PRODCOM statistics Prodcom Working Group 18th -19th September 2014.
Treatment of statistical confidentiality Part 3: Generalised Output SDC Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK.
PRODCOM WORKING GROUP TF proposal on sub-contracted operations and a better coverage of services Prodcom Working Group 18th -19th September 2014 Item.
Dealing with confidential data Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION.
Treatment of statistical confidentiality Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE.
Secondary confidentiality in European business statistics
Presentation transcript:

Confidentiality in Published Statistical Tables Annu Cabrera 22.9.2015 Study visit of the State Statistical Service of Ukraine (SSSU)

Statistics Act (280/2004), Section 11 “Statistics shall be compiled so that those whom they concern are not directly or indirectly identifiable from them, unless the data concerning identification are public by virtue of this Act. “ Statistics Act + theory & methodology → Guidelines How to apply theory in practice? Which methods to use? → Practice 22 September 2015 Annu Cabrera

Guidelines

Guidelines on the protection of tabulated data I Internal guidelines Tabulated business data Tabulated personal data Guidelines renewed in 2013 Old guidelines from 2000 & 2002 and since then legislation had changed data protection methods and tools had developed statistical disclosure control (SDC) practices at different departments of the agency had developed and adopted their own standards… → Need for consistency in practices 22 September 2015 Annu Cabrera

Guidelines on the protection of tabulated data II Guidelines describe protection methods and practices in general Departments are required to write down more specific SDC instructions for every statistics they publish These instructions need to be available and easily found inside Statistics Finland → Comparison of protection methods and practices between different statistics is possible → Information on methods specific for certain statistics doesn’t get lost even if production team changes 22 September 2015 Annu Cabrera

Recommendations for business data Default threshold rule: information on less than 3 units (business/enterprise/corporation group) cannot be disseminated If data are recent and their disclosure could have an impact on the market situation or the activity of an individual enterprise a dominance rule should be used alongside the threshold rule If protection can be made by not disseminating the identity and number of data suppliers, this is recommended Estimates based on sample data No information on which units belong to the sample Confidential data can be released if the data supplier gives consent to their publication 22 September 2015 Annu Cabrera

Recommendations for personal data Assessment of the sensitivity of the information contained in a certain statistical output Default threshold rule: information on small number of units (persons/households) or small classes/groups of units should not be disseminated. In tabulation cell frequency is too small if it’s less than 3 class/group frequency is too small if it’s less than 10 22 September 2015 Annu Cabrera

Practice

Data protection in practice Usually the same people who compile and prepare statistics for dissemination also implement the necessary data protection methods SDC expert from Standards and Methods department can assist with implementation if needed If new practices or methods are needed to apply in statistics production it’s normally done in co-operation with statistics production team and SDC expert 22 September 2015 Annu Cabrera

Protection methods Active data protection methods are needed to apply if data on individual units are at risk of being disclosed from a certain statistics i.e. threshold rule or dominance rule are violated Used protection methods at Statistics Finland: Re-designing the table / changing the classification Cell suppression Primary suppressions (disclosive cells) Secondary suppressions (non-disclosive cells) Additional cells need to be suppressed to prevent re-calculation of disclosive cells from table marginals / totals 22 September 2015 Annu Cabrera

Tools for cell suppression ”Manually” choosing secondary suppressions Works only with simple tables, i.e. tables with only few explanatory variables SAS SAS codes specific for certain statistics Not flexible solutions if new explanatory variables or classifications are used τ-Argus Can handle more complex tables (hierarchical structure, linked tables) Integrations with other production tools and software not easy Used only in few business statistics 22 September 2015 Annu Cabrera

Annu Cabrera annu.cabrera@stat.fi