Confidentiality issues in the EU Population and Housing Censuses of 2011 Risks and Criteria.

Slides:



Advertisements
Similar presentations
Tools for comparing EU and China social security systems New member states join the Union with various backgrounds Current schemes and systems are the.
Advertisements

Classroom Bill of Rights
IPOPI meeting Den Bosch 16 – 19 0ct Oxford, UK 1992 Lugano, Switserland 1994 Sitges, Spain 1996 Goteborg, Sweden 1998 Rhodos, Greece 2000 Geneva,
Ronde 6: Twaalf provinciën. 1. Solution: …It’s only just begun…
Netherlands FEDERAL REPUBLIC OF GERMANY BELGIUM Use the BUCKET in the
Reagan/Clinton Compare/Contrast Essay. I. Reagan and Clinton both argue... but... A.Reagan argues B.Clinton argues.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Eurostat Statistical Disclosure Control. Presented by Peter-Paul de Wolf, Statistics Netherlands (CBS)
17 September SME Statistics OECD Workshop SME data and methodologies in the EU - item 5 Paul Feuvrier / Eurostat.
Kingdom of the Netherlands country of heritage and future.
IMPROVING CONFIDENTIALITY WITH tau-ARGUS BY FOCUSSING ON CLEVER USAGE OF MICRODATA Roland van der Meijden MSc. ± 10 minutes.
Application to bridge condition data in the Netherlands.
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
Universal Data Collection (UDC) Mapping Project
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=

Targeting Research: Segmentation Birds of a feather flock together, i.e. people with similar characteristics tend to exhibit similar behaviors Characteristics.
Do not put content on the brand signature area ING Dutch Office Fund and Rotterdam Central District.
The Application of the Concept of Uniqueness for Creating Public Use Microdata Files Jay J. Kim, U.S. National Center for Health Statistics Dong M. Jeong,
Introduction to Colonization & Decolonization: Case Studies in Modern Africa and Asia Colonization in 1945.
The Netherlands Peter Down This is an introductory Power Point presentation designed for Year 4 students to give them an understanding of the Netherlands.
1 Statistical Disclosure Control for Communal Establishments in the UK 2011 Census Joe Frend Office for National Statistics.
Presenter: Rich Lee Location Suitability Analysis New Burger stores in San Fernando Valley 2010 Fall 406 Final Project.
Arjen Edzes, Marten Middeldorp en Jouke van Dijk University of Groningen, Department of Economic Geography, PO Box 800, 9700 AV Groningen Atlanta, NARSC,
StatLine 4 metadata implementation Edwin de Jonge Statistics Netherlands.
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
Joint meeting Working Groups on Environmental Accounts & Environmental Expenditure Statistics Luxembourg, 10 March 2015 Confidential data (point 7 of the.
European Standards on Confidentiality and Privacy in Healthcare Dr Colin M Harper Division of Psychiatry & Neuroscience Queen’s University.
Disclosure Avoidance at Statistics Canada INFO747 Session on Confidentiality Protection April 19, 2007 Jean-Louis Tambay, Statistics Canada
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 11, 12, 13, 14 and 16 Association at Nominal and Ordinal Level The Procedure in Steps.
1 1 Confidentiality protection of large frequency data cubes UNECE Workshop on Statistical Confidentiality Ottawa October 2013 Johan Heldal and Svetlana.
Access to microdata in the Netherlands: from a cold war to co-operation projects Eric Schulte Nordholt Senior researcher and project leader of the Census.
Joint UNECE/Eurostat work session on statistical data confidentiality Manchester, December 2007 Dealing with Confidentiality in Dissemination: The.
An outline is useful to organize your information You put this information in categories You use various symbols to organize your information For main.
THE FINANCE CHARTER 26/01/2016 ix NYASOSO EX-STUDENTS’ ASSOCIATION.
Chi-Squared Test of Homogeneity Are different populations the same across some characteristic?
Information Governance Jo Wall South East Public Health Intelligence Analyst Training Day 2, Session 5 11 th February 2016.
ПОРТФОЛИО профессиональной деятельности Белово 2015 Таюшовой Натальи Борисовны Преподавателя дисциплин «Химия», «Биология»
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Progress towards a table builder with in-built disclosure control for 2021 Census Keith Spicer UNECE, 22 September 2017.
“SO SEND I YOU” – An urgent message for a crucial hour
OUTREACH – Our most essential calling - 15 October 2016
Confidentiality in Published Statistical Tables
The Netherlands.
Plenary: rules of indices
Frans Schilder Johan Conijn
Treatment of statistical confidentiality Table protection using Excel and tau-Argus Practical course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER.
ОПШТИНА КУРШУМЛИЈА.
Lucas in NL Harmonised Land Use/ Land Cover Statistics, Part A
Regular Bio Characteristics of Life
Chapter 8: Weighting adjustment
Chi-square test or c2 test
Chapter 13 – Applications of the Chi-Square Statistic
Dutch Records New records available – September 17, 2017
Index Notation Sunday, 24 February 2019.
INTERREG DEUTSCHLAND-NEDERLAND BUILDING ON THE PAST TO BE READY
Welcome to Objective 2 programme East Netherlands
Welcome to Objective 2 programme South Netherlands
East West Move to Table 2 N/S
1 2 Table No Table No 5 Table 5 Table ¾ Howell Movement
Lucas in NL Harmonised Land Use/ Land Cover Statistics, Part A
The European Statistical Training Programme (ESTP)
Eurostat's plans and legal framework of the 2021 round of population and housing censuses  David Thorogood Population and migration unit Eurostat Policy.
DG Environment, Unit D.2 Marine Environment and Water Industry
SAFE – a method for anonymising the German Census
DG Environment, Unit D.2 Marine Environment and Water Industry
Item 2.2 Scientific Use Files for the Time Use Survey
8 Core Steps of success: I.Step:1 : DREAM SETTING: II. Step: 2 : LIST MAKING : IV. Step: 4 : SHOW THE PLAN: III. Step: 3 : INVITATION: V. Step: 5 : FOLLOW.
Presentation transcript:

Confidentiality issues in the EU Population and Housing Censuses of 2011 Risks and Criteria

Contents Different kinds of tables: Magnitude tables Frequency tables Stating the problem(s) Sensitive categories Group disclosure Possible Criteria Hierarchical tables Discussion issues for the 2011 Census Round

Magnitude tables Magnitude table: each cell value represents the sum of the score of the respondents that fall into that cell

Frequency tables (I) Frequency table: each cell value represents the number of respondents that fall into that cell Example: Dutch population, 1/1/2006 Male Female Total North East West South Total

Frequency tables (II) Cell value not sensitive Spanning variables: identifying (Region, gender, type of business,…) sensitive (Sexual behaviour, criminal offence, …)

Frequency tables (III) (Spanning) variables: one sensitive remaining identifying Example: number of shipowners Environmental offence RegionYesNoTotal … A

Frequency tables (IV) Group disclosure: All shipowners in region A committed an environmental offence Conclusion: Not all respondents should score on a sensitive category

Frequency tables (V) Example, continued number of shipowners Environmental offence RegionYesNoTotal … B

Frequency tables (VI) Still: non-offensive shipowners know quite surely that all other shipowners in region B committed an environmental offence Conclusion: There should not be too many respondents that score on a sensitive category

Frequency tables (VII) Possible criterion: Fraction of respondents that score on a sensitive category should be less than p% to increase the uncertainty E.g., p = 40

Frequency tables (VIII) Example, continued number of shipowners Environmental offence RegionYesNoTotal … C

Frequency tables (IX) Still: Non-offensive shipowner knows that the other one committed an environmental offence Possible criterion: If respondents score on a sensitive category, at least n respondents should score on non-sensitive categories

Frequency tables (X) Example, continued number of shipowners Environmental offence RegionYesNoTotal … D Non-offenders now do not know which other shipowner committed the offence

Hierarchical tables (I) Hierarchical tables: special case of linked tables One or more of spanning variable are hierarchic, i.e. its categories contain several (sub)totals E.g.:region (state/county/district/municipality) business classification (NACE)

Hierarchical tables (II) Groningen21 Friesland x Drenthe23 Overijssel27 Gelderland 41 Flevoland x Utrecht32 Noord-Holland54 Zuid-Holland67 Zeeland38 Noord-Brabant44 Limburg39 The Netherlands 417

Hierarchical tables (III) Groningen21 Friesland x Drenthe23 Overijssel27 Gelderland 41 Flevoland x Utrecht32 Noord-Holland54 Zuid-Holland67 Zeeland38 Noord-Brabant44 Limburg39 The Netherlands 417 North63 East80 South 83 West 191 The Netherlands 417

Hierarchical tables (IV) Groningen21 Friesland19 Drenthe23 Overijssel27 Gelderland 41 Flevoland x Utrecht32 Noord-Holland54 Zuid-Holland67 Zeeland38 Noord-Brabant44 Limburg39 The Netherlands 417 North63 East80 South 83 West 191 The Netherlands 417

Hierarchical tables (V) Groningen21 Friesland19 Drenthe23 Overijssel27 Gelderland 41 Flevoland12 Utrecht32 Noord-Holland54 Zuid-Holland67 Zeeland38 Noord-Brabant44 Limburg39 The Netherlands 417 North63 East80 South 83 West 191 The Netherlands 417

Discussion issues for the 2011 Census Round (I) – Regulation No 763/2008 on Population and Housing Censuses – Detailed output in high dimensional tables (so-called hypercubes) – How well are Census tables with these variables filled?

Discussion issues for the 2011 Census Round (II)

Discussion issues for the 2011 Census Round (III)

Discussion issues for the 2011 Census Round (IV) – How to prevent disclosure of individual sensitive information? – Which (categories of) variables are sensitive? – What protection measures are feasible (preferably European wide co-ordinated)? – To prevent recalculations from marginal totals: secondary suppressions (across a hypercube and between hypercubes) with Tau-ARGUS – Aggregation problems when cells are suppressed These issues have to be worked out further in the TF!