Progress towards a table builder with in-built disclosure control for 2021 Census Keith Spicer UNECE, 22 September 2017.

Slides:



Advertisements
Similar presentations
Statistical Disclosure Control (SDC) for 2011 Census Progress Update Keith Spicer – ONS SDC Methodology 23 April 2009.
Advertisements

Output Consultation Plans and Statistical Disclosure Control Strategy developments Angele Storey and Jane Longhurst ONS.
Conference Programme Introduction to the Samples of Anonymised Records - Keith Spicer, ONS CCSR's role in providing SAR's support - Jo Wathan,
Progress on the SDC Strategy for the 2011 Census 23 rd June 2008 Keith Spicer and Caroline Young.
SDC for continuous variables under edit restrictions Natalie Shlomo & Ton de Waal UN/ECE Work Session on Statistical Data Editing, Bonn, September 2006.
Weighting and Imputation for CORE Social Housing Statistics Julia Bowman & Niall Goulding.
EGM – Population & Housing Censuses Eurostat / UNECE - Geneva - 24/25 May 2012 Beyond 2011 The future of population statistics (England & Wales) Alistair.
Statistical Disclosure Control for the 2011 UK Census Keith Spicer Office for National Statistics.
Methods of Geographical Perturbation for Disclosure Control Division of Social Statistics And Department of Geography Caroline Young Supervised jointly.
Beyond 2011 – A new paradigm for population statistics? Pete Benton, Beyond 2011 Programme Director Office for National Statistics, UK.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Intruder Testing: Demonstrating practical evidence of disclosure protection in 2011 UK Census Keith Spicer, Caroline Tudor and George Cornish 1 Joint UNECE/Eurostat.
Statistical Disclosure Control for the 2011 UK Census Jane Longhurst, Caroline Young and Caroline Miller (ONS)
1 Statistical Disclosure Control Methods for Census Outputs Natalie Shlomo SDC Centre, ONS January 11, 2005.
1 Statistical Disclosure Control for Communal Establishments in the UK 2011 Census Joe Frend Office for National Statistics.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
Census/NeSS Roadshows March 2003 Better Information Initiatives.
1 Assessing the Impact of SDC Methods on Census Frequency Tables Natalie Shlomo Southampton Statistical Sciences Research Institute University of Southampton.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
Confidentiality issues in the EU Population and Housing Censuses of 2011 Risks and Criteria.
Joint UNECE / Eurostat meeting on Population and Housing Censuses 7-9 July 2010, Geneva Disseminating Census information to maximise use and value Keith.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
1 IPAM 2010 Privacy Protection from Sampling and Perturbation in Surveys Natalie Shlomo and Chris Skinner Southampton Statistical Sciences Research Institute.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, MAY 2009 DETERMINING USER NEEDS FOR THE 2011 UK CENSUS IAN WHITE, Office.
Using Targeted Perturbation of Microdata to Protect Against Intelligent Linkage Mark Elliot, University of Manchester Cathie.
Disclosure Control in the UK Census Keith Spicer 11 January 2005.
Protection of frequency tables – current work at Statistics Sweden Karin Andersson Ingegerd Jansson Karin Kraft Joint UNECE/Eurostat.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
1 1 Confidentiality protection of large frequency data cubes UNECE Workshop on Statistical Confidentiality Ottawa October 2013 Johan Heldal and Svetlana.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
Data Management and Analysis John Hollis Demographic Consultant, GLA Data Management and Analysis Statistical Aspects.
The 2011 Census: Estimating the Population Alexa Courtney.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, 7-9 JULY 2010 DISSEMINATING THE RESULTS OF THE 2011 CENSUS IN ENGLAND AND WALES.
Remote Analysis Server for Tabulation and Analysis of Data Tarragonia, October 2011 James Chipperfield and Frank Yu (presenter)
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
The complexities of publishing gridded data for the UK European Forum for Geostatistics Krakow – October 2014 Ian Coady Geography Policy and Research Manager.
Information Governance Jo Wall South East Public Health Intelligence Analyst Training Day 2, Session 5 11 th February 2016.
Reconciling Confidentiality Risk Measures from Statistics and Computer Science Jerry Reiter Department of Statistical Science Duke University.
The evolution of the England and Wales census in a European context Garnett Compton, ONS RSS Conference, 9 September 2015.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Australian Census of Population and Housing Dissemination Strategies UNSC Seminar February 2011 Gillian Nicoll Australian Bureau of Statistics.
Session topic (i) – Editing Administrative and Census data Discussants Orietta Luzi and Heather Wagstaff UNECE Worksession on Statistical Data Editing.
Natalie Shlomo Social Statistics, School of Social Sciences
ZAMBIA CENSUS MAPPING PRESETATION
Evaluating the potential for moving away from a traditional census Becky Tinsley Office for National Statistics (ONS), UK.
Creation of synthetic microdata in 2021 Census Transformation Programme (proof of concept) Robert Rendell.
Heather Ridolfo, Virginia Harris and Emilola Abayomi
Assessing Disclosure Risk in Microdata
Integrating administrative data – the 2021 Census and beyond
Beata Nowok Chris Dibben & Gillian Raab Administrative Data
Establishing an Automated Confidentiality Service in Stats NZ
Access to European microdata for scientific purposes
2001 Census Disclosure Control UK variations
PARENT INFORMATION SESSION
Sub-regional workshop on integration of administrative data, big data
Quality Criteria Initial Ideas.
Albania 2021 Population and Housing Census - Plans
Generic Statistical Business Process-Censuses
Evaluation of Content Error Pres. 10
Item 4.3 Confidentiality on the fly
Evaluation of Content Error Pres. 10
Perturbative methods for ESS census tables
Confidentiality on the Fly
Pete Benton , Beyond 2011 Programme Director
Population Statistics without a Census or Register
Maximising the quality of population estimates from the 2011 UK census
Work of the Task Force on future census Agenda point 5
SAFE – a method for anonymising the German Census
Imputation as a Practical Alternative to Data Swapping
Presentation transcript:

Progress towards a table builder with in-built disclosure control for 2021 Census Keith Spicer UNECE, 22 September 2017

Contents Context: 2011 Census 2021 Census Targeted Record Swapping Outputs Package (Table Builder) Cell key Method Perturbing Zeros

2011 Census Targeted record swapping Table redesign Targeted to “risky” records Table redesign Criteria of % 1s that are “real” and attribute disclosures that are “real” Sparsity gives higher chance of disclosure Sparsity also gives perception of disclosure

2011 Census Every table had to be checked for disclosure Timing was affected

……and sometimes…….. I wanted this table They gave me this table

2021 Census UK Parliament discussed: Aim for 2021 to be the last traditional census in England and Wales Look to use administrative and other sources to replace traditional census Parallel running of traditional and admin censuses in 2021 census round

2021 Census User concern from 2011 in three areas:

Flexibility Accessibility Timeliness

Targeted Record Swapping I

Targeted Record Swapping II

Targeted Record Swapping III

2021 Census: Outputs Package Aim to produce an outputs package Targeted record swapping Cell key method User-defined tables from a table builder Swaps obviously identifiable people / households Protects all by uncertainty, and differencing Allows tables quickly

Cell Key Method 1 2 3 4 Assign each record a random number For each cell, sum rkey and apply a function to get a cell key Age by sex Male Female 0-15 . 16-24 4 25-34 … Record Rkey r2 → 4 r4 → 61 r56 → 7 r72 → 90 Sum = 162 Record Rkey r1 → 54 r2 → 4 r3 → 93 … rN → 26 e.g. take last two digits → Ckey = 62 3 Use a look up table to get perturbation value 4 Apply pvalue to cell Cell Key Age by sex Male Female 0-15 . 16-24 5 25-34 … 1 2 3 … 61 62 63 99 +1 -1 4 5 Cell Value

Notes for Cell Key Method Adapted from “ABS method” Method primarily for protecting against differencing We are looking at a light touch (record swapping still the primary approach) Considering the need to retain 1s and 2s in outputs Introduces another layer of uncertainty for intruder Consistency in same cell across tables Some inconsistencies in breakdowns

Perturbing Zeros I Work in progress Additional n 1s perturbed to 0s Can balance by perturbing n 0s to 1s Counts therefore unbiased Increases protection through uncertainty Means that a 1 does not necessarily represent a record in the microdata Must ensure not changing ‘structural zeros’

Perturbing Zeros II Methods for this are being assessed. Example: Zero cells need a ‘cell key’ but there are no records or record keys Each variable has a set of category keys Combination (sum) of category keys = cell key (any structural zero; cell key set to 0) n highest cell keys are perturbed from 0 to 1

Key Points Aim to have Table Builder for users Targeted Record Swapping Cell Key Method Try to retain small cell counts Benefits of this approach to other collections Work continuing……………… Other areas: microdata products, origin-destination tables, admin census

Questions and Discussion