Microdata access in practice Felix Ritchie. Overview Concerns Conceptual and practical concerns International practice UK experience Key lessons.

Slides:



Advertisements
Similar presentations
ONS data – improving access Richard Laux National Statistics and International Division, ONS.
Advertisements

Samples of Anonymised Records from the 2001 Census Five different microdata files - with varying amounts of detail Three different modes of access - with.
Balancing Access and Confidentiality Jenny Telford Australian Bureau of Statistics September 2008.
The Statistics Act and Research Access to Data Paul J Jackson Legal Services ONS.
ONS Research Data Access Strategy AGENDA Background and context Confidentiality The Strategy.
Data without Boundaries project A short overview of outputs & future perspectives Roxane Silberman DwB coordinator ESS workshop, Luxembourg September,
Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
Administrative Data Research Centre for England 1.
New solutions for transnational access to secure use files David Schiller (IAB) Richard Welpton (UKDA) Microdata Access in European Countries – Cooperation.
How representative are the samples? Sabine Häder and Peter Lynn European Social Survey (ESS) – Launch Conference Brussels, 25/26 November 2003.
Access to and specifics of detailed national LFS data – the case of Slovenia Sebastian Kočar Social Science Data Archives University of Ljubljana 4th DwB.
Access routes to 2001 UK Census Microdata: Issues and Solutions Jo Wathan SARs support Unit, CCSR University of Manchester, UK
Settings, Practices and Data Access: Results of a Survey of UK Social Scientists Jo Wathan Centre for Census and Survey Research University of Manchester.
Developing a Statistical Disclosure Standard for Europe Tanvi Desai LSE Research Laboratory Data Manager Research Laboratory IASSIST 2010: Cornell.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 1.
Integrated European Census Microdata 5 th DwB Training, Barcelona, January 2015.
Development of Remote Access Systems Tanvi Desai LSE Research Laboratory Data Manager Research Laboratory IASSIST 2008: Stanford.
MOLLA HUNEGNAW STATISTICIAN AFRICAN CENTRE FOR STATISTICS ECASTATS.UNECA.ORG Confidentiality and Anonymization of Microdata 1 United Nations Regional Seminar.
Session 4. Panel session: How useful is the notion of “circle of trust” concept ? A vision for the future. Maurice Brandt Destatis Germany 2ND EUROPEAN.
REG set up: first steps… Alison Chisholm 7.40 am – 7:45 am.
1 The SpaceWire Internet Tunnel and the Advantages It Provides For Spacecraft Integration Stuart Mills, Steve Parkes Space Technology Centre University.
Research Data Centre network for transnational access - four years of experiences by seven European RDCs Karen Dennison (UK Data Archive) and David Schiller.
Supporting transnational access to government microdata from four European countries Karen Dennison, SDS and David Schiller, IAB P resented by Karen Dennison,
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Dissemination to support Research & Analysis John Cornish.
Disclosure Avoidance: An Overview Irene Wong ACCOLEDS/DLI Training December 8, 2003.
CES Task Force on Confidentiality and Microdata Tiina Luige UNECE Statistical Division Conference of European Statisticians UN Economic Commission for.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
User-focused Threat Identification For Anonymised Microdata Hans-Peter Hafner HTW Saar – Saarland University of Applied Sciences
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Disclosure detection & control in research environments Felix Ritchie.
UNECE Statistical Division Slide 115 May 2008 Open Discussion Work Session on Statistical Dissemination and Communication May 2008 Facilitator: Gina Pearson.
Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
The Dutch Virtual Census based on registers and already existing surveys Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
Access to sensitive data in the UK: a principles-based approach Felix Ritchie.
Access to Microdata Felix Ritchie Business Data Linking.
UK Data Access Practices Felix Ritchie. Overview The legislative model The data model The security model Developments Current key concerns.
Creating Something from Nothing: Synthetic and Dummy files Bo Wandschneider University of Guelph Chuck Humphrey University of Alberta DLI Training: Ottawa,
The Dutch Virtual Census of 2001 A New Approach by Combining Different Sources Eric Schulte Nordholt ECE Census meetings Geneva, November 2004.
Presenter: Silas Mulwah Organization:Kenya National Bureau of Statistics  th September 2013, United Nations Regional workshop on Data Dissemination.
Incentive compatibility in data security Felix Ritchie, ONS (Richard Welpton, Secure Data Service)
Creating Something from Nothing: Working with Synthetic Files ACCOLEDS /DLI Training: December 2003 Chuck Humphrey University of Alberta.
Economic Research and Policy Analysis Branch May 6, 2010 Access to Business Micro-Data to Support Economic Research and Policy Analysis: Where Do We Go.
Development of UK Virtual Microdata Laboratory Felix Ritchie Shanghai, March 2010.
New Solutions for Transnational Access and the Need for Proper Data Documentation NADDI , Vancouver (Canada) David Schiller, IAB Ingo Barkow,
Restitution on Work Session 1 Paul Jackson DwB – WP3.
Welcome and Conceptual Overview ATHA Specialized Training on International Humanitarian Law May 31, 2010 Stockholm, Sweden.
HETUS Pilot Group 8 Privacy procedures and ethical issues Kimberly Fisher, Centre for Time Use Research – co-ordinator External consultant Kai Ludwigs.
Joint UNECE/Eurostat work session on statistical data confidentiality October 2015 Helsinki, Finland Circle of trust Maurice Brandt DESTATIS.
Samples of Anonymised Records from the U.K. Census 1991 and 2001 Integrating Census Microdata Workshop Barcelona th July 2005 Dr. Ed Fieldhouse Cathie.
Development of UK Virtual Microdata Laboratory
Data Confidentiality and the Common Good.
Secure Data Laboratories: The U.S. Census Bureau Model
Census developments in the Netherlands
Creating Something from Nothing: Working with Synthetic Files
Access to business data: Is the balance of risks right?
UK Data Service Secure Lab
CSC 480 Software Engineering
Legal, political and methodological issues in confidentiality in the ESS Maria João Santos, Jean-Marc Museux Eurostat.
education.oracle.com/cloud
OECD Chief Statistician and Director, Statistics Directorate
National Statistical Systems and Researchers
Advancing Telemedicine Adoption in Europe – Developing capacities
Anja Burghardt, Institute for Employment Research (IAB)
Protecting Confidential Data
Access to business data: Is the balance of risks right?
LAMAS Working Group June 2015
Access to European microdata for scientific purposes
Creating Something from Nothing: Working with Synthetic Files
Presentation transcript:

Microdata access in practice Felix Ritchie

Overview Concerns Conceptual and practical concerns International practice UK experience Key lessons

Conceptual concerns Flexibility Convenience Confidentiality Practicality Scalability Cost

Practical considerations Location –on-site laboratories –distributed centres –local access Data management –distributed vs centralised Processing facility –fat vs thin clients Remote job submission

International practice – social data Characteristics –easy to anonymise usefully –unlinkable –dominate microdata research Accessed through –anonymised files with almost unrestricted release –scientific use, CURF, etc identifiable data with limited release (eg special license, on-site lab, remote access, remote job submission) –released with identifying variables for NSI-work only –easily identified observations typically not useful statistically

International practice – business data almost always restricted/zero access –identifying characteristics often useful ones –data typically identifiable, even in scientific use files –no access is the international norm where access is provided: –on-site labs and special licenses dominate –moves towards centralised thin-client systems (UK, Denmark, Sweden, Netherlands, Slovenia, US) local access in Scandinavia Four main areas of development –making useful anonymous files (Canada, Germany) –synthetic data (US) –remote job submission (Australia, NZ, US) –remote access (non-NSI sites) through thin client systems

International practice – health and Census data share characteristics with business and social data –identifying characteristics often useful ones –Census presents special problems because of inclusion probability large variations on confidentiality within and across countries often not collected by NSI in general treated like business data

UK experience – the strategy More confidential, more secure No release Virtual microdata laboratory Special licence WebUKDA Less confidential, easier access Business data, Census data Not anonymised Census, health data, OGD access to business data GHS LFS Aggregate data [Remote job submission]

UK experience – the VML Limited lab experience Thin clients used to simulate on-site laboratory –cost –security –flexibility –ease of management Strict technical regime to ensure confidentiality Practicality of servicing researchers through –training –shifting of responsibility –limited support

Lessons learned Use the law intelligently –challenge unhelpful interpretations –use laws actively to support procedures Demonstrate benefits soon, clearly, continuously Running a lab: –Practising researchers design and manage lab –Sort out rules in advance especially confidentiality actively involve users Continual development in operations and principles

Felix Ritchie