UK Data Access Practices Felix Ritchie. Overview The legislative model The data model The security model Developments Current key concerns.

Slides:



Advertisements
Similar presentations
Microdata access in practice Felix Ritchie. Overview Concerns Conceptual and practical concerns International practice UK experience Key lessons.
Advertisements

ONS data – improving access Richard Laux National Statistics and International Division, ONS.
The Statistics Act and Research Access to Data Paul J Jackson Legal Services ONS.
Output Consultation Plans and Statistical Disclosure Control Strategy developments Angele Storey and Jane Longhurst ONS.
ONS Research Data Access Strategy AGENDA Background and context Confidentiality The Strategy.
Eurostat T HE E UROPEAN PROCESS OF ENHANCING ACCESS TO E UROSTAT DATA A LEKSANDRA B UJNOWSKA E UROSTAT.
Presentation Brought to you by: New Thinking. Are your Managers at Risk ? Are Fleet costs to high ?
Secure Data Service: an improved access to disclosive data Reza Afkhami, Melanie Wright Secure Data service UKDA University of Essex IASSIST 2010, Ithaca,
Data-Sharing and Governance Consultation ANALYSIS OF RESPONSES.
EGM – Population & Housing Censuses Eurostat / UNECE - Geneva - 24/25 May 2012 Beyond 2011 The future of population statistics (England & Wales) Alistair.
Operationalising ‘safe statistics’ the case of linear regression Felix Ritchie Bristol Business School, University of the West of England, Bristol.
Improvement Service / Scottish Centre for Regeneration Project: Embedding an Outcomes Approach in Community Regeneration & Tackling Poverty Effectively.
Access routes to 2001 UK Census Microdata: Issues and Solutions Jo Wathan SARs support Unit, CCSR University of Manchester, UK
The UK Statistics and Registrations Services Act Tanvi Desai Data Manager LSE Research Laboratory Research Laboratory IASSIST Tampere.
International data sharing via standards Felix Ritchie.
Developing a Statistical Disclosure Standard for Europe Tanvi Desai LSE Research Laboratory Data Manager Research Laboratory IASSIST 2010: Cornell.
Strengthening Data Security Dr. Sharon Bolton Dr. Matthew Woollard.
Synthetic Data within the Risk – Utility Framework Keith Spicer Office for National Statistics.
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
Developing a Partner Reward Strategy – to build competitive advantage Peter Scott Consulting
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Dissemination to support Research & Analysis John Cornish.
WP. 46 Providing access to data and making microdata safe, experiences of the ONS Jane Longhurst Paul Jackson ONS.
1 Statistical Disclosure Control for Communal Establishments in the UK 2011 Census Joe Frend Office for National Statistics.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
User-focused Threat Identification For Anonymised Microdata Hans-Peter Hafner HTW Saar – Saarland University of Applied Sciences
Census/NeSS Roadshows March 2003 Better Information Initiatives.
GEOG3025 Confidentiality and social implications.
Improving Integration of Learning and Management Systems Paul Shoesmith Director of Technical Strategy Becta.
FLEET MANAGEMENT – A CONTINUING CHALLENGE Nigel Trotman, Business Relationship Manager, Whitbread Plc.
Access to sensitive data in the UK: a principles-based approach Felix Ritchie.
RIA: Communication – building credibility Aleš Pecka Department of Regulatory Reform and Public Administration Quality Ministry of Interior, Czech Republic.
Access to Microdata Felix Ritchie Business Data Linking.
Frameworks for the Access and Use of Administrative Data, With the Example of Current Practice in the UK Steven Vale Office for National Statistics UK.
Census Quality: another dimension! Paper for Q2008 conference, Rome Louisa Blackwell Quality Assurance Manager, 2011 Census.
26 August 2011 Future of access to EU confidential data for scientific purposes Jean-Marc Museux Eurostat – 58th ISI conference,
Modernisation of Statistics Production Stockholm November 2009 Summary and Conclusions New York 24 February 2010 Mats Wadman Deputy Director General Statistics.
Incentive compatibility in data security Felix Ritchie, ONS (Richard Welpton, Secure Data Service)
Statistics Netherlands’ modernization programme: the use of administrative data, lessons learned and the way ahead. Geert Bruinooge Assistant Director.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Data for secondary analysis: the experience of the UK Data Archive Hilary Beedham UK Data Archive.
Development of UK Virtual Microdata Laboratory Felix Ritchie Shanghai, March 2010.
Developing the prototype Longitudinal Business Database: New Zealand’s Experience Julia Gretton IAOS Conference Shanghai, China, October 2008
Data Dissemination Conditions in the European Statistical System (ESS) UNECE, Warschau May 2009.
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
Joint UNECE/Eurostat work session on statistical data confidentiality October 2015 Helsinki, Finland Circle of trust Maurice Brandt DESTATIS.
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
Researchers’ Usage of Microdata The example of Statistics Finland Advanced presentation – Some additional details Consultation Mission on Promoting the.
Beyond 2011 Voluntary Sector Statistics User Event Minda Phillips Amelia Ash.
Expanding the Role of Synthetic Data at the U.S. Census Bureau 59 th ISI World Statistics Congress August 28 th, 2013 By Ron S. Jarmin U.S. Census Bureau.
Development of UK Virtual Microdata Laboratory
Access to business data: Is the balance of risks right?
UK Data Service Secure Lab
Legal, political and methodological issues in confidentiality in the ESS Maria João Santos, Jean-Marc Museux Eurostat.
Sabrina Iavarone Senior User Services Officer
Ethical questions on the use of big data in official statistics
Education and Training Statistics Working Group – June 2014
Open data: who needs it? Presentation by Felix Ritchie
Treatment of statistical confidentiality Part 5 Summary & reflection: rules versus principles Introductory course Trainer: Felix Ritchie CONTRACTOR IS.
Enhancing statistical practices to improve data sharing
Nicolás J. I. Rodríguez & Arild Mellesdal
BETTER AND PROPER ACCESS TO PACIFIC MICRODATA
Modernisation of Statistics Production Stockholm November 2009
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Access to business data: Is the balance of risks right?
Treatment of statistical confidentiality Part 5: Rules versus principles Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK.
Dealing with confidential data Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION.
Treatment of statistical confidentiality Introductory course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE.
Presentation transcript:

UK Data Access Practices Felix Ritchie

Overview The legislative model The data model The security model Developments Current key concerns

The legislative model (1) Mixture of statutes and common law until… Statistics and Registration Services Act 2007 –Didn’t abolish existing gateways for research –Created a new gateway – ‘Approved Researchers’ –Allowed for cross-govt data sharing… –…but not for research purposes unless specifically agreed –Clarified limits of European data sharing –ONS given a statutory duty to support research

The legislative model (2) No theoretical limits on who can have access to enormous range of govt data –both within govt and in academia …but not a free-for-all ONS has a duty to protect confidentiality –even for Approved Researchers –data release has to be consistent with need → the data model

The data model (1) ‘Spectrum’ of access points balancing –value of data –ease of use –disclosure risk for a given level of confidentiality, maximise data use and convenience no ‘one-size-fits-all’ solution –no absolute prohibitions –trade-off is made explicit –users determine appropriate level of access

Type of access NoneVML ONS sites VML Govt sites Secure data service Special licences Licensed data archive Internet Anonymi- sation LittleComplete SDC of inputs NoneComplete Restric- tions on users ManyNone SDC of outputs CompleteNone Examples: Census data Original data Data for ONS linking ONS contractor Anon. CD-ROM Web tables Enterprise data Original data Identified data for ONS linking Identifiable data for analysis Govt. users only Web tables RDCs Use of confidential data: the access spectrum

The data model (2) Options should cover most cases –Can’t be perfect in every case –But the jump from one solution to another reflects data utility and patterns of research use Pretty efficient –Fairly transparent –Users balance their own costs/benefits –Economies of scale delivering mass solutions –eg UKDA, VML How do we define/describe access points? → the security model

The security model (VML version) valid statistical purpose trusted researchers anonymisation of data technical controls around data disclosure control of results safe projects + safe people + safe data + safe outputs  safe use + safe setting

Safety criterion VMLSDS (provisional) One-off cases “Special Licence” UK Data Archive Internet People*ARs/ Civil Servants ARs? UK academicsAnyone ProjectsScrutiny by MRP Academic projects None Data (in theory) Data (in practice) Any Unidentified Unlinkable? N/A Anonymised, low risk of identification Anonymised, almost no risk of identification Anonymised, no risk of identification SettingsSecure thin client ?Use on restricted IT systems Use by academics only None OutputsONS staff checked SDS staff checked, ONS guidelines ?Researchers agree to follow ONS guidelines No checking Use of confidential data: the access spectrum for ONS data at present *AR = Approved Researcher

Access: a summary No theoretical restrictions wide ranging and flexible legal basis

Remote access in the UK: the VML (1) Probably the most important research data resource in the UK after the UK Data Archive (and the internet) Expanding access from other govt depts. Data acquisitions: –internal ONS versions of social datasets –Other government dept data –Administrative data –Census 2011 detailed microdata?

Remote access in the UK: the VML (2) Highly theorised –Particularly in disclosure control Strong researcher relationship –compulsory training gives initial investment in researcher buy-in Next stage: full cost-benefit analysis –Planning model in context of new alternatives –CBA to include purpose of RDC

Developments in remote access VML clones being set up in academia –Possibly elsewhere in govt too –No possibility of VML being accessible over internet in near future –Likely to develop into a two-tier system VML practices and models adopted –for increasing range of data –across wider range of operations

Current key concerns IT –lack of resource –still some basic operational issues unresolved Delays in increasing access points –partly money, partly IT, partly culture Demand growth –30%-50% each year –Likely to be higher

Current potential concerns Potential in Statistics Act –possibility for ONS’ policies to be challenged –surprising (unwelcome) demands for information? social data in VML partially a pre-emptive response New data types bringing new rules Fragmentation of RDC practice in UK

Background concern: fear of the new Relative risk still poorly understood –Example VML temporarily closed for potential security flaw One data area returned to old non-VML solution: letting external visitors log on using ONS staff usernames VML was re-opened after a week for ONS staff and only three weeks later for external visitors But the flaw could only be exploited by ONS staff… Resistance to virtual solutions in favour of familiar –remote access always seen as a limitation despite much better data quality –‘distributed access’ no substitute for ‘distributed data’

Not current key concerns Staff resources –Fast training time –Supportive researcher base researcher buy-in => very lean processes Methodological issues –RDC-specific SDC methods proving robust Legal issues –Statistics law so far proving flexible enough to provide reasonable responses to all needs “reasonable”=ONS and researchers happy that balance between access and confidentiality is fair

Summary Clear legislative model and strong theoretical basis –policy decisions relatively easy Main difficulty for ONS is managing expansion of demand –meeting ONS internal needs (just, for now) –long way off meeting external demands

Contact Felix Ritchie Microdata Analysis and User Support

VML resources G6 G7 EO AO/AA SEO HEO/RO G7 HEO/RO Operations and analysisStrategic resources HEO/RO Strategic management Operational management Operations Support Target June 09 Minimum Current