microdata.no Instant Access to Microdata

Slides:



Advertisements
Similar presentations
Eurostat T HE E UROPEAN PROCESS OF ENHANCING ACCESS TO E UROSTAT DATA A LEKSANDRA B UJNOWSKA E UROSTAT.
Advertisements

Issues in Designing a Confidentiality Preserving Model Server by Philip M Steel & Arnold Reznek.
/// MELSEC Safety /// QS001CPU /// QS0J61BT12 /// QS0J65BTB2-12DT /// MELSEC Safety /// Mitsubishi Electric - MELSEC Safety - Training Documentation -
1 L U N D U N I V E R S I T Y a home grown, bespoke institutional Federated Search tool JIBS Conference at The John Rylands University Library,
Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
The Social Statistics Database: Invaluable source of micro-data for socio-economic statistics Johan van Rooijen.
RESEARCHERS‘ ACCESS TO HEALTH DATA – FACTS AND CHALLENGES Metka Zaletel National Institute of Public Health 24 March 2015.
Plannes security for items, variables and applications NEPS User Rights Management.
Mogens Grosen Nielsen Statistics Denmark
Implementation of GSBPM, DDI and SDMX reference metadata at Statistics Denmark UNECE workshop 5-7 May 2015 Mogens Grosen Nielsen
Electrical Engineering Department Software Systems Lab TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY Meeting recorder Application based on Software Agents.
Robofest 2001 Online Management System Jim Needham MCS 4833/01 Senior Project Dr. Chan-Jin Chung, Ph.D.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Proposing the Model for Croatian Remote Access (RA) Safe Centre for Statistical Microdata Martina Poljičak Central Bureau of Statistics
Statistical file systems and archive statistics – Experiences from Statistics Sweden Contribution to the Nordbotten Seminar at the Nordic Statistical Meeting.
Mara Cammarrota Italian National Institute of Statistics Development of Information System and Corporate Products, Information Management and Quality Assessment.
User-focused Threat Identification For Anonymised Microdata Hans-Peter Hafner HTW Saar – Saarland University of Applied Sciences
1 REMOTE ACCESS INFRASTRUCTURE FOR REGISTER DATA / 1 RAIRD Remote Access Infrastructure for Register Data - metadata aspects Ørnulf Risnes,
Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.
InSPIRe Australian initiatives for standardising statistical processes and metadata Simon Wall Australian Bureau of Statistics December
Access to microdata in Statistics Estonia First DwB European Data Access Forum Luxembourg, 28th March 2012 Tuulikki Sillajõe.
1 REMOTE ACCESS INFRASTRUCTURE FOR REGISTER DATA / 1 RAIRD Remote Access Infrastructure for Register Data Johan Heldal *, Elin Monstad **,
The availability of Dutch census microdata Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands Division Social.
1 1 Anonymised Integrated Event History Datasets for Researchers Johan Heldal Statistics Norway.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, MAY 2009 DETERMINING USER NEEDS FOR THE 2011 UK CENSUS IAN WHITE, Office.
State Statistical Institute Berlin-Brandenburg Jörg Höhne / Julia HöningerResearch Data Centre Morpheus – Remote Data Access with a Quality Measure Joint.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Michelle Simard, Thérèse Lalor Statistics Canada CSPA Project Manager UNECE Work Session on Statistical Data Confidentiality Helsinki, October 2015 Confidentialized.
1 1 Confidentiality protection of large frequency data cubes UNECE Workshop on Statistical Confidentiality Ottawa October 2013 Johan Heldal and Svetlana.
The use of GSIM in Statistics Norway Jenny Linnerud Senior Adviser Department of IT Statistics Norway 10th June 2014, Nizhny Novgorod.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Generic Statistical Information Model (GSIM) Jenny Linnerud
Saturday, January 23, 2016 Towards an easy use of CIRCABC Communication and Information Resource Centre for Administrations, Businesses and Citizens By.
GSIM in practice in Norway Jenny Linnerud – Ørnulf Risnes – Arofan Gregory -
Remote Analysis Server for Tabulation and Analysis of Data Tarragonia, October 2011 James Chipperfield and Frank Yu (presenter)
Researchers’ Usage of Microdata The example of Statistics Finland Advanced presentation – Some additional details Consultation Mission on Promoting the.
The Aim of Help Direct To support all adults across Lancashire to access practical support, guidance and/or information around a range of issues affecting.
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
Advanced Data Planning Tool Rajiv Ranjan Technical Programme Officer PARIS21.
Current Research Information SysTem In Norway
CRIStin, reporting and rewarding research
Supportive Services Demonstration
Biruta Sloka (University of Latvia)
101 Introduction aba.com BANKERS.
Census developments in the Netherlands
Appendix A Barb Ericson Georgia Institute of Technology May 2006
Reauthorization of the Workforce Investment Act of 1998: Listening Session for Disability Stakeholders October 1, :00 - 5:00PM (EST)
The Healthy Workplaces Summit 2017,
GSIM Implementation at Statistics Finland Session 1: ModernStats World - Where to begin with standards based modernisation? UNECE ModernStats World Workshop.
Workshop and Training Overview
Sabrina Iavarone Senior User Services Officer
Civil Registration Process: Place, Time, Cost, Late Registration
1 What is EGR? ESTP course on EGR 6-7 September 2016.
Brisbane Accord Group SESSION 15. APPLICATION AND UTILIZATION OF CIVIL REGISTRATION AND VITAL STATISTICS INFORMATION Civil Registration Process: Place,
Anja Burghardt, Institute for Employment Research (IAB)
Introduction of KNS55 Platform
5 November, 2018 Nuku’alofa, Tonga
Towards an easy use of CIRCABC Interest group leader training
Nicolás J. I. Rodríguez & Arild Mellesdal
Noumea, New Caledonia, 3 to 7 December, 2018
Exercise 2 students completed a higher education in Norway in 2004/05
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Federal Statistical Office Germany Research Data Centre
JCPS Final Corrective Action Plan
Item 4.3 Confidentiality on the fly
Transformation of the National Statistical System: Experience
5/24/ :22 AM © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Perturbative methods for ESS census tables
microdata.no Instant Access to Microdata
ESTP course on EuroGroups Register
Presentation transcript:

microdata.no Instant Access to Microdata CESS 2018 / Conference of European Statistics Stakeholders Bamberg 18-19 October 2018

Thank you for inviting us Dr. Johan Heldal Senior Adviser | Division for Methods Department for Digitalisation and shared business services | Statistics Norway johan.heldal@ssb.no Svein Johansen Senior Adviser | Division for Access to Microdata Department of Social Statistics | Statistics Norway svein.johansen@ssb.no Ørnulf Risnes Head of Digital Development Norwegian Centre for Research Data (NSD) ornulf.risnes@nsd.no

- No application for data - Self service - Negligible bill For researchers, PhD and master students in approved institutions - Fast access - No application for data - Self service - Negligible bill Microdata.no is a new (March 2018) scientific service where researchers get access to a defined database with sharp register microdata without any form for application. They work online with instant access to whichever populations and variables they need, presently costless, in the future at a substantial lower cost than today. Microdata.no was launched in March this year. By now some 20 institutions (among them the main universities and research institutes) with some 150 users are connected to the system. Present content: 124 variables on population, education, labour market, income, and welfare benefits. Developed 2012-2018 in co-operation with NSD – Norwegian Centre for Research data, and funded by The Research Council of Norway with some 4 mill Euro 

microdata.no Launched in March 2018. 20 institutions with some 150 users are connected to the system. Present content: 124 register variables on population, education, labour market, income, and welfare benefits. Event history data on 10,2 million individuals. Developed 2012–2018 in co-operation with NSD – Norwegian Centre for Research data, and funded by The Research Council of Norway.

Research is about results, not records New and different approach: Metadata driven Metadata open for anybody. Sharp data Individual records are invisible Data base stored in Statistics Norway Safe login GSIM (General Statistical Information Model) Results: confidentially safe on the fly Data base stored at Statistics Norway. Data hidden from the user. Only anonymous results downloadable The clue: Anonymous = no need for applications and allowances. The egg that WON’T stand: anonymous data. Expensive, reducing value for researchers More realistic: make the output, the results anonymous. But then the individual records must be invisible. Solution: Metadata driven. Sharp data stored at Statistics Norway. No information hidden.

This is the front page. It is only in Norwegian so far This is the front page. It is only in Norwegian so far. By clicking the square «Variabler» we come to the metadata page.

Focus: Rich metadata Integration of data & metadata General structures for data GSIM Metadata are open for anybody But so far only in Norwegian

Infrastructure layout – metadatadriven, remote access DataStore Statistics Data are stored in microdata.no DataStore. The Data are stored in Statistics Norway. Users import interesting variables to User workspace and give them User Workspace

Confidentially safe output Statistics Based on the variables imported to the User Workspace, users can subset populations, recode variables, create new variables and do analyses. The user defines the names of the variables in his or her workspace. User Workspace

Temporality structure, an example. 4 classes of Temporality: Event (start-stop), Single Time Point, Unchangeable, Accumulated

Data store variable window Command interface Work space variable window Click for metadata Data store variable window Click for metadata Command interface. This shows the meaning of metadata driven approach. When importing variables from Data Store the user defines his or her own names in the work space.

Results of analysis Command interface Change to Script mode Chat for assistance Command mode Results of analysis

You can run all commands an ensemble Results in the right pane mode Script mode interface. You can run all commands an ensemble Script in the left pane Results in the right pane Script mode interface Run

Metadata play several roles Informative: Open access at microdata.no definitions, data types, temporality, code lists Tech function. Data access only through metadata. Supportive function. Interactive assistance in User Interface.

Statistical Disclosure Control (SDC) Minimum size of populations to be analysed: At least 1000 Constant unbiased restricted maximum entropy noise addition on counts (max ±5) Inspired by the «Australian Bureau of Statistics method». No perturbations to negative counts and no counts 1-4. Magnitude totals adjusted proportionally to pertubed counts Automatic winsorization of all magnitude variables when subsetting hides extreme values All activity on microdata.no is logged Aims at preventing realistic disclosure scenarios

Next on the Agenda Include variables from new sources E.g health registers More statistical methods available Give access to new groups, e.g. public administration. International access to researchers. Strengthened SDC methods. Applications for new grants from the Norwegian Research Council are in (a week ago). We consider the metadata driven approach to be very useful, and have a strong wish to use it widely, both externally and internally in Statistics Norway.

Appreciating the value of register data for research and the value of research, we want to create the best possible solution for register based research data Thank You