microdata.no Instant Access to Microdata

Slides:



Advertisements
Similar presentations
Facts about Welcome to this video from Ozeki. In this video I will present what makes Ozeki Phone System XE the Worlds best on-site software PBX for Windows.
Advertisements

Issues in Designing a Confidentiality Preserving Model Server by Philip M Steel & Arnold Reznek.
Site Collection, Sites and Sub-sites
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
MIDAS is a complete web based scheduling solution for managing your facility’s bookings and resources. MIDAS is a complete web based scheduling solution.
Week 6: Chapter 6 Agenda Automation of SQL Server tasks using: SQL Server Agent Scheduling Scripting Technologies.
1 L U N D U N I V E R S I T Y a home grown, bespoke institutional Federated Search tool JIBS Conference at The John Rylands University Library,
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Access to and specifics of detailed national LFS data – the case of Slovenia Sebastian Kočar Social Science Data Archives University of Ljubljana 4th DwB.
RESEARCHERS‘ ACCESS TO HEALTH DATA – FACTS AND CHALLENGES Metka Zaletel National Institute of Public Health 24 March 2015.
A database-driven tool to create items, variables and questionnaires NEPS Metadata Editor.
Robofest 2001 Online Management System Jim Needham MCS 4833/01 Senior Project Dr. Chan-Jin Chung, Ph.D.
Web-Enabling the Warehouse Chapter 16. Benefits of Web-Enabling a Data Warehouse Better-informed decision making Lower costs of deployment and management.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
1 Agenda Overview Review Roles Lists Libraries Columns.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
About Dynamic Sites (Front End / Back End Implementations) by Janssen & Associates Affordable Website Solutions for Individuals and Small Businesses.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
Mara Cammarrota Italian National Institute of Statistics Development of Information System and Corporate Products, Information Management and Quality Assessment.
1 REMOTE ACCESS INFRASTRUCTURE FOR REGISTER DATA / 1 RAIRD Remote Access Infrastructure for Register Data - metadata aspects Ørnulf Risnes,
DLI Boot Camp 2011 Finding Statistics: Tools and Techniques Jean Blackburn Vancouver Island University Library SDA.
Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.
InSPIRe Australian initiatives for standardising statistical processes and metadata Simon Wall Australian Bureau of Statistics December
1 REMOTE ACCESS INFRASTRUCTURE FOR REGISTER DATA / 1 RAIRD Remote Access Infrastructure for Register Data Johan Heldal *, Elin Monstad **,
1 1 Anonymised Integrated Event History Datasets for Researchers Johan Heldal Statistics Norway.
MEMBERSHIP AND IDENTITY Active server pages (ASP.NET) 1 Chapter-4.
State Statistical Institute Berlin-Brandenburg Jörg Höhne / Julia HöningerResearch Data Centre Morpheus – Remote Data Access with a Quality Measure Joint.
The use of GSIM in Statistics Norway Jenny Linnerud Senior Adviser Department of IT Statistics Norway 10th June 2014, Nizhny Novgorod.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Generic Statistical Information Model (GSIM) Jenny Linnerud
Saturday, January 23, 2016 Towards an easy use of CIRCABC Communication and Information Resource Centre for Administrations, Businesses and Citizens By.
COBRA® V5 Janet L. Anderson, M.S., D-ABC Forensic Toxicologist Alcohol and Drug Testing Program Technical Director.
GSIM in practice in Norway Jenny Linnerud – Ørnulf Risnes – Arofan Gregory -
Remote Analysis Server for Tabulation and Analysis of Data Tarragonia, October 2011 James Chipperfield and Frank Yu (presenter)
Researchers’ Usage of Microdata The example of Statistics Finland Advanced presentation – Some additional details Consultation Mission on Promoting the.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
ORACLE's Approach ORALCE uses a proprietary mechanism for security. They user OLS.... ORACLE Labeling Security. They do data confidentiality They do adjudication.
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
Core ELN Training: Office Web Apps (OWA)
Current Research Information SysTem In Norway
Global BTC King.
CRIStin, reporting and rewarding research
Creating Engaging Websites
HORIZONT TWS/WebAdmin DS TWS/WebAdmin DS Tips & Tricks
Jon Galloway | Tech Evangelist Christopher Harrison | Head Geek
Innovative Technology Solutions
The Healthy Workplaces Summit 2017,
By Janet Crawford and Dam Luong Submitted to the Faculty of
Table of Contents What is SONA? Benefits of using SONA
Department of Computer Science Homepage
GSIM Implementation at Statistics Finland Session 1: ModernStats World - Where to begin with standards based modernisation? UNECE ModernStats World Workshop.
Latest work on regional statistics and analysis at OECD
Presentation 2b 2018 Census Products & Services Engagement.
Sabrina Iavarone Senior User Services Officer
1 What is EGR? ESTP course on EGR 6-7 September 2016.
Power Apps Canvas and Model-Driven
Anja Burghardt, Institute for Employment Research (IAB)
5 November, 2018 Nuku’alofa, Tonga
Nicolás J. I. Rodríguez & Arild Mellesdal
WELCOME TO SEMINAR.
microdata.no Instant Access to Microdata
Exercise 2 students completed a higher education in Norway in 2004/05
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Rational Publishing Engine RQM Multi Level Report Tutorial
ASP.NET Imran Rashid CTO at ManiWeber Technologies.
Item 4.3 Confidentiality on the fly
5/24/ :22 AM © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
ESTP course on EuroGroups Register
Presentation transcript:

microdata.no Instant Access to Microdata NTTS / New Techniques and Technologies for Statistics Brussels 14 March 2019

Thank you for inviting us Dr. Johan Heldal Senior Adviser | Division for Methods Department for Digitalisation and shared business services | Statistics Norway johan.heldal@ssb.no Svein Johansen Senior Adviser | Division for Access to Microdata Department of Social Statistics | Statistics Norway svein.johansen@ssb.no Ørnulf Risnes Head of Digital Development Norwegian Centre for Research Data (NSD) ornulf.risnes@nsd.no

- No application for data - Web interface with self service For researchers, PhD and master students in approved institutions - Fast access - No application for data - Web interface with self service - Microdata invisible Microdata.no opened March 2018 as a scientific service where researchers get access to a defined database with sharp register microdata without any form for application. They work online with instant access to whichever populations and variables they need, presently costless, in the future at a substantial lower cost than today. Microdata.no was launched in March last year. By now some 30 institutions (among them the main universities and research institutes) with some 200 users are connected to the system. Present content: 124 variables on population, education, labour market, income, and welfare benefits. Developed 2012-2018 in co-operation with NSD – Norwegian Centre for Research data, and funded by The Research Council of Norway with some 4 mill Euro 

microdata.no Launched in March 2018. 30 institutions with some 200 users are connected to the system. Present content: 124 register variables on population, education, labour market, income, and welfare benefits. Event history data on 10,2 million individuals. All national IDs ever issued in Norway. Developed 2012–2018 jointly by Statistics Norway and NSD – Norwegian Centre for Research data. Funded by The Research Council of Norway. Written in Python and takes advantage of the statistics package Pandas.

Research is about results, not records New and different approach: Metadata driven Metadata are open for everybody. Microdata records are invisible. Data base stored in Statistics Norway Safe login, three factor authentication GSIM (General Statistical Information Model) Results: confidentially safe on the fly Data base stored at Statistics Norway. Data hidden from the user. Only anonymous results downloadable The clue: Anonymous = no need for applications and allowances. The egg that WON’T stand: anonymous data. Expensive, reducing value for researchers More realistic: make the output, the results anonymous. But then the individual records must be invisible. Solution: Metadata driven. Sharp data stored at Statistics Norway. No information hidden.

This is the front page. It is only in Norwegian so far This is the front page. It is only in Norwegian so far. To log on you need a Norwegian national ID. English version of the page texts in near future. User interface and metadata in Norwegian. Part of 4-yeas’ plan to develop English version and enable international login. By clicking the square «Variabler» we come to the metadata page.

Focus: Rich metadata Integration of data & metadata General structures for data GSIM Metadata are open for anybody But so far only in Norwegian

Infrastructure layout – metadatadriven, remote access DataStore User workspace Data are stored in microdata.no DataStore. The Data are stored in Statistics Norway. Users import interesting variables to User workspace and give them customized names if desireable

Confidentially safe output Based on the variables imported to the User Workspace, users can subset populations, recode variables, create new variables and do analyses. The user defines the names of the variables in his or her workspace. Subsetting user workspace S

Temporality structure, an example. 4 classes of Temporality: Event (start-stop), Single Time Point, Unchangeable, Accumulated

Data store variable window Command interface Work space variable window Click for metadata Data store variable window Click for metadata Command interface. This shows the meaning of metadata driven approach. When importing variables from Data Store the user defines his or her own names in the work space.

Results of analysis Command interface Change to Script mode Chat for assistance Command mode Results of analysis

You can run all commands an ensemble Results in the right pane To command mode Script mode interface. You can run all commands an ensemble Script in the left pane Results in the right pane Script mode interface Run

Metadata plays several roles Informative: Open access at microdata.no definitions, data types, temporality, code lists Tech function. Data access only through metadata. Supportive function. Interactive assistance in User Interface.

Statistical Disclosure Control (SDC) Minimum size of populations to be analysed: At least 1 000 Constant unbiased restricted maximum entropy noise addition on counts (max ±5) Inspired by the «Australian Bureau of Statistics method». No perturbations to negative counts and no counts 1-4. Magnitude totals adjusted proportionally to pertubed counts. Automatic winsorization of all numerical variables when subsetting and importing. hides extreme values All activity on microdata.no is logged Aims at preventing realistic disclosure scenarios

Access and reproducibility Everybody with access to microdata.no have access to all data in the system. This guarantees reproducibility by peers for all research done with its data.

Next on the Agenda Include variables from new sources E.g health registers More statistical methods available Give access to new groups, e.g. public administration. International access to researchers. Strengthened SDC methods. Applications for new grants from the Norwegian Research Council are pending. We consider the metadata driven approach to be very useful, and have a strong wish to use it widely, both externally and internally in Statistics Norway.

Appreciating the value of register data for research and the value of research, we want to create the best possible solution for register based research data Thank You