Michelle Simard, Thérèse Lalor Statistics Canada CSPA Project Manager UNECE Work Session on Statistical Data Confidentiality Helsinki, October 2015 Confidentialized.

Slides:



Advertisements
Similar presentations
Presentation to the HLG By Gary Dunnet (Statistical Network Chair)
Advertisements

HLG, November 2013 By Jonathan Challener INTERNATIONAL COLLABORATION USE CASE: THE OECD’S STATISTICAL INFORMATION SYSTEM COLLABORATION COMMUNITY.
OECD Expert Group for International Collaboration on Microdata Access Mariarosa Lunati, OECD Statistics Directorate Luxembourg, 28 March 2012.
GSIM, CSPA, and Related Activities of the High-Level Group
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
GSBPM and GSIM as the basis for the Common Statistical Production Architecture Steven Vale UNECE
International Collaboration to Modernise Official Statistics
Statcom 2013 New York 11 High Level Group for the Modernization of Statistical Products and Services Big Data: Big Opportunity! Gosse van der Veen, Statistics.
Products and Sources: Issues and Challenges Steven Vale on behalf of the Modernisation Committee on Products and Sources.
ABS Tablebuilder and DataAnalyser Session 7 UNECE Work Session on Statistical Data Confidentiality October 2013 Daniel Elazar
Business Architecture model within an official statistical context Nadia Mignolli Giulio Barcaroli, Piero Demetrio Falorsi Alessandra Fasano Italian National.
European Conference on Quality in Official Statistics (Q2010) 4-6 May 2010, Helsinki, Finland Brancato G., Carbini R., Murgia M., Simeoni G. Istat, Italian.
Generic Statistical Information Model (GSIM) Thérèse Lalor and Steven Vale United Nations Economic Commission for Europe (UNECE)
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Workshop on the modernisation of statistical production and services Annual report of the UNECE High Level Group on Modernisation of Statistical Production.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
United Nations Economic Commission for Europe Statistical Division Introducing the GSBPM Steven Vale UNECE
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
United Nations Economic Commission for Europe Statistical Division Standards and Statistical Production Architectures Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division GSBPM Workshop Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Introduction to Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division International Collaboration to Modernise Official Statistics Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division The Common Statistical Production Architecture: An Important New Tool for Process Standardisation.
Michelle Simard Statistics Canada UNECE Worksessions on Statistical Disclosure Control Methods Helsinki, October 2015 Development of rules from administrative.
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
1 High-Level Group for the Modernisation of Statistical Production and Services Annual Workshop Gosse van der Veen, Statistics Netherlands 2013 Geneva.
1 Sharing Advisory Board Report from the Sharing Advisory Board www1.unece.org/stat/platform/display/msis or GOOGLE “MSIS wiki” Steven Vale Marton Vucsan.
Modernization Committee on Products and Sources: Report th High -Level Group Workshop on the modernization of Production and Services, Den Haag.
Multinational Enterprise Project Roundtable on Business Survey Frames Beijing, October 17-22, 2004.
Modernisation Activities DIME-ITDG – February 2015 Item 7.
United Nations Economic Commission for Europe Statistical Division Data collection and the modernisation of official statistics Steven Vale UNECE
The use of GSIM in Statistics Norway Jenny Linnerud Senior Adviser Department of IT Statistics Norway 10th June 2014, Nizhny Novgorod.
United Nations Economic Commission for Europe Statistical Division International Collaboration to Modernise Official Statistics Steven Vale UNECE
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
Generic Statistical Information Model (GSIM) Jenny Linnerud
Information about the HLG work Some considerations on HLG and related work from an NSI point of view Rune Gløersen, Statistics Norway.
GSBPM and GAMSO Steven Vale UNECE
Generic Statistical Data Editing Models (GSDEMs) Workshop on the Modernisation of Official Statistics The Hague, 24 November 2015.
GSIM, DDI & Standards- based Modernisation of Official Statistics Workshop – DDI Lifecycle: Looking Forward October 2012.
Modernization Committee on Products and Sources: Future Work 5 th High -Level Group Workshop on the modernization of Production and Services, Den Haag.
The future of Statistical Production CSPA. This webinar on CSPA (common statistical production architecture) is part of a series of lectures on the main.
Wesley Yung and Claude Poirier 2015 World Statistics Congress CSPA from a Methodologist’s Point of View.
Remote Analysis Server for Tabulation and Analysis of Data Tarragonia, October 2011 James Chipperfield and Frank Yu (presenter)
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Advancing statistics for development Marko Javorsek ESCAP Statistics Division Modernization Working Group on Production, Methods, and Standards (MWG) First.
. Steering Group on Climate Change-Related Statistics Purpose and objective Expert Forum for producers and users of climate change-related statistics 2-3.
United Nations Economic Commission for Europe Statistical Division CSPA: The Future of Statistical Production Steven Vale UNECE
Data Integration in Official Statistics 2017 Project Proposal
Generic Statistical Data Editing Models (GSDEMs)
The HLG and the “New World Order”
Thérèse Lalor Statistical Management and Modernisation Unit
Modernization Maturity Model and Roadmap
Modernization Maturity Model
Strategic vision of the HLG-BAS High-Level Group on Strategic Developments in Business Architecture in Statistics Steven Vale UNECE
Access to European microdata for scientific purposes
Generic Statistical Business Process Model (GSBPM)
GSBPM, GSIM, and CSPA.
Modernising Official Statistics
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Multinational Enterprise Project
CSPA: The Future of Statistical Production
Introducing the GSBPM Steven Vale UNECE
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
The future of Statistical Production
ESTP Training Course “Enterprise Architecture and the different EA layers, application to the ESS context ” Rome, 16 – 19 October 2017.
Introduction to the Common Statistical Production Architecture Alice Kovarikova High-Level Workshop on Modernization of Official Statistics, Nizhny Novgorod,
CSPA Common Statistical Production Architecture Motivations: definition and benefit of CSPA and service oriented architectures Carlo Vaccari Istat
CSPA Common Statistical Production Architecture Motivations: definition and benefit of CSPA and service oriented architectures Carlo Vaccari Istat
High-Level Group for the Modernisation of Official Statistics
Presentation transcript:

Michelle Simard, Thérèse Lalor Statistics Canada CSPA Project Manager UNECE Work Session on Statistical Data Confidentiality Helsinki, October 2015 Confidentialized Analysis of Microdata CSPA project 1

Outline  Background  High-Level Group (HLG) Structure  Confidentiality groups  From ABS to StatCan  Next Steps 2

Background  Generalized systems traditionally developed by each National Statistics Organisation (NSO)  Ad hoc and minimal sharing between countries  With recent financial constraint No longer sustainable to work all by ourselves  NSO need to help each other more comprehensively 3

The UNECE High-Level Group for the Modernisation of Official Statistics (HLG-MOS)  Set up by the Bureau of the Conference of European Statisticians (CES) in 2010  Coordinates international work  Promotes standards-based modernisation of statistical production and services  The missions Overseeing development of frameworks, and sharing of information, tools and methods Improving the efficiency of the statistical production process 4

Background  Key: using the same concepts/ languages  Developed common frameworks  Some results: GSBPM (Generic Statistical Business Process Model)  Decomposition of all processes in a statistical organisation CSPA (Common Statistical Production Architecture)  Decomposition of common IT components GSIM (Generic Statistical Information Model)  Decomposition of common information / metadata GAMSO (Generic Activity Model for Statistical Organisations) 5

HLG Structure 6

 Participating members: UNECE, Eurostat, OECD, Ireland, Australia, Canada, Italy, Netherlands, New Zealand, Republic of Korea, Slovenia, UK, Sweden, Finland, Norway, Mexico...  Annual meeting, teleconferences and wikis, sprints and sandbox Note: As of 2015, the Statistical Network activities are incorporated into the HLG HLG Structure 7

HLG Projects – CSPA Implementation  Standardisation of the infrastructure - common architecture (CSPA work) Implemented on NSO common generalised tools such as editing, sampling, coding, linkage, confidentiality, etc...  Creation of a common “language” (wrapping)  The idea  Make it CSPA-compliant  Then share with other countries One country may have developed a tool, another one will be the “wrapper”, others will use the wrapped tool 8

Confidentiality  Started in 2014 Statistical Network Innovation in Dissemination (SNID)  Australia, Norway, Italy, UK, Canada  Exchanges on systems and methods for confidentialized output tool  Lead: Australian Bureau of Statistics (ABS) 9

Confidentialized Analysis of Microdata  Through CSPA Partnership ABS (builder) – StatCan(wrapper)  ABS DataAnalyzer was imported and made functional in StatCan environment  Architect built a ‘Confidentialized Analysis of Microdata’ CSPA service Removed the ABS “outside layers” and connections Kept the engine – statistics and confidentiality Wrap a CSPA compliant architecture around the “engine” for easy recycling to other countries Then for StatCan use, they developed a internal GUI prototype (not web-based) 10

ABS DataAnalyzer  Online product with an Interface (ABS GUI)  Web-based  Explore (tabulate), manipulate, and analyse microdata Linear Regression Model Generalized Linear Model Multivariate Model  Confidentiality of outputs (diagnostics and model parameters)  Privacy of individuals’ data kept 11

ABS DataAnalyzer  All outputs (tabular or graphics) are confidentialized  Perturbation is the main method of protection; adding random noise to any estimates  Perturbation is used in Tables (counts and means) and Regressions (coefficient estimation)  Other methods used: Sparsity, Field Exclusion Rules, Range Restrictions, Dropping Units, Suppression of Small Counts, X-only Variables, Leverage Protection  2011 UNECE worksession, Ottawa 12

 Canada, New Zealand, Australian and Finland are implementing the Confidentialized Analysis of Microdata CSPA service  Canada – How does it work: User submits code through GUI/ functions/options (prototype) - StatCan codes Code of the “engine” parsed, validated executed - ABS codes Results are parsed, validated and returned back to the user – StatCan codes  Early evaluation and assessment of the tool Many years of development and resources saved by the organisation Remaining issues  IT architecture and methodological Implementation 13

Canada  Discussion and coordination about the potential integration of the tool within StatCan Generalised system, remote access, common tool, internal tool for economists, analysts, others... StatCan RTRA is a “remote submission” using a FTP Ideally build a web-based application Evaluate the informatics infrastructure and how it fits with StatCan architecture 14

 Evaluate methods for calculating statistics and model diagnostics If applicable, propose modifications, testing and implementation  Evaluate coherence between ABS SDC methods and StatCan SDC methods (including graphics)  Validation and Approval processes for methods (statistical procedures and disclosure controls) Canada – Next steps 15

Concluding remarks  Very exciting and promising  StatCan definitely will find some use  More information: Confidentialized Analysis of Microdata CSPA Service on UNECE wiki 16