DDI and GSIM – Impacts, Context, and Future Possibilities

Slides:



Advertisements
Similar presentations
Status on the Mapping of Metadata Standards
Advertisements

GSIM, CSPA, and Related Activities of the High-Level Group
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
SDMX and DDI: How Do They Fit Together in Practical Terms? Arofan Gregory The Open Data Foundation European DDI User’s Group 2011 Gothenburg, Sweden.
The Data Cube Vocabulary: Statistics in the Web of Linked Data Arofan Gregory Open Data Foundation WICS, Geneva, 5-7 May 2015.
Modernizing the Data Documentation Initiative (DDI-4) Dan Gillman, Bureau of Labor Statistics Arofan Gregory, Open Data Foundation WICS, 5-7 May 2015.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
ESCWA SDMX Workshop Session: Role in the Statistical Lifecycle and Relationship with DDI (Data Documentation Initiative)
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Generic Statistical Information Model (GSIM) Thérèse Lalor and Steven Vale United Nations Economic Commission for Europe (UNECE)
3 rd Annual European DDI Users Group Meeting, 5-6 December 2011 The Ongoing Work for a Technical Vocabulary of DDI and SDMX Terms Marco Pellegrino Eurostat.
Standardisation Informal summary of ABS Perspective.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
Technical Overview of SDMX and DDI : Describing Microdata Arofan Gregory Metadata Technology.
Describing Statistical registers in SDMX and DDI: A Comparison Arofan Gregory Metadata Technology Eurostat, June 4-6, 2013 Luxembourg.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
United Nations Economic Commission for Europe Statistical Division Introduction to Steven Vale UNECE
DDI Discovery: An Overview of Current RDF Vocabularies Arofan Gregory Metadata Technologies NA Joachim Wackerow GESIS.
The future of Statistical Production CSPA. 50 task team members 7 task teams CSPA 2015 project.
Survey Data Management and the Combined Use of DDI and SDMX Arofan Gregory Chris Nelson Metadata Technology Eurostat, June
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
SDMX IT Tools Introduction
GSIM Mapping to SDMX and DDI: Preliminary Findings and Status Arofan Gregory Metadata Technology METIS, May , Geneva.
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
DDI Lifecycle and Qualitative Data: Development of a Formal Model Arofan Gregory Joachim Wackerow.
1 Joint UNECE/EUROSTAT/OECD METIS Work Session (Geneva, March 2010) The On-Going Review of the SDMX Technical Specifications Marco Pellegrino, Håkan.
Presented By Margaret Hellen Atiro Uganda Bureau of Statistics at the United Nations Regional Seminar on Census Data Archiving 20 – 23 Sep 2011, Addis.
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Tips for Building an Effective Model for DDI Dan Gillman US Bureau of Labor Statistics.
United Nations Economic Commission for Europe Statistical Division CSPA: The Future of Statistical Production Steven Vale UNECE
Implementing ModernStats Standards Linked Open Metadata
End-to-End Management of the Statistical Process An Initiative by ABS
Contents Introducing the GSBPM Links to other standards
Streamlining the Statistical Production in TurkStat Metadata Studies in TURKSTAT High Level Seminar for Eastern Europe, Caucasus and Central Asia Countries.
Using SDMX structures to facilitate data reporting
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
GSBPM, GSIM, and CSPA.
Logical information model LIM Geneva june
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
The Generic Statistical Information Model
Modernising Official Statistics
The problem we are trying to solve
3rd WGM Meeting 3 May 2018 Item 2.3 Possible standards for ESS Validation.
The Generic Statistical Business Process Model
Roxane Silberman, Réseau Quételet
CSPA: The Future of Statistical Production
Introducing the GSBPM Steven Vale UNECE
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Contents Introducing the GSBPM Links to other standards
Presentation to SISAI Luxembourg, 12 June 2012
Generic Statistical Information Model (GSIM)
The future of Statistical Production
Introducing the Data Documentation Initiative
ESTP Training Course “Enterprise Architecture and the different EA layers, application to the ESS context ” Rome, 16 – 19 October 2017.
DDI and GSIM – Impacts, Context, and Future Possibilities
CSPA Templates for sharing services
ESS Enterprise Architecture
The Role of Metadata in Census Data Dissemination
CSPA Templates for sharing services
Palestinian Central Bureau of Statistics
SDMX training Francesco Rizzo June 2018
GSIM overview Mauro Scanu ISTAT
Presentation transcript:

DDI and GSIM – Impacts, Context, and Future Possibilities Arofan Gregory Metadata Technology

Overview The general situation for GSIM – DDI Implementing GSIM with DDI Detailed view of some GSIM areas and overlap/gaps with DDI Describing data Describing questionnaires Describing codelists, categories, and concepts Describing events and processing Looking forward

GSIM and DDI GSIM is a creation of the HLG-BAS group under UN/ECE DDI is a creation of the DDI Alliance There is no immediate formal relationship between these organizations However, both organizations have made statements that they will work together to make DDI a good implementation vehicle for GSIM

GSIM, DDI, and Official Statistics GSIM is a key standard for official statistics organizations Some official statistical organizations already use DDI or are planning to do so IHSN Metadata tools (developing world) DDI-Lifecycle (ABS, Stats NZ, INSEE, Eurostat) GSIM is a potential vehicle for the widespead adoption of DDI among official statistical organizations

Models at Different Levels GSIM is a Conceptual Model It is technology and implementation-neutral DDI is an Implementation Model It is cross-platform and application-neutral It is implementated in XML (and soon, RDF), but isn’t technology-neutral Specific applications have their own, internal models These are bound to specific technologies and platforms

Implementing GSIM at a Technical Level To allow re-use of applications and services, agreements must exist on many levels Conceptual models must match (GSIM) Implementation models must match (DDI) Application models must match (TBD – web services? Others?) There is still a lot of work around mapping DDI to GSIM, and then agreeing on how DDI XML will be used within applications before we have reusable, interoperable GSIM-based services and applications

What is the Usefulness of GSIM? To make applications work together on all levels, we will need to map existing application models to each other On the basis of DDI On the basis of GSIM From a technical perspective, this can be very difficult Having an agreed base model at the conceptual and implementation level makes it easier/possible

Describing Data DDI describes two kinds of data Both exist in GSIM Microdata sets Aggregate (“dimensional”) data sets – Ncubes Both exist in GSIM In DDI microdata, each case/unit has a set of variables, at least one of which is the case identifier Others hold observations or derived or supporting values (such as weights) Ncube structures use variables as dimensions, observations, and attributes to describe the matrix structure of tables

Other GSIM Constructs GSIM does make a distinction between “unit data structures” and “dimensional data structures” GSIM supports hierarchical relationships in data sets Both are based on the core data model you have seem

Unit Data Set

Dimensional Data Set

Describing Questionnaires DDI – Lifecycle has a very complete description of a questionnaire/instrument Includes the mode and specifics of the instrument Includes the questions, statements, and instructions used Includes the flow logic of the questionnaire Can have multiple-question “blocks” GSIM does the same With less detail Largely based on DDI

GSIM Survey Instrument

Classifications, Codelists, Categories and Concepts DDI has codelists which take their meaning from categories. DDI has concepts associated with variables and questions. GSIM has all of this, and more! GSIM is “concept-rich” GSIM also has a pure classification model, which is not as complete in DDI-Lifecycle (a bit in 3.2)

Nodes and Node-Sets

Events and Processing DDI provides us with several ways to describe events and processing Lifecycle Events Collection Events “Coding” Elements Generation Instructions General instructions GSIM gives us much, much more! Some of this is very specific to statistical agencies

Looking Forward DDI and GSIM have some very strong alignments There are also some gaps DDI may need to add support for some functionality But maybe not everything – maybe SDMX can fill some gaps This is a two-way alignment GSIM may need to adjust to better fit DDI implementation

Looking Forward (cont.) As we look to the next major re-design of DDI, we will be working proactively with GSIM Representative from GSIM were invited to the first working session this year at Schloss Dagstuhl DDI will continue to attend events around GSIM sponsored by HLG-BAS Like the Geneva meeting this past November Possibility for proactive engagement at a technical level SDMX-DDI Dialogue DDI Working Groups? Others? GSIM may also provide a strong basis for other types of work within the DDI Community, less focused on official statistics Like the “Generic Longitudinal Process Model”, which was based on GSBPM

Looking Forward Some external projects involve both archives and statistical agencies Data without Boundaries (DwB) is a prime example DwB is using a DDI-based metadata model May lead to production implementations in future If archives and statistical agencies use the same metadata… Archiving of official data becomes much easier Both communities can leverage the same tools. Approaches, and resources (where appropriate) Microdata access is an obvious point of stnergy

Questions?