Linked Data for SDG Reporting

Slides:



Advertisements
Similar presentations
A multi-level metadata approach for a Public Sector Information data infrastructure Nikos Houssos 1,2, Brigitte Jörg 1,3, Brian Matthews 4 1 euroCRIS 2.
Advertisements

Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
(1) Standardizing for Open Data Ivan Herman, W3C Open Data Week Marseille, France, June Slides at:
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
The Data Cube Vocabulary: Statistics in the Web of Linked Data Arofan Gregory Open Data Foundation WICS, Geneva, 5-7 May 2015.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Data on the Web Life Cycle Bernadette Farias Lóscio March, 2014.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
Nationally Significant Databases and Collections Providers’ Group Emma Kelly Environmental Information Advisor Environmental Monitoring and Reporting Team.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
W HAT IS I NTEROPERABILITY ? ( AND HOW DO WE MEASURE IT ?) INSPIRE Conference 2011 Edinburgh, UK.
Hampshire Hub Data Platform Progress update 1 October Bill Roberts Swirrl.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
WIGOS Data model – standards introduction.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
SDMX IT Tools Introduction
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
® Using (testing?) the HY_Features model, 95th OGC Technical Committee Boulder, Colorado USA Rob Atkinson 3 June 2015 Copyright © 2015 Open Geospatial.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Session: Towards systematically curating and integrating
The Semantic Web By: Maulik Parikh.
Linked Data Web that can be processed by machines
Constructing a National Reporting Platform for SDGs: lessons form the MDGs. Enrique Ordaz INEGI Geneva, April 2017.
Cloud based linked data platform for Structural Engineering Experiment
Building the Semantic Web
Nader KEYROUZ-Advisor SDG preparedness workshop
Flanders Marine Institute (VLIZ)
Middleware independent Information Service
Lifting Data Portals to the Web of Data
Country use cases: Cambodia, and Tunisia
Scalable Policy-awarE Linked Data arChitecture for prIvacy, trAnsparency and compLiance H2020-ICT Big Data PPP: privacy-preserving Big Data technologies.
Interoperable data formats: SDMX
Numbers, places, decisions
MSDs and combined metadata reporting
SDMX for SDGs What it means for you
SDMX: A brief introduction
11. The future of SDMX Introducing the SDMX Roadmap 2020
A platform for Linked Data publishing
How can DDI make the most of RDF?
2. An overview of SDMX (What is SDMX? Part I)
The Data Cube Vocabulary: Deploying SDMX as RDF from Existing Systems
2. An overview of SDMX (What is SDMX? Part I)
Accommodating local cataloguing traditions in a global context
PRESENTATION OF SHORT-TERM ECONOMIC STATISTICS
Interoperability and standards for statistical data exchange
Session 2: Metadata and Catalogues
SDMX Information Model: An Introduction
LOD reference architecture
United Nations Statistics Division
Statistical Information Technology
Expert Group Meeting on SDG Economic Indicators in Africa
W3C Recommendation 17 December 2013 徐江
Linked Data Ryan McAlister.
Australian and New Zealand Metadata Working Group
Classifications and Linked Open Data Formalizing the structure and content of statistical classifications Item 9.1 Standards Working Group Luxembourg,
Palestinian Central Bureau of Statistics
Presentation transcript:

Linked Data for SDG Reporting Bill Roberts Swirrl 23 January 2018

What is Linked Data?

“Data you can link to” Use the mechanisms of the web to give fine grained access to data You can link to a file on the web, but only at the level of the whole file If you can link to individual things within the data – specific countries or regions, specific indicators, specific data points then can be more precise and more selective and it gives a mechanism for attaching all kind of metadata, and making relationships between the topics of interest, for example combining complex geospatial data with statistical data.

Needs for SDG National Reporting Platforms Different presentation for different groups of users (analysts, ministers/managers, public) Easy to find – available on the web API access to data Need to automate data preparation and publishing processes Interoperable data One size does not fit all – individual country needs and constraints

What does ‘interoperable’ mean? We need to agree on what data means, and how to get it: Shared identifiers Data transfer protocol Data format Data models Data models and systems of identifiers still need to be agreed and that’s a hard organisational problem – but once you’ve done that, Linked Data gives you a mechanism for systematically encoding that

Use URIs as names for things Use HTTP URIs so that people can look up those names When someone looks up a URI, provide useful information, using the standards (RDF and SPARQL) Include links to other URIs, so that they can discover more things Directly exploit the plumbing of the web as a way of making data available Globally unique identifiers for things of interest A mechanism for looking up information about those things Standards-based machine-readable way of representing that information With a way of describing relationships between things Identifiers Protocol Format and data models Connections Berners-Lee, 2006

Resource Description Framework

What is RDF? Property Subject Object “graph” representation of data – social graph, eg Facebook, LinkedIn All kinds of enterprise databases Strength is its flexibility in dealing with very diverse data, and to highlight the connections between things

What is RDF? Is a United Kingdom Country “graph” representation of data – social graph, eg Facebook, LinkedIn

What is RDF? 4.29 Death rate United Kingdom refArea refPeriod 2015 Observation123 unit Number per 1000 Can represent statistical data in RDF Age range 0-5 years indicator 3.2.1 Under-5 mortality

What is RDF? 4.29 Death rate United Kingdom refArea refPeriod 2015 Observation123 unit Number per 1000 Can represent any data in RDF – as the ’schema’ is part of the data Which means it’s a flexible system for combining statistical data with other contextual data Age range 0-5 years indicator 3.2.1 Under-5 mortality

<http://statistics. gov <http://statistics.gov.scot/data/population-estimates-current-geographic-boundaries/year/2016/S92000003/age/all/sex/all/people/count> a <http://purl.org/linked-data/cube#Observation> ; sg-measure:count 5404700 ; sdmx-dimension:refArea <http://statistics.gov.scot/id/statistical-geography/S92000003> ; sdmx-attribute:unitMeasure <http://statistics.gov.scot/def/concept/measure-units/people> ; sdmx-dimension:refPeriod <http://reference.data.gov.uk/id/year/2016> ; qb:measureType sg-measure:count ; qb:dataSet <http://statistics.gov.scot/data/population-estimates-current-geographic-boundaries> ; sdmx-dimension:age <http://statistics.gov.scot/def/concept/age/all> ; sdmx-dimension:sex <http://statistics.gov.scot/def/concept/sex/all> . You can link directly to this observation – and get it in various machine readable ways You can add data markers Or annotations Or say this observation has been revised and replaced by some other observation

What does linking enable? Connect datasets, indicators, features of interest, data points to: Other data, other features Definitions Context Provenance Annotations/feedback

SPARQL Standardised query language for RDF Flexible, powerful – can be complex Use it directly, or build simpler APIs or user interfaces on top of it

Sustainable Development Goals Where are we now? What is most urgent? What should we do about it? Is it working? Answering these questions needs a lot of information about context Where are we now? For one indicator for one country, how does it look: Compared to the target Compared to other countries Compared to other ‘similar’ countries – similar perhaps in terms of size, income, demographic profile, economic activities, climate… Compared to other indicators Compared to previous years

Challenges Data sources Dealing with the big challenges of today Changing demographics – people living longer, but also more years of unhealthy life Changing world of work – different kinds of jobs Climate change (the real challenge for a smart city) Limited environmental resources – balancing the economy, health and the natural environment That feeds into crucial business as usual for government: where to allocate limited resources for the biggest societal benefit Local government strategies – what are the special constraints and opportunities where you live and work – how to coordinate and balance all the aspects of the community. All of these problems are complex; all depend on the interaction of many different aspects of society, and many different strands of government policy. All of them need diverse sources of data. Importance of understanding context to be able to decide how to act on a particular piece of information

Linked Data works behind the scenes Strength is for underlying data representation and integration Automatic import of data from CSV, XML, JSON, Shapefile… Select, filter, export data as CSV, XML, JSON, Shapefile... W3C ‘Tabular Data on the Web’ standards: CSV plus JSON metadata

Building on existing standard data models for multidimensional statistical data: RDF Data Cube Vocabulary – Linked data version of SDMX for metadata: Dublin Core, DCAT for provenance: PROV for annotation: Web annotation ontology for data quality: Data Quality Vocabulary

Making it work for SDGs Choose or create agreed identifiers and definitions for: the SDG indicators dimensions, measures and units concept schemes for dimension values

Gartner hype cycle Swirrl operates production systems for Scottish Govt, MHCLG, NHS Working with ONS on applying this technique for sharing of data across UK official stats publishers Eurostat is quite active

@billroberts http://www.swirrl.com