Linked Environment Data and how we are implementing SEIS Søren Roug.

Slides:



Advertisements
Similar presentations
European Topic Centre on Biological Diversity Reporting Tool workshop March 2012.
Advertisements

Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
RDF triple store Ontology Curator Harvester Departmental Web sites Research grants databases Query system Web interface Harvester.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
02-Oct-2008 European Forum for GeoStatistics 2008 in Bled Concept for an Integrated Web Solution / an Infrastructure for Geostatistics (Subproject 3)
European Environment Agency EEA –State of informatics for biodiversity Content Vs Functionalities? Rania Spyropoulou, EEA Biodiversity and Ecosystems group.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Achille Felicetti, Emanuele Bellini, Cinzia Luddi Fondazione Rinascimento.
1 gStore: Answering SPARQL Queries Via Subgraph Matching Presented by Guan Wang Kent State University October 24, 2011.
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
A BRIEF INTRO TO THE PROV DATA MODEL Simon Miles The entire W3C Provenance Working Group.
OneGeology-Europe - the first step to the European Geological SDI INSPIRE Conference 2010, Session Thematic Communities: Geology Krakow, June 24 th 2010.
Information and Business Work
WISE European System for Water Information WISE – part of Eionet (European Environment Information and Observation Network) http//
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
1 Introducing Reportnet Miruna Badescu. 2 A linear view of Reportnet process.
Data Sets, Vocabularies and Tools Pablo N. Mendes Freie Universität Berlin 1st year review Luxembourg, December /02/11.
Exchange formats and APIs Questions – how and when to access metadata? – lifecycle/status – how to access? can things disappear? – is CSV enough? – is.
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
WP 5 Data management & analysis Michel Bohms and Philomena M. Bluyssen – TNO Isabella Annesi-Maesano - UMPC Paris 06 Aileen Yang and Alena Bartonova –
Reportnet standards and next steps Søren Roug, Information and Data Services (IDS)
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
Scotland's Environment Web Data Journey Dave Watson, Duncan Taylor.
CountryData Development Improving the collation, availability and dissemination of development indicators (including the MDGs) Nairobi, 27 November 2013.
1 Country report 2014 – Statistics Norway PC-Axis Reference Group meeting
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
Eurostat Unit B3 – IT and standards for data and metadata exchange SDMX Basics Training – 2012 IT architectures for data exchange SDMX-RI and the Hub approach.
Status of INSPIRE implementation in EU Member States. INSPIRE conference – Lisbon, 26/05/2015 Paul Hasenohr, European Environment Agency.
Artur Gsella European Environment Agency Transition to e-Reporting and the new process flows at the EEA 18th EIONET Workshop on Air Quality Assessment.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Miruna Badescu Eau de Web Biodiversity Action Plans data reporting and publishing.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
DDI Discovery: An Overview of Current RDF Vocabularies Arofan Gregory Metadata Technologies NA Joachim Wackerow GESIS.
Introduction to the Aggregation Database Søren Roug, IT Project manager.
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Serving society Stimulating innovation Supporting legislation Workshop on the INSPIRE registry and registers Søren Roug European Environment.
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Renovation of Eurostat dissemination chain
Automated Access to Statistical Facts via Statline4 Web Services Olav ten Bosch Statistics Netherlands UN-ECE conference, Bratislava April.
Reportnet – progress and next steps Søren Roug European Environment Agency.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
Eurostat May 2016 Eurostat, Unit B3 – IT solutions for statistical production Test Client Jean-Francois LEBLANC Christian SEBASTIAN.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
1 Mashup Workflow. 2 What We Have 3 Challenges with REST APIs * Only ask what its built to answer * No standard - must relearn each time * Opaque - no.
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge R&D Statistics Information System: An Interoperability Tail between CERIF and SDMX Dimitris Karaiskos Dimitrios.
IAEA International Atomic Energy Agency Implementing SDMX for Energy Domain: From Discussion to Actual Implementation and Testing Andrii Gritsevskyi Oslo.
BISE EU Biodiversity Strategy to 2020 Franz Daffner, EEA.
SENSE project – towards a next (second) part Stefan Jensen - head - SEIS and SDI group.
1 RDF Storage and Retrieval Systems Jan Pettersen Nytun, UiA.
Artur Gsella European Environment Agency Transition to e-Reporting and the new process flows at the EEA 18th EIONET Workshop on Air Quality Assessment.
The Eurostat Metadata Handler Götzfried Eurostat (Head of Unit B6)
Cloud based linked data platform for Structural Engineering Experiment
The BARTOC story: from blog to basic to full terminology registry
Lifting Data Portals to the Web of Data
WISE and the future of WFD reporting
SDMX: Enabling World Bank to automate data ingestion
11. The future of SDMX Introducing the SDMX Roadmap 2020
Industrial Emissions Reporting Information System – Version 2
Semantic Annotation service
LOD reference architecture
SDMX: an Overview Abdulla Gozalov UNSD.
SDMX Tools Overview and architecture
Reportnet 3.0 Database Feasibility Study – Approach
9. Practical use case 3: Pesticides Use Project
Jean-Francois LEBLANC Christian SEBASTIAN
State-of-play of current Water Directive integration into WISE
Presentation transcript:

Linked Environment Data and how we are implementing SEIS Søren Roug

The current situation Find dataset

The current situation Find dataset Download it

The current situation Find dataset Download it Import it

The current situation Find dataset Download it Import it Clean it

The current situation Find dataset Download it Import it Clean it Create chart

Vision statement Too much manual work We want to eliminate all steps but the last!...And we’re going to use Linked Data technology to do it

Solution to the data format problem In addition to the HTML for human eyes we’re asking for a new format called RDF that machines can understand It is a modernisation of CSV, Excel and all the other data dump formats This is all we ask a producer to provide... and some metadata No Web Services – just files

No more searching on foreign sites The remote nodes provide lists of their datasets Called manifests or semantic sitemaps Also in RDF format Controlled vocabulary URLs in metadata Use any identifier, we create equivalence links between them

How to create equivalence links We set up correspondance tables between the URLs. This is called an ontology = Some RDF databases handle ontologies transparently. When you use one, you get the data for the other too

Remember this?

Now we can make the join

Downloading made easy! Click on the title to see if it is in the database

Downloading made easy Seconds later...

Status EEA has deployed two triple stores called Content Registry and Semantic Data Service that import all lists and all data Content Registry is for Reportnet deliveries Semantic Data Service is for published datasets We have created RDF of several data sets: Reportnet, GEMET, EUNIS, ITIS, NUTS, NACE etc. We can also load Eurostat SDMX data via the LATC project

SDS and CR’s Role ITIS Reportnet PRTR Harvesting Content Registry EUNISOther... SPARQL JSON RDF Querying RDF XML OtherVisualisationEUNISReportnet QA system

Queries

Comparing data: Where do EUNIS and ITIS not agree on naming? PREFIX e: PREFIX itis: PREFIX dwc: SELECT ?eunisname ?eunisauthor ?itisname ?itisauthor ?usage WHERE { ?eunisurl e:validName 1; e:sameSynonym ?itisurl; e:binomialName ?eunisname; dwc:scientificNameAuthorship ?eunisauthor. ?itisurl itis:nameUsage "invalid",?usage; itis:completename ?itisname; itis:hasAuthor ?auurl. ?auurl itis:shortAuthor ?itisauthor }

Results eunisnameeunisauthoritisnameitisauthorusage Chondrocladia alaskensis Lambe,1900Chondrocladia alaskensis Lambe 1895invalid Myxilla parasitica(Lambe,1900)Myxilla parasiticaLambe 1893invalid Hymedesmia primitiva Lundbeck,1910Hymedesmia primitiva Lundbeck 1910invalid Asbestopluma lycopodium (Levinsen,1886)Asbestopluma lycopodium Levinsen 1886invalid Esperiopsis rigidaLambe,1900Esperiopsis rigidaLambe 1893invalid Cordylophora lacustris Allman, 1844Cordylophora lacustris Allman 1844invalid

Example of SPARQL query Future prospects for the European otter (From Reportnet) PREFIX art17: PREFIX eea: SELECT ?country ?region ?future WHERE { [] art17:forSpecies ; art17:hasRegionalReport ?report. ?report art17:conclusion_future ?future; art17:forCountry ?curl; art17:region ?bgregion. ?bgregion eea:name ?region. ?curl eea:name ?country } ORDER BY ?country ?region

Result: Future of the European otter countryregionfuture AustriaAlpineInadequate (U1) AustriaContinentalInadequate (U1) BelgiumAtlanticBad (U2) BelgiumContinentalBad but improving (U2+) Czech RepublicContinentalFavourable (FV) Czech RepublicPannonianFavourable (FV) EstoniaBorealFavourable (FV)

Queries on EUNIS

Visualisations

Water use per NUTS level 2 in 2007 Top 20 Combination of two Eurostat SDMX datasets Combination of two Eurostat SDMX datasets

Linked Data in map views

GHG per capita

Søren Roug European Environment Agency