Presentation is loading. Please wait.

Presentation is loading. Please wait.

Linked Environment Data and how we are implementing SEIS Søren Roug.

Similar presentations


Presentation on theme: "Linked Environment Data and how we are implementing SEIS Søren Roug."— Presentation transcript:

1 Linked Environment Data and how we are implementing SEIS Søren Roug

2 The current situation Find dataset

3 The current situation Find dataset Download it

4 The current situation Find dataset Download it Import it

5 The current situation Find dataset Download it Import it Clean it

6 The current situation Find dataset Download it Import it Clean it Create chart

7 Vision statement Too much manual work We want to eliminate all steps but the last!...And we’re going to use Linked Data technology to do it

8 Solution to the data format problem In addition to the HTML for human eyes we’re asking for a new format called RDF that machines can understand It is a modernisation of CSV, Excel and all the other data dump formats This is all we ask a producer to provide... and some metadata No Web Services – just files

9 No more searching on foreign sites The remote nodes provide lists of their datasets Called manifests or semantic sitemaps Also in RDF format Controlled vocabulary URLs in metadata Use any identifier, we create equivalence links between them

10 How to create equivalence links We set up correspondance tables between the URLs. This is called an ontology http://eurostat.europa.eu/countries#UK = http://eea.europa.eu/countries.rdf#GB Some RDF databases handle ontologies transparently. When you use one, you get the data for the other too

11 Remember this?

12 Now we can make the join

13 Downloading made easy! Click on the title to see if it is in the database

14 Downloading made easy Seconds later...

15 Status EEA has deployed two triple stores called Content Registry and Semantic Data Service that import all lists and all data Content Registry is for Reportnet deliveries Semantic Data Service is for published datasets We have created RDF of several data sets: Reportnet, GEMET, EUNIS, ITIS, NUTS, NACE etc. We can also load Eurostat SDMX data via the LATC project

16 SDS and CR’s Role ITIS Reportnet PRTR Harvesting Content Registry EUNISOther... SPARQL JSON RDF Querying RDF XML OtherVisualisationEUNISReportnet QA system

17 Queries

18 Comparing data: Where do EUNIS and ITIS not agree on naming? PREFIX e: PREFIX itis: PREFIX dwc: SELECT ?eunisname ?eunisauthor ?itisname ?itisauthor ?usage WHERE { ?eunisurl e:validName 1; e:sameSynonym ?itisurl; e:binomialName ?eunisname; dwc:scientificNameAuthorship ?eunisauthor. ?itisurl itis:nameUsage "invalid",?usage; itis:completename ?itisname; itis:hasAuthor ?auurl. ?auurl itis:shortAuthor ?itisauthor }

19 Results eunisnameeunisauthoritisnameitisauthorusage Chondrocladia alaskensis Lambe,1900Chondrocladia alaskensis Lambe 1895invalid Myxilla parasitica(Lambe,1900)Myxilla parasiticaLambe 1893invalid Hymedesmia primitiva Lundbeck,1910Hymedesmia primitiva Lundbeck 1910invalid Asbestopluma lycopodium (Levinsen,1886)Asbestopluma lycopodium Levinsen 1886invalid Esperiopsis rigidaLambe,1900Esperiopsis rigidaLambe 1893invalid Cordylophora lacustris Allman, 1844Cordylophora lacustris Allman 1844invalid

20 Example of SPARQL query Future prospects for the European otter (From Reportnet) PREFIX art17: PREFIX eea: SELECT ?country ?region ?future WHERE { [] art17:forSpecies ; art17:hasRegionalReport ?report. ?report art17:conclusion_future ?future; art17:forCountry ?curl; art17:region ?bgregion. ?bgregion eea:name ?region. ?curl eea:name ?country } ORDER BY ?country ?region

21 Result: Future of the European otter countryregionfuture AustriaAlpineInadequate (U1) AustriaContinentalInadequate (U1) BelgiumAtlanticBad (U2) BelgiumContinentalBad but improving (U2+) Czech RepublicContinentalFavourable (FV) Czech RepublicPannonianFavourable (FV) EstoniaBorealFavourable (FV)

22 Queries on EUNIS

23 Visualisations

24 Water use per NUTS level 2 in 2007 Top 20 Combination of two Eurostat SDMX datasets Combination of two Eurostat SDMX datasets

25 Linked Data in map views

26 GHG per capita 1990-2009

27 Søren Roug European Environment Agency Soren.Roug@eea.europa.eu


Download ppt "Linked Environment Data and how we are implementing SEIS Søren Roug."

Similar presentations


Ads by Google