Feasibility study on data harvesting using INSPIRE infrastructure

Slides:



Advertisements
Similar presentations
Proposed update of Technical Guidance for INSPIRE Download services based on SOS Matthes Rieke, Dr. Albert Remke (m.rieke, 52°North.
Advertisements

INSPIRE - how to use? Spatial data from diverse data sources INSPIRE and reporting data flows Example: spatial data sets related to MSFD, water and INSPIRE.
1 Introducing Reportnet Miruna Badescu. 2 A linear view of Reportnet process.
1 INSPIRE Data Specifications Area management / restriction / regulation zones and reporting units (Draft data specification ver.2.0) INSPIRE Thematic.
Update on INSPIRE: INSPIRE maintenance and implementation and INSPIRE related EEA activities on biodiversity CDDA/European protected areas technical meeting.
Project “European CDDA and INSPIRE”: scope, transformation workflow and mapping rules INSPIRE Conference 2014 Workshop: Implementing Existing European.
Co-funded by the European Community eContentplus programme Data flow in Natura 2000: the past and the future Current and future data management Diederik.
Framework service contract Lot 4 Project: CDDA in conformity with INSPIRE Project status and overview of main issues Meeting Darja Lihteneger.
Freshwater Eionet Workshop, Agenda item #3b Fernanda Nery WISE spatial data collection.
Serving society Stimulating innovation Supporting legislation Workshop on the INSPIRE registry and registers Martin Tuchyňa, Tomáš.
1 How to make the GEOSS Data CORE a reality Part 1 Max Craglia Presented on behalf of Max by: Alan Edwards, EC Stefano Nativi, CNR.
Serving society Stimulating innovation Supporting legislation Workshop INSPIRE and Reporting under environmental acquis JRC, Ispra.
Update on INSPIRE CDDA/European protected areas technical meeting 2015 Darja Lihteneger Project manager - Data centres and INSPIRE implementation IDM -
Serving society Stimulating innovation Supporting legislation Benefits and challenges of INSPIRE implementation in the field of statistics.
Darja Lihteneger, November 2015, Copenhagen, Denmark INSPIRE Data Modelling for Reporting Data Flows – WISE SoE Monitoring Stations Eionet NRC.
INSPIRE Data validation: the eENVplus experience --- Extension of the INSPIRE DS: the GeoSmartCity experience NRC EIS meeting Copenhaghen Nov 26 th, 2015.
Making the CDDA data model INSPIRE compliant Project information and status 2015 Eionet Biodiversity Ecosystems, Indicators and Assessments NRC Workshop.
Framework service contract Lot 4 Project: CDDA in conformity with INSPIRE CDDA – INSPIRE PS mapping conclusions Meeting Darja Lihteneger.
Darja Lihteneger, November 2015, Copenhagen, Denmark INSPIRE Data Modelling for Reporting Data Flows – CDDA conceptual data model and INSPIRE Eionet.
Common Database on Designated Areas vs. INSPIRE Martin Tuchyňa, Darja Lihteneger INSPIRUJME SE, , Bratislava.
WISE Working Group D September 2009, Brussels Jon Maidens.
1 Integration of the LCP Reporting Into the E-PRTR Current status and Technical proposal August 4th.
1 Terminal Management System Usage Overview Document Version 1.1.
European Monitoring Platform for Mapping of QoS and QoE
Project Management: Messages
CDDA 2018 new reporting re-using INSPIRE services
INSPIRE Network Services
Flanders Marine Institute (VLIZ)
USAJOBS – Application Manager
Designation boundaries
Data set series Michael Lutz 44th MIG-T meeting, Ispra
Tools and guidance for data providers
INSPIRE supporting activities for IED/ E-PRTR
Integration of INSPIRE & SDMX data infrastructures for the 2021 Census
Meeting of Working Group Data & Information Sharing (DIS)
WISE and the future of WFD reporting
INSPIRE Geoportal Thematic Views Application
On behalf of the EEA/EC INSPIRE team
INSPIRE MIG-T MIWP-16 activity report.
INSPIRE Sub-GROUP VALIDATION
Availability of MS priority datasets 20/6/2018
Indicator structure and common elements for information flow
MIWP Action ”Priority List of E-Reporting Datasets”
MIG-T meeting, 19th April 2016, Ispra End-user applications
CDDA 2018 reporting Mette Lund Biodiversity group, EEA
Status on eu actions from the kick-off meeting
Priority List of Data Sets for eReporting Proposal for extension, Issues Gabriele Vincze, 23 November 2017.
Priority geospatial datasets for the European Commission
Expert Group on Reporting Nature Directives , Brussels
Metadata keywords for Reporting
Work on the coherence of data-flows / improving data-quality
JRC INSPIRE team INSPIRE MIG-T Meeting Ispra, October 2016
Implementation of INSPIRE in EIONET Core Data Flows
CDDA & INSPIRE work of EEA - preliminary lessons learnt
WFD Article 8 Schemas Yvonne Gordon-Walker.
Meeting on the implementation of the INSPIRE Directive
WP 4 - Revision of Natura 2000 dataflow
CDDA & INSPIRE work of EEA - lessons from implementation so far
ITAS Risk Reporting Integration to an ERP
Reportnet 3.0 Requirements Catalogue
Reportnet 3.0 Database Feasibility Study – Approach
4th Meeting of the MIG-P, Brussels, June 2016
INSPIRE IMPLEMENTATION IN CYPRUS
Priority list of datasets for eReporting (Priority Datasets) INSPIRE
WG standards for data access/exchange
CDDA alignment with INSPIRE
Challenges with distributed systems like WISE
Interoperability of metadata systems: Follow-up actions
State-of-play of current Water Directive integration into WISE
WISE and INSPIRE By Albrecht Wirthmann, GISCO, Eurostat
Presentation transcript:

Feasibility study on data harvesting using INSPIRE infrastructure Reportnet 3.0 Feasibility study on data harvesting using INSPIRE infrastructure INSPIRE MIG-T, 24.-25.10.2018 Darja Lihteneger

Reportnet 3.0 / Feasibility study on data harvesting Outline Finding services and data Services Downloaded data Referencing spatial objects Findings and input for Reportnet 3.0

Reportnet 3.0 / Feasibility study on data harvesting Purpose: To assess the applicability of harvesting national INSPIRE services to automate the collection of geospatial data sets falling under reporting obligations Scope: Use case 1: data harvesting Use case 2: referencing spatial objects Thematic domain: Natura 2000 sites (INSPIRE theme “Protected Sites”) The study will address INSPIRE download services providing Natura 2000 spatial data sets

Identification of data and services – needs & means Results of INSPIRE Geoportal PDS Viewer (23.08.2018) Additional manual search in INSPIRE Geoportal Additional manual refinement (selection of Natura 2000 data sets from the list of several data sets or from top Atom Feed, selection of predefined stored query in WFS) Creating WFS GetFeature requests Images – INSPIRE Geoportal snapshot on 21.10.2018

Sample - initial list of services Service type Atom feed 10 File download (http, https, ftp) 25 WFS 17 Total 52 Not always 1-1 service-country or data-service mapping

Service connection and monitoring / testing All services tested to build a better understanding of service performance and use for reporting Availability Capacity Performance Reliability GET Spatial Data Set findings: Downloading data sets completed Higher latency in some WFS Limited number of features returned Example of encountered errors: Operation timed-out Bad Gateway Purpose: Confirmation of active service Applying INSPIRE service quality criteria Detecting data change: comparing downloaded data (check-sum) from different service requests have to be combined with other more detailed data content tests Service monitoring and statistics is important in the period of official reporting data flow: documentation of data harvesting process, identification and awareness of service errors, trigger for notifications

Spatial data quality control: is it Natura 2000 data set? Data quality control on data provided as GML: Using INSPIRE ETF validator Result: high level of passed tests Some encountered errors: Test not completed (Timeout on test) GML schema errors Protected site feature was not in dataset Differences in content but not necessarily errors: Certain content is not included, e.g. SPA, SCI Other content is included (non-Natura 2000 designation scheme, no protected sites) Input for data quality control tests in Reportnet at the level of data acceptance

SWAssociatedProtectedArea INSPIRE base type: Identifier Use case 2: Referencing mechanism Challenge: For WFD water dependent protected areas that correspond to Natura 2000 sites find their related spatial data ! WFD Natura 2000 Inspire WFD surface water body <<XSDComplexType>> SurfaceWaterBody swAssociatedProtectedArea: Yes … N2000 site Site SITE_CODE MAP_ID … Inspire Protected Sites <<FeatureType>> ProtectedSite geometry: GM_Object inspireID: Identifier … Associated protected area <<XSDComplexType>> SWAssociatedProtectedArea euProtectedAreaCode: Natura 2000 site code protectedAreaType: Birds / Habitats … Associated map data Map MAP_ID MAP_INSPIRE … Inspire Identifier Type <<dataType>> INSPIRE base type: Identifier localID: CharacterString namespace: CharacterString … Each resource follows own life cycle / maintenance

INSPIRE base type: Identifier Referencing mechanism findings Positive patterns: WFD -> Natura 2000: link gives no issues N2000 - INSPIRE: Inspire ID is provided N2000 site code is part of Inspire ID Referencing of spatial data: through WFS requests based on Inspire identifier N2000 site code INSPIRE identifier WFD N2000 INSPIRE Associated map data Map MAP_ID MAP_INSPIRE … Inspire Identifier Type <<dataType>> INSPIRE base type: Identifier localId: CharacterString namespace: CharacterString ? Issues: Natura 2000 SDF reporting: Inspire ID often not filled in Inspire ID provided in different ways (localId, combined localId + namespace) WFS – spatial objects provided according to different schemas: NO STANDARDISATION > customised matching needed Alternatives for linking if N2000 <> Inspire / possible if Natura 2000 site code is provided in INSPIRE spatial data WFD -> Natura 2000 High percentage of successful links Natura 2000 reporting: Missing Inspire ID Inspire ID issues: sometimes only localId is filled in, sometimes namespace+localId Sometimes InspireId.localId=siteCode (provides reliable linking) WFS – different schemas: For different countries, different matching has to be used. Some uses INSPIRE Protected schema directly (the Romanian example uses the ps.ProtectedSite schema), other define own schema (the NL example is in Natura2000 schema, which has the advantage that it contains the natura2000 SiteCode). There are other schemas besides these 2. Linking possibilities: Give alternatives for linking N2000 <> Inspire The Protected Site schema does not contain the Natura2000 site code. This means that we can only link if a) inspireID is correctly filled in in the Natura2000 db (which at this moment is the why it should be, but isn’t) or b) The country reuses the Natura2000 sitecode as the localID part of the inspireID (which sometimes is the case)   If other schemas are used they often contain the Natura2000 SiteCode as a separate GML element(see the NL example in previous mail). So this gives us an alternative for linking, not using InspireID but directly the Natura2000 code present in the WFS answer.

Reporting data and service providers (MS) INSPIRE Protected sites Findings – requirements for focus groups ReqID Priority Reportnet Reporting data and service providers (MS) Natura 2000 SDF INSPIRE Protected sites INSPIRE F-024 M X   F-026 F-028 F-029 S F-030 T-010 F-025 C F-027 F-031 T-012 T-013 T-014 T-011 W Priority categories: MSCW (M – highest priority) M – Must S – Should C – Could W - Would

Requirements - Reportnet ReqID Priority Reportnet F-024 M Reportnet should provide the means to collect INSPIRE datasets and services, all relevant datasets have to be available (complete geographic coverage, topic) F-026 If the Inspire service is indicated in the reporting data flow, testing the availability of that service should be part of the reporting workflow. Secondly, it must be tested if the returned data follows the expected schema T-010 S Atom feed: Supplied Atom feeds should be datasets feeds (not top feeds). The Reportnet should include quality procedures to test the service and to provide the notification on findings. F-025 C If an Inspire dataset contains features related to different reporting obligations, make sure it is possible to download only the part of the dataset related to that reporting. F-027 Test of each reported entry can be reached in the Inspire dataset through the given service(s). The Reportnet should include quality procedures to test the data and matching and to provide the notification on findings. T-012 Reportnet workflows should continue to accommodate manual uploading of datasets where services are not available. T-013 WFS: Reportnet must be able to check if stored queries exist, therefore the ListStoredQueries feature is mandatory. T-014 Filenames in archived datasets should be unique. The Reportnet should include quality procedures to test the service and to provide the notification on findings. T-011 W Atom dataset feeds supplied for harvesting could contain only the entries for the datasets under the specific reporting obligation. The Reportnet should include quality procedures to test the service and to provide the notification on findings.

Reporting data and service providers (MS) Requirements – Reporting data and service providers ReqID Priority Reporting data and service providers (MS) F-028 M Improve the implementation of inspire ID in Natura 2000 SDF. F-030 S Ensuring the spatial data (downloaded through INSPIRE download services) match the reported thematic data. The responsibility to ensure this synchronisation lies with the MS. However, while checking the data, there should be a mechanism in Reportnet to indicate the discrepancies to the MS reporting contact person (reporter). T-010 Atom feed: Supplied Atom feeds should be datasets feeds (not top feeds). F-031 C WFS: It should be easy and straightforward to extract a single feature using INSPIRE identifier or thematic identifier. T-013 WFS: WFS Download Service must provide the ListStoredQueries feature for further interrogations. T-014 Compressed files: Filenames in compressed files (archived datasets) should be unique. T-011 W Atom feed: Atom dataset feeds supplied for harvesting could contain only the entries for the datasets under the specific reporting obligation.

INSPIRE Protected sites Requirements – Data models, data provision, INSPIRE ReqID Priority Natura 2000 INSPIRE Protected sites F-028 M The way the InspireID in the Natura 2000 SDF should be provided must be detailed, tackling the namespace/localID issue (one attribute, standardisation), or Natura 2000 SDF might change Standardisation for using Inspire ID for thematic data sets, e.g. Natura 2000 SDF ReqID Priority INSPIRE F-029 S In general, it would be very useful if the Inspire data models would include thematic identifiers, that would allow to relate the features to the corresponding reporting obligations. For the purpose of the Natura 2000 reporting, the thematic ID (Natura2000 siteCode) should be added to the INSPIRE Protected sites (PS) schema. F-031 C It would be much more useful to allow a standarised query on the Inspire ID or, if it is available in the Inspire dataset, on the thematic indentifier. As a possible solution, making available a stored query returning a feature with the specific Inspire ID would be very useful. Alternatively, a getFeatureByThematic ID could be used