Search Relevancy in GEO Data Access Broker

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

Report of the Architecture and Data Committee (ADC) R.Shibasaki (ADC, Japan)
GEOSS Common Infrastructure (GCI): status and evolution EC Side Event - GEO Plenary IX Foz do Iguacu, November 2012 Mirko Albani Earth Observation.
GEOSS AIP-5 – Energy SBA Leader / POC: Lionel Menard - MINES ParisTech GEO-IX Foz do Iguaçu Brazil 2 Scenarios: High-Resolution.
The GEO Web Portal New Interface Guido Colangeli.
GCI Research Activity Stefano Nativi, Mattia Santoro.
GEOSS Common Infrastructure: A practical tour Doug Nebert U.S. Geological Survey September 2008.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Earth Data Open Search Specifications Doug Newman (NASA ECHO) CWIC January 2014.
Registration and Harvest IIB Presentation May 1,
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
ENV proposal meeting, Geneva, Sep. 24, GCI Presentation Joost van Bemmelen, ESA
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
GEOSS Clearinghouse GEO Web Portal GEOSS Common Infrastructure Components & Services Standards and Interoperability Best Practices Wiki User Requirements.
WGISS-39, Tsukuba, Japan, May 11-15, 2015 GEO Community Portals Ken McDonald/NOAA CWIC Session, WGISS–39 May 13, 2015.
What is CWIC? Authors: Doug Newman Andrew Mitchell
GCI Elements the GEOSS Portal. GCI Snapshot Within the GCI: presentation layer –The GEOSS Portal implements the functionality related to the presentation.
NDD (National Oceans Office Data Directory) development overview as at 1 July 2002 Tony Rees/Miroslaw Ryba CSIRO Marine Research, Hobart.
GEO 2014 Work Plan Symposium Opening Remarks Barbara J. Ryan Director, GEO Secretariat Geneva, Switzerland 28 April 2014.
GEOSS Common Infrastructure (GCI) for Earth Observation Networks
Report of the Architecture and Data Committee (ADC) R.Shibasaki (ADC, Japan)
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Task IN-03 GEO Work Plan Symposium 2014 GEOSS Common Infrastructure IN-03.
CEOS Open Search Best Practices Doug Newman (NASA ECHO) CWIC January 2014.
What is ECHO? ECHO Open Search ECHO Facts NASA’s Earth Observing System ClearingHOuse (ECHO) acts as the core metadata.
CWIC Open Search Best Practices Doug Newman (NASA ECHO) CEOS WGISS-37 April 15th 2014 Presenter: Archie Warnock (A/WWW Enterprises)
GEOSS Common Infrastructure: A Practical Tour Doug Nebert U.S. Geological Survey AIP-3 Kickoff March 2010.
Ideas on Opening Up GEOSS Architecture and Extending AIP-5 Wim Hugo SAEON.
GCI Overview Steve Browdy with input from Doug Nebert May 2012.
GEOSS Common Infrastructure (GCI) The GEOSS Common Infrastructure allows Earth Observations users to search, access and use the data, information, tools.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
GCI Architecture GEOSS Information System Meeting 20 September 2013, ESA/ESRIN (Frascati, Italy) M.Albani (ESA), D.Nebert (USGS/FGDC), S.Nativi (CNR)
OGC’s role in GEO: Results from the Architectural Implementation Pilot (AIP) George Percivall Open Geospatial Consortium GEO Task IN-05 Coordinator
Genève, 06 April GEOSS KNOWLEDGE BASE Towards a Knowledge-driven Access Stefano Nativi National research Council of Italy (CNR)
Page 1 CSISS Center for Spatial Information Science and Systems IIB and GCI Meeting CSR Architecture and Current Registration Status Prof. Liping Di Director.
Common interoperability, best practices and strategic approach
GEOSS Component and Service Registry (CSR)
CWIC Status Report Yonsook Enloe yonsook. k.
M. Santoro, F. Papeschi, E. Boldrini, S. Nativi
GCI Registration: Yellow Page approach Gregory Giuliani University of Geneva Stefano Nativi, Mattia Santoro CNR Paola De Salvo, Osamu Ochiai GEO Secretariat.
Providing access to GEOSS Resources The GEOSS Common Infrastructure - GCI Giovanni Rum GEO Secretariat AfriGEOSS Symposium, Victoria Falls
How FAIR is GEOSS BlueBRIDGE Workshop 3 April, 2017
WIS and GCI/GEOSS interoperability project
GEO WP 1. INFRASTRUCTURE (Architecture and Data Management)
GCI Requirements and GEOSS Portal Functionalities
GEO DAB APIs: Introduction
Data providers needs.
CAP-378 and “Conhecer para não ignorar”
Implementing through the GCI
Brokering Agreement process Stefano Nativi and Mattia Santoro ESSI-lab of CNR-IIA San Petersburg (Russia), 07 Nov 2016.
High Level Architecture
Capacity Building Enhance the coordination of efforts to strengthen individual, institutional and infrastructure capacities, particularly in developing.
2nd Data Providers Workshop Joost van Bemmelen Guido Colangeli
Geo Data Providers Workshop
GEOSS Evolution: the GEOSS Evolve Initiative
Building Search Systems for Digital Library Collections
GEOSS Air Quality Community Infrastructure
Metadata Quality WMO Information System and GEOSS Thorsten Büßelberg Deutscher Wetterdienst 7th November 2016 St Petersburg.
Interoperability WMO Information System and GEOSS Thorsten Büßelberg Deutscher Wetterdienst 20th April 2017 Florence.
CWIC Status Report Yonsook Enloe yonsook. k.
Enhanced GEOSS Portal Joost van Bemmelen / Guido Colangeli ESA/ESRIN
The GEO DAB possible contributions
Summary of Bottom-Up Thread 2
GEOSS Future Products Workshop March 26-28, 2013 NOAA
Community Portal Interactions with GEOSS GEO-XII Plenary November 9, 2015 Ken McDonald/NOAA
The GEO Discovery and Access Broker (DAB)
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
ESIP Winter Meeting 2016 January 2016
4/5 May 2009 The Palazzo dei Congressi di Stresa Stresa, Italy
WGISS WGISS Connected Data Assets Status Report October, 2019 CWIC Team Eugene Yu (GMU), Archie Warnock (A/WWW), Li Lin (GMU)
Presentation transcript:

Search Relevancy in GEO Data Access Broker CEOS WGISS Tech Expo Webinar March 14, 2017  Search Relevancy in GEO Data Access Broker Stefano Nativi and Mattia Santoro (Earth and Space Sciences Informatics laboratory, CNR)

GEOSS Common Infrastructure GEOSS end-Users DOWNSTREAM GEOSS Applications GEOSS Applications GEOSS Applications GEOSS Applications GEOSS Portal GEOSS Application Developers (intermediate Users) GEOSS Common Infrastructure APIs MIDSTREAM Mediation modules GEOSS Supply Chain Enterprise System 1 Enterprise System 3 … . System 4 Enterprise System 2 Enterprise System 1 Enterprise System 3 … . Enterprise System 2 Enterprise System 2 System 4 Enterprise System Z System 4 Enterprise System 3 Enterprise System 1 SBA 8 … . … . Enterprise System K Enterprise System j SBA 2 UPSTREAM SBA 1 GEOSS Providers

GEOSS Common Infrastructure (GCI) More than 150 GEOSS Data Providers More than 40 million Datsets About 200 million Granules Societal Benefit Areas Ranking and Pagination of discovery results Data Providers Societal Benefit Areas are ‘implemented’ via GEO-Community Activities, GEO Initiatives and GEO Flagships (in order or becoming more ‘mature services’) SBAs need access to data/other EO-resources from data/resource providers GEO is implementing this via a GEOSS Common Infrastructure (GCI) – and the GEOSS portal is the main Graphical User Interface for the Users, while the GEO-DAB (Discovery and Access Broker) is the middleware. Machine to Machine access to the DAB is as well possible via different API’s. > 200 million data resources spanning all SBAs

Ranking (and pagination) Weighted quality scores approach Static Score Pre-calculated in batch, based on: Metadata Quality Accessibility Etc. Dynamic Score Calculated on-the-fly, based on: Query Constraints Weights Applied to scores (configurable)

Static Score for metadata record R Essential Variable Quality Access Quality 𝑆 𝑠𝑡𝑎𝑡𝑖𝑐 𝑅 = 𝑊 𝑚𝑑𝑞 ∗𝑀𝐷𝑄 𝑅 + 𝑊 𝑒𝑣 ∗𝐸𝑉 𝑅 + 𝑊 𝑔𝑑𝑐 ∗𝐺𝐷𝐶 𝑅 + 𝑊 𝑎𝑞 ∗𝐴𝑄(𝑅 Metadata Quality GEOSS Data Core Quality   𝐸𝑉 𝑅 =𝑚𝑖𝑛⁡(10, 𝑜𝑐𝑐𝑢𝑟𝑟𝑒𝑛𝑐𝑖𝑒𝑠 𝑜𝑓 𝑑𝑖𝑠𝑡𝑖𝑛𝑐𝑡 𝑒𝑠𝑠𝑒𝑛𝑡𝑖𝑎𝑙 𝑣𝑎𝑟𝑖𝑎𝑏𝑙𝑒𝑠 𝑖𝑛 𝑅) 𝐺𝐷𝐶 𝑅 =10∗ &1, 𝑖𝑓 𝑅 𝑖𝑠 𝐺𝐸𝑂𝑆𝑆 𝐷𝑎𝑡𝑎 𝐶𝑜𝑟𝑒 &0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

(discovery + access) Metadata Quality Score 𝑀𝐷𝑄 𝑅 = 𝑚𝑖𝑛 200, 𝑖=0 𝑛 𝑅 ℎ𝑎𝑠 𝐹𝑖𝑒𝑙 𝑑 𝑖 ∗𝐹 𝑊 𝑖 20 Field Description Weight DIRECT DOWNLOAD GEO-SPATIAL SERVICE The metadata contains a link to directly download the dataset through a geo-spatial service (e.g. a GetCoverage request, an OPeNDAP request, etc.). Resources provided with a preview (e.g. WMS layers) are ranked first. 60 COMPLEX DOWNLOAD DIRECT DOWNLOAD GEO-SPATIAL SERVICE The metadata contains a link to a geo-spatial service, including: the name of the data layer to which the metadata is referred to and the protocol of the geo spatial service to invoke. 30 SIMPLE DOWNLOAD GENERIC SERVICE The metadata contains a link to directly download the dataset from a non geo-spatial service (e.g. ftp links, HTTP GET request for a KML document, etc.). 15 GENERIC LINK The metadata contains a link to resource on the web (e.g. HTML pages). 10 FILE IDENTIFIER The metadata contains the File Identifier field. 5 ABSTRACT The metadata contains the Abstract field. SPATIAL EXTENT The metadata contains the Spatial Extent covered for the resource. 3 TIME The metadata contains the Temporal Extent covered for the resource 2 TITLE The metadata contains the Title field. 1 Access Discovery

Access Quality Score Iteration over the list of the OnlineResource elements characterizing the record R 𝐴𝑄 𝑅 = 𝑚𝑖𝑛 𝑂 𝑂𝑛𝑙𝑖𝑛𝑒𝑠 𝐴𝑄 𝑂 , 10 𝐴𝑄 𝑂 = &1, 𝑑𝑎𝑡𝑎 𝑐𝑎𝑛 𝑏𝑒 𝑑𝑜𝑤𝑛𝑙𝑜𝑎𝑑𝑒𝑑 & &2, 𝑑𝑎𝑡𝑎 𝑐𝑎𝑛 𝑏𝑒 𝑡𝑟𝑎𝑛𝑠𝑓𝑜𝑟𝑚𝑒𝑑 𝑏𝑦 𝑡ℎ𝑒 𝐷𝐴𝐵 & &3, 𝑝𝑟𝑒𝑣𝑖𝑒𝑤 𝑡𝑖𝑙𝑒𝑠 𝑤𝑒𝑟𝑒 𝑔𝑒𝑛𝑒𝑟𝑎𝑡𝑒𝑑

Dynamic Score of Metadata Record R for Query Q 𝑆 𝑑𝑦𝑛𝑎𝑚𝑖𝑐 𝑄, 𝑅 = 𝑊 𝑡𝑖𝑡𝑙𝑒 ∗ 𝑇𝐼𝑇𝐿𝐸 𝑄,𝑅 + 𝑊 𝑎𝑏𝑠𝑡 ∗𝐴𝐵𝑆𝑇 𝑄,𝑅 + 𝑊 𝑘𝑤𝑑 ∗𝐾𝑊𝐷 𝑄,𝑅 + 𝑊 𝑎𝑛𝑦𝑡 ∗𝐴𝑁𝑌𝑇 𝑄,𝑅 + 𝑊 𝑏𝑏𝑜𝑥 ∗𝐵𝐵𝑂𝑋(𝑄,𝑅 = 𝑊 𝑡𝑖𝑡𝑙𝑒 ∗ 𝑇𝐼𝑇𝐿𝐸 𝑄,𝑅 + 𝑊 𝑎𝑏𝑠𝑡 ∗𝐴𝐵𝑆𝑇 𝑄,𝑅 + 𝑊 𝑘𝑤𝑑 ∗𝐾𝑊𝐷 𝑄,𝑅 𝑆 𝑑𝑦𝑛𝑎𝑚𝑖𝑐 𝑄, 𝑅 = 𝑊 𝑡𝑖𝑡𝑙𝑒 ∗ 𝑇𝐼𝑇𝐿𝐸 𝑄,𝑅 + 𝑊 𝑎𝑏𝑠𝑡 ∗𝐴𝐵𝑆𝑇 𝑄,𝑅 + 𝑊 𝑘𝑤𝑑 ∗𝐾𝑊𝐷 𝑄,𝑅 + 𝑊 𝑎𝑛𝑦𝑡 ∗𝐴𝑁𝑌𝑇 𝑄,𝑅 + 𝑊 𝑏𝑏𝑜𝑥 ∗𝐵𝐵𝑂𝑋(𝑄,𝑅 𝑇𝐼𝑇𝐿𝐸 𝑄,𝑅 =10∗ &1, 𝑖𝑓 𝑄 ℎ𝑎𝑠 𝑎 𝑡𝑖𝑡𝑙𝑒 𝑐𝑙𝑎𝑢𝑠𝑒 𝑎𝑛𝑑 𝑅 𝑠𝑎𝑡𝑖𝑠𝑓𝑖𝑒𝑠 𝑖𝑡 & & &0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝐴𝐵𝑆𝑇 𝑄,𝑅 =10∗ &1, 𝑖𝑓 𝑄 ℎ𝑎𝑠 𝑎𝑛 𝑎𝑏𝑠𝑡𝑟𝑎𝑐𝑡 𝑐𝑙𝑎𝑢𝑠𝑒 𝑎𝑛𝑑 𝑅 𝑠𝑎𝑡𝑖𝑠𝑓𝑖𝑒𝑠 𝑖𝑡 & & &0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Dynamic Score of Metadata Record R for Query Q 𝐾𝑊𝐷 𝑄,𝑅 =10∗ &1, 𝑖𝑓 𝑄 ℎ𝑎𝑠 𝑎 𝑘𝑒𝑦𝑤𝑜𝑟𝑑 𝑐𝑙𝑎𝑢𝑠𝑒 𝑎𝑛𝑑 𝑅 𝑠𝑎𝑡𝑖𝑠𝑓𝑖𝑒𝑠 𝑖𝑡 & & &0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝐴𝑁𝑌𝑇 𝑄,𝑅 =10∗ &1, 𝑖𝑓 𝑄 ℎ𝑎𝑠 𝑎𝑛 𝑎𝑛𝑦𝑡𝑒𝑥𝑡 𝑐𝑙𝑎𝑢𝑠𝑒 𝑎𝑛𝑑 𝑅 𝑠𝑎𝑡𝑖𝑠𝑓𝑖𝑒𝑠 𝑖𝑡 & & &0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Dynamic Score of Metadata Record R for Query Q Q contains a spatial clause: (Qbbox) ; R has a bounding box (Rbbox); If (Rbbox)  (Qbbox), then 𝐵𝐵𝑂𝑋 𝑄,𝑅 =10∗ 𝐴𝑟𝑒𝑎( 𝑅 𝑏𝑏𝑜𝑥 𝐴𝑟𝑒𝑎( 𝑄 𝑏𝑏𝑜𝑥

Thank you !