METADATA from observation to its use

Slides:



Advertisements
Similar presentations
ASIAES Project Overview Satellite Image Network for Natural Hazard Management in ASEAN+3 region Pakorn Apaphant Geo-Informatics and Space Technology Development.
Advertisements

Report on progress Stakeholder workshop, 29 Jan 2003.
The CEGIS Online Bibliography Holly K. Caro In late May of 2009, the Center of Excellence for Geospatial Information Science (CEGIS) decided to consolidate.
Licensing and Rights Management – a model for the future? Graham Vowles with contribution from David Cotton and John Pepper Marine & Costal Data Workshop.
NERC Data Grid Helen Snaith and the NDG consortium …
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
OSU | PSU | UO The Oregon Spatial Data Library: A Vision for Increased Data Sharing Myrica McCune Institute for Natural Resources February 5, 2014.
What are research data? July 2015 This work is licensed under a Creative Commons Attribution 4.0 International LicenseCreative Commons Attribution 4.0.
User Perspective in Nature Conservation Co-funded by the Community Programme eContentplus ECP-2007-GEO
Spatially enabling Northern Ireland Dr Suzanne McLaughlin DFP Land & Property Services GIS Ireland Conference 11 th October 2012.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
COMMONWEALTH OF LEARNING A feasibility study for a Virtual University for Small Commonwealth States Dato Prof Gajaraj Dhanarajan Dr Glen Farrell.
Mapping between SOS standard specifications and INSPIRE legislation. Relationship between SOS and D2.9 Matthes Rieke, Dr. Albert Remke (m.rieke,
Virtual Ice Charting System Archive Browser Interface Distribution IngestProduction Ice Analyst Application Database Click on the boxes for more information.
Design central EMODnet portal Objectives, Technical Proposal and Consultation Process.
Workshop 1.4: ESPON Database ESPON Internal Seminar November 2011 Kraków,Poland ESPON M4D Project - LIG (Grenoble Computer Science Lab) Partner Jérôme.
GeoProMT Purpose of today’s meeting – Present some research ideas Identify people willing to make a commitment to the project – Development could be part.
Rob Walker The INSPIRE metadata regulations and quality issues – a user view Rob Walker Association for Geographic Information, London.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
Jonas Eberle9th November Standard-based time-series data access and geoprocessing services for Earth cover change detection within the “Earth Observation.
ADC Portal & Clearinghouse GEO Architecture and Data Committee 2-3 March 2006 George Percivall OGC Chief Architect
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
3rd Training Workshop June 2008, Ostende Management of CSR Anne Che-Bohnenstengel, BSH  Metadata Formats  Defined Vocabularies  Content Management.
HMA-FO ODA, January 2011 HMA Follow-On Task 3 Online Data Access in the frame of ESA's Heterogeneous Missions Accessibility (HMA) initiative Acceptance.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
OGC’s role in GEO: Results from the Architectural Implementation Pilot (AIP) George Percivall Open Geospatial Consortium GEO Task IN-05 Coordinator
CrossCutting topic: Data Quality and European Network of EO Networks
Page 1 CSISS Center for Spatial Information Science and Systems IIB and GCI Meeting CSR Architecture and Current Registration Status Prof. Liping Di Director.
Introduction to Metadata
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
NASA Earth Science Data Stewardship
GEOSS Component and Service Registry (CSR)
The Global Soil Information System
GeoNetwork OpenSource: Geographic data sharing for everyone
GEO Data Providers Workshop Introducing GEOSS Data Providers Steven Ramage 20 April, 2017 Florence, Italy.
Discovering and accessing data from a distributed network of data centres S. Mazzeo (ESA)
ODP Interoperability Package
DIAS & DIAS data release 2 years DIAS-GCI Cooperation Hiroko KINUTANI DIAS (Data Integration and Analysis System in Japan) , St. Petersburg.
GCI Requirements and GEOSS Portal Functionalities
Implementing through the GCI
Flanders Marine Institute (VLIZ)
Geo Data Providers Workshop
Use Case: The GEO-Wetlands Community Portal
Accessing Spatial Information from MaineDOT
the Need for Data Integration
S-121 Maritime Limits and Boundaries
INPE, São José dos Campos (SP), Brazil
INSPIRE Geoportal Thematic Views Application
Data Management: Documentation & Metadata
S-121 Maritime Limits and Boundaries
The GEO DAB possible contributions
Search Relevancy in GEO Data Access Broker
WGISS Connected Data Assets April 9, 2018 Yonsook Enloe
Geospatial Data Use and sharing Concepts
Design central EMODnet portal Objectives and Technical description Initial draft prepared by the Flanders Marine Institute.
Session 2: Metadata and Catalogues
Workshop on Gap Analysis and Prioritization
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Metadata Development in the Earth System Curator
4/5 May 2009 The Palazzo dei Congressi di Stresa Stresa, Italy
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Is Copernicus benefitting from INSPIRE?
QoS Metadata Status 106th OGC Technical Committee Orléans, France
CEOS WGISS Carbon Data Portal: Progress and Demo CEOS WGISS Carbon Portal Team Reported at WGISS’48 Vietnam Academy of Science and Technology, Hanoi,
WISE and INSPIRE By Albrecht Wirthmann, GISCO, Eurostat
Presentation transcript:

METADATA from observation to its use METADATA from observation to its use Dr Esa Falkenroth Information Architect, SMHI 1st Data Provider Workshop St Petersburg November 2016

three perspectives - producer - infra-structure - users Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set three perspectives - producer - infra-structure - users

Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set

SOMEBODY ELSE RESPONSIBILITY all very busy ” research is done”, ” can’t update allportals.” producer Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set SOMEBODY ELSE RESPONSIBILITY

digital infra-structure infra-structure view Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set SOMEBODY ELSE digital infra-structure all very busy ”I don’t know the data…” ”..OGC, XML, WFS, HDF!” SOMEBODY ELSE

SOMEBODY ELSE user ”I just want to search, download and use the data” Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set SOMEBODY ELSE user ”I just want to search, download and use the data” all very busy

digital infra-structure MIND THE GAP Helicopter view producer Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set digital infra-structure user

digital infra-structure MIND THE GAP ?= Helicopter view producer Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set Who writes metadata for old inactive projects ? ? ? digital infra-structure ? ? ? ? ? user Who should make the classification? producer user or ”mediators”?

”where is that book?” …. before the librarians not a new problem…

We can do better with metadata for open data ”somebody elses problem” does not help. Collaborate w. providers NOT a software issue, just hard work.

SWITCH-ON METADATA LIBRARY Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set producer UPLOAD TOOL SWITCH-ON METADATA LIBRARY SEARCH TOOL MIND THE GAP user DOWNLOAD DOWNLOAD (SVN)

SWITCH-ON METADATA LIBRARY FILLING METADATA producer Create data sets Format data sets Provide data sets online Write abstract Geo-relate data sets Classify data sets Enter metadata Maintain metadata Develop metadata standard Develop classification system Locate datasets matching keyword Locate datasets matching geo Listing matching data sets Show data sample Search using classification Search based on geolocation Search based on keyword Pick data sets Provide download service Download dataset Open dataset Understand dataset Use data set UPLOAD TOOL SWITCH-ON METADATA LIBRARY Maintain ontologies Catalogue resources Create metadata what a SEARCH TOOL ”BYOD” MIND THE GAP user Hydrologist work during the summer period to improve abstracts, geographical information, license information for data sets by contacting data providers.

8300 resources usable metadata free copy- right Acknow- ledgement science special non-commercial 48% LICENSE 8300 resources usable metadata - varying formats - many licences - ways to access - being added to GEOSS ACCESS direct download 78 % request 14% viewing 8% other 10% hdf netcdf 24% Downloadable datasets (direct ) Request-datasets require registration View-services (no download ) Download services (e.g. ftp-servers) Other websites (w. open data) SWITCH-ON datasets asc txt 7% FORMAT dat 9% excel 24% shape 9% html 10%

SWITCH-ON innovation in usable metadata search Innovative (usable) classification Innovative (usable) geospatial data Innovative (usable) interface Innovative budget (0.2 % for ”librarian work”) Extend/correct incomplete or missing abstracts More detailed spatial coverage for point sources Reclassification for water-science (user perspective)

classification problems (1) Generic themes give many “hits” (not specific) - GEOSS Water (155035 hits) GEOSS Climate (24436 hits), a generic portal has GEOSS Agriculture (11866 hits) generic keywords… (2) Producer and users use different sets of keywords. - Producer: WFS, realtime portal, operational data store - User: mass fraction pm2p5 nitrate dry aerosol, runoff (3) Neither the producer, user or the mediators necessarily have the ”whole picture” needed to make a good classification. Here, user communities can help with develop usable classifications (that work for search).

usability-driven classification Resources catalogued based on how the users will search instead of using the producers terms Balancing “specialisation-degree” Too specific (zero hits) Too generic (too many hits) Good enough (7 -100 hits) SWITCH-ON extended the well- known CUAHSI ontology for the hydrosphere with additional keywords to cover land-use and population data.

Good fit with GEOSS DAB thesaurus

Usable spatial search and the world box problem Bounding boxes are great for describing coverage of maps and gridded data. However, for in-situ data, bounding boxes give false positives. More detailed spatial resolution with individual in-situ positions for datasets facilitate search on local or regional scale. Technically, bounding boxes / polygons are replaced with multipoint coverage

Usable spatial search and the world box problem What happens if the user does a search for his/her area of interest? The search box matches the bounding box of the data …but there are no relevant data in the dataset found.

Balancing and pragmatic approach to metadata Not all data sets are equally popular. More popular datasets need more refined/detailed metadata. Not all ISO metadata attributes are necessary for search. Water scientists mainly want to search by classification and/or spatial reference (co-location). The rest is simple matter of automatic filtering (as implemented by geoportal) or simply sifting through the often limited results. This means the search metadata can be simplified while still maintaining compatibility with GEOSS and ISO-standards.

agile development of user interfaces Agile approach is a proven/established method of finding user requirements and develop software Easier interfaces Less coding Easier testing Faster time to market Happier users

Very basic search tool

Welcome to the SWITCH-ON Portal SWITCH-ON is developing a large number of commercial water-information products and services Open Virtual Water-Science Laboratory: Research infrastructure to facilitate collaboration, transparency and repeatable computational experiments. Tailored data, research results and marketing SWITCH-ON will give free access:  tools for datasearch and knowledge brokering for development and marketing of commercial information products and services one-stop-shop with water information and tools to water scientists, consultancies and managers:

Increasing the use of GEOSS: summary from three perspectives Producers can do better: Sharing data to enable innovation and better research in e.g. climate Use clear permitting licences!! Preferrably Creative Commons. Provide complete and correct metadata in standard machine-readable formats Mediators (portals, brokers, data hubs ) can do better: Monitor availability, completeness and usability of the data sets Encourage open data and the adoption of Creative Commons Active pragmatic collaboration with data providers increase precision of  spatial information (e.g. multipoint coverages) update broken links in collaboration (some 4% yearly loss of data in our collection). fix missing descriptions of all data sets Librarian effort in SWITCH-ON is less than 0.2% of the total project cost. User communities, research organisations and product developers: Better communicate their primary requirements for data search Contribute to metadata (especially for data sets from smaller local projects)

Thank you !