C ommunity In ventory of E arthCube R esources for G eoscience I nteroperability data discovery is the most often cited issue in executive summaries on.

Slides:



Advertisements
Similar presentations
GEOSS ADC Architecture Workshop Clearinghouse, Catalogues, Registries Doug Nebert U.S. Geological Survey February 5, 2008.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
CAP Support in Esris Open Source Geoportal Server WMO Information System (WIS) CAP Implementation Workshop Geneva, 6-7 April 2011 Clive Reece
Where next…. Stakeholder workshop, 29 Jan To the end of the project.
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Doug Nebert, Senior Advisor for Geospatial Technology, System-of-Systems Architect FGDC Secretariat.
OneGeology-Europe - the first step to the European Geological SDI INSPIRE Conference 2010, Session Thematic Communities: Geology Krakow, June 24 th 2010.
National Coastal Data Development Center A division of the National Oceanographic Data Center Please a list of participants at each location to
ODM2: Developing a Community Information Model and Supporting Software to Extend Interoperability of Sensor and Sample Based Earth Observations Jeffery.
Environmental Terminology System and Services (ETSS) June 2007.
An Architecture for Creating Collaborative Semantically Capable Scientific Data Sharing Infrastructures Anuj R. Jaiswal, C. Lee Giles, Prasenjit Mitra,
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Doug Nebert Senior Advisor for Geospatial Technology CSS, FGDC Secretariat.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
January, 23, 2006 Ilkay Altintas
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
Status of upgrading CDI service (user interface, harvesting via GeoNetwork, CDI interoperability options following SeaDataNet D8.7) By Dick M.A. Schaap.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Registration and Harvest IIB Presentation May 1,
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Various tools created depending on user needs: Desktop vs. Web Applications GUI vs. XML Tools for Creating and Editing ISO Metadata.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
U.S. Department of the Interior U.S. Geological Survey CWG Workshop December 4, 2007 Geospatial One-Stop Gateway for Discovery and Access Rob Dollison.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
Semantic Web, Web Services and Museums: Mapping the Road to Implementation John Perkins “MESMUSES Workshop” Florence, June 16-17, 2003.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Data Management at the National Climate Change and Wildlife Science Center.
Finding Water Resource Data: A Discussion David Arctur Ilya Zaslavsky OGC Hydrology DWG Workshop Sept 2015, Orleans France.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
® Sponsored by Towards a Conceptual Design of a Cross-Domain Integrative Information System for the Geosciences ILYA ZASLAVSKY, DAVID VALENTINE, AMARNATH.
GIS data sources; catalogs of data and services. USGS: National Mapping.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
1 Using the GEOSS Common Infrastructure in the Air Quality & Health SBA: Wildfire & Smoke Assessment Prepared by the GEOSS AIP-2 Air Quality & Health Working.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Hydro DWG at the RDA Plenary BoF - Improve sharing of water resource data globally 24 September BREAKOUT :30-15:00.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
ESIP AQ Cluster Community Components for the Air Quality SBA in AIP-2.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
ILYA ZASLAVSKY RAQUEL CALDERON CHRIS CONDIT JEFFREY GRETHE AMARNATH GUPTA BURAK OZYURT THOMAS WHITENACK DAVID VALENTINE ALICE GILIARINI AARON GONG University.
Glossary WMS – OGC Web Mapping Services WFS – OGC Web Feature Services XML- Extensible Markup Language OGC – Open GIS Consortium ADN –
1 Using the GEOSS Common Infrastructure in the Air Quality & Health SBA: Wildfire & Smoke Assessment Prepared by the GEOSS AIP-2 Air Quality & Health Working.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
GEOSS Common Infrastructure: A Practical Tour Doug Nebert U.S. Geological Survey AIP-3 Kickoff March 2010.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
GEOSS Common Infrastructure (GCI) The GEOSS Common Infrastructure allows Earth Observations users to search, access and use the data, information, tools.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
geospatial catalogues in the Web of Data
GeoNetwork OpenSource: Geographic data sharing for everyone
Paul Eglitis [IEEE] and Siri Jodha S. Khalsa [IEEE]
Sharing Hydrologic Data with the CUAHSI* Hydrologic Information System
User Characterization in Search Personalization
Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt
Washington, DC USA Luis Bermudez March
Accessing Spatial Information from MaineDOT
The Re3gistry software and the INSPIRE Registry
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Session 2: Metadata and Catalogues
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
LOD reference architecture
IGARSS 2019 Dr. Ingo Simonis July 2019
Presentation transcript:

C ommunity In ventory of E arthCube R esources for G eoscience I nteroperability data discovery is the most often cited issue in executive summaries on the EarthCube web site CINERGI Ilya Zaslavsky, Steve Richard and the CINERGI team

Goals  Large inventory of high quality information resources across disciplines, with traceable provenance, usable across EarthCube research scenarios:  datasets, catalogs, vocabularies, information models, services, process models, repositories, etc.  Make it open to the community  Organize it to enable search and integration across domains and linking between information objects  Plus links between resources, people/organizations, publications, models, workflows, software, activities, etc.

Approach  Build on high-level resource inventory started at  Compile metadata for as many resources as we can (collect recommendations from geoscientists, harvest existing catalogs)  Expose through simple search interface  Use off the shelf technology: Geoportal, ISO metadata, CSW  Make it accessible through EarthCube.org

READINESS ASSESSMENT 1 Catalog Metadata M1 Has a data listing M2 Uses minimal metadata standard, such as Dublin Core M3 Uses metadata standard, such as FGDC, or INSPIRE Catalog Search S1 Search Interface S2 Search API, not following a standard S3 Complies with Opensearch API S4 Complies with OGC CSW API Catalog Harvest H1 Has a harvest API H2 OAI API H3 OGC CSW API Vocabulary – Control and Access V1 Uses controlled terminology V2 Community Managed Terminology V3 SPARQL Vocabulary -- Representation T1 Listing of terminology, such as web pages T2 Uses ontology or SKOS Data Access API A1 Bulk download A2 Static URL A3 Web Service Data Query API Q1 Simple query subset Q2 Complex query Q3 Processing Subset Information Model Conceptual C0 Unspecified C1 Domain/Conceptual Model using UML C2 Domain/Conceptual Model using UML based on OGC or ISO standards Information Model as XML X1 XML Format. Schema may not be specified X2 Xml Schema Information Model as SQL S1 Provides an SQL Schema Also evaluated: processing services; visualization services; community consensus efforts; identifier persistence

High-level inventory and readiness assessment: viewer

Staging Database Document processing components Harvest adapters Public access components Harvest adapters: components that connect to information sources and import descriptions of EarthCube resources into the staging database. Staging Database: document database that persists the originally harvested descriptions in their native state, as well as any additional information or updates resulting from subsequent processing/curation of the description Document processing components: components that pull documents from the staging database, perform various functions to upgrade content or transform presentation. The processed document may be pushed back to the staging database or out to the public access components Public access components: components that connect to document processors and implement external interfaces to present content for users Interfaces to the world Resource descriptions Ye Most Excellent EarthCube Inventory System

Then add features  Links to organizations, researchers, other systems  Validation Services  Deep registration of datasets/databases (at feature level)  Data search capabilities  Quality/interop readiness assessment  Annotation system

CINERGI Outline (without deep registration so far) Publication Staging and curation Harvesting Geoportal CSW, ISO ATOM, GeoRSS, etc. Linked data RDF, RDF store, eg Neo4j Extra metadata, provenance, links, annotations WAF w/XML ISO Staging DB: MDB MongoDB, CouchDB Geoportal, etc. ISODC other CSW, OAI-MPH, WAF, CKAN, other DISCO Validated triples 1. Metadata validation per record 2. Triggering parsers depending on metadata and validation results Spatial parser Person /org parser LOD parser Keyword parser Topic parser Time parser Finding ambiguities for manual curation Need a parser API so parsers can be added Duplicate detection, tagging, grouping Curation UI Results of parsing Provenance Duplicate flags Search UI Reporting to sources Pivot for search results Harvesting dashboard Record editor Community pivots Hot page Search in domain systems geoportal pivotDB

Challenges  Scope  Different levels of granularity  Lack of formal information models  Implicit domain semantics  Multiple metadata registry platforms and standards  Lots of data outside managed repositories  Cross-domain governance vs domain systems  Different expectations across domains (survey)

Initial inventory earthcube.org Resources from domain workshops and surveys + initial harvesting

Domain inventories: you are invited to participate!  All sources of data mentioned at domain end-user workshops – are included  Working with funded RCNs Step 1: Prepare an initial collection in a spreadsheet. Step 2: CINERGI will set up your community resource viewer and editing system, seeded with your collection Step 3: Community editing, updates and curation

Short questionnaire FunctionImportanceComments Making metadata from your facility available for search using standard metadata, via standard APIs Unimportant Essential NA DK Tracking demand for and cross-domain usage of your resources Unimportant Essential NA DK Identifying issues related to data and metadata quality and completeness Unimportant Essential NA DK Tracking search hits that become searches for resources managed by your data facility Unimportant Essential NA DK Connecting owners of relevant datasets to your facility for potential longer-term data management Unimportant Essential NA DK Connecting data from your facility with people, publications, models, and projects Unimportant Essential NA DK Identifying communities using data, tools, and models from your facility Unimportant Essential NA DK Validating published metadata and service signatures from your facility Unimportant Essential NA DK Finding and reporting to you resources that appear as duplicates across multiple registries Unimportant Essential NA DK Potential added value by a cross-domain system Integration with cross-domain search Key characteristics for CINERGI See CINERGI Survey at

Development Team  San Diego Supercomputer Center/UCSD  Ilya Zaslavsky, David Valentine, Tom Whitenack  Amarnath Gupta, Jeff Grethe (NIF project)  Lamont /Columbia Univ./IEDA  Kerstin Lehnert, Leslie Hsu  Arizona Geological Survey  Stephen Richard  University of Chicago  Tanu Malik  Open Geospatial Consortium  Luis Bermudez Community Partners Anthony Aufdenkampe: Critical Zone Observatories Shanan Peters: stratigraphy Bernhard Peucker- Ehrenbrink: Global River Observatories RCN projects that plan to organize community resources Test Enterprise Governance Building Blocks projects working on web services, brokering solutions Agencies International