Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt

Slides:



Advertisements
Similar presentations
Geoscience Information Network Stephen M Richard Arizona Geological Survey National Geothermal Data System.
Advertisements

Technology Exploration – Semantics Karen Moe NASA Earth Science Technology Office WGISS-37 Meeting April 14-18, 2014.
ODM2: Developing a Community Information Model and Supporting Software to Extend Interoperability of Sensor and Sample Based Earth Observations Jeffery.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
New Approaches to GIS and Atlas Production Infrastructure for spatial data integration: across scales and projects Ilya Zaslavsky David Valentine San Diego.
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Ontologies in Spatial Data Infrastructures Doug Nebert Federal Geographic Data Committee Reston, VA November 2009.
ArcGIS Workflow Manager An Introduction
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
Advancing an Information Model for Environmental Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Richard P. Hooper, Kerstin Lehnert, Kim Schreuders,
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
C ommunity In ventory of E arthCube R esources for G eoscience I nteroperability data discovery is the most often cited issue in executive summaries on.
References: [1] Branch, B.D., Fosmire, M., The role of interdisciplinary GIS and data curation librarians in enhancing authentic scientific research.
Use of Hierarchical Keywords for Easy Data Management on HUBzero HUBbub Conference 2013 September 6 th, 2013 Gaurav Nanda, Jonathan Tan, Peter Auyeung,
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
Exploring Spatial Data Infrastructure in an Open Source World Jacqueline Lowe UNC-Asheville National Environmental Modeling and Analysis Center Jacqueline.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
Finding Water Resource Data: A Discussion David Arctur Ilya Zaslavsky OGC Hydrology DWG Workshop Sept 2015, Orleans France.
1. Data providers deliver metadata records that describe their datasets through OGC catalogue services for the web. Each metadata record uses keyword concepts.
® Sponsored by Towards a Conceptual Design of a Cross-Domain Integrative Information System for the Geosciences ILYA ZASLAVSKY, DAVID VALENTINE, AMARNATH.
Introduction to the Semantic Web and Linked Data
Session on Disasters Management: Overview Karen Moe NASA Earth Science Technology Office WGISS-37 Meeting April 14-18, 2014.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
The Neuroscience information framework A User’s Guide.
ILYA ZASLAVSKY RAQUEL CALDERON CHRIS CONDIT JEFFREY GRETHE AMARNATH GUPTA BURAK OZYURT THOMAS WHITENACK DAVID VALENTINE ALICE GILIARINI AARON GONG University.
University of Illinois at Urbana-Champaign. BeeSpace Project 5-year NSF-funded project Project Goals  Develop open bioinformatics resources  Support.
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Discovery and Metadata March 9, 2004 John Weatherley
Global Water Information Interest Group meeting RDA 7 th Plenary, 1 st March 2016, Tokyo Global Water Information Interest Group Welcome to the inaugural.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Sharing Hydrologic Data with the CUAHSI* Hydrologic Information System
Hans-Peter Plag Global Change and Sustainability Research Institute
University of California, San Diego
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
Overview of MDM Site Hub
CUAHSI HIS Sharing hydrologic data
Connect UNAVCO, a VIVO for a Scientific Community
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Accessing Spatial Information from MaineDOT
Linked Data for SDG Reporting
SDMX: A brief introduction
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
An ecosystem of contributions
PREMIS Tools and Services
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
2. An overview of SDMX (What is SDMX? Part I)
Accommodating local cataloguing traditions in a global context
Semantic Annotation service
Common Solutions to Common Problems
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
COMPASS: A Geospatial Knowledge Infrastructure Managed with Ontologies
Agenda (AM) 9:30-10:15 Introduction to RDA
HAO/SCD: VO, metadata, catalogs, ontologies, querying
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Bird of Feather Session
Brokering as a Core Element of EarthCube’s Cyberinfrastructure
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Australian and New Zealand Metadata Working Group
Presentation transcript:

Community Inventory of EarthCube Resources for Geoscience Interoperability Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt Thomas Whitenack David Valentine Adam Schahne University of California San Diego stephen Richard Arizona Geological Survey Kerstin lehnert, Leslie hsu LDEO, Columbia University Tanu Malik University of Chicago Luis bermudez Open Geospatial Consortium RDA 9/2016 - Denver

Metadata aggregation in CINERGI CINERGI Metadata Pipeline Domain Inventories RCN (Research Coordination Networks) Domain workshops High-level assets Catalogs

Content enhancement components Common enhancer API Provenance recording: W3C PROV and Neo4J Spatial enhancer (bounding boxes) Keyword enhancer Materials; Processes; Equipment; Methods; Features; Activities; Science Domains; Geologic age; Organizations; Resource types GeoSciGraph API for semantic processing Validation and provenance components

GeoSciGraph and Ontologies GeoSciGraph: an ontology management system that provides the semantic infrastructure to integrate and search multiple data resources across sub-disciplines of Earth Science Some included ontologies: SWEET ENVO CHEBI YAGO (geo features) NASA GCMD (equipment, providers) GeoSciML Geochronology EDAM Bioinformatics (software terms and operations) Also: VIAF

GeoSciGraph Services API GeoSciGraph Services: The GeoSciGraph API exposes a set of web services for querying and exploring the CINERGI ontology. Lexical Services are used to break text into sentences and perform sentence parsing using lightweight NLP techniques. Vocabulary Services are used to find concepts, synonyms, term categories, autocomplete search, and term suggestions based on similarity.

GeoSciGraph Services API Graph Services are used to navigate the graph by following user-specified relationships and finding neighborhoods. Another service locates the head of a clique (all pair connected subgraph) in an ontology graph. Refine Services provides a gateway to OpenRefine, Google service to match entries in a data table to an ontology. Cypher Utility Service is a pass-through service that directs a user-specified Cypher query directly to the underlying Neo4J system. Analyze Services provides a way to add custom- defined analyses into the GeoSciGraph system

Manual Review of Keyword and Location Assignments (CINERGI Metadata Annotator)

Interesting issues… Re-publishing linked data Semantic conflicts ISO 19115? RDF? JSON-LD? Semantic conflicts Selecting which ontology IDs to use when conflicts Our ability to detect concepts and assign keywords may not match ontology’s level of detail Lots of tricks in the bridge ontology Enabling faceting and search Pre-defining upper facets; adjusting underlying ontology fragments for consistency (cinergiParent, cinergiFacet annotations) Generating corpus of text to analyze (crawling, introspection) Curating keyword assignments Manual; Tool-Assisted; Community curation, Automated (Machine learning; Rules) Adding usage metadata (eventually a facet?) Communities may promote their own facets