Federated Space-Time Query for Earth Science Data Using OpenSearch Conventions ESIP Federated Search Cluster Chris Lynnes Bruce Beaumont Ruth Duerr Hook.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

What is ECHO? HTTP-based Search and Ordering Using ECHOs REST and OpenSearch APIs How Can.
Earth Science Data: Why So Chris Lynnes.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
OGC and ESIP Discovery or Can’t we all just get along?? Christopher Lynnes.
Best Practices to Promote Data Interoperability Chris Lynnes Joe Glassy Technology Infusion Working Group.
Lightweight Advertising and Scalable Discovery of Services, Datasets, & Events Using Broadcast Feeds B. Wilson 1, G. Manipon 1, R. Ramachandran 2, A. Kulkarni.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
Obtaining MISR Data and Information Jeff Walter Atmospheric Science Data Center April 17, 2009.
WGISS CNES SIT-30 Agenda Item 10 CEOS Action / Work Plan Reference 30 th CEOS SIT Meeting CNES Headquarters, Paris, France 31 st March – 1 st April 2015.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Introduction to NASA Goddard Space Flight Center(GSFC) Data and Information Services Center(GES DISC) National Aeronautics and Space Administration
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
1 Maintaining the momentum of OpenSearch in Earth Science data discovery Doug Newman (NASA ECHO) & Dr Chris Lynnes (GES DISC) 12/11/13 10:50am PT IN32A-03.
Community Based Initiatives GES DISC UWG 2011 Meeting.
Giovanni for AQ Gregory Leptoukh NASA Goddard Space Flight Center Goddard Earth Sciences Data and Information Services Center (GES DISC)
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Web services at TRFIC TRFIC has developed the Access Technologies to achieve its goals of interoperability and provide access to data and information on.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
1 1 ECHO Overview and Status Enabling Interoperability with NASA Earth Science Data and Services GES DISC User Working Group May 10, 2011 Andrew E. Mitchell.
Data discovery and data processing for environmental research infrastructures Roberto Cossu ENVRI WP4 leader ESA.
EOSDIS Status 9/29/2010 Dan Marinelli, NASA GSFC
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
Creating Documentation and Metadata: Introduction to Metadata and Metadata Standards Lynn Yarmey National Snow and Ice Data Center Version 1.0 February.
Managing Your Data: Assign Descriptive File Names Robert Cook Oak Ridge National Laboratory Section: Local Data Management Version 1.0 October 2012.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California EDGE: The Multi-Metadata.
GEOSS Air Quality Community Infrastructure ESIP, 23 July 2010 Knoxville, TN E.M. Robinson, R.B. Husar, S.R. Falke, E.T. Habermann, A. Warnock, M. Hogeweg,
ESO Developing and Emerging Standards, Practices, and Technologies Yonsook Enloe, Helen Conover, Allan Doyle ESDIS Standards Office 7/14/ Summer.
1 ECHO and EDG Status May 9, 2006 Beth Weinstein, Yonsook Enloe,
A Menagerie of ESIP OpenSearch Clients C. Lynnes, NASA/GSFC K. Keiser, U. Alabama--Huntsville.
Interoperability = Leverage + Collaboration  Chris Lynnes  GES DISC.
An HDF-EOS Data Server Based on OPeNDAP and ECHO Bob Bane, Mohammad Rabi, Weijun Su, Richard Ullman, Jingli Yang, Zhangshi Yin Data Usability Group, NASA/GSFC.
ESIP AQ Cluster Community Components for the Air Quality SBA in AIP-2.
Page 1 CSISS Center for Spatial Information Science and Systems CWIC Development Team Meeting, 2014 CWIC OpenSearch Design and Implementation Yuanzheng.
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
ORNL DAAC SPATIAL DATA ACCESS TOOL Open Geospatial Consortium (OGC) Services Bruce E. Wilson Suresh K. Santhana Vannan Yaxing Wei Tammy W. Beaty National.
What is ECHO? ECHO Open Search ECHO Facts NASA’s Earth Observing System ClearingHOuse (ECHO) acts as the core metadata.
Advertising your data Alecia Aleman 1, Ruth Duerr 2 1 National Aeronautics and Space Administration (NASA) 2 National Snow and Ice Data Center, University.
CWIC Open Search Best Practices Doug Newman (NASA ECHO) CEOS WGISS-37 April 15th 2014 Presenter: Archie Warnock (A/WWW Enterprises)
ESIP Air Quality Jan Air Quality Cluster Air Quality Cluster Technology Track Earth Science Information Partners Partners NASA NOAA EPA (?) USGS.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
Interoperability Day Introduction Standards-based Web Services Interfaces to Existing Atmospheric/Oceanographic Data Systems Ben Domenico Unidata Program.
LP DAAC Overview – Land Processes Distributed Active Archive Center Chris Doescher LP DAAC Project Manager (605) Chris Torbert.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Global Precipitation Data Access, Value-added Services and Scientific Exploration Tools at NASA GES DISC Zhong Liu1,4, D. Ostrenga1,2, G. Leptoukh4, S.
MERRA Data Access and Services
ESIP Discovery – Show & Tell ESIP Summer Meeting 2011 Matt Cechini
ESIP Federated Search Cluster
Introduction to the ESIP Discovery Cluster
GEOSS Air Quality Community Infrastructure
OpenSearch: the data search API for everyone
Proposal for an Obs4ECV Dataset and OpenSearch Capability for ESG
WGISS Connected Data Assets April 5, 2017 Yonsook Enloe
WGISS Connected Data Assets April 9, 2018 Yonsook Enloe
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
Status OpenSearch Standardisation Activities - HMA-S Project
ESIP Winter Meeting 2016 January 2016
OpenSearch and JSON-LD for enhanced Earth observation data and service discovery Dr. Ingo Simonis Workshop on making spatial data discoverable through.
IGARSS 2019 Dr. Ingo Simonis July 2019
WGISS WGISS Connected Data Assets Status Report October, 2019 CWIC Team Eugene Yu (GMU), Archie Warnock (A/WWW), Li Lin (GMU)
Presentation transcript:

Federated Space-Time Query for Earth Science Data Using OpenSearch Conventions ESIP Federated Search Cluster Chris Lynnes Bruce Beaumont Ruth Duerr Hook Hua et al.

Outline Finding Earth science data: why so difficult??? Federated search, past and present Recursive OpenSearch Client and server developments

Many phenomena require space-time searches for distributed data Effect of Arctic Oscillation on precipitation in Greenland –GC-Net station data –AO indices –AIRS atmospheric profiles –NCEP model output, etc. Chaiten volcanic plume –OMI SO2+Aerosols –MODIS Aerosols –MISR Aerosols –CALIPSO Aerosol / Cloud Classification, etc. Data sources include: –International and national data centers –Regional data centers and data collection sites –Value-added providers and individual investigators

Obtaining satellite data today is tedious, hit-or-miss Step 1: Search through multiple directories for the right datasets –“Did I find them all?” Steps 2-N: Foreach data_provider Learn_search_interface() Search_for_data_files() Fetch_data_files() Load_data_into_analysis_tool() End foreach Ideally, you would want your analysis tool to find and fetch data based on the current work context

EOSDIS Version 0 offered “one-stop shopping” in the ‘90s Earth Observing System Data and Information System –Search servers using a common protocol at 8 distributed data centers Datasets Files –Early federated search But: –Slow –Idiosyncratic protocol –Limited clients

Federated search can be better today Simple: facilitates adoption Standards-based, but extensible Machine-callable: enables clients Embeddable –In web pages, documents, workflows, analysis tools…

OpenSearch is a simple, extensible, embeddable, machine-callable convention –“a collection of simple formats for the sharing of search results” OpenSearch Description Document (XML) –Describes URL-based (REST-like) queries –Describes a search engine so that it can be used by search clients (incl. Firefox and IE) OpenSearch response elements –Extend syndication formats (e.g., RSS and Atom) with extra metadata in search results Extensions –Have been proposed for Geospatial and Time queries

Space-time data query works better as a 2-step process Search for datasets, then files within selected datasets Most dataset-level queries have –small results set (dozens) –low "precision": precision = desiderata / total Space-time granule queries for a given dataset have –large results set (tens of thousands), but –high precision Combining both in one step would produce –mammoth results set (dozens * tens of thousands) –with low precision OpenSearch Description Documents provide a path to a recursive two-step search

The ESIP Federated Search Cluster is defining guidelines for a 2-step space time query Earth Science Information Partners –Consortium of >90 organizations working with remotely sensed Earth observation information –Clusters: focus groups to work specific topics Federated Search cluster for ESIP community conventions –2-Step (Recursive) OpenSearch –Atom response details

Recursive OpenSearch begins with a dataset discovery phase Dataset Discovery Granule Search Client Dataset Query Engine OpenSearch Description Document Store Granule Query Engine dataset query dataset results with link to OpenSearch Description Document for each dataset

Dataset results link to OpenSearch Description documents Client Dataset Query Engine Granule Query Engine OpenSearch Description Request OpenSearch Description Document with template for granule queries Granule Search Dataset Discovery OpenSearch Description Document Store dataset query dataset results

Templates from OpenSearch Description Documents enable granule query construction Client Dataset Query Engine Granule Query Engine granule query granule results OpenSearch Description Request Granule Search Dataset Discovery OpenSearch Description Document Store OpenSearch Description Document dataset query dataset results

A client can be as simple as an XSLT Attach a stylesheet to the Dataset OpenSearch Description Document –Renders the document in the browser as a search form

Several groups are developing servers and clients Servers –ACCESS-NEWS –EOS Clearinghouse (ECHO) –Global Hydrology Resource Center –Goddard Earth Sciences Data and Information Services Center (GES DISC) –MODIS Adaptive Processing System –National Snow and Ice Data Center Clients –Mirador (GES DISC) –Talkoot (University of Alabama--Huntsville) –Reference implementation / test script (GES DISC)

Next Steps Integration with Web Services –Format conversion, subsetting, standard data protocols (OPeNDAP, OGC) –Servicecasting: Atom-based approach to advertising services for ESIP data Develop / recruit clients