ECHO: Foundational Middleware for a Science Cyberinfrastructure Robin Pfister Keith Wichmann Beth Weinstein.

Slides:



Advertisements
Similar presentations
RP Designs Semi-Custom e-Commerce Package. Overview RP Designs semi- custom e-commerce package is a complete website solution. Visitors can browse a catalog.
Advertisements

ECHO Services Discussion WGISS 18 September 6-10, 2004 Beijing, Peoples Republic of China Robin Pfister NASA/GSFC.
ECHO Services – Foundational Middleware for a Science Cyberinfrastructure WGISS – March 2005.
© 2006 Blueprint Technologies, Inc. All Rights Reserved ECHO – NASA’s Middleware for an Earth Science Marketplace Overview and Status WGISS 22.
BEDI -Big Earth Data Initiative
MODIS Data at NSIDC MODIS Collection 5/Long Term Data Record Workshop Molly McAllister & Terry Haran January
Matthew Cechini Raytheon - EED ID: IN31C-07.  ECHO Metadata Overview  Introduction  Problem Space  Solutions ISO Lessons Learned – Perceived.
May 17, Capabilities Description of a Rapid Prototyping Capability for Earth-Sun System Sciences RPC Project Team Mississippi State University.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
02/07/2001 EOSDIS Core System (ECS) COTS Lessons Learned Steve Fox
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Android Core Logging Application Keith Schneider Introduction The Core Logging application is part of a software suite that is designed to enable geologic.
NASA/ESA Interoperability Efforts CEOS Subgroup - CINTEX Alexandria, Sept 12, 2002 Ananth Rao Yonsook Enloe SGT, Inc.
ECHO For International Partners March 8, 2005 Robin Pfister Yonsook Enloe.
Earth Observing System Data and Information System (EOSDIS) provides access to more than 3,000 types of Earth science data products and specialized services.
Why We Create Metadata and How it is Useful Bruce Godfrey University of Idaho Library INSIDE Idaho
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
1 OPeNDAP/ECHO Demo Integrating and Chaining services September, 2006 CEOS WGISS 22 Annapolis, MD.
1 EOS Clearinghouse Robin Pfister, NASA/GSFC CEOS WGISS May 10-14, 2004.
1 Interoperability Among EOS Data Gateway, ECHO and CEOS’ INFEO Systems CEOS WGISS Subgroup Meeting May 9, 2002 Frascati Italy Chao-Hsi Chang NASA/EDG.
Why do I want to know about HDF and HDF- EOS? Hierarchical Data Format for the Earth Observing System (HDF-EOS) is NASA's primary format for standard data.
1 1 ECHO Extended Services WGISS – 23 Hanoi, Vietnam May 15, 2007.
U.S. Department of the Interior U.S. Geological Survey Access to MODIS Land Data Products Through the Land Processes DAAC John Dwyer and Carolyn Gacke,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 1 ECHO Overview and Status Enabling Interoperability with NASA Earth Science Data and Services GES DISC User Working Group May 10, 2011 Andrew E. Mitchell.
1 ECHO and EDG Status Sept 12, 2006 Beth Weinstein, Yonsook Enloe,
EOSDIS Status 9/29/2010 Dan Marinelli, NASA GSFC
NASA’s EOSDIS Clearinghouse - ECHO - WGISS – 24. Status ECHO 9 is operational ECHO 10 Focused on a re-architecture of Ingest services.
What is CWIC? Authors: Doug Newman Andrew Mitchell
EOSDIS Status 10/16/2008 Dan Marinelli, Science Systems Development Office.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EOSDIS User Registration System (URS) 1 GES DISC User Working Group May 10, 2011 GSFC, NASA.
1 ECHO and EDG Status May 9, 2006 Beth Weinstein, Yonsook Enloe,
PoDAG XXI: SEEDS SEED: NSIDC Potential Interactions NSIDC DAAC should prepare an evaluation of their desired future roles in "core activities" and in mission.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
1 1 ECHO Extended Services February 15, Agenda Review of Extended Services Policy and Governance ECHO’s Service Domain Model How to…
Evolving a Legacy System Evolution of the Earth Observing Data and Information System M. Esfandiari 1, H. Ramapriyan 1, J. Behnke 1, E. Sofinowski 2 1.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
MODIS SDST, STTG and SDDT MODIS Science Team Meeting (Land Discipline Breakout Session) July 13, 2004 Robert Wolfe Raytheon NASA GSFC Code 922.
Science Review Panel Meeting Biosphere 2, Tucson, AZ - January 4-5, 2011 Vegetation Phenology and Vegetation Index Products from Multiple Long Term Satellite.
MODIS Data at NSIDC MODIS Science Team Meeting - Nov. 2, 2006.
What is ECHO? ECHO Open Search ECHO Facts NASA’s Earth Observing System ClearingHOuse (ECHO) acts as the core metadata.
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NGDC CLASS Developers Meeting July 14th, 2008 Simple NOAA Archive Access Portal.
SPDF Science Advisory Group - September 29-30, 2005 Page 12/24/2016 9:09:48 PM Services of the Space Physics Data Facility (SPDF) / Sun-Earth Connection.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
Advertising your data Alecia Aleman 1, Ruth Duerr 2 1 National Aeronautics and Space Administration (NASA) 2 National Snow and Ice Data Center, University.
1 ECHO ECHO 9.0 for Data Partners Rob Baker January 23, 2007.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Virtual Lab Overview 5/21/2015 xxxxxxxxxx NWS/MDL/CIRA.
LP DAAC Overview – Land Processes Distributed Active Archive Center Chris Doescher LP DAAC Project Manager (605) Chris Torbert.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
International Planetary Data Alliance Registry Project Update September 16, 2011.
AIRS Meeting GSFC, February 1, 2002 ECS Data Pool Gregory Leptoukh.
Page 1 CSISS Center for Spatial Information Science and Systems IIB and GCI Meeting CSR Architecture and Current Registration Status Prof. Liping Di Director.
GEOSS Component and Service Registry (CSR)
EOS ClearingHOuse Robin Pfister Code 423, NASA/GSFC
OGSA Data Architecture Scenarios
Flowserve Distributor Online Store & Portal
WGISS Connected Data Assets April 9, 2018 Yonsook Enloe
Flowserve Distributor Online Store & Portal
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
SDMX IT Tools SDMX Registry
Presentation transcript:

ECHO: Foundational Middleware for a Science Cyberinfrastructure Robin Pfister Keith Wichmann Beth Weinstein MODIS Science Team Meeting March 22, 2005

ECHO is… Is an Enabling Framework that allows interoperability among diverse and distributed, data, service and client systems Is a metadata clearinghouse and order broker. In the near-future will also be a granule-level service broker Is an Open System: Enabling a Collaborative Community

Background Users of Earth Science Data were confined to a single user interface built as a one-size-fits-all solution Search and retrieval data access paradigm, searches were executed at the archives. Many factors contributed to slow or non-performance Scientists spent a significant amount of time performing data access activities NASA was paying a single contractor to develop all needed functionality into a single system Messages: Search, Browse & Order EOS Data Gateway User Interface Collection Metadata Catalog & Interoperability Middleware EOSDIS Data Providers Data Holdings Inventory Metadata Catalog Browse Images WWW Browsers

Background Individuals and communities can share data, services and tools. Scientists save time by using tools tailored for their data access needs, cutting out unnecessary steps in the data access process Tools can be developed by anyone, saving NASA funds and increasing quality through more competition ECHO Data & Service Providers - Any Orders Collection & Inventory Catalog + Sample Images Service & Order Broker Provider APIs Client APIs WWW Browsers Data Holdings EDG & Other Clients: Machine - Machine & Human - Machine User Interface Search, Browse & Order Requests

ECHO Context ECHO acts as middleware between resource (data or service) partners and client partners.  Clearinghouse of spatial, temporal, numeric and textual Earth Science metadata  Order Broker  User and provider account service  Services clearinghouse and broker (future)  Capabilities are in line with (and expected to be driven by) GEOSS. Data Partners provide information about their data holdings, and client partners develop software (either machine-machine or human- machine) to access this information. End users who want to search ECHO's metadata must use one of the ECHO clients. OGC Clients (e.g. ESG) MODIS Clients (e.g. NEO, SIMECC) Others…. LP DAAC ORNL DAAC Other Data Partners Collection & Granule Catalog ECHO Browse Images Partner APIs Client APIs

ECHO Context Clearinghouse Catalog CLIENT APICLIENT API SERVICE API PROVIDERAPIPROVIDERAPI ECHO Order & Service Brokers Client Extensibility Data Extensibility Service Extensibility Graphical User Interfaces (applets, active pages, etc.) Machine-to- Machine Interfaces New Data Partners - Can establish policies for their own data New Collections/Data Types Access Mechanisms Types of services: -Data Services: e.g. collection, subsetting - Search Services: e.g. gazetteer, thesaurus (future) - Administrative Services: e.g. accounting (future) Service interactions: - Order Options - Advertised - Brokered (future- ECHO 8.0) - Orchestration/Chaining (future) Service Views: - Service View - Data View

Why use ECHO? The multi-organizational content of ECHO provides a broadened set of resources to a growing number of Earth science applications and interdisciplinary research efforts.  The more resources provided, the richer the set of available resources. ECHO streamlines access to digital data and materials and brokers orders and other services from Clients to Data Partners.  More data access paradigms and approaches are supported. Human-Machine and Machine to Machine “Navigation and discovery” and “Search and retrieval” Subscriptions  Virtual seamless access from the end user’s perspective.  Because scientists will have access to clients tailored to their needs, valuable time can be saved and refocused on research.

Why use ECHO? (cont.) ECHO puts the control of the resources in the hands of the providers of those resources.  ECHO has tools for providers to mange and control access to their contributed resources.  Establish the best spatial search approach (details in back-up slides) for their data (orbital search, geodetic, or Cartesian) Improved availability over previous architecture (details in back-up) ECHO provides tracking services for both the Data and Client Partner. In the future, new technologies can be infused using the services APIs. Back-up slides also illustrate:  Recent performance improvement. Performance improvement is an ongoing effort (more details on availability and performance improvements in back-up slides)

Current State of ECHO Current Operational Version: Version 6.0 is being externally tested Version 7.0 is ready for external test Version 8.0 development is getting started Information on future releases ECHO Web site.

Client API Improvements Client Workshop (March 10-11) kicked off 2005 as the "year of the client".  Prioritize API areas for development of the Client Libraries and the Reference client.  Reference Client will provide a view with 'best practices' code that can be copied and pasted into other applications.  The workshop also encouraged component sharing among the existing client providers. Additionally, We've repackaged the complete set of toolkit source code, java documentation, and libraries.  This repackaging fills in all the remaining "holes" in the toolkit library  Included is a sample demonstration client- this demo client was always there - it is updated to reflect the 6.x changes.

ECHO Holdings Report Operational Holdings Summary as of 3/1/2005 CollectionsGranulesBrowse Public ,178,3945,308,581 Restricted 1766,483,4602,047,530 Total1,40941,661,8547,356,111

ECHO Data Partners Operational  ORNL DAAC (Oak Ridge National Laboratory Distributed Active Archive Center)  SEDAC (Socioeconomic Data and Applications Center)  LP DAAC (Land Processes EOS Core System (ECS) DAAC)  GES DAAC (GSFC Earth Sciences ECS DAAC)  ASF DAAC (Alaska SAR Facility DAAC) In Test  LaRC DAAC (Langley Research Center ECS DAAC)  NSIDC DAAC (National Snow and Ice Data Center ECS DAAC) In Development  PO.DAAC (Physical Oceanography DAAC)  SSC SDP (NASA Stennis Space Center Science Data Purchase)  GES DAAC V0 (GSFC Earth Sciences Version 0 DAAC)

ECHO Client Partners Operational  Mercury EOS – ORNL client  Power User Interface – script for bulk ordering  SIMECC (Simple MODIS ECHO Client) – MODIS data search and order In Test  WIST – EOS Data Gateway using ECHO  ASF ECHO Client - Alaska SAR Facility client  WISRD (Web Interface for Searching, Subsetting, Stitching, Resampling, Regriding, and Reformating Data) – NSIDC Client to search for swath, scene, and gridded data  MODIS Website – Client via existing MODIS website and ORNL’s shopping cart  Data Validation User Interface – Client for MODIS Land Data Validation Team Under development  Annoterra Version2 – Link between EOS newsfeeds, GCMD, and ECHO granules  SNOWI-E – NSIDC client  NEO - NASA Earth Observations client  Invasive Species Data Service  OGC/NSDI Client Adaptor - will allow any OGC/NSDI compliant client to plug in. Earth-Sun Gateway (ESG) is an example ) There are plans for other clients (MODIS rapid response, Adaptive Sensor Fleet)

Client Partner Support The ECHO Operations Team (ECHO Ops) is the point of contact for direct interaction between ECHO, its Partners, and end users ECHO Ops is responsible for the operation and maintenance of the ECHO operational and partner test systems ECHO Ops general support for Partners includes:  Assisting Partners in understanding the ECHO API and DTDs  Ingest management and accounting - weekly summaries and metrics on all ingest jobs  Problem tracking and resolution  Advertising Data Partner holdings and availability of new datasets  Promoting and engaging new Client Partners Reference client in progress (best practices code reuse) Improve website content to provide “front office” for client developer support  Improve overview materials  Create sense of thriving project and community  Reorganize developer reference materials into developer center Develop multilevel documentation to ease use of APIs

Contact Information ECHO Ops   ECHO website   Holdings Summary, Upcoming Functionality, APIs and DTDs  Real-time status ECHO mailing lists  echo-all, echo-status, echo-client, echo-data ECHO Technical Committee (ETC) Meetings  3:00pm ET  Contact ECHO Ops for dial-in information

DVUI Demonstration Data Validation User Interface Proof-of-concept Illustrates ECHO can support very different search and access paradigms from EDG Resulting from this effort were improvements to ECHO that are being released in upcoming iterations This is an example of something that couldn’t be done in the old heritage architecture Potential re-users of this concept (or code) need to make sure that the data they need for their application are made visible by the providers of those datasets to the potential users of the client (this was a major reason DVUI wasn’t able to go operational before)

Backup

DVUI Proof-of-Concept Client

Architecture Overview: System Drivers Design for Change - with ever-changing technology, science needs and policies we needed to build a highly-evolvable system Present an open API for organizations to connect their own data or service or to build user interfaces for specialized views of these resources Make it easy for providers of Earth Science data and services to participate in the system Provide searches that respond quickly Broker orders for both data and services Minimize operational costs Build upon advances in industry and use e-commerce systems as a model Build a system that can be scaled up to handle large numbers of requests

Access Control Lists ECHO supports allowing the Data Partner to restrict visibility of their Collection or Granule metadata, as well as the ability to order Since ECHO contains 10s of millions of records, granules to be access controlled are determined based on a set of rules  Fixed time rules (i.e. hide data acquired between September 11th, 2001 and September 30th, 2001)  Relative time rules (i.e. prevent orders for data for 30 days after it is produced)  Restriction flag setting  Specific collection or granule Access controls are evaluated at the last possible moment so that changes are taken into account Providers block access to everyone, and then grant access to groups ECHO provides a separate function for managing what users are in a group PUMP (Provider User Management Program) is a GUI tool for managing access controls and other Data Partner related functions

About Spatial Data ECHO leverages Oracle’s spatial capabilities to perform spatial search and to represent Partner metadata Both collections and granules can be spatially represented  If a collection or granule represents the entire Earth, there is no need to spatially index it  ECHO is asking Providers to set a spatial keyword to “Global” in order to indicate this condition Version 5.5 will have a special Global flag for the Partner to set Oracle 9i version 2

Cartesian, Geodetic and Orbit Models Cartesian Model  The Earth is represented by a rectangle (-180, -90, 180, 90)  Spatial objects cannot cross the International Date Line  Spatial objects cannot cross the poles  Polygons can be as large as the whole Earth Geodetic Model  The Earth is represented by a spheroid  Spatial objects may cross the International Date Line  Spatial objects may cross the poles  No single polygon may have an area equal to or larger than half of the Earth’s area Orbit Model

Cartesian View of Sample Polygon

Geodetic View of Sample Polygon

Services Provided For Searching Spatial Search  Cartesian  Geodetic  Orbit-based Temporal Range Searches Keyword Searches Numeric Searches Product Specific Attribute Searches Extensibility to support varying, specialized inventory search capabilities. E.g spatial models

Improved Availability: Old Architecture 200TB 8TB 100TB 1PB 1GB Net Result: 200 terabytes of data not searched. Search time dependent on lowest common denominator. Partner down for maintenance, network not available, etc. Data Partner Data Partner Data Partner Data Partner Data Partner EDG

Improved Availability: ECHO Architecture Partner down for maintenance, network not available, etc. 200TB (35MB New) 8TB (2MB New) 100TB (0 New) 1PB (670MB New) 1GB (100MB New) Net Result: 35 megabytes of data not searched. Search time dependent on ECHO only. ECHO Client EDG Client Data Partner Data Partner Data Partner Data Partner Data Partner

Online Access or Order Some Data Partners make the actual data available online. In this case, the OnlineAccessURL is included in the query result, and users can access the data without going through the order process provided in Order Entry Service. If the data is not online, the Order Entry Service allows users to create, modify, quote, and submit orders. The user can also check the status of the order if it is updated by the Data Partner. One order consists of one or more provider orders. Each provider order represents the catalog items that a user wants from a particular Data Partner. Each provider order consists of one or more order line items, with each line item describing the catalog item that is being requested including any requested options for that item.  Catalog Items in ECHO are either granules or collections  Only items that are orderable are Catalog Items

Subscription Service The subscription service automatically keeps users or clients informed of updated metadata  Don’t have to issue a query every night  Only updated metadata is sent The user can subscribe to the following options  Collection metadata updates of one or all collections  Granule metadata updates of a collection  Both collection metadata and granule metadata updates of a collection The metadata can be delivered by either FTP or Must be a registered user to use this service  address of user is used to report errors The subscription service honors the Access Control Lists by only delivering the data that the registered user has access to

Query Performance GranuleUR  # of Runs  Min:  Max:  Mean: DataSetID, Browse Only  # of Runs  Min:  Max:  Mean: DataSetID, Temporal  # of Runs  Min:  Max:  Mean: DataSetID, Temporal, Spatial  # of Runs  Min:  Max:  Mean:

System Availability 12/1/ /1/2005 Operational System Uptime  Overall average: 98.5%  December 2004: 96.4%  January 2005: 99.6%  February 2005: 99.4% Partner Test System Uptime  Overall average: 99.8%  December 2004: 99.8%  January 2005: 99.8%  February 2005: 99.7%

WIST Warehouse Inventory Search Tool – general ECHO client based on EDG GUI WIST 0.2  ECHO 6.0 compatible  June 1, 2005 WIST 0.3  ECHO 7.0 compatible  Sept 1, 2005 Draft of EDG to WIST transition plan  Distributed to the DAAC Managers this week  Telecons