OPeNDAP-Unidata Development of DAP4 (a Data Access Protocol) Describing Progress and Seeking Input at the ESIP Summer Meeting 2012 by Dave Fulker (OPeNDAP.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

James Gallagher OPeNDAP 1/10/14
OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
Chapter 10: Designing Databases
Unidata Seminar Series - 30 January 2004 OPeNDAP and THREDDS: Access and Discovery of Distributed Scientific Data Yuan Ho Ethan Davis UCAR Unidata.
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
View, through an architectural lens, of OPeNDAP’s Data Access Protocol (DAP2) A candidate OGC Standard (OGC Pending Document ) by James Gallagher.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
PHP (2) – Functions, Arrays, Databases, and sessions.
1 COS 425: Database and Information Management Systems XML and information exchange.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Serving unstructured grids using OPeNDAP: Using server-side operations to subset and subsample data Christopher Barker NOAA Office of Response & Restoration.
OPeNDAP Present and Future An Overview Encompassing Current Projects & Potential New Directions Dave Fulker and James Gallagher.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
Session 2: Using OPeNDAP-enabled Applications to Access Australian Data Services and Repositories eResearch Australasia 2011, ½ Day Morning Workshop, Thursday.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Chapter 3 The Relational Model Transparencies Last Updated: Pebruari 2011 By M. Arief
Toward Rich, User- Defined Aggregation & Subset-Selection Services Dave Fulker, President, OPeNDAP, Inc ESIP Summer Meeting on 9-12 July 2013 Thursday.
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
Implementation of Model Data Interoperability for IOOS: Successes and Lessons Learned Rich Signell USGS Woods Hole, MA / NOAA Silver Spring USA Model Data.
An Introduction to Software Architecture
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Coverages and the DAP2 Data Model James Gallagher.
Weathertop Consulting, LLC Wednesday, January 14, 2009 IIPS 11A.2 1 A General Purpose System for Server-side Analysis of Earth Science Data Roland Schweitzer.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.
Integrating netCDF and OPeNDAP (The DrNO Project) Dr. Dennis Heimbigner Unidata Go-ESSP Workshop Seattle, WA, Sept
DAP4 James Gallagher & Ethan Davis OPeNDAP and Unidata.
Unidata TDS Workshop THREDDS Data Server Overview
1 NASA CEOP Status & Demo CEOS WGISS-24 Oberpfaffenhofen, Germany October 15, 2007 Yonsook Enloe.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
HDF4 OPeNDAP Project Progress Report MuQun Yang and Hyo-Kyung Lee 1 HDF Developers' Meeting11/24/2015.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
A Data Access Framework for ESMF Model Outputs Roland Schweitzer Steve Hankin Jonathan Callahan Kevin O’Brien Ansley Manke.
1 NASA CEOP Final Summary CEOS WGISS-26 Boulder, Colorado September 23, 2008 Yonsook Enloe
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
Semantic Phyloinformatic Web Services Using the EvoInfo Stack Speaker: John Harney LSDIS Lab, Dept. of Computer Science, University of Georgia Mentor(s):
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
Data Interoperability at the IRI: translating between data cultures Benno Blumenthal International Research Institute for Climate Prediction Columbia University.
Weathertop Consulting, LLC Server-side OPeNDAP Analysis – Concrete steps toward a generalized framework via a reference implementation using F-TDS Roland.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
LAS and THREDDS: Partners for Education Roland Schweitzer Steve Hankin Jonathan Callahan Joe Mclean Kevin O’Brien Ansley Manke Yonghua Wei.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Update on Unidata Technologies for Data Access Russ Rew
THREDDS Data Server (TDS) and Data Discovery John Caron Unidata/UCAR May 15, 2006.
TSDS (HPDE DAP). Objectives (1) develop a standard API for time series-like data, (2) develop a software package, TSDS (Time Series Data Server), that.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Logical Database Design and the Rational Model
DAP+NETCDF Using the netCDF-4 Data Model
XML QUESTIONS AND ANSWERS
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
Remote Data Access Update
Future Development Plans
OPeNDAP/Hyrax Interfaces
Presentation transcript:

OPeNDAP-Unidata Development of DAP4 (a Data Access Protocol) Describing Progress and Seeking Input at the ESIP Summer Meeting 2012 by Dave Fulker (OPeNDAP President)

2 Overarching Concept of OPeNDAP’s Data Access Protocol (DAP): Clients Get Only Needed Data, When They Need them Accessing data through web services (i.e., URL ≈ dataset) Appending query strings to invoke server functions, esp. subsetting Getting responses of 2 major types: Metadata - dataset descriptions & catalogs (textual) Content - values and metadata (binary or textual) Using responses in diverse client contexts, e.g., MATLAB maps DAP responses directly to its internal math types DAP libraries (netCDF, e.g.) simplify the programming of apps

3 Some of DAP Users’ Distinguishing Needs Data often depict (scientific) phenomena where Geospatial maps are among the useful views But other views are important as well Coordinates often are 2-, 3-, 4- & even 5- dimensional These may include (time-dependent) coordinate-proxies Users often wish to use data whose source files Are in a variety of inconvenient formats With insufficient or obsolete metadata

4 Present State of DAP The DAP2 specification (after nearly 2 decades!) has multiple contemporary realizations on servers and clients Clients include: MATLAB, GRADS, IDL, IDV... Python apps that employ the PyDAP library Fortran, C, C++ & Java apps that employ the netCDF library Servers include: PyDAP, ERDAP... (often with augmented services) Most widely deployed: TDS (Unidata) & Hyrax (OPeNDAP) Widely used by data providers and users, including cases where DAP servers provide translations of inconveniently formatted source files

5 Branching: Hyrax & THREDDS Multiple implementations of a protocol often is considered a good thing (per IETF, e.g.) This can be a problem, however, if the implementations embody excessive redundancy or confuse users Our view: co-existence of TDS (Unidata) & Hyrax (OPeNDAP) reflects some redundancy & creates some inconsistencies for users Need #1: achieve conformance ⇒ consistency for users Need #2: more software reuse ⇒ more advancement

6 NOAA/BAA grant for OPeNDAP-Unidata Linked Servers (OPULS) Goal 1: OPeNDAP/Unidata conformance & linkage New data-model/protocol specs (DAP4), with conformance tests & extensibility demos: Modes of asynchronous access (to near-line data, e.g.) Server-side subsetting of data on irregular meshes Goal 2: common software for OPeNDAP & Unidata servers Work yet to begin...

7 OPeNDAP Data-Type Philosophy (reflected in DAP2 & now DAP4) Data model has few data types For simplified programming & lowered risk of errors Data types are deliberately domain-neutral For better trans-domain utility & programmer uptake But they allow both syntactic & semantic structures/metadata These Types do in fact support domain needs NetCDF-like (can represent functions on 4-D domains, e.g.) Sequences & selections match DBMS sensibilities

8 DAP4 Data Model (simplified) dataset ≈ unique URL (with no query string) a dataset holds a hierarchy of groups, each a namespace /container for variables, dimensions & attributes each variable comprises a name (unique in the group) a type (which applies to all values) value(s) (organized as dimensioned arrays) attributes* (optional)  Attributes are like variables but with a semantic purpose, making a variable or a group more meaningful. E.g., variables often have an attribute (of type string) named “units.”

9 DAP4 Data Types & Relations as in C or Java, e.g., a variable’s type may be structured or atomic: integer, float, byte, string... DAP variables may be (semantically) related to one another via two key grouping constructs relations link 1-D variables as columns in a table; sampled functions link coordinate-map variables (domain) to function-value variables (ranges) having common indexes in turn, relations can be linked via variables that serve as foreign keys

10 DAP4 Operations (invoked as query strings) 3 kinds of constraint expressions (i.e. query strings) yield subsets or invoke (server-side) processing projection (returns a subset) selection (returns a subset) function (today’s town hall!) specify included variables (by name) as well as indices of included array elements limit tuples (rows) of a relation to those with variable values satisfying a DBMS- style predicate invoke server functions to calculate a return [we intend to target critical needs]

11 Like netCDF, but as a Web service, users may Skip indices Limit index ranges Reduce dimensionality OPeNDA P Projecti on Operato rs

12 Other DAP-Related Serverices Note: these were not part of the DAP2 specification... Many DAP-based servers (from Unidata & OPeNDAP, e.g.) Accept multiple types of data as inputs Offer several views of them over the web Native DAP web services: for DAP-enabled clients Source format (lossless): netCDF-to-netCDF or HDF4-to-HDF4, e.g. Alternative web services: html (browser views), XML, WCS, etc. Town-Hall: what other services should be offered?

13 Other OPULS Accomplishments Irregular mesh subsetting Progress with U WA (Bill Howe) To be released soon... Asynchronous access Preliminary trials... Cloud-based service provision (with parallelism) MODIS reprojection (related, but not OPULS funding)

14 OPULS Process Transparency Public documentation updated weekly (just Google OPULS!) Advisory committee Jeff de La Beaujardiere, James Frew, Mike Folk, Steve Hankin, Eric Kihn, Rich Signell Welcoming input (per this town hall)

15 Town-Hall Questions What server functions ought to be specified in the DAP4 protocol? Simple point-wise mathematics Mathematics on sampled functions Truly domain-specific functions (involving the datum, e.g.) Which (other) web-service protocols should be leveraged by DAP servers, & what are the pertinent use cases? To facilitate open search (exploiting ATOM), e.g. To facilitate semantic analysis (providing RDF output, e.g.) Others?

16 i thank you OPeNDAP, Inc g g increasing data’s visibility increasing data’s visibility OPeNDAP, Inc g g increasing data’s visibility increasing data’s visibility