1 Earth System Grid Center for Enabling Technologies OPeNDAP Services for ESG March 9, 2016 Peter Fox, Patrick West, Stephan Zednik RPI Performance Measures.

Slides:



Advertisements
Similar presentations
웹 서비스 개요.
Advertisements

LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Data Management Expert Panel - WP2. WP2 Overview.
Chapter 17: WEB COMPONENTS
OPeNDAP in the Cloud Optimizing the Use of Storage Systems Provided by Cloud Computing Environments OPeNDAP James Gallagher, Nathan Potter and NOAA/NODC.
OPeNDAP’s Server4 Building a High Performance Data Server for the DAP Using Existing Software Building a High Performance Data Server for the DAP Using.
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Gateway Node Security Block Diagram ESG Gateway Node Confluence Server OpenID Filter Authz Service Callout Authorization Service (SSL) F-TDS OpenID Filter.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Web browsers.
INTERNET DATABASE Chapter 9. u Basics of Internet, Web, HTTP, HTML, URLs. u Advantages and disadvantages of Web as a database platform. u Approaches for.
INTERNET DATABASE. Internet and E-commerce Internet – a worldwide collection of interconnected computer network Internet – a worldwide collection of interconnected.
Definitions, Definitions, Definitions Lead to Understanding.
Kitchen Sinks, Plumbing and Virtual Observatories Peter Fox June 4, 2010 – CSIRO Aspendale.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Hyrax Installation and Customization Dan Holloway James Gallagher.
OPeNDAP Hyrax Back-End Server (BES) Authentication and Authorization Patrick West
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
FALL 2005CSI 4118 – UNIVERSITY OF OTTAWA1 Part 4 Web technologies: HTTP, CGI, PHP,Java applets)
Introduction to ASP.NET. Prehistory of ASP.NET Original Internet – text based WWW – static graphical content  HTML (client-side) Need for interactive.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
Principles of Computer Security: CompTIA Security + ® and Beyond, Third Edition © 2012 Principles of Computer Security: CompTIA Security+ ® and Beyond,
Hyrax Architecture Two cooperating processes: –Front-end provides DAP interface –Back-end reads data Both parts can be customized –Front-end: different.
Chapter 4: Core Web Technologies
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
OOI CyberInfrastructure: Technology Overview - Hyrax January 2009 Claudiu Farcas OOI CI Architecture & Design Team UCSD/Calit2.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
OPeNDAP Developer’s Workshop Feb OPeNDAP 4 Data Server – Hyrax James Gallagher and Nathan Potter 21 Feb 2007.
Lecturer: Prof. Piero Fraternali, Teaching Assistant: Alessandro Bozzon, Advanced Web Technologies: Struts–
GridFE: Web-accessible Grid System Front End Jared Yanovich, PSC Robert Budden, PSC.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Topaz : A GridFTP extension to Firefox M. Taufer, R. Zamudio, D. Catarino, K. Bhatia, B. Stearn University of Texas at El Paso San Diego Supercomputer.
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
Web Services BOF This is a proposed new working group coming out of the Grid Computing Environments Research Group, as an outgrowth of their investigations.
1 Welcome to CSC 301 Web Programming Charles Frank.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
1 WWW. 2 World Wide Web Major application protocol used on the Internet Simple interface Two concepts –Point –Click.
OPeNDAP Hyrax Harnessing the power of the BES OPeNDAP Hyrax Back-End Server Patrick West
Web Technologies Lecture 8 Server side web. Client Side vs. Server Side Web Client-side code executes on the end-user's computer, usually within a web.
JS (Java Servlets). Internet evolution [1] The internet Internet started of as a static content dispersal and delivery mechanism, where files residing.
CSI 3125, Preliminaries, page 1 SERVLET. CSI 3125, Preliminaries, page 2 SERVLET A servlet is a server-side software program, written in Java code, that.
1 Earth System Grid Center for Enabling Technologies ESG-CET Security January 7, 2016 Frank Siebenlist Rachana Ananthakrishnan Neill Miller ESG-CET All-Hands.
Interactive Data Analysis on the “Grid” Tech-X/SLAC/PPDG:CS-11 Balamurali Ananthan David Alexander
April 2008ESG All-Hands meeting ESG Product Services Overview of components Issues in need of discussion Steve Hankin, NOAA/PMEL Roland Schweitzer, Weathertop.
Weathertop Consulting, LLC Server-side OPeNDAP Analysis – Concrete steps toward a generalized framework via a reference implementation using F-TDS Roland.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
OPeNDAP’s Server4: Building a High Performance Data Server for the DAP Using Existing Software James Gallagher*, Nathan Potter*, Patrick West**, Jose Garcia**
9/21/04 James Gallagher Server-Side: The Basics This part of the workshop contains an overview of the two servers which OPeNDAP has developed. One uses.
Update on Unidata Technologies for Data Access Russ Rew
TSDS (HPDE DAP). Objectives (1) develop a standard API for time series-like data, (2) develop a software package, TSDS (Time Series Data Server), that.
12. DISTRIBUTED WEB-BASED SYSTEMS Nov SUSMITHA KOTA KRANTHI KOYA LIANG YI.
Using Community-Driven Metadata Conventions with On-Line Data
CX Introduction to Web Programming
Data Bridge Solving diverse data access in scientific applications
Other Services in Hyrax
Distribution and components
Processes The most important processes used in Web-based systems and their internal organization.
Remote Data Access Update
OPeNDAP’s Server4: Building a High Performance Data Server for the DAP
OPeNDAP/Hyrax Interfaces
Presentation transcript:

1 Earth System Grid Center for Enabling Technologies OPeNDAP Services for ESG March 9, 2016 Peter Fox, Patrick West, Stephan Zednik RPI Performance Measures x.x, x.x, and x.x

2 Earth System Grid Center for Enabling Technologies: (ESG- CET) In ESG II (in regard to data)  Server side aggregation  DAP object transfer via HTTP and GridFTP  GSIFTP integration, myProxy support  OPeNDAP-g -> led to architecture of OPeNDAP/ Hyrax (primarily BES is from OPeNDAP-g)

3 Earth System Grid Center for Enabling Technologies: (ESG- CET) Requirements leading to OPeNDAP-g  Separation of the core Data Access Protocol (DAP) from the transport protocol (HTTP).  High Performance Computing. The previous CGI based servers did not have the capacity required by ESG. Error and memory handling added.  Security. Once the OPeNDAP was independent of the transport protocol, adding security was possible by relying on the Globus gsiFTP system.  Aggregation. OPeNDAP 3.0 did not operate on aggregated datasets. OPeNDAP-g does.  Transport protocol independence and HPC were incorporated back into OPeNDAP leading to the current version. Security and aggregation remain an ESG only feature.

4 Earth System Grid Center for Enabling Technologies: (ESG- CET) OPeNDAP-g Architecture DispatcherBES  OPeNDAP-g Dispatcher (e.g. ESG Front-end Server)  Receives requests and asks the BES to fill them  Uses Apache Modules  Does not directly ‘touch’ data, handles URLs Data  Back End Server (BES)  Reads data files, Databases, et c., returns info  May return DAP objects or other data (netCDF)

5 5 OPeNDAP Hyrax Architecture OLFSBES  OPeNDAP Lightweight Front end Server (OLFS)  Receives requests and asks the BES to fill them  Uses Java Servlets  Does not directly ‘touch’ data  Multi-protocol Data  Back End Server (BES)  Reads data files, Databases, et c., returns info  May return DAP2 objects or other data  Does not require web server Client

6 6 GridFTP DAP2 GridFTP DAP2 HTTP DAP2 HTTP DAP2 ASCII output HTML form Info output OPeNDAP Lightweight Front end Server THREDDS** Request Formulation** Request from client Response to client BES SOAP-DAP (HTTP) DAP2 (GridFTP, HTTP) RDF, OWL, JSON (HTTP) PML output

7 7 Hyrax/ Back-end Server Network Protocol and Process start/stop activities Data Store Interfaces e.g. IOSP BES Framework PPT/PPTS* Initialization/ Termination DAP2 Access NetCDF3HDF4RDF/ SPARQL … Provenance Commands** e.g. server-side, Ferrett BES Commands/ XML Documents *PPT is built in (other protocols) **Some commands are built in Data Catalogs

8 Earth System Grid Center for Enabling Technologies: (ESG- CET) OPeNDAP-g services for ESG  Data access via Dataportals. In this Use Case users interact with the portal, browse the catalogs and decide what data to download. The portal passes the request to OPeNDAP-g which executes it and returns the data to the portal. The portal returns a URL to the user to download the data.  Data access via netCDF library. In this Use Case users link their applications with OPeNDAP-g client library for netCDF. The user can “open” a URL that refers to ESG data. The library fetches the data for the user and makes it a local netCDF file  The performance requirements for ESG II were met. However, ESG-CET scales up these requirements.

9 Earth System Grid Center for Enabling Technologies: (ESG- CET) Status of the Community OPeNDAP Server Software  Together the OLFS and the BES are known as Hyrax  Hyrax 1.6 provides support for NcML-based aggregation  Data response streamed back as netCDF file  RDF response type  Updated DDX response type (Data Definition XML)  Beginning development of DataDDX – multi-mime response with data and DDX  Full security audit and static code analysis certification to comply with NOAA and NASA requirements

10 Earth System Grid Center for Enabling Technologies: (ESG- CET) ESG-CET and data  Large data sets, numbers and sizes High performance Flexible architecture, both client and several types and numbers of servers Aggregation Server side operations Multiple transport protocol options  Full ESG security support as well as loose federation  Read-only client access via API (netCDF/CDM) To satisfy the new goals, the OPeNDAP services for ESG have been re-architected. We now use parts of the standard OPeNDAP framework Hyrax, focusing on high performance for the client side and extended flexibility.

11 ESG-CET and Products (server side functions)  Goal: drop in replacement for the TDS part of FTDS in LAS  Requires netCDF-Java Input-Output Service Provider (IOSP) adapter for Hyrax/BES  Use case examination will be required Earth System Grid Center for Enabling Technologies: (ESG- CET)

12 Security Infrastructure status  OPeNDAP BES Security SSL authentication between gridFTP middle tier and the BES. No persistent SSL connections are maintained  RNI Integration with ESG Security Infrastructure RNI client supports gsiFTP connections to ESG GridFTP servers ESG GridFTP server handles authentication of user Neill's ESG GridFTp authz callout plugin handles authorization of data request ESG/RNI GridFTP DSI module handles data request, forwards request to ESG BES server running RNI module Earth System Grid Center for Enabling Technologies: (ESG- CET)

13 Client status  RNI Version 0.1 implemented using netCDF version 3 and OPeNDAP’s libnc-dap  In communication with Unidata regarding integration of RNI client with new NetCDF version 4  Developing ncml aggregation in both client and server RNI  Full ESG security support Earth System Grid Center for Enabling Technologies: (ESG- CET)

14 Earth System Grid Center for Enabling Technologies: (ESG- CET) The Remote NetCDF Invocation (RNI)  The client is the netCDF library. It has exactly the same API as the standard C library netCDF, but it can deal with local files or files reachable via HTTP, PPT or gridFTP.  The third tier, the BES server can be reached only via PPT. NetCDF services for all NetCDF calls are implemented a a BES module.  The middle tier, acts like a proxy between the RNI client and server and deals with security.

15 Earth System Grid Center for Enabling Technologies: (ESG- CET) RNI Architecture CLIENT DATA GridFTP OPeNDAP BES NetCDF Library RNI Module connection acts like RNI Library

16 Earth System Grid Center for Enabling Technologies: (ESG- CET) Characteristics of the RNI as part of a data access system  Full Support of standard OPeNDAP URLs. RNI is being developed with the integrated Unidata/OPeNDAP netCDF library (and CDM)  Transparent access to either standard netCDF files and aggregated datasets via the NetCDF Markup Language (NCML).  For remote containers, all write operations are disable for security. That is, for HTTP/HTTPS, PPT and gridFTP/gsiFTP the RNI system is a read only API.  RNI utilizes Just in Time access. Caching is only for metadata. No pre-fetching of data.  RNI transparently accesses secure (gsiFTP, HTTPS) or insecure (gridFTP, HTTP) remote data.

17 Earth System Grid Center for Enabling Technologies: (ESG- CET) Status of the RPI work (OPeNDAP)  The primary accomplishments for this subproject in the past year has been The complete request-response for all netCDF API calls has been completed, that is, the two ends have been developed. We have highly optimized core components of the standard OPeNDAP framework to support the performance goals. We have established the knowledge of how the middle tier (the proxy) will be incorporated into the complete system. Our work was presented in AGU fall 2007 and EGU  So far all of the goals established for the first stage of the project has been completed.  Next stage integration with product server for gateway and data node data access Functional (with enhancements) replacement for TDS

18 Future, will not elaborate  Storage Resource Manager / DMLite as a client  Return as RDF  Return as PML (Provenance – Proof Markup Language) Earth System Grid Center for Enabling Technologies: (ESG- CET)