OPeNDAP in the Cloud Optimizing the Use of Storage Systems Provided by Cloud Computing Environments OPeNDAP James Gallagher, Nathan Potter and NOAA/NODC.

Slides:



Advertisements
Similar presentations
James Gallagher OPeNDAP 1/10/14
Advertisements

CHANGING THE WAY IT WORKS Cloud Computing 4/6/2015 Presented by S.Ganesh ( )
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
OPeNDAP in the Cloud OPeNDAP James Gallagher, Nathan Potter and NOAA/NODC Deirdre Byrne, Jefferson Ogata, John Relph 26 June 2013.
OPeNDAP’s Server4 Building a High Performance Data Server for the DAP Using Existing Software Building a High Performance Data Server for the DAP Using.
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Web browsers.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Supervisor: Hadi Salimi Abdollah Ebrahimi Mazandaran University Of Science & Technology January,
OPeNDAP Present and Future An Overview Encompassing Current Projects & Potential New Directions Dave Fulker and James Gallagher.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Cloud Computing Systems Lin Gu Hong Kong University of Science and Technology Sept. 21, 2011 Windows Azure—Overview.
Internet GIS. A vast network connecting computers throughout the world Computers on the Internet are physically connected Computers on the Internet use.
Hyrax Installation and Customization Dan Holloway James Gallagher.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
Cloud based storage. Cloud Storage Storage accessed by a web service API It is a block storage, it exposes its storage to clients as Raw storage that.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. WEB.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Assorted Topics Introduction AJAX What is it? Why is it important? Examples of live applications Cloud Computing What is it? Why.
NODC ↔ Data Consumers Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
Unidata TDS Workshop THREDDS Data Server Overview
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Remote Data Access with OPeNDAP Dr. Dennis Heimbigner Unidata netCDF Workshop October 25, 2012.
A Data Access Framework for ESMF Model Outputs Roland Schweitzer Steve Hankin Jonathan Callahan Kevin O’Brien Ansley Manke.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
LAS and THREDDS: Partners for Education Roland Schweitzer Steve Hankin Jonathan Callahan Joe Mclean Kevin O’Brien Ansley Manke Yonghua Wei.
Optimized File Uploads in Mobile Cloud Computing Yash Sheth Vishal Sahu Swapnil Tiwari
OPeNDAP’s Server4: Building a High Performance Data Server for the DAP Using Existing Software James Gallagher*, Nathan Potter*, Patrick West**, Jose Garcia**
Ocean Observatories Initiative Serving Ocean Model Data on the Cloud M. Meisinger, C. Farcas, E. Farcas, C. Alexander, M. Arrott, J. de La Beaujardière,
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Cloud Computing Shannon McManus Michael Weihert. What is Cloud Computing?
1 Earth System Grid Center for Enabling Technologies OPeNDAP Services for ESG March 9, 2016 Peter Fox, Patrick West, Stephan Zednik RPI Performance Measures.
1 The World Wide Web Architectural Overview Static Web Documents Dynamic Web Documents HTTP – The HyperText Transfer Protocol Performance Enhancements.
Deploying Highly Available SAP in the Cloud
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Web Page Design The Basics. The Web Page A document (file) created using the HTML scripting language. A document (file) created using the HTML scripting.
Added Value to XForms by Web Services Supporting XML Protocols Elina Vartiainen Timo-Pekka Viljamaa T Research Seminar on Digital Media Autumn.
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
James Gallagher, Nathan Potter and NOAA/NODC
Using Community-Driven Metadata Conventions with On-Line Data
CLOUD ARCHITECTURE Many organizations and researchers have defined the architecture for cloud computing. Basically the whole system can be divided into.
Course: Cluster, grid and cloud computing systems Course author: Prof
Data Browsing/Mining/Metadata
Chapter 6: Securing the Cloud
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Data Bridge Solving diverse data access in scientific applications
Amazon Storage- S3 and Glacier
Nope OS Prepared by, Project Guides: Ms. Divya K V Ms. Jucy Vareed
Other Services in Hyrax
Platform as a Service.
ICT Communications Lesson 1: Using the Internet and the World Wide Web
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Amazon AWS Solution Architect Associate Exam Dumps For Full Exam Info Visit This Link:
Replication Middleware for Cloud Based Storage Service
OneStop: Architecture Review
OneStop: Progress Toward Implementation of Enterprise Storage Services
AWS Cloud Computing Masaki.
Web Server Technology Unit 10 Website Design and Development.
OPeNDAP’s Server4: Building a High Performance Data Server for the DAP
Adapting an existing web server to S3
Presentation transcript:

OPeNDAP in the Cloud Optimizing the Use of Storage Systems Provided by Cloud Computing Environments OPeNDAP James Gallagher, Nathan Potter and NOAA/NODC Deirdre Byrne, Jefferson Ogata, John Relph 26 June 2013

Cloud Systems Now* Providers: IBM, Microsoft, Amazon, Google, Rackspace, … Microsoft: Azure “…handles 100 petabytes of data a day” Amazon: “…hundreds of thousands of users” Netflix: “…stopped building it’s own data centers in 2008;” all in Amazon by 2012 Snapchat: 4000 pictures per second; “…never owned a computer server.” (Google cloud) *Quentin Hardy, “Google Joins a Heavyweight Competition in Cloud Computing,” NY Times, 3 December 2013

TheOPeNDAP request smaller and is just the data the person wants In cloud systems cost is a function of data transfer, in addition to to data stored, so smaller targeted requests reduce costs OPeNDAP request 4% Download Full dataset 100% Download Why use OPeNDAP?

NOAA Environmental Data Management Conceptual Cloud Architecture* Potential locations of cloud-enabled OPeNDAP instances *Aadapted from NOAA Environmental Data Management Framework Draft v0.3 Appendix C - Dr. Jeff de La Beaujardière, NOAA Data Management Architect

No vendor lock-in! No Stovepipes! - flexible storage method What will be the client of 2020? Hierarchical/human browsable Constraints file dataset file

Data stores: S3 and Glacier S3 Spinning disk with a flat file system Designed to make web-scale computing easier Glacier Near-line device with 4-hour (or >) access times Secure and durable storage EC2 EC2 was used to run the OPeNDAP data server Linux

Using S3 as a Data Store Catalog Data S3 HTTP GET & HEAD requests

Web requests S3 Catalog, or data request XML or data file

To enhance performance, data were accessed from S3 only when not already cached. OPeNDAP Catalog requests S3 OPeNDAP Server catalog cache XML File User catalog Request Catalog Access THREDDS catalog or HTML EC2 data cache

To enhance performance, data were accessed from S3 only when not already cached. OPeNDAP Data requests S3 OPeNDAP Server catalog cache Data File User data Request Data Access Data Slice EC2 data cache

Observations S3FS & Amazon's APIs: vendor lock-in XML catalogs were flexible: Support both direct web and… Subsetting server access Likely adaptable to other use-cases Easily support hierarchical structure Catalogs didn't need to be stored in S3

Glacier and Asynchronous Responses To use Glacier, a web service protocol must support asynchronous access! Glacier is a near-line device; not a spinning disk. Support via protocol is not enough: typical use cases cannot be met without caching ‘metadata’ o To support web interfaces/clients DAP metadata objects should be cached o To support smart clients, may need domain data in cache

Glacier Implementation Caching o Catalog o DAP metadata Support for programmatic and web clients o Web clients are the primary user of the DAP metadata because of their ‘click and browse’ behavior XML with an embedded XSL style sheet o Single response (XML) o Multiple target clients – smart and browser

Comparison: S3 and Glacier* Glacier provides “secure and durable storage” S3 is “designed to make web-scale computing easier” These graphs: A tiny part of complex cost model. They do not include the cost to move data out of the Amazon cloud, EC2 instances, etc. *

Summary OPeNDAP server with minimal changes Data stored in S3 and Glacier Solution widely applicable: Web + Smart clients Complexity of the cost model  combination of both S3 and Glacier likely Modeling & Monitoring use required