University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn 1 Quinn Hart 1 Bertram Ludäscher 1 Yogesh Simmhan.

Slides:



Advertisements
Similar presentations
A MapReduce Workflow System for Architecting Scientific Data Intensive Applications By Phuong Nguyen and Milton Halem phuong3 or 1.
Advertisements

Hello i am so and so, title/role and a little background on myself (i.e. former microsoft employee or anything interesting) set context for what going.
SALSA HPC Group School of Informatics and Computing Indiana University.
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
IPDPS - April 20, 2010 Jie Li 1, Deb Agarwal 2, Marty Humphrey 1, Keith Jackson 2, Catharine van Ingen 3, Youngryel Ryu 4 University of Virginia eScience.
Meeting Service Level Objectives of Pig Programs Zhuoyao Zhang, Ludmila Cherkasova, Abhishek Verma, Boon Thau Loo University of Pennsylvania Hewlett-Packard.
High memory instances Monthly SLA : Virtual Machines Validated & supported Microsoft workloads Price reduction: standard Windows (22%) & Linux (29%)
Azure Services Platform Piotr Zierhoffer. Agenda Cloud? What is Azure? Environment Basic glossary Architecture Element description Deployment.
Microsoft Cloud Futures 2010 April 9, 2010 Jie Li 1, Youngryel Ryu 2, Deb Agarwal 3, Keith Jackson 3, Marty Humphrey 1, Catharine van Ingen 4 University.
MCell Usage Scenario Project #7 CSE 260 UCSD Nadya Williams
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Platform as a Service (PaaS)
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
Cloud Computing Systems Lin Gu Hong Kong University of Science and Technology Sept. 21, 2011 Windows Azure—Overview.
WINDOWS AZURE STORAGE 11 de Mayo, 2011 Gisela Torres – Windows Azure MVP Aventia-Renacimiento Twitter:
Abstract Cloud data center management is a key problem due to the numerous and heterogeneous strategies that can be applied, ranging from the VM placement.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
A Workflow-Aware Storage System Emalayan Vairavanathan 1 Samer Al-Kiswany, Lauro Beltrão Costa, Zhao Zhang, Daniel S. Katz, Michael Wilde, Matei Ripeanu.
Fault tolerance in BlobSeer Bogdan Nicolae University of Rennes 1 Jesús Montes Sánchez CesViMa – Universidad Politécnica de Madrid.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
Migrating Business Apps to Windows Azure Marc Müller Principal Consultant, 4tecture GmbH
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Larisa kocsis priya ragupathy
Austin code camp 2010 asp.net apps with azure table storage PRESENTED BY CHANDER SHEKHAR DHALL
UI and Data Entry UI and Data Entry Front-End Business Logic Mid-Tier Data Store Back-End.
Emalayan Vairavanathan
Yongzhi Wang, Jinpeng Wei VIAF: Verification-based Integrity Assurance Framework for MapReduce.
The Center for Autonomic Computing is supported by the National Science Foundation under Grant No NSF CAC Seminannual Meeting, October 5 & 6,
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Threads, Thread management & Resource Management.
Your First Azure Application Michael Stiefel Reliable Software, Inc.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Mostafa Abdollahi Mazandaran University Of Science And Technology January 2011.
Overview of Cloud Computing Sven Rosvall ACCU
Windows Azure Conference 2014 Designing Applications for Scalability.
Fragmentation in Large Object Repositories Russell Sears Catharine van Ingen CIDR 2007 This work was performed at Microsoft Research San Francisco with.
SALSA HPC Group School of Informatics and Computing Indiana University.
Data centers Account Container Blobs Table Entities Queue Messages
Server to Server Communication Redis as an enabler Orion Free
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
Compiler and Runtime Support for Enabling Generalized Reduction Computations on Heterogeneous Parallel Configurations Vignesh Ravi, Wenjing Ma, David Chiu.
Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.
Problem-solving on large-scale clusters: theory and applications Lecture 4: GFS & Course Wrap-up.
North America Europe Asia Pacific Data centers.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Threads, Thread management & Resource Management.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Experiences Running Seismic Hazard Workflows Scott Callaghan Southern California Earthquake Center University of Southern California SC13 Workflow BoF.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
The EUBrazilOpenBio-BioVeL Use Case in EGI Daniele Lezzi, Barcelona Supercomputing Center EGI-TF September 2013.
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Platform as a Service (PaaS)
Platform as a Service (PaaS)
BEST CLOUD COMPUTING PLATFORM Skype : mukesh.k.bansal.
Introduction to Data Management in EGI
Platform as a Service.
Windows Azure Migrating SQL Server Workloads
Replication Middleware for Cloud Based Storage Service
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Orchestration and data movement with Azure Data Factory v2
Distributing META-pipe on ELIXIR compute resources
MapReduce: Simplified Data Processing on Large Clusters
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn 1 Quinn Hart 1 Bertram Ludäscher 1 Yogesh Simmhan 2 1 University of California at Davis 2 University of Southern California Presented at WORKS 2010 Streaming Satellite Data to Cloud Workflows for On-Demand Computing of Environmental Data Products Streaming Satellite Data to Cloud Workflows for On-Demand Computing of Environmental Data Products

University of California, Davis Daniel Zinn 2 Motivation Growing prevalence of sensors, continuously gathering observational data with high spatial and time granularity Increased resource needs for storing, managing, analyzing and sharing  Investigate Streams as first-class entities in Workflow Systems over heterogeneous resources What programming abstractions? How to provide these in Cloud/heterogeneous environments? What (new) application areas can benefit from them?

University of California, Davis Daniel Zinn 3 Use-case: Reference Evapotranspiration Reference Evapotranspiration (ETo) Planning daily water use (CA farmers, turf managers) Defining water resource policies  improve irrigation scheduling and monitor water stress. Current State Single-machine, monolithic ‘workflow’ orchestrated by make Executed once a day to create ETo maps for CA Computed from streaming observational data: Geostationary Operational Environmental Satellite (GOES) (GOES-WEST) California Irrigation Management Information System (CIMIS) stations

University of California, Davis Daniel Zinn 4 Use-case High-level Overview GOES West imagery CIMIS weather station point data Hourly cloud cover Reference ETo Complex Makefile Using GRASS GIS

University of California, Davis Daniel Zinn 5 Use-case Overview

University of California, Davis Daniel Zinn 6 App Domain Requirements Need to scale existing applications for Increased data quality [spatial and time] Batch-processing of historic data with novel analysis algorithms Need to share computation and data with a large community Benefits from Scientific Workflow Technology and Streaming Streaming as abstraction maps well to application domain Better management of dataflow pipeline Increased sharing of data and pipeline steps Benefits of the Cloud as computational platform Theoretic CPU scale-out limited only by $$$ Pay-as you go storage abstractions (BLOB/Tuple store) Simple service abstractions (BLOB, table, worker, …) Relatively easy to access/maintain (vs. GRIDS, vs. owning cluster)

University of California, Davis Daniel Zinn 7 Challenges for Cloud Usage Application migration Data movement slower / more complex Data-source to cloud Cloud to desktop Desktop to cloud Intra-cloud movement between workers and persistent storage Unpredictability Non-homogeneous data transfer rates Non-homogeneous disk access rates Non-homogeneous computation speeds

University of California, Davis Daniel Zinn 8 Contributions Migrating legacy script to a cloud-enabled workflow model Investigating data movement strategies BLOB storage as easy-accessible persistent cloud storage Direct streaming from Client to Cloud via TCP socket  Streaming faster by a factor of 5 Investigating scale-out behavior Ran 7 workflows in parallel  Almost linear speed-up for workflow steps and data movement

University of California, Davis Daniel Zinn 9 Workflow Design Workflow engine on client, that orchestrates workflow tasks Here, we use Restflow workflow system Light-weight, Java-based workflow engine with scripting capabilities and automated tracing of data flow and invocation timings Workflow tasks run on the Cloud, implement application logics Here, we use windows Azure cloud We used the PaaS (.Net program) infrastructure Access queues, tables, and BLOB via REST interface

University of California, Davis Daniel Zinn 10 Client – Cloud Communication Instructions via Message queues [Request/Response] Input / Output data via BLOB-store Workflow tasks are stateful, i.e. locally written files from earlier tasks are re-used by later tasks.  all tasks of a single ETo computation are run on the same cloud machine  achieved via reserving a cloud machine as first task and switching to private request/response queue BLOB Disk REST Storage Service API Upload input files Put Workflow Tasks Get Workflow Tasks Get input from BLOB to local disk Publish output from local disk to BLOB Workflow Engine on desktop

University of California, Davis Daniel Zinn 11 Cloud-Migrating of ETo computation Stage Input Reserve worker, data up+download via BLOB; 14 days (315MB) Make Cloud-cover & Make ETo Re-used legacy Make implementation that calls GRASS, Perl and bash scripts Publish Output data Upload output data to BLOB and return URL for user Served by Azure Web-hosting worker Make Clean and release worker VM Stage Input Data Make Cloud- Cover Make ET0 Publish Output Data Satellite Data Files on Desktop, Temporal Range Satellite Data Files in Cloud, Temporal Range Satellite Data Files in Cloud, Output from Cloud- Cover Satellite Data Files in Cloud, Output from Cloud-Cover, ET0 Output images from Cloud-Cover, ET0 Make Clean

University of California, Davis Daniel Zinn 12 Improved Data Movement using Streams BLOB Disk REST Storage Service API Upload input files Put Workflow Tasks Get Workflow Tasks Get input from BLOB to local disk Publish output from local disk to BLOB Workflow Engine on desktop Direct transfer Application can work on incoming data incrementally  Fault tolerance Simple API Fault tolerance  2 Data transfers  Slower bandwidth Disk BLOB Stream input files Put Workflow Tasks Get Workflow Tasks Socket Load Balancer Publish output from local disk to BLOB REST Storage Service API Workflow Engine on desktop

University of California, Davis Daniel Zinn 13 Changes for Streaming Stage Input Reserve worker via “Socket Load Balancer” Stream input data to CloudVM directly Rest is unaltered. We still communicate over queues, And store the final output in BLOB for persistency. Make Cloud-cover & Make Et0 Publish Output data Make Clean and release worker VM Stage Input Data Make Cloud- Cover Make ET0 Publish Output Data Satellite Data Files on Desktop, Temporal Range Satellite Data Files in Cloud, Temporal Range Satellite Data Files in Cloud, Output from Cloud- Cover Satellite Data Files in Cloud, Output from Cloud-Cover, ET0 Output images from Cloud-Cover, ET0 Make Clean

University of California, Davis Daniel Zinn 14 Evaluation Investigate effectiveness of Cloud Implementation Compare local vs. Cloud execution Investigate Scale-out for 7 concurrent workflows Compare BLOB vs. Streaming strategy Experimental Setup Local production machine: 2-core, 2.8GHz, 2GB RAM, NFS- mounted home+data (20MB/s) Client machine: gigabit to the internet Azure worker machine: 1.6GHz, 1.7GB RAM, 250GB local disk space, running 64bit Windows Server Co-located with data storage account US North Central Data Center Time measured by Restflow actor-invocation tracing capabilities Experiments run 4 times, averages and std-dev reported

University of California, Davis Daniel Zinn 15 Single Local and Cloud Executions Cloud-cover comparable ETo performed better on cloud than on local machine (NFS) Local outperforms BLOB Streaming beats local by 11%

University of California, Davis Daniel Zinn 16 Speedup of Concurrent Cloud Workflows Ran 7 workflows in parallel (315MB input data each) Linear speedup of computational tasks 25% overhead for data movement (220s vs. 177s for streaming) Overhead for zipping input data on client Summary: BLOB: speedup/#proc = 0.80 Stream: speedup/#proc = 0.92

University of California, Davis Daniel Zinn 17 BLOB vs. Streaming Single WF 315 MB of data in 15 zip files BLOB Client->BLOB: 1030s 13% stdev (300KB/s) BLOB->VM: 32s (10MB/s) Stream 180s 6%stdev (1.75MB/s)  Streaming 5x faster, and more stable

University of California, Davis Daniel Zinn 18 BLOB vs. Streaming: Parallel WFs BLOB Client->BLOB: 800s—1800s; avg:1172s; (270KB/s); stdev 44% (1.8MB/s total) BLOB->VM: 32s (10MB/s) stdev 35% (70MB/s total) Stream Avg: 227s 1.35MB/s stdev 6% (9.7MB/s total)  Streaming faster by >5x and more stable  Overall time: streaming 130% / 160% faster (single/parallel)

University of California, Davis Daniel Zinn 19 Findings / Summary Migrating ETo to cloud resources feasible from performance-point-of-view Streaming paradigm fits application domain well Streaming data transport beneficial for performance Much faster than going through BLOB storage More consistent performance Streaming on average 5x faster than via BLOB; total wall-clock time improvement 130% / 160%

University of California, Davis Daniel Zinn 20 Related Work

University of California, Davis Daniel Zinn 21 Ongoing / Future Work Compare the impact of streaming on different cloud platforms Investigate service abstraction for streams Easy to use such as BLOB, table, queues Built-in fault-tolerance, persistence, and sharing Investigating novel streaming apps Energy management (incoming smart-meter streams) Collaborative execution of workflows in the Cloud Incorporate pay-as-you-go cost model

University of California, Davis Daniel Zinn 22 University of California, Davis Daniel Zinn 22