OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Common Execution Infrastructure Kate Keahey, Tim Freeman, Alex Clemesha, David LaBissoniere,

Slides:



Advertisements
Similar presentations
Performance Testing - Kanwalpreet Singh.
Advertisements

NSF ANNUAL REVIEW June 2010 Ocean Observatories Initiative August Release 1 Life Cycle Architecture (LCA) Review Status of CI Project Matthew Arrott.
Database Architectures and the Web
R3 Kickoff Meeting Ocean Observatories Initiative Common Execution Infrastructure (CEI) Subsystem OOI CI System Architecture Team: 1.
Management Framework for Amazon EC2 Speaker: Frank Bitzer
COMS E Cloud Computing and Data Center Networking Sambit Sahu
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
Cloud Computing Systems Lin Gu Hong Kong University of Science and Technology Sept. 21, 2011 Windows Azure—Overview.
Microsoft ® Application Virtualization 4.6 Infrastructure Planning and Design Published: September 2008 Updated: February 2010.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Ocean Observatories Initiative Common Execution Environment Kate Keahey OOI Cyberinfrastructure Life Cycle Objectives Milestone Review, Release 1 San Diego,
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
OOI CI R2 Life Cycle Objectives Review Aug 30 - Sep Ocean Observatories Initiative OOI CI Release 2 Life Cycle Objectives Review CyberPoPs & Network.
RUP Fundamentals - Instructor Notes
Click to add text TWA Cloud Integration with Tivoli Service Automation Manager TWS Education.
Ocean Observatories Initiative EOI User Review June OOI Cyberinfrastructure Release 1 Scope OOI CI - EOI User Review 16 June 2011.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Concepts of Operations John Graybeal Life Cycle Architecture Review La Jolla, CA.
Ocean Observatories Initiative Common Execution Infrastructure (CEI) Overview Michael Meisinger September 29, 2009.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Common Execution Environment Kate Keahey, Tim Freeman, Alex Clemesha, John Bresnahan, David.
OOI CI R2 Life Cycle Objectives Review Aug 30 - Sep Ocean Observatories Initiative OOI CI Release 2 Life Cycle Objectives Review Common Execution.
= WEEKS, MONTHS, YEARS OF DELAYED APPLICATION VALUE MISSED REVENUE OPPORTUNITIES, INCREASED COST AND RISK DEV QA PACKAGE COMMERCIAL SOFTWARE CUSTOM APPLICATION.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Common Execution Environment Kate Keahey, Tim Freeman, Alex Clemesha, John Bresnahan, David.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Life Cycle Objective Review for Common Execution.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Construction Plan John Graybeal Life Cycle Architecture Review La Jolla, CA.
OOI CI EOI LCA REVIEW December 13, 2010 Ocean Observatories Initiative External Observatory Integration Christopher Mueller Life Cycle Architecture Review.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative External Observatory Integration Christopher Mueller, Matt Arrott, John Graybeal Life Cycle.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Common Operating Infrastructure Subsystem Michael Meisinger Life Cycle.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Assessment John Graybeal, Michael Meisinger Life Cycle.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
Ocean Observatories Initiative OOI Cyberinfrastructure Overview Matthew Arrott VMware Presentation March 5, 2010.
37 Copyright © 2007, Oracle. All rights reserved. Module 37: Executing Workflow Processes Siebel 8.0 Essentials.
CS551 - Lecture 5 1 CS551 Lecture 5: Quality Attributes Yugi Lee FH #555 (816)
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Common Operating Infrastructure Subsystem Michael Meisinger, Brian.
LegendCorp What is System Center Virtual Machine Manager (SCVMM)? SCVMM at a glance Features and Benefits Components / Topology /
Ocean Observatories Initiative CEI Demonstrations Tim Freeman OOI Cyberinfrastructure Life Cycle Objectives Milestone Review, Release 1 San Diego, CA February.
Software Development Risk Assessment for Clouds National Technical University of Ukraine “Kiev Polytechnic Institute” Heat and energy design faculty Department.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Anubha Gupta | Software Engineer Visual Studio Online Microsoft Corp. Visual Studio Enterprise Leveraging modern tools to streamline Build and Release.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Web Technologies Lecture 13 Introduction to cloud computing.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Sensing and Acquisition Subsystem Arjuna Balasuriya Life Cycle Architecture Review La Jolla,
Ocean Observatories Initiative Serving Ocean Model Data on the Cloud M. Meisinger, C. Farcas, E. Farcas, C. Alexander, M. Arrott, J. de La Beaujardière,
Ocean Observatories Initiative OOI Cyberinfrastructure Overview Matthew Arrott VMware Presentation March 5, 2010.
Introduction to Performance Testing Performance testing is the process of determining the speed or effectiveness of a computer, network, software program.
Ocean Observatories Initiative OOI Cyberinfrastructure Common Execution Infrastructure Michael Meisinger OOI Cyberinfrastructure Life Cycle Objectives.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative User Experience Susanne Jul, Carolanne Fisher, Tom Auer, Chris Quintana Life Cycle Architecture.
Ocean Observatories Initiative Integrating Marine Observatories into a System-of-Systems: Messaging in the US Ocean Observatories Initiative M. Arrott,
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INFSO-RI Demonstration StratusLab First.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey Nimbus Project University of Chicago Argonne National Laboratory.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INFSO-RI Work Package 5 Infrastructure.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Structured Container Delivery Oscar Renalias Accenture Container Lead (NOTE: PASTE IN PORTRAIT AND SEND BEHIND FOREGROUND GRAPHIC FOR CROP)
Understanding The Cloud
Cloud Computing Platform as a Service
Cloud Computing Dr. Sharad Saxena.
Simplified Development Toolkit
Cloud Computing: Concepts
Using and Building Infrastructure Clouds for Science
OOI Cyberinfrastructure Integrated Elaboration Demo
Microsoft Virtual Academy
Presentation transcript:

OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Common Execution Infrastructure Kate Keahey, Tim Freeman, Alex Clemesha, David LaBissoniere, John Bresnahan Life Cycle Architecture Review La Jolla, CA

OOI CI LCA REVIEW August 2010 Common Execution Infrastructure Purpose Basic capabilities in resource provisioning on IaaS clouds Commercial National infrastructure Highly Available (HA) services Allow OOI computations to scale to demand by leveraging elastically provisioned resources

OOI CI LCA REVIEW August 2010 R1 Use Cases IDTitleDescription UC.R1.14Use Service AnywhereMessages go to services wherever they are UC.R1.15Put Services AnywhereAllocate services where need is greatest UC.R1.16Scale the ProcessingIncrease processing quickly to meet demand UC.R1.17Replicate ServiceConfigure service once, deploy many times UC.R1.20Command A ResourceSend typical commands to specific resource UC.R1.25Assure ReliabilityComputer fails, messages resent, work resumes UC.R1.26Virtualize EverythingVirtual processes embody all system services UC.R1.28Operate SystemConfigure system and respond to requests UC.R1.30Troubleshoot SystemDiagnose issues using logs, feeds, tools

OOI CI LCA REVIEW August 2010 User’s View of the Architecture EPU EPU Worker (Operational Unit) EPU Worker (Operational Unit) HA Service (OOI Application) VM (Deployable Unit) Application Software (Deployable Type) EPU Worker (Operational Unit) EPU Worker (Operational Unit) EPU Worker (Operational Unit) EPU Worker (Operational Unit) VM (Deployable Unit) VM (Deployable Unit)

OOI CI LCA REVIEW August 2010 Overall Architecture HA App- v1 Client VM Exchange Point …and then a miracle occurs…

OOI CI LCA REVIEW August 2010 Provisioner-2 Provisioner-0 Overall Architecture Capability Container App-v1 cc-agent EPU Worker ctx-agent IaaS Context Broker HA App- v1 Client Provisioner-0 Deployable Type Registry Service EPU Controller (App-v1) DE (Planne r) Sensor Aggregator (App-v1) A HA-P VM updates Queue length uses queries contextualization Launches VM Health report Per-node status Exchange Point

OOI CI LCA REVIEW August 2010 Capability Container One VM HA Provisioner Provisioner-2 Provisioner-0 IaaS Context Broker Provisioner-0 A HA-P Provisioner (Provisioner) Controller (HA-Provisioner) Sensor Aggregator (HA-Provisioner) Per-node status Queue length Base CEI Instance All other EPU controllers Bottom Turtle: Operations Monitors and restarts

OOI CI LCA REVIEW August 2010 Daemonize and monitor Bootstrapping and Monitoring Provisioner (Provisioner) Controller (HA-Provisioner) Sensor Aggregator (HA-Provisioner) Base CEI Instance Context Broker Messaging Service Core Services epu_control launch test monitor launch test monitor launch test monitor launch test monitor Provisioner-2 Provisioner-0 HA-P Service launches

OOI CI LCA REVIEW August 2010 Summary of Implementation Status Detailed design and implementation documents All major components implemented: Provisioner, EPU Controller, Decision Engine and Planner, Sensor Aggregator, DTRS Integrated with ION Some components needing refinement: bootstrap process, draft user and administrator process, image building and management Tested on infrastructure from Magellan to EC2

OOI CI LCA REVIEW August 2010 Technology Choices ION: Integrated Observatory Network boto txrabbitmqTwotp Nimboss Context Broker Fabric

OOI CI LCA REVIEW August 2010 The Testfest Objectives: Test a fully experimental system queue_length excepted to make progress Identify areas needing potential redesign Test the “muscle” of the system: no optimizations, no policies, no fancy improvements Scalability target: up to 1000 VMs 237 achieved so far

OOI CI LCA REVIEW August 2010 R1 Use Cases Demonstrated UC.R1.16: Scale the processing A load is put on the system Additional demand is recognized via different sensors Message queue length, CPU loads, disk usage System scales up to meet increased demand System scales down when demand goes away UC.R1.25: Assure reliability Failures happen Remedial actions happen No significant impact on observatory operation

OOI CI LCA REVIEW August 2010 Testing Environment Provisioner-2 Provisioner-0 Capability Container App-v1 cc-agent EPU Worker ctx-agent IaaS Context Broker HA App- v1 Client Provisioner-0 Deployable Type Registry Service EPU Controller (App-v1) DE (Planne r) Sensor Aggregator (App-v1) A HA-P VM updates Queue length uses queries contextualization Launches VM Health report Per-node status Exchange Point EC2 small EC2 High-CPU XL EC2 Small UC EC2 small

OOI CI LCA REVIEW August 2010 Scale the Processing Average load scenario 70 jobs, infinitely long One job per VM Submitted over 28 minutes, 5 jobs every 2 minutes Worst-case scenario 70 jobs, infinitely long One job per VM Saturating the system

OOI CI LCA REVIEW August 2010 Assure Reliability How does the system react to failure? Saturate the system with 10s jobs Bounded policy: 20 VMs Kill 2 VMs every 5 minutes

OOI CI LCA REVIEW August 2010 Lessons Learned Many, MANY, tractable small issues and lessons learned a.k.a., “an endless stream of simple bugs” ;-) Most significant unresolved issues: Messaging system connections close unexpectedly Currently prevents us from running at scale, need for scalability testing in COI Inspecting message queue remotely needs to be rethought Need for concurrency in the container Unresolved issue in “pulling work” Lots of work to do!

OOI CI LCA REVIEW August 2010 Risk Assessment -CEI Use Cases IDNameDescription Risk of Not Availability Level of Maturity Target Use UC.R1.15Put Services Anywhere Allocate services where need is greatest LowExpectedDeveloper UC.R1.16Scale the ProcessingIncrease processing quickly to meet demand LowExpectedDeveloper UC.R1.17Replicate ServiceConfigure service once, deploy many times LowExpectedDeveloper UC.R1.26Virtualize Everything Virtual processes embody all system services LowExpectedDeveloper UC.R1.25Assure ReliabilityComputer fails, messages resent, work resumes MediumNecessaryDeveloper UC.R1.28Operate SystemConfigure system and respond to requests MediumNecessaryOperator Services Name Risk of Not Availability Level of MaturityTarget Use Elastic ComputingLowExpectedDeveloper Exec Engine RepositoryLowExpectedDeveloper Resource Management ServicesMediumNecessaryDeveloper

OOI CI LCA REVIEW August 2010 Roadmap Iteration 1: Finalize components and interactions - Continue stress testing - Refine Deployable Type Creation and Management - Integration with Data Management - Bootstrapping Iteration 2: Prepare an Internal Release - Refine the policy engine - Continue testing - Build&test harness - Preliminary documentation Iteration 3: Prepare an External Release -Testing and robustness - User and admin process - Improve quality and documentation

OOI CI LCA REVIEW August 2010 Questions?

OOI CI LCA REVIEW August 2010 Use Cases at (Medium) Risk for Release 1 TypeTitleImpact UC.R1.16Scale ProcessingPotential known obstacles to scalability UC.R1.25Assure ReliabilityPotential known unreliable scenarios UC.R1.28Operate SystemScaled down functionality, ease of use UC.R1.30Troubleshoot System Scaled down functionality, ease of use