Download presentation
Presentation is loading. Please wait.
Published byLester Glenn Modified over 9 years ago
1
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Life Cycle Objective Review for Common Execution Infrastructure (CEI) Subsystem Kate Keahey David LaBissoniere Patrick Armstrong Pierre Riteau 1
2
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Subsystem Purpose Allow OOI applications and system to –Provide Highly Available (HA) services –Scale to demand Enact OOI deployment policies in elastic environment Provide a deployment foundation for OOI CI 2
3
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Overview CEI Overview R3 Scope Cloud Provider Options Risks Elaboration Plan 3
4
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Resources for HA and Scaling 10/16/201510/16/201510/16/201510/16/2015 4 EPU Management Monitor and regulate set properties based on system-specific and application-specific metrics –Cloud resources are available on-demand, but any particular resource may fail at any time –Applications/processes can absorb new resources –Applications/processes can tolerate failures EPU
5
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 EE ioncore 1.3 EPU Management Elastic Processing Unit (EPU) Management 5 EE ioncore 1.2 context-agent ou-agent EE matlab 6.1 context-agent ou-agent Decision Engine context-agent ou-agent Provisioner IaaS create instance AMQP Other DTRS CB
6
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Making the EPU HA ou-agent EPU Worker Bootstrap EPU Dedicated DE Provisioner/DTRS IaaS create instance AMQP Other cloudinit.d
7
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Managing Processes: Creating a Process I 7 Process Definition Registry Process Dispatcher EE type A instance Process Instance Registry request to activate process X ee-agent Decision Engine lookup launch enter AMQP Other
8
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Managing Processes: Creating a Process II 8 Process Definition Registry Process Dispatcher Provisioner/DTRS IaaS EE type A instance EPU Management Process Instance Registry request to activate process X ee-agent Decision Engine lookup launch enter request instance create instance AMQP Other
9
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 CC instance Managing Processes: Inside an Execution Engine 9 EE type A instance context-agent ee-agent ou-agent supervisord Kepler C C M CMR CMK CMKO datastream subscriptionresult Process Dispatcher EPU Management Package Server process (adapter) 1 AMQP Other C – create M – monitor R – restart K – kill O – I/O
10
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Adventures in Availability Time to repair (TTR) –Diagnosis –Time to scale (TTS) PENDING (request) STARTED (deployment) RUNNING (contextualization) 10/16/201510/16/201510/16/201510/16/2015 10 A = MTBF MTBF+MTTR mean time between failures mean time to repair TTS: preliminary results for 2,000 VMs provisioned on AWS EC2
11
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 CEI R3 Proposed Scope Robustness –upgrade mechanisms, maintainability code refactor, more unit tests, scale and stress testing, documentation, packaging, support, etc. Integration –Component interaction update, tight inter-component integration New features –Process and resource management Process activation and validation New execution site registration –Integration with National Infrastructure Framework for integration of academic cloud providers such as XSEDE and OSG –Support for a new cloud provider –SLA management 11
12
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Cloud Provider Options Windows Azure –Initially PaaS, now offers IaaS with Windows & Linux –Pros: 8 regions in North America, Europe, and Asia –Cons: no libcloud support, still in preview mode (no SLA yet) Rackspace –Pros: based on OpenStack, libcloud support –Cons: only 3 regions with 2 in the USA Google Compute Engine –Pros: targets high performance/throughput clusters, advertises 50% more CPU per $ compared to EC2 –Cons: still in limited preview, no libcloud support, few regions for now 12
13
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Risks Scope –Mitigation: scope prioritization with the architects Handoff process 13
14
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Elaboration 1 Theme: focus on support and high-risk elements Support activities (includes R2 Transition and R2.1 features) Assist with Kepler integration Design and prototype process package download and installation scheme (with COI) Initial prototype of Chef server integration (upgrades) Integration: eliminate resource registry mirroring in Process Dispatcher 14
15
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Elaboration 2 Theme: continue to support existing deployments and fix issues while emphasizing integration. Continued support activities Assist with Kepler Integration Integration: pyon capabilities in EPUM, DTRS, and Provisioner New execution site registration 15
16
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 CEI R3 Team 16 10/16/2015 CEI Developer Patrick Armstrong University of Chicago (location: Victoria, Canada) CEI Developer Pierre Riteau University of Chicago (location: Oxford, England) (part-time) CEI Senior Developer David LaBissoniere University of Chicago (location: Chicago, IL) CEI Designer Kate Keahey Argonne National Lab University of Chicago
17
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Questions? 17
18
OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 CEI R3 Proposed Scope Robustness –upgrade mechanisms, maintainability code refactor, more unit tests, scale and stress testing, documentation, packaging, support, etc. Integration –Component interaction update, tight inter-component integration New features –Process and resource management Activation and validation New execution site registration –Integration with National Infrastructure Framework for integration of academic cloud providers such as XSEDE and OSG –Support for a new cloud provider –SLA management 18
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.