Managing allocatable resources

Slides:



Advertisements
Similar presentations
2  Industry trends and challenges  Windows Server 2012: Beyond virtualization  Complete virtualization platform  Improved scalability and performance.
Advertisements

© 2012 IBM Corporation Architecture of Quantum Folsom Release Yong Sheng Gong ( 龚永生 ) gongysh #openstack-dev Quantum Core developer.
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
Cloud Computing to Satisfy Peak Capacity Needs Case Study.
PlanetLab Operating System support* *a work in progress.
Clouds C. Vuerli Contributed by Zsolt Nemeth. As it started.
11 HDS TECHNOLOGY DEMONSTRATION Steve Sonnenberg May 12, 2014 © Hitachi Data Systems Corporation All Rights Reserved.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
CSE 490dp Resource Control Robert Grimm. Problems How to access resources? –Basic usage tracking How to measure resource consumption? –Accounting How.
VMware vCenter Server Module 4.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
VMware vSphere 4 Introduction. Agenda VMware vSphere Virtualization Technology vMotion Storage vMotion Snapshot High Availability DRS Resource Pools Monitoring.
Windows Server 2008 R2 CSIT 320 (Blum) 1. Server Consolidation – Today’s chips have enhanced capabilities compared to those of the past. In particular.
Paper on Best implemented scientific concept for E-Governance projects Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
An Introduction to IBM Systems Director
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
VMware vSphere Configuration and Management v6
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
Scaling the CERN OpenStack cloud Stefano Zilli On behalf of CERN Cloud Infrastructure Team 2.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
Md Baitul Al Sadi, Isaac J. Cushman, Lei Chen, Rami J. Haddad
OpenStack.
Instructor Materials Chapter 7: Network Evolution
rain technology (redundant array of independent nodes)
Chapter 6: Securing the Cloud
Lab A: Planning an Installation
Use Case for Distributed Data Center in SUPA
The advances in IHEP Cloud facility
Workload Distribution Architecture
Distributed Cache Technology in Cloud Computing and its Application in the GIS Software Wang Qi Zhu Yitong Peng Cheng
Grid Services for Collaborative Science
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Multi-VIM/Cloud High Level Architecture
Tools and Services Workshop Overview of Atmosphere
Chapter 21: Cloud Computing and Related Security Issues
AWS COURSE DEMO BY PROFESSIONAL-GURU. Amazon History Ladder & Offering.
Introduction to Cloud Computing
Chapter 22: Cloud Computing Technology and Security
OpenStack Ani Bicaku 18/04/ © (SG)² Konsortium.
Network+ Guide to Networks 6th Edition
ONOS Drake Release September 2015.
Indigo Doyoung Lee Dept. of CSE, POSTECH
Virtualization Meetup Discussion
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Design Unit 26 Design a small or home office network
NSF cloud Chameleon: Phase 2 Networking
QNX Technology Overview
HC Hyper-V Module GUI Portal VPS Templates Web Console
The Globus Toolkit™: Information Services
Sky Computing on FutureGrid and Grid’5000
Specialized Cloud Mechanisms
Cloud computing mechanisms
Principles/Paradigms Of Pervasive Computing
Technical Capabilities
* Introduction to Cloud computing * Introduction to OpenStack * OpenStack Design & Architecture * Demonstration of OpenStack Cloud.
Cloud Computing Architecture
GENI Exploring Networks of the Future
Cloud Computing: Concepts
Basics of Cloud Computing
Module 1: Overview of Systems Management Server 2003
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Managing allocatable resources Kate Keahey, Pierre Riteau, Jason Anderson, and Zhuo Zhen Argonne National Laboratory, University of Chicago keahey@anl.gov Cloud 2019 July 11, 2019

Chameleon in a Nutshell We like to change: testbed that adapts itself to your experimental needs Deep reconfigurability (bare metal) and isolation (CHI) – but also ease of use (KVM) CHI: power on/off, reboot, custom kernel, serial console access, etc. We want to be all things to all people: balancing large-scale and diverse Large-scale: ~large homogenous partition (~15,000 cores), 5 PB of storage distributed over 2 sites (now +1!) connected with 100G network… …and diverse: ARMs, Atoms, FPGAs, GPUs, Corsa switches, etc. Cloud on cloud: leveraging mainstream cloud technologies Powered by OpenStack with bare metal reconfiguration (Ironic) + “special sauce” Chameleon team contribution recognized as official OpenStack component We live to serve: open, production testbed for Computer Science Research Started in 10/2014, testbed available since 07/2015, renewed in 10/2017 Currently 3,000+ users, 500+ projects, 100+ institutions

Allocatable resources Definition: object within a system that can be automatically allocated and managed for exclusive, metered usage, delimited by well- defined time events Exclusive usage/isolation: System versus performance isolation Examples: GENI slice, containers, virtual machines, instances on commercial clouds, physical modes Implementation costs: e.g., hypervisor overhead, or default state recovery Time-bounded: e.g., advance reservation (including on-demand) Metered: policy enforcement and incentive management (via monetary or non-monetary means) Automated lease management: extends over both time and resource types; ensures scalability

Non-fungible Allocatable resource availability representation

Architecture Components Third-party Services Service interfaces Lease Manager (functionality generic to all leases) Enactment Plugins (resource-specific functionality) Third-party Services Adaptation of non-allocatable resources Examples: Nova, Neutron

Service interfaces Inventory management (operators) Objective: manage resource database Create, update, delete manage resource database Informational (show and list) Lease management (resource clients) Objectives: create and manage records in lease database Leases versus reservations Create and delete: range from a very partial description of a resource, (e.g., “two nodes on the same rack”, “node with at least 2GB of memory”) to very specific (“node X”) and can include multiple resources (e.g., nodes, IPs, VLANs) Update (active or inactive) lease: change resources, numbers, or temporal constraints

Lease Manager Interface to lease database Early or late assignment Information persisted includes original constraints Early or late assignment Simplicity versus efficiency Resolved at activation time at the latest Selection targets ranging from user choice to operation optimization Event-based assignment management E.g., reassignment in case of resource failure

Resource Plugins Allow lease manager to manage diverse resources Inventory management: separate reservable resources from freely available resources Lease management: allocating and deallocating resources to a lease Lease management functions on_start: resource allocation on_end: resource deallocation before_end: trigger an action at a configurable time before the end of a reservation (e.g., snapshot instances) update_reservation: e.g., add nodes to a lease

implementation Allocatable resources in Chameleon Heterogeneous bare metal machines Isolated network segments (VLANs) Public IP addresses OpenStack Blazar implementation with plugins for each HTTP/JSON (Keystone for authentication), SQL Alchemy, early binding with optimizations Bare-metal nodes: system and performance isolation, implemented as OpenStack Nova/Ironic plugin VLANs: system isolation, implemented as OpenStack Neutron plugin IP addresses: implemented as OpenStack Neutron plugin

Allocatable resource usage on Chameleon

Allocatable resource usage over time

How responsibly are reservations used Fully used: ~40% 80% used: ~60% 20% used: ~ 20% Not used: ~15% Management Idle at beginning or end: ~5% each ~20% early release

Conclusions Articulated resource management service Implemented in OpenStack Blazar Collaboration with OpenStack Blazar community Can be used independently of OpenStack – anticipate future work with IoT devices Effective tool in the management of scarce resources Managing user incentives Come and give it a shot! www.chameleoncloud.org