André Bauer, Simon Spinner, Nikolas Herbst, Samuel Kounev

Slides:

Advertisements

Similar presentations

Hadi Goudarzi and Massoud Pedram

Advertisements

Achieving Elasticity for Cloud MapReduce Jobs Khaled Salah IEEE CloudNet 2013 – San Francisco November 13, 2013.

SLA-Oriented Resource Provisioning for Cloud Computing

System Center 2012 R2 Overview

CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.

Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.

st International Conference on Parallel Processing (ICPP)

CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.

U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Provisioning for Multi-tier Internet Applications Bhuvan Urgaonkar, Prashant.

Providing Performance Guarantees for Cloud Applications Anshul Gandhi IBM T. J. Watson Research Center Stony Brook University 1 Parijat Dube, Alexei Karve,

Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.

Dynamically Scaling Applications in the Cloud Presented by Paul.

Automatic Resource Scaling for Web Applications in the Cloud Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer Science.

Energy Efficient Prefetching – from models to Implementation 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software Engineering.

COMS E Cloud Computing and Data Center Networking Sambit Sahu

INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.

Energy Aware Network Operations Authors: Priya Mahadevan, Puneet Sharma, Sujata Banerjee, Parthasarathy Ranganathan HP Labs IEEE Global Internet Symposium.

Towards Autonomic Hosting of Multi-tier Internet Services Swaminathan Sivasubramanian, Guillaume Pierre and Maarten van Steen Vrije Universiteit, Amsterdam,

Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.

SOFTWARE AS A SERVICE PLATFORM AS A SERVICE INFRASTRUCTURE AS A SERVICE.

Generating Adaptation Policies for Multi-Tier Applications in Consolidated Server Environments College of Computing Georgia Institute of Technology Gueyoung.

Capacity Scaling for Elastic Compute Clouds Ahmed Aleyeldin Hassan

A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.

Author : Chengwei Wang, Vanish Talwar*, Karsten Schwan, Parthasarathy Ranganathan* Conference: IEEE 2010 Network Operations and Management Symposium (NOMS)

Department of Computer Science Engineering SRM University

Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.

OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.

Windows Azure Team 9 Ben Holland Bao Nguyen Eric Petrowiak Barret Schloerke.

November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.

Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis [1] 4/24/2014 Presented by: Rakesh Kumar [1 ]

Adaptive software in cloud computing Marin Litoiu York University Canada.

An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.

Autonomic SLA-driven Provisioning for Cloud Applications Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer Presented by Ismail Alan.

Adaptive Virtual Machine Provisioning in Elastic Multi-tier Cloud Platforms Fan Zhang, Junwei Cao, Hong Cai James J. Mulcahy, Cheng Wu Tsinghua University,

1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.

Copyright © 2011, Performance Evaluation of a Green Scheduling Algorithm for Energy Savings in Cloud Computing Truong Vinh Truong Duy; Sato,

Using SaaS and Cloud computing For “On Demand” E Learning Services Application to Navigation and Fishing Simulator Author Maha KHEMAJA, Nouha AMMARI, Fayssal.

Reporter ： Yu Shing Li 1.  Introduction  Querying and update in the cloud  Multi-dimensional index R-Tree and KD-tree Basic Structure Pruning Irrelevant.

© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Injecting Realistic Burstiness to.

June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.

Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),

03/03/051 Performance Engineering of Software and Distributed Systems Research Activities at IIT Bombay Varsha Apte March 3 rd, 2005.

EuroSys Doctoral Workshop 2011 Resource Provisioning of Web Applications in Heterogeneous Cloud Jiang Dejun Supervisor: Guillaume Pierre

Capsule Placement in the Service Platform Bhuvan Urgaonkar Timothy Roscoe Systems Group, Sprint ATL.

Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.

Online Parameter Optimization for Elastic Data Stream Processing Thomas Heinze, Lars Roediger, Yuanzhen Ji, Zbigniew Jerzak (SAP SE) Andreas Meister (University.

Business Service Governance Managing from Above the Clouds Page 1 Michael Salsburg, Ph.D. Ohio CMG May

RESERVOIR Service Manager NickTsouroulas Head of Open-Source Reference Implementations Unit Juan Cáceres

Energy Aware Network Operations

Organizations Are Embracing New Opportunities

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING

Introduction to Load Balancing:

CSE 591: Energy-Efficient Computing Lecture 17 SCALING: survey

Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.

Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.

Standards and Patterns for Dynamic Resource Management

Provisioning Performance of name server Software

Adaptive Cloud Computing Based Services for Mobile Users

Introduction to Cloud Computing

Frequency Governors for Cloud Database OLTP Workloads

Graduation Project Kick-off presentation - SET

Managing Clouds with VMM

Extending BUNGEE Elasticity Benchmark for Multi-Tier Cloud Applications - Talk - André Bauer.

Cloud computing mechanisms

ICSOC 2018 Adel Nadjaran Toosi Faculty of Information Technology

The Vision of Self-Aware Performance Models

André Bauer, Johannes Grohmann, Nikolas Herbst, and Samuel Kounev

Tailoring Performance Prediction Processes to Situational Requirements

Tailoring Performance Prediction Processes to Situational Requirements

Harrison Howell CSCE 824 Dr. Farkas

Presentation transcript:

Chameleon: Design and Evaluation of a Proactive, Application-Aware Elasticity Mechanism André Bauer, Simon Spinner, Nikolas Herbst, Samuel Kounev Chair of Software Engineering University of Würzburg http://se.informatik.uni-wuerzburg.de/ Symposium on Software Performance Kiel, November 09, 2016 http://descartes.tools/chameleon

Motivation Cloud computing is a paradigm which faces the increasing scale and complexity of modern internet services To guarantee a reliable service, most application run with a fixed amount of resources High energy consumption, if the system is not fully utilized Bad performance, if unexpected peaks appear High quality auto-scalers are required, which reconfigure the system regarding its load 2 A. Bauer Introduction Approach Evaluation Conclusion

State of the Art Auto-scalers can be classified into 5 groups [Lorido-Botran14] Prominent examples of each group: Mechanism are either generic or application-specific Threshold-based Rules Queueing Theory Control Theory [Urgaonkar08] (Hist) [Adhikari12] (Adapt) [Maurer11] Reinforcement Learning Time Series Analysis Chameleon [Rao09] [Iqbal11] (Reg) Delimitation: Chameleon is both generic and application-aware 3 A. Bauer Introduction Approach Evaluation Conclusion

In a Nutshell… Problem Idea Benefit Action Existing auto-scalers are either application-specific or generic. Bad performance can be translated into loss of money or customers Controller combines time series analysis and query theory enriched with application knowledge Expected: More accurate scaling decisions compared to other controllers or standard reactive Taking forecasting, resource demand estimation and application model into account to increase the performance of the reconfiguration Problem Idea Benefit Action 4 A. Bauer Introduction Approach Evaluation Conclusion

Data Flow of Chameleon DML Model http://descartes.tools/ 6: reconfigures cloud 5: determines resource demands Cloud Chameleon Controller Resource Demand Estimation Component - LibReDE - Business Application monitors cloud 3: send data saves the observed data has DML model 1: requests historical data Forecast Component - WCF - 4: forecasts load Performance Repository DML Model Time Series Storage 2: sends data http://descartes.tools/ 5 A. Bauer Introduction Approach Evaluation Conclusion

Activity diagram of Chameleon 6 A. Bauer Introduction Approach Evaluation Conclusion

Assumptions & Evaluation Overview Availability of at least 4 days of historical data The SLO is the response time Chameleon is limited to scale CPUs A DML model is already provided Chameleon is driven with two different scenarios CPU intensive scenario The business application SPECjEnterprise2010 Both scenarios are deployed in a private CloudStack environment 7 A. Bauer Introduction Approach Evaluation Conclusion

Workload FIFA world cup 1998 is used for an authentic workload A sequence of 7 days were cropped out 4 days as historical data for WCF 3 days for the evaluation 8 A. Bauer Introduction Approach Evaluation Conclusion

Benchmarking Evaluation with Bungee experiment controller Perform each scenario with Chameleon Perform each scenario with other auto-scalers Hist [Urgaonkar08] Reg [Iqbal11] Adapt [Adhikari12] ConPaaS [Pierre12] Compare the results with benchmarking metrics Benchmarking Metrics Elasticity metrics User metrics Number of SLO violations Average response time 90th percentile of response time Resource Demand #Running VMs Load requests 9 A. Bauer Introduction Approach Evaluation Conclusion

CPU Intensive Scenario Workload is accelerated 8 times: 3 d  9 h Between 40 and 200 request/second 3.5 million requests Setup contains up to 20 WildFly slaves, 1 Domain controller and 1 Citrix Netscaler (LB) 2 cores @2.6GHz 4 GB memory Each slave is a http application Request parameter n, here 600 LU decomposition of a random generated nxn matrix SLO: 90% of responses < 5 seconds 10 A. Bauer Introduction Approach Evaluation Conclusion

Evaluation Part I Scaling Chameleon 11 A. Bauer Introduction Approach Conclusion

Evaluation Part II No Auto-Scaling Scaling Chameleon Scaling Hist Scaling Reg 12 A. Bauer Introduction Approach Evaluation Conclusion

Evaluation Part III Scaling Chameleon Scaling Adapt Scaling ConPaaS 13 A. Bauer Introduction Approach Evaluation Conclusion

Metric Comparison Metrics No Scaler Chameleon Hist Adapt Reg ConPaaS accuracyo 0.282 0.350 0.387 0.079 0.289 accuracyu 0.147 0.030 0.055 0.613 0.078 0.105 timeshareo 0.453 0.755 0.608 0.181 0.52 0.535 timeshareu 0.491 0.129 0.285 0.788 0.371 0.355 jitter -0.924 -0.723 -0.654 -0.615 -0.719 -0.466 instability 0.478 0.215 0.22 0.418 0.196 0.235 #Adaptions 85 110 618 93 204 Avg. #VMs 10 12.5 13.4 8.1 11.1 13.2 scoreNoScaler 1 1.330 1.151 0.896 1.177 1.143 Avg. response time 1251 ms 671 ms 1267 ms 4339 ms 2018 ms 2082 ms P90 response time 5002 ms 1051 ms Violations 18.07% 7.50% 19.24% 82.75% 32.25% 35.98% Chameleon achieves highest score and lowest SLO violations 14 A. Bauer Introduction Approach Evaluation Conclusion

Future Work For Chameleon: For evaluation: Add vertical scaling and migration Improve precision by dynamic method selection Integration of a DML extractor and solver Taking billing of reconfigurations into account For evaluation: Distributed JMeter usage Calibration search vs. bottleneck shadowing Retaking the experiments in a public cloud Multitier scaling of SPECjEnterprise2010 Scale another multi-tier application (e.g., RuBiS) Evaluate energy savings by auto-scaling 16 A. Bauer Introduction Approach Evaluation Conclusion

Conclusion Problem Idea Results Ongoing Work Existing auto-scalers are either application-specific or generic. Bad performance can be translated into loss of money or customers Chameleon combines time series analysis and query theory enriched with application knowledge Chameleon achieves the best score in the CPU intensive scenario compared with 4 existing auto-scaler Release Chameleon as open source, … Problem Idea Results Ongoing Work 17 A. Bauer Introduction Approach Evaluation Conclusion

Thank you for your attention andre.bauer@uni-wuerzburg.de http://descartes.tools/chameleon http://se.informatik.uni-wuerzburg.de

Literature [Adhikari12] R. Adhikari and R. Agrawal, “An Introductory Study on Time Series Modeling and Forecasting,” arXiv preprint arXiv:1302.6613, 2013. [Iqbal11] W. Iqbal, M. N. Dailey, D. Carrera, and P. Janecek, “Adaptive Resource Provisioning for Read Intensive Multi-tier Applications in the Cloud,” Future Generation Computer Systems, vol. 27, no. 6, pp. 871-879, 2011. [Lorido-Botran14] T. Lorido-Botran, J. Miguel-Alonso, and J. A. Lozano, “A Review of Autoscaling Techniques for Elastic Applications in Cloud Environments,” Journal of Grid Computing, vol. 12, no. 4, pp. 559-592, 2014. [Maurer11] M. Maurer, I. Brandic, and R. Sakellariou, “Enacting Slas in Clouds Using Rules,“ in Euro-Par 2011 Parallel Processing. Springer, 2011, pp. 455-466. [Rao09] J. Rao, X. Bu, C.-Z. Xu, L. Wang, and G. Yin, “VCONF: a Reinforcement Learning Approach to Virtual Machines Auto-conguration,” in Proceedings of the 6th international conference on Autonomic computing. ACM, 2009,pp. 137-146. [Urgaonkar08] B. Urgaonkar, P. Shenoy, A. Chandra, P. Goyal, and T. Wood, “Agile Dynamic Provisioning of Multi-tier Internet Applications,“ ACM Transactions on Autonomous and Adaptive Systems (TAAS), vol. 3, no. 1, p. 1, 2008. [Pierre12] Pierre, Guillaume, and Corina Stratan. "ConPaaS: a platform for hosting elastic cloud applications." IEEE Internet Computing 16.5 (2012): 88-92. 19 A. Bauer

Elasticity Metrics 20 A. Bauer

Reprodicibility Reproducibility of the scaling behaviour 3 runs with the same setting Elasticity score has a standard variation of 0.017 21 A. Bauer

Logic I Reactive scaling events vs. proactive scaling events Chameleon plans reconfiguration steps in advance 22 A. Bauer

Logic II 23 A. Bauer

Resource Demand Resource demand for Chameleon is determined via LibReDE Resource demand for the existing auto-scalers is determined via the server speed 24 A. Bauer

Without Historical knowledge Chameleon without historical data Until minute 390: reactive scaling Afterwards proactive scaling 25 A. Bauer

Forecasting Modified WCF forecast strategy: Forecasts on demand Forecast model contains the next 24 predictions If MAPE > tolerance then build new forecast model 26 A. Bauer

SPECjEnterprise2010 Setup contains 8 different tiers (= WildFly slaves), 1 Domain controller and 1 Citrix Netscaler (LB) Problems with benchmarking Very small response time (about 10 ms) High load generation burden Difficult calibration (reproducibility, hidden bottlenecks,…) SLO: 90% responses < 100 milliseconds Only user metrics can be determined Number of SLO violations Average & median response time 90th percentile of response time 27 A. Bauer

Metric Comparison Chameleon achieves the second best user metrics Reactive Chameleon Hist Adapt Reg Avg. response time 669 ms 476 ms 404 ms 592 ms 500 ms P50 response time 9 ms 7 ms 12 ms 10 ms P90 response time 2005 ms 1876 ms 1754 ms 1899 ms 1915 ms Violations 33,02 % 27,19 % 26,57 % 27,32 % 30,16 % Chameleon achieves the second best user metrics 28 A. Bauer