Adaptive Scheduling with QoS Satisfaction in Hybrid Cloud Environment 研究生:李羿慷 指導老師:張玉山 老師.

Slides:



Advertisements
Similar presentations
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Advertisements

Evaluating the Cost-Benefit of Using Cloud Computing to Extend the Capacity of Clusters Presenter: Xiaoyu Sun.
Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
Hadi Goudarzi and Massoud Pedram
Class-constrained Packing Problems with Application to Storage Management in Multimedia Systems Tami Tamir Department of Computer Science The Technion.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
SLA-Oriented Resource Provisioning for Cloud Computing
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Tai, Yu-Chang 4/29/2013 Future Generation Computer Systems(FGCS.J) journal homepage: Saeid Abrishami a, ∗, Mahmoud Naghibzadeha,
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
Fair Scheduling in Web Servers CS 213 Lecture 17 L.N. Bhuyan.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Cloud Computing (101).
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
HeteroPar 2013 Optimization of a Cloud Resource Management Problem from a Consumer Perspective Rafaelli de C. Coutinho, Lucia M. A. Drummond and Yuri Frota.
Cloud Computing All Copyrights reserved to Talal Abu-Ghazaleh Organization
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
Applying Twister to Scientific Applications CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Bargaining Towards Maximized Resource Utilization in Video Streaming Datacenters Yuan Feng 1, Baochun Li 1, and Bo Li 2 1 Department of Electrical and.
A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.
OPERATING SYSTEMS CPU SCHEDULING.  Introduction to CPU scheduling Introduction to CPU scheduling  Dispatcher Dispatcher  Terms used in CPU scheduling.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
 Escalonamento e Migração de Recursos e Balanceamento de carga Carlos Ferrão Lopes nº M6935 Bruno Simões nº M6082 Celina Alexandre nº M6807.
Agent-based Federated Hybrid Cloud Prof. Yue-Shan Chang Distributed & Mobile Computing Lab. Dept. of Computer Science & Information Engineering National.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
ROBUST RESOURCE ALLOCATION OF DAGS IN A HETEROGENEOUS MULTI-CORE SYSTEM Luis Diego Briceño, Jay Smith, H. J. Siegel, Anthony A. Maciejewski, Paul Maxwell,
Cloud Resource Scheduling for Online and Batch Applications Kick-off meeting.
Kevin Ross, UCSC, September Service Network Engineering Resource Allocation and Optimization Kevin Ross Information Systems & Technology Management.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
Plan  Introduction  What is Cloud Computing?  Why is it called ‘’Cloud Computing’’?  Characteristics of Cloud Computing  Advantages of Cloud Computing.
The Owner Share scheduler for a distributed system 2009 International Conference on Parallel Processing Workshops Reporter: 李長霖.
An Architecture for Distributed High Performance Video Processing in the Cloud 作者 :Pereira, R.; Azambuja, M.; Breitman, K.; Endler, M. 出處 :2010 IEEE 3rd.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Advanced Spectrum Management in Multicell OFDMA Networks enabling Cognitive Radio Usage F. Bernardo, J. Pérez-Romero, O. Sallent, R. Agustí Radio Communications.
IT Infrastructure for Providing Energy-as-a-Service to Electric Vehicles Smruti R. Sarangi, Partha Dutta, and Komal Jalan IEEE TRANSACTIONS ON SMART GRID,
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
Data Replication and Power Consumption in Data Grids Susan V. Vrbsky, Ming Lei, Karl Smith and Jeff Byrd Department of Computer Science The University.
Downlink Scheduling With Economic Considerations to Future Wireless Networks Bader Al-Manthari, Nidal Nasser, and Hossam Hassanein IEEE Transactions on.
Service-oriented Resource Broker for QoS-Guaranteed in Grid Computing System Yichao Yang, Jin Wu, Lei Lang, Yanbo Zhou and Zhili Sun Centre for communication.
Using Map-reduce to Support MPMD Peng
1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Static Process Scheduling
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Architecture for Resource Allocation Services Supporting Interactive Remote Desktop Sessions in Utility Grids Vanish Talwar, HP Labs Bikash Agarwalla,
Basic Concepts Maximum CPU utilization obtained with multiprogramming
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
Dynamic Mobile Cloud Computing: Ad Hoc and Opportunistic Job Sharing.
Md Baitul Al Sadi, Isaac J. Cushman, Lei Chen, Rami J. Haddad
R-Storm: Resource Aware Scheduling in Storm
Introduction to Load Balancing:
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
Operating systems Process scheduling.
CPU SCHEDULING.
Towards Predictable Datacenter Networks
Presentation transcript:

Adaptive Scheduling with QoS Satisfaction in Hybrid Cloud Environment 研究生:李羿慷 指導老師:張玉山 老師

Outline 1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 2

1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 3

1. Introduction Cloud computing – Huge data store and highly parallel computing – Cloud services: SaaS, PaaS, IaaS Private cloud – Control and security issue – One-time purchase and long term maintain Public cloud – Flexible, scalable – Pay-per-use 4

Introduction (cont.) Cloud environment workload status – Ex: Yahoo! Video 5

Introduction (cont.) Hybrid Cloud – Combine Private and Public cloud – Private cloud Regular workload Constant maintenance cost – Public cloud Transit overloading Pay-per-use (cost issue) 6

Introduction (cont.) Cloud services – Efficiency – Reliability – Cost Quality of Service (QoS) – Response time ↓ – Payment ↓ 7

Introduction (cont.) Hybrid Cloud – Guarantee user QoS demand Workload dispatching – Private Cloud Maximize utilization Minimize execution time – Public Cloud Minimize cost expense 8

Introduction (cont.) To improve QoS satisfaction in hybrid cloud – We propose: Adaptive Scheduling with QoS Satisfaction in Hybrid Cloud Environment 9

MMKP Mapping the QoS satisfaction and cost function into a MMKP – MMKP (Multi-dimension Multi-choice Knapsack Problem) – MMKP is proved as NP-complete – Maximal utilization – Minimal cost value – QoS deadline constraint 10

Introduction (cont.) We may solve our problem by finding a near optimal heuristic solutions in polynomial time Using dynamic programming finding a heuristic solution (near optimal) – Solving complex problem into smaller sub- problems Using CloudSim to evaluate the experiment 11

1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 12

2. Related Work FIFO: first come first serve – Most common – Drawback: Convoy Effect Fair (Facebook), Capacity (Yahoo) Scheduling – Solve multi-user problem in FIFO – Ensure every task has approximately equal computational resource/time 13

Related Work (cont.) Intelligent Workload Factoring for A Hybrid Cloud Computing Model – Split the workload into two parts – Base load and trespassing load (privately-owned data center and public cloud service) – Reduce data cache/replication overhead – But not support real-time QoS constraint computing 14

Related Work (cont.) GA-Based Task Scheduler for the Cloud Computing Systems – Genetic Algorithm based for task level scheduling in Hadoop MapReduce – Achieve better load balancing – GA for making the optimal decision – Not for Hybrid Cloud – Not supporting QoS constraint 15

Related Work (cont.) Cost-Minimizing Scheduling of Workflows on a Cloud of Memory Managed Multicore Machines – Service-oriented architecture framework – Cost function, maps values of workflow tardiness to corresponding cost function value – To minimize the sum of cost function values for all workflows – Cost function not for user-aspect design – Not supporting Hybrid Cloud 16

1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 17

3. Problem Definition and Formulation 3.1 Resource Slot Definition 3.2 Request Job Definition 3.3 Problem Formulation 18

3.1 Resource Slot Definition Private resource slot – One node (machine) in private cloud can generate more slots – Base on virtual machine infrastructure – One slot require one CPU resource ability – Basic unit for handling request task 19

Resource Slot Definition (cont.) 20 Example of Private Slots

Resource Slot Definition (cont.) Resource Slots has different computing ability, depends on CPU speed, memory…etc – Unit: Million Instruction Per Second Private Cloud set data replications between resource slots 21

Resource Slot Definition (cont.) Public Resource Slot – Resource from charging public cloud provider – Based on different instance type – Unify charging policy by Computing price Storage price Data transfer price 22

Resource Slot Definition (cont.) 23 Example of Public Slots

3.2 Request Job Definition Target applications – Internet-based applications – Focus on data sets on certain kinds of distributable, parallel problems – Ex: image and video rendering codes and highly parallel data analysis codes – Each application has a completion deadline 24

25 Example of Request Jobs

3.3 Problem Formulation For guarantee QoS demand deadline constraint – Maximize private slot utilization – Minimize task execution time – Minimize cost value 26

Definition Deadline constraint: – For Job J i = {V i1 ~ V in }, and deadline D i – Code size SC ij for task V ij – For private slot PrR k and computing ability Prμ k, k = 1 to m 27

Definition (cont.) Budget control – For Job J i = {V i1 ~ V in }, and cost budget M i – Code size SC ij for task V ij – Information data size SD ij for task V ij – For public slot PuR q and computing price x q – For public slot PuR q and storage price y q k = 1 to m 28

Definition (cont.) Estimated Finish Time (Est) – For task V ij on private slot PrR k – Code size SC ij for V ij – Computing ability Prμ k for PrR k 29

Definition (cont.) Estimated Execution Time (EEt) – For task V ij on private slot PrR k – Code size SC ij for V ij – Computing ability Prμ k for PrR k Data Transmission Time (Dtt) – For task V ij and code size SC ij – Network bandwidth NB – Disk speed DS k on resource slot k 30,

Definition (cont.) Cost Function (CostF) – Code size SC ij and information data size SD ij for task V ij – Computing price x k, storage price y k, data transfer in price dti k and data transfer out price dti o for public resource slot PuR k 31

MMKP Mapping our mathematical formulate problems into MMKP (NP-complete) – MMKP (Multi-dimension Multi-choice Knapsack Problem) – Maximal utilization – Minimal cost value – QoS deadline constraint We may solve our problem by finding a near optimal heuristic solutions in polynomial time 32

33

1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 34

4. Adaptive Scheduling Algorithm with QoS Satisfaction 4.1 Resource Needed Weight 4.2 Execution Time Estimation with Task on Different Slots 4.3 Dynamic Programming for Dispatching to Candidate Slots 4.4 Dispatch Selection from Slot Queue 4.5 Dynamic Programming for Minimal Cost on Public Slot 35

4.1 Resource Needed Weight If multi Jobs arrive in the pool at the same time, they’ll share the resource by the % of Resource Needed Weight W i Differ from Fair Scheduling, guarantee the resource amount base on code size and deadline – W x :W y :W z ->Slot distributed rate, for Job x, y, z 36

Resource Needed Weight (cont.) 37 Fair SchedulingAsQ

4.2 Execution Time Estimation with Task on Different Slots Collect private cloud resources’ current status and information – Remain code size – Computation ability Calculate estimated finish time (Est) Can find out when the slot will be available 38

Example of Est 39

Execution Time Estimation with Task on Different Slots (cont.) Calculate estimated execution time (EEt) of current tasks on every private resource from 1 to k – Estimated Execution time – Data Transfer Time If, Dtt=0 Else if, 40

Example of EEt 41

Execution Time Estimation with Task on Different Slots (cont.) By having Est and EEt, the slots which were able to finish the task before the deadline can be selected The slots which can reach the QoS (deadline) will be collect in a candidate set 42 Example of Est + EEt

Example of Overloading Dispatch 43

4.3 Dynamic Programming for Dispatching to Candidate Slots The optimal scheduling has been mapping to MMKP Using dynamic programming to solve the NP- complete problem Finding the minimal runtime of every tasks and slots – Data location, computation ability, network bandwidth…etc, will effect the total runtime 44

Example of Scheduling Job 2 45

Dynamic Programming for Dispatching to Candidate Slots (cont.) Dynamic programming will make the decision with minimal execution time of all The less execution time we take, the more task we can serve on the same private cloud resources with same operation cost More on private, less on charging public 46

4.4 Dispatch Selection from Slot Queue When transit overloading or strict deadline – Private slots can not handle in QoS demand Need to dispatch into charging public slots Examining the possibility of task in queue with dispatching into public slots – Data transmission time 47

Example of Job 3 Arrive 48

Example of Examining Dtt in Queue 49

4.5 Dynamic Programming for Minimal Cost on Public Slot Trying to minimize the cost in renting public resource slots Cost function with knapsack problem can be solve by dynamic programming Find out the minimal cost and reach the QoS deadline 50

Example of Minimum Cost Selection 51

Algorithm of [Execution Time Estimation with Task on Different Slots] & [Dynamic Programming for Dispatching to Candidate Slots] 52

Algorithm of [Dispatch Selection from Slot Queue] 53

Algorithm of [Dynamic Programming for Minimal Cost on Public Slot] 54

1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 55

5. Experiments and Discussion CloudSim Support for modeling and simulation with customizable policies for resources scheduler on Cloud computing 56 Slot experiment setup image size5 GB RAM512 MB BW1,000 CPU Number1 computing ability[10, 50] MIPS Task experiment setup file size[200, 400] MB output size[20, 40] MB code size[400, 1000] MI

5.1 Measurement of AsQ, FIFO and Fair – Latency measurement – QoS satisfaction rate measurement – Cost analysis 5.2 Measurement of AsQ and COSHIC – Cost analysis – Latency measurement – QSR spending time measurement – Normalized Violated Quality Value measurement 57

5.1 Latency Measurement Latency measurement – No deadline limit – Private resource only – 5, 10, 20, 50 tasks – Waiting time – Execution time – Finish time 58

Task Waiting Time Measurement 59

Task Execution Time Measurement 60

Task Finish Time Measurement 61

Measurement of AsQ, FIFO and Fair (cont.) QoS satisfaction rate measurement – Percentage of complete in time tasks of all – QSR = k/n n=total task number, k=task number which response before deadline, 0 ≦ k ≦ n – 20, 50, 70 tasks – Private slots only – Deadline: loose → strict 62

QSR Measurement (20 tasks) 63

QSR Measurement (50 tasks) 64

QSR Measurement (70 tasks) 65

Measurement of AsQ, FIFO and Fair (cont.) QSR – Cost measurement – 50, 70 tasks – Using public slots – Paying more for higher QSR 66 Public Cloud Slots Computing ability (MIPS) Computing price ($/MI) Storage price ($/MB) 100.1[0.01, 0.05] 200.2[0.01, 0.05] 500.5[0.01, 0.05]

QSR – Cost Measurement (50 tasks) 67

QSR – Cost Measurement (70 tasks) 68

Measurement of AsQ, FIFO and Fair (cont.) Cost analysis – 20, 50, 70 tasks – Deadline: loose → strict 69

Cost Analysis (20 tasks) 70

Cost Analysis (50 tasks) 71

Cost Analysis (70 tasks) 72

5.2 Measurement of AsQ and COSHIC Compare with “Cost-optimal Scheduling in Hybrid IaaS Clouds” – Linear programming formulation – Assume that applications are CPU and network intensive – Scheduling applications in the public cloud, in terms of cost minimization 73

Cost Analysis 74 Cost 22.7% as AsQ

Task Execution Time Measurement 75 Time spend 10.7%

Task Finish Time Measurement 76 Time spend 16.8%

QSR Spending Time Measurement 77 COSHIC spend 4.9 times than AsQ

NVQV Measurement Normalized Violated Quality Value For normalized the performance between execution time and cost value 78

NVQV Measurement (cont.) 79

Comparison with other Scheduling Algorithm 80

1.Introduction 2.Related Work 3.Problem Definition and Formulation 4.Adaptive Scheduling Algorithm with QoS Satisfaction 5.Experiments and Discussion 6.Conclusions and Future Work 81

6. Conclusions and Future Work We propose Adaptive Scheduling Algorithm with QoS Satisfaction Satisfy user QoS demand Near optimal resource allocation – Better resource utilization Lower cost spend for service provider 82

Finding suitable workload on private cloud with better tradeoff between operation cost and computing efficiency Reliability Implement on a real cloud environment 83

End 84