Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu

Slides:

Advertisements

Similar presentations

Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.

Advertisements

Clustering Categorical Data The Case of Quran Verses

SLA-Oriented Resource Provisioning for Cloud Computing

Hongliang Li, Senior Member, IEEE, Linfeng Xu, Member, IEEE, and Guanghui Liu Face Hallucination via Similarity Constraints.

Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu Toward Optimal Deployment of Communication-Intensive Cloud Applications 1.

An Energy Efficient Hierarchical Heterogeneous Wireless Sensor Network

© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.

1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.

An Authentication Service Against Dishonest Users in Mobile Ad Hoc Networks Edith Ngai, Michael R. Lyu, and Roland T. Chin IEEE Aerospace Conference, Big.

1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman

Grid Load Balancing Scheduling Algorithm Based on Statistics Thinking The 9th International Conference for Young Computer Scientists Bin Lu, Hongbin Zhang.

MULTICOMPUTER 1. MULTICOMPUTER, YANG DIPELAJARI Multiprocessors vs multicomputers Interconnection topologies Switching schemes Communication with messages.

A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.

MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.

1 IEEE Trans. on Smart Grid, 3(1), pp , Optimal Power Allocation Under Communication Network Externalities --M.G. Kallitsis, G. Michailidis.

Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.

Presented by Tienwei Tsai July, 2005

2015/10/1 A color-theory-based energy efficient routing algorithm for mobile wireless sensor networks Tai-Jung Chang, Kuochen Wang, Yi-Ling Hsieh Department.

Y. Kotani · F. Ino · K. Hagihara Springer Science + Business Media B.V Reporter: 李長霖.

BFTCloud: A Byzantine Fault Tolerance Framework for Voluntary-Resource Cloud Computing Yilei Zhang, Zibin Zheng, and Michael R. Lyu

« Performance of Compressed Inverted List Caching in Search Engines » Proceedings of the International World Wide Web Conference Commitee, Beijing 2008)

Online Learning for Collaborative Filtering

The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Chapter 8-2 : Multicomputers Multiprocessors vs multicomputers Multiprocessors vs multicomputers Interconnection topologies Interconnection topologies.

Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.

Probabilistic Coverage in Wireless Sensor Networks Authors : Nadeem Ahmed, Salil S. Kanhere, Sanjay Jha Presenter : Hyeon, Seung-Il.

Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.

Computing Scientometrics in Large-Scale Academic Search Engines with MapReduce Leonidas Akritidis Panayiotis Bozanis Department of Computer & Communication.

Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.

A User Experience-based Cloud Service Redeployment Mechanism KANG Yu Yu Kang, Yangfan Zhou, Zibin Zheng, and Michael R. Lyu {ykang,yfzhou,

WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.

WS-DREAM: A Distributed Reliability Assessment Mechanism for Web Services Zibin Zheng, Michael R. Lyu Department of Computer Science & Engineering The.

Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉教授 : 許毅然作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.

IEEE CLOUD’2012 Topology-Aware Deployment of Scientific Applications in Cloud Computing Pei Fan 1, Zhenbang Chen 1, Ji Wang 1, Zibin Zheng 2, Michael R.

A Clustering-based QoS Prediction Approach for Web Service Recommendation Shenzhen, China April 12, 2012 Jieming Zhu, Yu Kang, Zibin Zheng and Michael.

Energy-Efficient Randomized Switching for Maximizing Lifetime in Tree- Based Wireless Sensor Networks Sk Kajal Arefin Imon, Adnan Khan, Mario Di Francesco,

A Bandwidth Scheduling Algorithm Based on Minimum Interference Traffic in Mesh Mode Xu-Yajing, Li-ZhiTao, Zhong-XiuFang and Xu-HuiMin International Conference.

Reputation-aware QoS Value Prediction of Web Services Weiwei Qiu, Zhejiang University Zibin Zheng, The Chinese University of HongKong Xinyu Wang, Zhejiang.

BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.

A Collaborative Quality Ranking Framework for Cloud Components

Presented by Edith Ngai MPhil Term 3 Presentation

2010 IEEE Global Telecommunications Conference (GLOBECOM 2010)

Talal H. Noor, Quan Z. Sheng, Lina Yao,

A Signal Processing Approach to Vibration Control and Analysis with Applications in Financial Modeling By Danny Kovach.

Golrezaei, N. ; Molisch, A.F. ; Dimakis, A.G.

CSCI5570 Large Scale Data Processing Systems

Discrete ABC Based on Similarity for GCP

WSRec: A Collaborative Filtering Based Web Service Recommender System

Edinburgh Napier University

Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting

Load flow solutions Chapter (3).

Parallel Density-based Hybrid Clustering

A Scalable Routing Architecture for Prefix Tries

Efficient Load Balancing Algorithm for Cloud

Nithin Michael, Yao Wang, G. Edward Suh and Ao Tang Cornell University

Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.

Multy- Objective Differential Evolution (MODE)

Yue Zhang, Nathan Vance, and Dong Wang

Understanding and Exploiting Amazon EC2 Spot Instances

Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.

Effective Social Network Quarantine with Minimal Isolation Costs

Effective Replica Allocation

Resource-Efficient and QoS-Aware Cluster Management

Korea University of Technology and Education

Exploring Latent Features for Memory-Based QoS Prediction in Cloud Computing Yilei Zhang 17/05/2011.

Presented By Riaz (STD ID: )

Data Communication: Routing algorithms

GhostLink: Latent Network Inference for Influence-aware Recommendation

Edinburgh Napier University

Presentation transcript:

Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu Toward Optimal Deployment of Communication-Intensive Cloud Applications Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu peifan@nudt.edu.cn

Content Introduction Related work System Architecture Cluster-Based Method Experiments Conclusion and Future Work

Introduction (1/4) Similar to traditional component-based systems , the cloud service provider need to select a number of cloud nodes to user for deploying a cloud application in a cloud. How to make optimal deployment of cloud applications is a challenging and urgent required research problem.

Introduction (2/4) There are two types of common cloud applications: computation- intensive and communication-intensive applications Computation-intensive: cloud nodes do not communicate with each other frequently (e.g., BONIC). Ranking-based can select a set of optimal cloud nodes for optimal deployment purpose. Communication-intensive: use ranking-based or other methods selecting optimal cloud nodes for communication-intensive application is not proper, since communication performance between cloud nodes needs to be considered.

Introduction (3/4) An example Assuming a user wants to deploy a MPI program on a cloud and needs to select two cloud nodes for this MPI application. As illustrated in Figure 1, there are totally four available cloud nodes in the cloud Figure 1. Cloud Node Ranking by Average Response Time If we rank these available node candidates via their average response time, then nodes A and D will be selected as the best performing nodes for the MPI application. However, the response time between A and D is 3 seconds, and not the optimal select.

Introduction (4/4) The contribution of this paper is two-fold: We identify the critical problem of selecting optimal cloud nodes for communication- intensive cloud applications and propose a clustering-based method to address this problem. Based on our method, optimal cloud nodes can be efficiently and effectively determined for communication-intensive cloud applications. Real-world experiments are conducted to compare our method with other methods. We deploy several well known MPI programs on a real-world cloud, PlanetLab The experimental results show the effectiveness of our proposed approach.

Related Work Based on the cloud node QoS performance, a number of selection and schedule strategies have been proposed in the recent literature. The major approaches can be divided into three types: Random appraoches (use random methods to select nodes) Ranking or rating approaches (cloud nodes is ranked by the order of QoS performance). Matching approaches (matching algorithms are employed to compare the users’requirements and the QoS values of cloud nodes).

System Architecture (1/2) The cloud node selection problem as shown in Figure 2: Figure 2. cloud nodes selection

System Architecture (2/2) The optimal cloud nodes selection framework as shown in Figure 3: Figure 3. cloud nodes selection Architecture Details of steps please see in paper.

Cluster-Based Method (1/8) Cluster-Based Method designed as a three-phase process: Step 1: Selecting initial centroids Step 2: Clustering Analysis Step 3: Selection Details of these phase are presented in the following.

Cluster-Based Method (2/8) Select Initial Centroids Choosing proper initial centroids is a key step of the cluster analysis procedure. Although it is easy to choose initial centroids randomly, the cluster results are often poor. In general, in a data space, data objects in lower density area are usually regarded as noise objects. As shown in Figure 3. p1, p2 and p3 are noise points. Figure 3. High and low-density areas

Cluster-Based Method (3/8) Select Initial Centroids In our approach, we use the response time between two nodes to represent the distance between them. Definition 1: The neighborhood of a cloud node p, denoted by N(p), is defined by N(p)={q∊D|dist(p,q)<DIST}, where DIST is a threshold of response time between two cloud nose. D is a set of existing cloud nodes. dist(pi, pj) denotes the distance between two cloud nodes. Definition 2: A cloud node p that in the high density ara should satisfy the following condition Num(N(p))>NUMBER Where NUMBER is a threshold of the number of neighborhood nodes.

Cluster-Based Method (4/8) Select Initial Centroids Let H={yi|1 ≤i ≤m} be the set of cloud node in the high-density areas. The initial centroids will be selected from H. The cloud node which has the largest number of neighbors is selected as the centroid z1, and should satisfy the following condition: Num(N(z1)) ≥ max{Num(N(yi))|yi ∈ H}. (1) We select second centroid z2 is the node that has the greatest distance from z1, and the third one z3 has satisfy the below condition: min{dist(z3, z1), dist(z3, z2)} =max{min{dist(yi, z1), dist(yi, z2)}|yi ∈ H}. Similarly, the kth centroid zk needs to satisfy: min{dist(zk, zi)|1 ≤ i < k} =max{min{dist(yj, zi)|1 ≤ i < k}|yj ∈ H}. （2）

Cluster-Based Method (5/8) Clustering analysis In our approach, we divide the cloud nodes into different clusters based on the response time between different nodes. The response times between nodes can be represented as an n by n matrix. Figure . 4 response time matrix We use pi to represent the vector of response times from node i to other nodes. i.e., pi = (xi1, xi2, . . . , xin).

Cluster-Based Method (6/8) Clustering analysis A cluster analysis algorithm is designed to divide the cloud nodes into K clusters, denoted by C = {C1, C2, . . . , CK} In our approach, we use the average distance between pi and all the cloud nodes of one cluster to represent the distance between a node and a cluster. The average distance calculate by Eq. (3) (3) where d is the number of cloud nodes in the k-th cluster Ck.

Algorithm 1: The algorithm of cluster analysis

Cluster-Based Method (8/8) Selection After clustering, these cloud nodes are assigned to different clusters. Therefore, we can select the cluster by the RTT of every cluster that user require. RTT calculated by: (4) After selecting clusters, we can rank the nodes in selected cluster by their performance. perf=λ×calc+(1-λ) ×comm (5)

Experiments (1/7) Experiment Setup We have deployed our experiments on PlanetLab. Our experimental environment consists of 100 distributed nodes which serve as cloud nodes. The schedule node and database server are also deployed on PlanetLab nodes. The parameter of experiment shown in Table 1. Cluster number 4 λ 0.5 DIST 100 ms NUMBER 25

Experiments (2/7) Experiments Setup In the experiments, we run different cloud node selection approaches for a MPI benchmark, called NASA NPB. To compare the performance of Cluster-based method against other schedule algorithm, we use the following metric via NPB: Makespan: The makespan of a job is defined as the duration between sending out a job and receiving a correct result Throughput: The throughput of a job is defined as the total million operations per second rate (Mop/s) rate over the number of processes.

Experiments (3/7) Performance Comparison To study the cluster-based method performance, we compare our method with the following four methods: Random: Random-based cloud nodes selection method. RankRes: Ranking cloud nodes with respect to the free memory and CPU time RankComm: Ranking cloud nodes with respect to communication performance. RankAll: ranking cloud nodes with respect to both the computing power and communication ability.

Experiments (4/7) Table 2 and Table 3 show the running of the different benchmarks. The numbers 8,16 indicate the numbers of the used cloud nodes Table 2. Comparison of makespan Table 3. Comparison of Throughput

Experiments (5/6) Compare of initial centroids selection method Table 4. Comparison of centroids selection method

Experiments (6/7) Impact of parameters 1) Impact of Class number: In this section, we will analyze the impact of different number of clusters by vary it from 2 to 10 with the step value 1. Figure .5 shown the results Figure 5. Makespan of different cluster number

Experiments (7/7) Impact of parameters 1) Impact of λ : We change the value of λ from 0 to 1 with a step value of 0.1. The results shown in Figure 6. Figure 6. Makespan of different λ values

Conclusion and Future Work In this paper, we propose a clustering-based cloud node selection approach for communication-intensive cloud applications. By taking advantage of the cluster analysis, our approach not only considers the QoS values of cloud nodes, but also considers the relationship (i.e., response time) between cloud nodes. Our future work include consider the topology structure of cloud node , the load balance for cloud nodes and fault tolerant cluster

Thanks Q&A