Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.

Slides:



Advertisements
Similar presentations
Lecture 12: MapReduce: Simplified Data Processing on Large Clusters Xiaowei Yang (Duke University)
Advertisements

MAP REDUCE PROGRAMMING Dr G Sudha Sadasivam. Map - reduce sort/merge based distributed processing Best for batch- oriented processing Sort/merge is primitive.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
LIBRA: Lightweight Data Skew Mitigation in MapReduce
SLA-Oriented Resource Provisioning for Cloud Computing
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Center-of-Gravity Reduce Task Scheduling to Lower MapReduce Network Traffic Mohammad Hammoud, M. Suhail Rehman, and Majd F. Sakr 1.
Look Who’s Talking: Discovering Dependencies between Virtual Machines Using CPU Utilization HotCloud 10 Presented by Xin.
Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds School of Computer Engineering Nanyang Technological University,
Quincy: Fair Scheduling for Distributed Computing Clusters Microsoft Research Silicon Valley SOSP’09 Presented at the Big Data Reading Group by Babu Pillai.
IMapReduce: A Distributed Computing Framework for Iterative Computation Yanfeng Zhang, Northeastern University, China Qixin Gao, Northeastern University,
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Mesos A Platform for Fine-Grained Resource Sharing in Data Centers Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Delay.
Homework 2 In the docs folder of your Berkeley DB, have a careful look at documentation on how to configure BDB in main memory. In the docs folder of your.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Improving MapReduce Performance Using Smart Speculative Execution Strategy Qi Chen, Cheng Liu, and Zhen Xiao Oct 2013 To appear in IEEE Transactions on.
L22: SC Report, Map Reduce November 23, Map Reduce What is MapReduce? Example computing environment How it works Fault Tolerance Debugging Performance.
MapReduce Simplified Data Processing On large Clusters Jeffery Dean and Sanjay Ghemawat.
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Department of Computer Science Engineering SRM University
A Dynamic MapReduce Scheduler for Heterogeneous Workloads Chao Tian, Haojie Zhou, Yongqiang He,Li Zha 簡報人:碩資工一甲 董耀文.
Click to add text TWA Cloud Integration with Tivoli Service Automation Manager TWS Education.
Map Reduce for data-intensive computing (Some of the content is adapted from the original authors’ talk at OSDI 04)
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Improving Network I/O Virtualization for Cloud Computing.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Cloud Distributed Computing Platform 2 Content of this lecture is primarily from the book “Hadoop, The Definite Guide 2/e)
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
1 Time & Cost Sensitive Data-Intensive Computing on Hybrid Clouds Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
1 A Framework for Data-Intensive Computing with Cloud Bursting Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The Ohio.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
MapReduce How to painlessly process terabytes of data.
The Limitation of MapReduce: A Probing Case and a Lightweight Solution Zhiqiang Ma Lin Gu Department of Computer Science and Engineering The Hong Kong.
Benchmarking MapReduce-Style Parallel Computing Randal E. Bryant Carnegie Mellon University.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Synchronized Co-migration of Virtual Machines for IDS Offloading in Clouds Kenichi Kourai and Hisato Utsunomiya Kyushu Institute of Technology, Japan.
Using Map-reduce to Support MPMD Peng
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Shanjiang Tang, Bu-Sung Lee, Bingsheng He, Haikun Liu School of Computer Engineering Nanyang Technological University Long-Term Resource Fairness Towards.
Dynamic Slot Allocation Technique for MapReduce Clusters School of Computer Engineering Nanyang Technological University 25 th Sept 2013 Shanjiang Tang,
Virtualization and Databases Ashraf Aboulnaga University of Waterloo.
1/22 Optimization of Google Cloud Task Processing with Checkpoint-Restart Mechanism Speaker: Sheng Di Coauthors: Yves Robert, Frédéric Vivien, Derrick.
NTU Cloud 2010/05/30. System Diagram Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Ian Gable HEPiX Spring 2009, Umeå 1 VM CPU Benchmarking the HEPiX Way Manfred Alef, Ian Gable FZK Karlsruhe University of Victoria May 28, 2009.
PDAC-10 Middleware Solutions for Data- Intensive (Scientific) Computing on Clouds Gagan Agrawal Ohio State University (Joint Work with Tekin Bicer, David.
Using Map-reduce to Support MPMD Peng
Load Rebalancing for Distributed File Systems in Clouds.
Next Generation of Apache Hadoop MapReduce Owen
Presented by Qifan Pu With many slides from Ali’s NSDI talk Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion Stoica.
Part III BigData Analysis Tools (YARN) Yuan Xue
PACMan: Coordinated Memory Caching for Parallel Jobs Ganesh Ananthanarayanan, Ali Ghodsi, Andrew Wang, Dhruba Borthakur, Srikanth Kandula, Scott Shenker,
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center
CS 425 / ECE 428 Distributed Systems Fall 2016 Nov 10, 2016
Abstract Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for.
PA an Coordinated Memory Caching for Parallel Jobs
湖南大学-信息科学与工程学院-计算机与科学系
Cloud Computing Large-scale Resource Management
Presentation transcript:

Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng

Virtual Clusters on Cloud  Private cluster on public cloud  Distributed computing platforms  MapReduce [OSDI ’04], Hadoop, Dryad [Eurosys ’07]  New York Times used 100 nodes on Amazon EC2 [OSDI ’08]  Each VM in a virtual cluster has static configuration Instance typesConfiguration Small1 virtual core, 1.7GB memory Large2 virtual cores, 7.5GB memory Extra Large4 virtual cores, 15GB memory master virtual cluster e.g. Amazon EC2 VM instance types vm

Resource Utilization Management  Physical cluster  Load balancing is the only mechanism for higher utilization  Virtual cluster  Dynamic resource management is also possible  With using resource hot-plug technique  Possible resource types: core and memory We focus on core hot-plugging in this work

Dynamic Resource Management physical cluster virtual cluster virtual machine Physical MachineVirtual Machine

Management by Whom?  Requirements 1. Current resource utilization monitoring 2. Platform-level information 3. Privileged permission to hot-plug resource 4. Support management for multiple users  Resource management as Platform-as-a-Service (PaaS) service  Provider offers platform with dynamic resource management for various users  e.g. Amazon Elastic MapReduce

MapReduce master job task Distributed File System disk slave

Data Locality on MapReduce  Disadvantages from low data locality 1. Network performance degradation because of network bottleneck 2. Under-utilization of computing resource disk task disk task disk task disk task Satisfy data locality Does not satisfy data locality slave

slave 2 slave 1 Hadoop Fair Scheduler  Hadoop  Open source implementation of MapReduce  Hadoop Fair Scheduler  Generally used scheduler  Guarantee fairness between submitted jobs on Hadoop master job A task 1 task 2 job B Job# Running tasks job A job B task 1 1

Main Idea  Approach  Move available resource to a node satisfying data locality and assign a task to the node Dynamic Resource Reconfiguration

slave 1 slave 2 Dynamic Resource Reconfiguration master task 2 job B 1. A node(source node) does not satisfy data locality 2. Master schedule to another node(target node) satisfying data locality 3. Reconfigure both source and target nodes target node task 1 source node job A task 1

 Resource hot-plugging  De-allocation  Giving up and giving back resource to provider  Always possible  Allocation  Taking new resource from provider  Not always possible  Two solutions  Synchronous DRR  Queue-based DRR Dynamic Resource Reconfiguration

Synchronous DRR  Headroom  Remained by provider  Idle and available resource on each physical machine  Shared by all VMs on a physical machine master source node headroom target node headroom physical machine slave

Queue-based DRR master AQ DQ AQ DQ 1. Reconfiguration from vm A to vm C 2. Reconfiguration from vm D to vm B vm A vm Cvm B vm D physical machine vm A vm B vm C vm D

Queue-based DRR master AQ DQ AQ DQ 1. Reconfiguration from vm A to vm C 2. Reconfiguration from vm D to vm B 3. Reconfigure (vm A, vm B) and (vm C, vm D) vm A vm Cvm B vm D physical machine vm B vm C vm D vm A

Synchronous vs. Queue-based  Synchronous DRR  No waiting time until reconfiguration  Synchronously executed allocation and deallocation  Overall resource under-utilization because of headroom  Queue-based DRR  Realistic and industry-applicable mechanism  Performance degradation if queuing delay is large

Evaluation  Environment  EC2 cluster: 100 VM instances  8 virtual cores, 7 GB memory (High-CPU Extra Large Instance)  Synchronous DRR only  Private cluster: 30 VMs on 6 physical machines  6 cores, 16GB memory  Synchronous DRR + Queue-based DRR  Workloads  Hive performance benchmark  grep, select, join, aggregation, inverted index  Job schedule  Randomly generated schedule based on the trace of the industry [Eurosys’10]

Large-scale Evaluation  Overall speedup : 15% Locality (%) Speed-up (Workloads, # of map tasks) Original HadoopSynchronous DRR (Workloads, # of map tasks)

Evaluation on the Private Cluster  Overall speedup  Synchronous DRR : 41%  Queue-based DRR : 35% Locality (%) Speed-up Original HadoopSynchronous DRR Queue-based DRR (Workloads, # of map tasks) Queue-based DRR

Conclusion  Propose a dynamic VM reconfiguration mechanism for distributed data-intensive platforms on virtualized cloud environment  Improve the input data locality of a virtual MapReduce cluster, by temporarily increasing cores to VMs to run local tasks, and it is called Dynamic Resource Reconfiguration (DRR)