Matchmaking: A New MapReduce Scheduling Technique

Slides:



Advertisements
Similar presentations
Simulation of Feedback Scheduling Dan Henriksson, Anton Cervin and Karl-Erik Årzén Department of Automatic Control.
Advertisements

MapReduce.
SDN + Storage.
LIBRA: Lightweight Data Skew Mitigation in MapReduce
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
SkewTune: Mitigating Skew in MapReduce Applications
SecureMR: A Service Integrity Assurance Framework for MapReduce Wei Wei, Juan Du, Ting Yu, Xiaohui Gu North Carolina State University, United States Annual.
UC Berkeley Job Scheduling for MapReduce Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Scott Shenker, Ion Stoica 1 RAD Lab, * Facebook Inc.
1 Degraded-First Scheduling for MapReduce in Erasure-Coded Storage Clusters Runhui Li, Patrick P. C. Lee, Yuchong Hu The Chinese University of Hong Kong.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Meeting Service Level Objectives of Pig Programs Zhuoyao Zhang, Ludmila Cherkasova, Abhishek Verma, Boon Thau Loo University of Pennsylvania Hewlett-Packard.
UC Berkeley Job Scheduling with the Fair and Capacity Schedulers Matei Zaharia Wednesday, June 10, 2009 Santa Clara Marriott.
Towards Energy Efficient Hadoop Wednesday, June 10, 2009 Santa Clara Marriott Yanpei Chen, Laura Keys, Randy Katz RAD Lab, UC Berkeley.
Quincy: Fair Scheduling for Distributed Computing Clusters Microsoft Research Silicon Valley SOSP’09 Presented at the Big Data Reading Group by Babu Pillai.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Delay.
Towards Energy Efficient MapReduce Yanpei Chen, Laura Keys, Randy H. Katz University of California, Berkeley LoCal Retreat June 2009.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Distributed MapReduce Team B Presented by: Christian Bryan Matthew Dailey Greg Opperman Nate Piper Brett Ponsler Samuel Song Alex Ostapenko Keilin Bickar.
CPS216: Advanced Database Systems (Data-intensive Computing Systems) How MapReduce Works (in Hadoop) Shivnath Babu.
Improving MapReduce Performance Using Smart Speculative Execution Strategy Qi Chen, Cheng Liu, and Zhen Xiao Oct 2013 To appear in IEEE Transactions on.
End-to-End Delay Analysis for Fixed Priority Scheduling in WirelessHART Networks Abusayeed Saifullah, You Xu, Chenyang Lu, Yixin Chen.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Hadoop & Cheetah. Key words Cluster  data center – Lots of machines thousands Node  a server in a data center – Commodity device fails very easily Slot.
Investigation of Data Locality in MapReduce
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
A Dynamic MapReduce Scheduler for Heterogeneous Workloads Chao Tian, Haojie Zhou, Yongqiang He,Li Zha 簡報人:碩資工一甲 董耀文.
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Presented By HaeJoon Lee Yanyan Shen, Beng Chin Ooi, Bogdan Marius Tudor National University of Singapore Wei Lu Renmin University Cang Chen Zhejiang University.
Storage in Big Data Systems
DisCo: Distributed Co-clustering with Map-Reduce S. Papadimitriou, J. Sun IBM T.J. Watson Research Center Speaker: 吳宏君 陳威遠 洪浩哲.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
Introduction to Hadoop Owen O’Malley Yahoo!, Grid Team
Eneryg Efficiency for MapReduce Workloads: An Indepth Study Boliang Feng Renmin University of China Dec 19.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
Batch Scheduling at LeSC with Sun Grid Engine David McBride Systems Programmer London e-Science Centre Department of Computing, Imperial College.
1 University of Maryland Linger-Longer: Fine-Grain Cycle Stealing in Networks of Workstations Kyung Dong Ryu © Copyright 2000, Kyung Dong Ryu, All Rights.
BALANCED DATA LAYOUT IN HADOOP CPS 216 Kyungmin (Jason) Lee Ke (Jessie) Xu Weiping Zhang.
GreenSched: An Energy-Aware Hadoop Workflow Scheduler
CARDIO: Cost-Aware Replication for Data-Intensive workflOws Presented by Chen He.
Hadoop System simulation with Mumak Fei Dong, Tianyu Feng, Hong Zhang Dec 8, 2010.
MC 2 : Map Concurrency Characterization for MapReduce on the Cloud Mohammad Hammoud and Majd Sakr 1.
Testing the dynamic per-query scheduling (with a FIFO queue) Jan Iwaszkiewicz.
1 Making MapReduce Scheduling Effective in Erasure-Coded Storage Clusters Runhui Li and Patrick P. C. Lee The Chinese University of Hong Kong LANMAN’15.
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
Using Map-reduce to Support MPMD Peng
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Fair.
Sparrow Distributed Low-Latency Spark Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion Stoica.
1 Enabling Efficient and Reliable Transitions from Replication to Erasure Coding for Clustered File Systems Runhui Li, Yuchong Hu, Patrick P. C. Lee The.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Programming in Hadoop Guangda HU Huayang GUO
Dynamic Slot Allocation Technique for MapReduce Clusters School of Computer Engineering Nanyang Technological University 25 th Sept 2013 Shanjiang Tang,
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Massive Semantic Web data compression with MapReduce Jacopo Urbani, Jason Maassen, Henri Bal Vrije Universiteit, Amsterdam HPDC ( High Performance Distributed.
ApproxHadoop Bringing Approximations to MapReduce Frameworks
A Two-phase Execution Engine of Reduce Tasks In Hadoop MapReduce XiaohongZhang*GuoweiWang* ZijingYang*YangDing School of Computer Science and Technology.
Using Map-reduce to Support MPMD Peng
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Chris Cai, Shayan Saeed, Indranil Gupta, Roy Campbell, Franck Le
Tao Zhu1,2, Chengchun Shu1, Haiyan Yu1
Chapter 10 Data Analytics for IoT
Edinburgh Napier University
Hadoop Clusters Tess Fulkerson.
The Basics of Apache Hadoop
MapReduce: Data Distribution for Reduce
Admission Control and Request Scheduling in E-Commerce Web Sites
Presentation transcript:

Matchmaking: A New MapReduce Scheduling Technique Chen He Dr. Ying Lu Dr. David Swanson

Problem Statement MapReduce cluster scheduling algorithm becomes increasingly important Efficient MapReduce scheduler must avoid unnecessary data transmission We will focus on decreasing data transmission in a MapReduce cluster

Contributions Build a matchmaking algorithm to improve data locality of Hadoop MapReduce jobs MatchMaking algorithm lead to higher data locality rate and shorter map task response time We substitute Delay algorithm with MatchMaking algorithm in Fair-sharing scheduler and also obtain better performance

Outline Background Delay Algorithm MatchMaking algorithm Evaluation Conclusion Questions

Background Hadoop FIFO scheduler Scheduler searches local tasks in the first job and assign them If no local task in the first job, a non-local task of the first job will be assigned Strict FIFO job order is followed

Background Hadoop FIFO scheduler

Background Hadoop FIFO scheduler

Background Hadoop FIFO scheduler

Background Hadoop FIFO scheduler

Background Hadoop FIFO scheduler

Background Hadoop FIFO scheduler deficiencies On the node side, strict FIFO job order reduces data locality On the job side, FIFO can not provide a fair opportunity for each worker node

Delay Algorithm Driven by Facebook events log saved in their Hadoop data warehouse Hadoop default FIFO scheduler results in unnecessarily long job response time and lack of fairness in resource sharing Focus on two points: fair sharing and data locality

Delay Algorithm Workload* Bin #Maps %Jobs at Facebook #Maps in Benchmark # of jobs in Benchmark 1 39% 38 2 16% 16 3 3-20 14% 10 14 4 21-60 9% 50 8 5 61-150 6% 100 6 151-300 200 7 301-500 4% 400 501-1500 800 9 >1501 3% 4800 *Matei Zaharia et al “Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling”

Delay Algorithm Fairness: Data locality Task execution percentage between jobs groups users Data locality For Map stage, a map task is running on a node that contains its input data For Reduce stage?

Fairness VS. Data locality Delay Scheduling Fairness VS. Data locality

Delay Algorithm Fair-sharing principle-hierarchical principle

Delay Scheduling-including rack locality

Delay Algorithm Relax the strict job order Scheduler can search other jobs in the job queue to find a local task Maximum Delay Time (MDT) for a job to avoid starvation MDT is a user defined maximum time that the scheduler can delay a job from assigning its non-local map tasks

Delay Algorithm

Delay Algorithm

Delay Algorithm

Delay algorithm

Delay algorithm

Delay algorithm

Delay Algorithm Properties MDT decides data locality rate Rl is an increasing function of MDT but with a ceiling value “1” However, average response time

Delay Algorithm Deficiency To achieve best response time, we need to vary the MDT value different types of jobs different cluster sizes different job execution orders

Outline Background Delay Algorithm MatchMaking algorithm Evaluation Conclusion Questions

MatchMaking Algorithm Relax strict job order search all jobs in the queue for local tasks To give every node a fair chance to grab its local tasks when a node fails to find a local task for the first time in a row, no non-local task will be assigned to it when a node fails to find a local task for the second time in a row, a non-local task will be assigned to it A node can be assigned at most one non-local task in every heartbeat interval

MatchMaking Algorithm

MatchMaking Algorithm

MatchMaking Algorithm

MatchMaking Algorithm

MatchMaking Algorithm

MatchMaking Algorithm

Outline Background Delay Algorithm MatchMaking algorithm Evaluation Conclusion Questions

Evaluation Environment Test cases Metrics Hardware 1 head node with 2 AMD Optron 2.2GHz 64bit, 8GB Mem, 1Gbps Ethernet 30 worker nodes with same CPUs and network but 4GB Mem Software Hadoop 0.21 Redhat Linux CentOS 5.5 Test cases Loadgen Wordcount Metrics Locality Rate Average Response Time

Evaluation Hadoop Configuration HDFS MapReduce Block size is128MB 100 Blocks evenly distributed in 30 worker nodes Replication number is 2 MapReduce 2 map slots and 1 reduce slot for each worker node Facebook production workload* *Matei Zaharia et al “Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling”

Evaluation FIFO Scheduler Fair-sharing Scheduler Default locality policy Delay policy Matchmaking policy Fair-sharing Scheduler

Evaluation FIFO scheduler locality rate loadgen wordcount

Evaluation FIFO scheduler MTART loadgen wordcount

Evaluation Fair sharing scheduler locality rate

Evaluation Fair sharing scheduler response time

Conclusion We create MatchMaking algorithm to improve MapReduce scheduler’s data locality without tuning It obtains good performance in a middle size cluster with Facebook production workload It can be easily integrated with other scheduler like FIFO or Fair-sharing scheduler

Disscussion Data locality in the Reduce stage

Discussion Performance in a large cluster and uneven distributed environment Large cluster may have long hearbeat interval Large block size ResponseTime=QueuingTime+DataLoadingTime+DataProcessTime More replicas Data blocks may not be evenly distributed Hotspot

Discussion If the job queue is very long. Set a parameter MaxJobConsidered Priorities

Discussion Anything else?

Questions Back Page This picture is adopted from the Internet