Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center NSDI 11’ Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D.

Slides:



Advertisements
Similar presentations
MapReduce: Simplified Data Processing on Large Cluster Jeffrey Dean and Sanjay Ghemawat OSDI 2004 Presented by Long Kai and Philbert Lin.
Advertisements

The Datacenter Needs an Operating System Matei Zaharia, Benjamin Hindman, Andy Konwinski, Ali Ghodsi, Anthony Joseph, Randy Katz, Scott Shenker, Ion Stoica.
UC Berkeley a Spark in the cloud iterative and interactive cluster computing Matei Zaharia, Mosharaf Chowdhury, Michael Franklin, Scott Shenker, Ion Stoica.
SLA-Oriented Resource Provisioning for Cloud Computing
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
UC Berkeley Job Scheduling for MapReduce Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Scott Shenker, Ion Stoica 1 RAD Lab, * Facebook Inc.
No Agent Left Behind: Dynamic Fair Division of Multiple Resources Ian Kash 1 Ariel Procaccia 2 Nisarg Shah 2 (Speaker) 1 MSR Cambridge 2 Carnegie Mellon.
THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI GHODSI, ANTHONY JOSEPH, RANDY KATZ, SCOTT SHENKER, ION STOICA.
Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael Franklin, Scott Shenker, Ion Stoica Spark Fast, Interactive,
Hadoop YARN in the Cloud Junping Du Staff Engineer, VMware China Hadoop Summit, 2013.
Resource Management with YARN: YARN Past, Present and Future
The "Almost Perfect" Paper -Christopher Francis Hyma Chilukiri.
Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds School of Computer Engineering Nanyang Technological University,
Why static is bad! Hadoop Pregel MPI Shared cluster Today: static partitioningWant dynamic sharing.
Making Sense of Spark Performance
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Mesos A Platform for Fine-Grained Resource Sharing in Data Centers Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Delay.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Cluster Scheduler Reference: Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center NSDI’2011 Multi-agent Cluster Scheduling for Scalability.
Outline | Motivation| Design | Results| Status| Future
A Platform for Fine-Grained Resource Sharing in the Data Center
Benjamin Hindman Apache Mesos Design Decisions
1 CS : Project Suggestions Ion Stoica ( September 14, 2011.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Tachyon: memory-speed data sharing Haoyuan (HY) Li, Ali Ghodsi, Matei Zaharia, Scott Shenker, Ion Stoica Good morning everyone. My name is Haoyuan,
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
The Limitation of MapReduce: A Probing Case and a Lightweight Solution Zhiqiang Ma Lin Gu Department of Computer Science and Engineering The Hong Kong.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Fair.
Resilient Distributed Datasets: A Fault- Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Matei Zaharia, in collaboration with Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Haoyuan Li, Justin Ma, Murphy McCauley, Joshua Rosen, Reynold Xin,
A Platform for Fine-Grained Resource Sharing in the Data Center
Presented by Qifan Pu With many slides from Ali’s NSDI talk Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion Stoica.
Apache Mesos What is it ? Beyond Hadoop Resource Sharing Mesos Intentions Architecture Users
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
Part III BigData Analysis Tools (YARN) Yuan Xue
PACMan: Coordinated Memory Caching for Parallel Jobs Ganesh Ananthanarayanan, Ali Ghodsi, Andrew Wang, Dhruba Borthakur, Srikanth Kandula, Scott Shenker,
Omega: flexible, scalable schedulers for large compute clusters
Resilient Distributed Datasets A Fault-Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
COS 518: Advanced Computer Systems Lecture 13 Michael Freedman
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Mesos and Borg (Lecture 17, cs262a)
Scalable containers with Apache Mesos and DC/OS
Hierarchical Scheduling for Diverse Datacenter Workloads
COS 418: Distributed Systems Lecture 23 Michael Freedman
Yarn.
Introduction to Distributed Platforms
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center
Lottery Scheduling and Dominant Resource Fairness (Lecture 24, cs262a)
Managing Data Transfer in Computer Clusters with Orchestra
CS 425 / ECE 428 Distributed Systems Fall 2016 Nov 10, 2016
Apache Hadoop YARN: Yet Another Resource Manager
CS 425 / ECE 428 Distributed Systems Fall 2017 Nov 16, 2017
PA an Coordinated Memory Caching for Parallel Jobs
INDIGO – DataCloud PaaS
Meng Cao, Xiangqing Sun, Ziyue Chen May 28th, 2014
Sajitha Naduvil-vadukootu
EECS 582 Final Review Mosharaf Chowdhury EECS 582 – F16.
湖南大学-信息科学与工程学院-计算机与科学系
Omega: flexible, scalable schedulers for large compute clusters
COS 518: Advanced Computer Systems Lecture 14 Michael Freedman
Introduction Apache Mesos is a type of open source software that is used to manage the computer clusters. This type of software has been developed by the.
Cloud Computing Large-scale Resource Management
Cloud Computing MapReduce in Heterogeneous Environments
Presentation transcript:

Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center NSDI 11’ Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy Katz, Scott Shenker, Ion Stoica Presented by Youngmoon Lee EECS 582 – W1613/14/16

Agenda 1.Introduction 2.Problem statement 3.Design 4.Results 5.Discussion 6.Real world Mesos 3/14/16EECS 582 – W162

A cluster manager that provides resource sharing and isolation across cluster frameworks, Hadoop, Spark, MPI. Datacenter OS, even shorter 3/14/16EECS 582 – W163

Background [2009] A Berkeley view of Cloud Computing [2009] Nexus: Common substrate for Cloud [2011] Mesos: Fine-grained resource sharing [2011] DRF: Dominant Resource Fairness [2013] YARN [2013] Omega [2015] Borg 3/14/16EECS 582 – W164

Introduction Want to run different frameworks in a single cluster Static partitioning: No sharing 3/14/16EECS 582 – W165

Problem Required resource is different for frameworks: under-utilization 3/14/16EECS 582 – W166 time Static assignment

Problem Required resource is different for frameworks: under-utilization 3/14/16EECS 582 – W167 time

Solution 3/14/16EECS 582 – W168

Solution 3/14/16EECS 582 – W169

Objective Dynamic sharing and management of resources Utilization( and scalability 3/14/16EECS 582 – W1610

Design Micro-kernel pushes scheduling logic to frameworks “Minimal resource multiplexer (two-level)” 3/14/16EECS 582 – W1611 Utilization! Scalability! Task-level Fine-grained sharing Resource offers “Let framework pick” Performance Isolation? Containers Fairness? Max-min fairness[DRF} Revoke resource Objectives Concern Implementation Design

Resource offer 3/14/16EECS 582 – W1612 Free!

Resource offer 3/14/16EECS 582 – W1613 Suit yourself

Resource offer 3/14/16EECS 582 – W1614

Resource offer 3/14/16EECS 582 – W1615

Resource offer 3/14/16EECS 582 – W1616 Suit yourself 5 So harmonious, yet…

Hypothesis Tasks are short-lived returning resources frequently All long running, no resource return, No sharing? Job sizes are small compared to the size of cluster 3/14/16EECS 582 – W1617 Most of time 0 CPU remains

Results 3/14/16EECS 582 – W1618 vs Static resource sharing

CPU/Memory Utilization CPU 10%, Memory 17% Improves 3/14/16EECS 582 – W1619

Resource offer Hadoop finds better data-locality with resource offers 3/14/16EECS 582 – W1620

Scalability Simple inter-framework scheduling micro-kernel 3/14/16EECS 582 – W1621 Note: 10 s task 10% 8% 6% 4% 2% 0% Number of Nodes

Discussion Max-min fairness allocation? Framework Starvation? Malicious Framework? Lottery/stride scheduling? Omega? 3/14/16EECS 582 – W CPU CPU

Discussion MPI becomes slower, how to handle resource contention? MPI interdependency affects performance? 3/14/16EECS 582 – W s ∆=112 s

Discussion Overhead of 8% for 50K nodes? For 10% utilization gain? 10 s tasks takes 11 s start time increases with resource requirements? 3/14/16EECS 582 – W1624 Note: 10 s task 10% 8% 6% 4% 2% 0%

3/14/16EECS 582 – W1625 enterprise Long running service Fault tolerant Cron-like system PaaS

3/14/16EECS 582 – W1626 Enterprise Consulting Long running service Fault tolerant Cron-like system PaaS

3/14/16EECS 582 – W1627 “Datacentre OS”

Original motivation Originally, Mesos built to run different version of Hadoop If it’s useful, also can be useful for many things 3/14/16EECS 582 – W1628 v1 v2 v3

Thank you 3/14/16EECS 582 – W1629

3/14/16EECS 582 – W1630

Resource offers 3/14/16EECS 582 – W1631

Resource offers 3/14/16EECS 582 – W1632

Resource offers 3/14/16EECS 582 – W1633