The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.

Slides:



Advertisements
Similar presentations
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
Advertisements

Virtual Network Diagnosis as a Service Wenfei Wu (UW-Madison) Guohui Wang (Facebook) Aditya Akella (UW-Madison) Anees Shaikh (IBM System Networking)
Towards Predictable Datacenter Networks
SDN + Storage.
LIBRA: Lightweight Data Skew Mitigation in MapReduce
SLA-Oriented Resource Provisioning for Cloud Computing
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Low-Cost Data Deduplication for Virtual Machine Backup in Cloud Storage Wei Zhang, Tao Yang, Gautham Narayanasamy University of California at Santa Barbara.
Scalable and Crash-Tolerant Load Balancing based on Switch Migration
60 GHz Flyways: Adding multi-Gbps wireless links to data centers
Reconfigurable Network Topologies at Rack Scale
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
End-to-End Analysis of Distributed Video-on-Demand Systems Padmavathi Mundur, Robert Simon, and Arun K. Sood IEEE Transactions on Multimedia, February.
ENFORCING PERFORMANCE ISOLATION ACROSS VIRTUAL MACHINES IN XEN Diwaker Gupta, Ludmila Cherkasova, Rob Gardner, Amin Vahdat Middleware '06 Proceedings of.
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Resource Virtualization Winter Quarter Project.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Peter Dinda Department of Computer Science Northwestern University.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
New Challenges in Cloud Datacenter Monitoring and Management
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
Practical TDMA for Datacenter Ethernet
Edge Based Cloud Computing as a Feasible Network Paradigm(1/27) Edge-Based Cloud Computing as a Feasible Network Paradigm Joe Elizondo and Sam Palmer.
Network Sharing Issues Lecture 15 Aditya Akella. Is this the biggest problem in cloud resource allocation? Why? Why not? How does the problem differ wrt.
Sharing the Data Center Network Alan Shieh, Srikanth Kandula, Albert Greenberg, Changhoon Kim, Bikas Saha Microsoft Research, Cornell University, Windows.
Network Aware Resource Allocation in Distributed Clouds.
David G. Andersen CMU Guohui Wang, T. S. Eugene Ng Rice Michael Kaminsky, Dina Papagiannaki, Michael A. Kozuch, Michael Ryan Intel Labs Pittsburgh 1 c-Through:
Improving Network I/O Virtualization for Cloud Computing.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.
Kevin Ross, UCSC, September Service Network Engineering Resource Allocation and Optimization Kevin Ross Information Systems & Technology Management.
C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection Marco Canini (UCL) with Lalith Suresh, Stefan Schmid, Anja Feldmann (TU Berlin)
CloudNaaS: A Cloud Networking Platform for Enterprise Applications Theophilus Benson*, Aditya Akella*, Anees Shaikh +, Sambit Sahu + (*University of Wisconsin,
Dynamic Resource Monitoring and Allocation in a virtualized environment.
Eneryg Efficiency for MapReduce Workloads: An Indepth Study Boliang Feng Renmin University of China Dec 19.
Quantifying and Improving I/O Predictability in Virtualized Systems Cheng Li, Inigo Goiri, Abhishek Bhattacharjee, Ricardo Bianchini, Thu D. Nguyen 1.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
MC 2 : Map Concurrency Characterization for MapReduce on the Cloud Mohammad Hammoud and Majd Sakr 1.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
A dynamic optimization model for power and performance management of virtualized clusters Vinicius Petrucci, Orlando Loques Univ. Federal Fluminense Niteroi,
Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,
1 Making MapReduce Scheduling Effective in Erasure-Coded Storage Clusters Runhui Li and Patrick P. C. Lee The Chinese University of Hong Kong LANMAN’15.
1 Enabling Efficient and Reliable Transitions from Replication to Erasure Coding for Clustered File Systems Runhui Li, Yuchong Hu, Patrick P. C. Lee The.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Network-Aware Scheduling for Data-Parallel Jobs: Plan When You Can
Performance Analysis of Preemption-aware Scheduling in Multi-Cluster Grid Environments Mohsen Amini Salehi, Bahman Javadi, Rajkumar Buyya Cloud Computing.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Multi-Resource Packing for Cluster Schedulers Robert Grandl Aditya Akella Srikanth Kandula Ganesh Ananthanarayanan Sriram Rao.
Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,
Full and Para Virtualization
Feifei Chen Swinburne University of Technology Melbourne, Australia
Technical Reading Report Virtual Power: Coordinated Power Management in Virtualized Enterprise Environment Paper by: Ripal Nathuji & Karsten Schwan from.
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
Towards Predictable Data Centers Why Johnny can’t use the cloud and what we can do about it? Hitesh Ballani, Paolo Costa, Thomas Karagiannis, Greg O’Shea.
Slide 1/20 "PerfSight: Performance Diagnosis for Software Dataplanes." Wu, Wenfei, Keqiang He, and Aditya Akella ACM ICM, Presented by: Ayush Patwari.
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Yiting Xia, T. S. Eugene Ng Rice University
Chris Cai, Shayan Saeed, Indranil Gupta, Roy Campbell, Franck Le
Cloud-Assisted VR.
Abstract Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for.
Cloud-Assisted VR.
Performance Evaluation of Adaptive MPI
VDN: Virtual Machine Image Distribution Network for Cloud Data Centers
Multi-hop Coflow Routing and Scheduling in Data Centers
Cloud Computing MapReduce in Heterogeneous Environments
Towards Predictable Datacenter Networks
Presentation transcript:

The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1

Cloud Computing is Hot 2 Private Cluster

Key Factors for Cloud Viability Cost Performance 3

Performance Variability in Cloud BW variation in cloud due to contention [Schad’10 VLDB] Causing unpredictable performance 4

Network performance variability Tenant Enterprise Map Reduce Job Results Data analytics on an isolated cluster Completion Time 4 hours

Network performance variability Tenant Enterprise Map Reduce Job Results Data analytics on an isolated cluster Completion Time 4 hours Data analytics in a multi-tenant datacenter Tenant Map Reduce Job Results Datacenter Completion Time hours

Network performance variability Tenant Enterprise Map Reduce Job Results  Data analytics on an isolated cluster Completion Time 4 hours  Data analytics in a multi-tenant datacenter Tenant Map Reduce Job Results Datacenter Completion Time hours Variable tenant costs Expected cost (based on 4 hour completion time) = $100 Actual cost = $

Network performance variability Tenant Enterprise Map Reduce Job Results Data analytics on an isolated cluster Completion Time 4 hours Data analytics in a multi-tenant datacenter Tenant Map Reduce Job Results Datacenter Completion Time hours Variable tenant costs Expected cost (based on 4 hour completion time) = $100 Actual cost = $ Unpredictability of application performance and tenant costs is a key hindrance to cloud adoption Key Contributor: Network performance variation

Reserving BW in Data Centers SecondNet [Guo’10] – Per VM-pair, per VM access bandwidth reservation Oktopus [Ballani’11] – Virtual Cluster (VC) – Virtual Oversubscribed Cluster (VOC) 9

How BW Reservation Works Virtual Cluster Model Time Bandwidth N VMs Virtual Switch 1. Determine the model 2. Allocate and enforce the model 0T B Only fixed-BW reservation Request

Network Usage for MapReduce Jobs Hadoop Sort, 4GB per VM 11

Network Usage for MapReduce Jobs Hadoop Sort, 4GB per VM Hadoop Word Count, 2GB per VM 12

Network Usage for MapReduce Jobs Hadoop Sort, 4GB per VM Hadoop Word Count, 2GB per VM Hive Join, 6GB per VM 13

Network Usage for MapReduce Jobs Hadoop Sort, 4GB per VM Hadoop Word Count, 2GB per VM Hive Join, 6GB per VM Hive Aggregation, 2GB per VM 14

Network Usage for MapReduce Jobs Hadoop Sort, 4GB per VM Hadoop Word Count, 2GB per VM Hive Join, 6GB per VM Hive Aggregation, 2GB per VM 15 Time-varying network usage

Motivating Example 4 machines, 2 VMs/machine, non-oversubscribed network Hadoop Sort – N: 4 VMs – B: 500Mbps/VM 1Gbps 500Mbps Not enough BW 16

Motivating Example 4 machines, 2 VMs/machine, non-oversubscribed network Hadoop Sort – N: 4 VMs – B: 500Mbps/VM 17 1Gbps 500Mbps

Under Fixed-BW Reservation Model 18 1Gbps 500Mbps Job3 Job2 Virtual Cluster Model Job1 Time Bandwidth

Under Time-Varying Reservation Model 19 1Gbps 500Mbps TIVC Model Job1 Time Job2 Job3 Job4 Job5 J1J2J3J4J5 Bandwidth Doubling VM, network utilization and the job throughput Hadoop Sort

Temporally-Interleaved Virtual Cluster (TIVC) Key idea: Time-Varying BW Reservations Compared to fixed-BW reservation – Improves utilization of data center Better network utilization Better VM utilization – Increases cloud provider’s revenue – Reduces cloud user’s cost – Without sacrificing job performance 20

Challenges in Realizing TIVC Virtual Cluster Model Time Bandwidth N VMs Virtual Switch 0T B Request Time Bandwidth 0T B Request Q1: What are right model functions? Q2: How to automatically derive the models? Q1: What are right model functions? Q2: How to automatically derive the models?

Challenges in Realizing TIVC 22 Q3: How to efficiently allocate TIVC? Q4: How to enforce TIVC? Q3: How to efficiently allocate TIVC? Q4: How to enforce TIVC?

Challenges in Realizing TIVC What are the right model functions? How to automatically derive the models? How to efficiently allocate TIVC? How to enforce TIVC? 23

How to Model Time-Varying BW? 24 Hadoop Hive Join

TIVC Models 25 Virtual Cluster T 11 T 32

Hadoop Sort 26

Hadoop Word Count 27 v

Hadoop Hive Join 28

Hadoop Hive Aggregation 29

Challenges in Realizing TIVC What are the right model functions? How to automatically derive the models? How to efficiently allocate TIVC? How to enforce TIVC? 30

Possible Approach “White-box” approach – Given source code and data of cloud application, analyze quantitative networking requirement – Very difficult in practice Observation: Many jobs are repeated many times – E.g., 40% jobs are recurring in Bing’s production data center [Agarwal’12] – Of course, data itself may change across runs, but size remains about the same 31

Our Approach Solution: “Black-box” profiling based approach 1.Collect traffic trace from profiling run 2.Derive TIVC model from traffic trace Profiling: Same configuration as production runs – Same number of VMs – Same input data size per VM – Same job/VM configuration 32 How much BW should we reserve to the application?

Impact of BW Capping 33 No-elongation BW threshold

Choosing BW Cap Tradeoff between performance and cost – Cap > threshold: same performance, costs more – Cap < threshold: lower performance, may cost less Our Approach: Expose tradeoff to user 1.Profile under different BW caps 2.Expose run times and cost to user 3.User picks the appropriate BW cap 34 Only below threshold ones

From Profiling to Model Generation Collect traffic trace from each VM – Instantaneous throughput of 10ms bin Generate models for individual VMs Combine to obtain overall job’s TIVC model – Simplify allocation by working with one model – Does not lose efficiency since per-VM models are roughly similar for MapReduce-like applications 35

Generate Model for Individual VM 1.Choose B b 2.Periods where B > B b, set to B cap 36 BW Time B cap BbBb

Challenges in Realizing TIVC What are the right model functions? How to automatically derive the models? How to efficiently allocate TIVC? How to enforce TIVC? 37

TIVC Allocation Algorithm Spatio-temporal allocation algorithm – Extends VC allocation algorithm to time dimension – Employs dynamic programming – Chooses lowest level subtree Properties – Locality aware – Efficient and scalable 99 th percentile 28ms on a 64,000-VM data center in scheduling 5,000 jobs 38

Challenges in Realizing TIVC What are the right model functions? How to automatically derive the models? How to efficiently allocate TIVC? How to enforce TIVC? 39

Enforcing TIVC Reservation Possible to enforce completely in hypervisor – Does not have control over upper level links – Requires online rate monitoring and feedback – Increases hypervisor overhead and complexity Enforcing BW reservation in switches – Most small jobs will fit into a rack – Only a few large jobs cross the core – Avoid complexity in hypervisors 40

Challenges in Realizing TIVC What are the right model functions? How to automatically derive the models? How to efficiently allocate TIVC? How to enforce TIVC? 41

Proteus: Implementing TIVC Models Determine the model 2. Allocate and enforce the model

Evaluation Large-scale simulation – Performance – Cost – Allocation algorithm Prototype implementation – Small-scale testbed 43

Simulation Setup 3-level tree topology – 16,000 Hosts x 4 VMs – 4:1 oversubscription Workload – N: exponential distribution around mean 49 – B(t): derive from real Hadoop apps 44 50Gbps 10Gbps … … … 1Gbps … 20 Aggr Switch 20 ToR Switch 40 Hosts ………

Batched Jobs Scenario: 5,000 time-insensitive jobs 45 42%21%23%35% 1/3 of each type Completion time reduction All rest results are for mixed

Varying Oversubscription and Job Size % reduction for non-oversubscribed network

Dynamically Arriving Jobs Scenario: Accommodate users’ requests in shared data center – 5,000 jobs arrives dynamically with varying loads 47 Rejected: VC: 9.5% TIVC: 3.4% Rejected: VC: 9.5% TIVC: 3.4%

Analysis: Higher Concurrency Under 80% load 48 7% higher job concurrency 28% higher VM utilization Rejected jobs are large 28% higher revenue Charge VMs VM

Testbed Experiment Setup – 18 machines Real 30 MapReduce jobs – 10 Sort – 10 Hive Join – 10 Hive Aggre. 49

Testbed Result 50 TIVC finishes job faster than VC, Baseline finishes the fastest TIVC finishes job faster than VC, Baseline finishes the fastest Baseline suffers at variability of completion time, TIVC achieves similar performance as VC

Conclusion Network reservations in cloud are important – Previous work proposed fixed-BW reservations – However, cloud apps exhibit time-varying BW usage They propose TIVC abstraction – Provides time-varying network reservations – Uses simple pulse functions – Automatically generates model – Efficiently allocates and enforces reservations Proteus shows TIVC benefits both cloud provider and users significantly 51