How Useful is Old Information? On the Management and Efficiency of Cloud Based Services seminar MICHAEL MITZENMACHER November 2010 Computer Science faculty.

Slides:



Advertisements
Similar presentations
Data and Computer Communications
Advertisements

Hadi Goudarzi and Massoud Pedram
Markov Decision Process
Cloud Control with Distributed Rate Limiting Raghaven et all Presented by: Brian Card CS Fall Kinicki 1.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Gossip Algorithms and Implementing a Cluster/Grid Information service MsSys Course Amar Lior and Barak Amnon.
Good afternoon everyone.
Load Balancing of Elastic Traffic in Heterogeneous Wireless Networks Abdulfetah Khalid, Samuli Aalto and Pasi Lassila
Cuckoo Hashing : Hardware Implementations Adam Kirsch Michael Mitzenmacher.
Simulation Where real stuff starts. ToC 1.What, transience, stationarity 2.How, discrete event, recurrence 3.Accuracy of output 4.Monte Carlo 5.Random.
Routing Strategies Fixed Routing
Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,
CSCE Review—Fortran. CSCE Review—I/O Patterns: Read until a sentinel value is found Read n, then read n things Read until EOF encountered.
Decentralised Structural Reorganisation in Agent Organisations Ramachandra Kota.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
DAST, Spring © L. Joskowicz 1 Data Structures – LECTURE 1 Introduction Motivation: algorithms and abstract data types Easy problems, hard problems.
Multiple server queues In particular, we look at M/M/k Need to find steady state probabilities.
Source-Destination Routing Optimal Strategies Eric Chi EE228a, Fall 2002 Dept. of EECS, U.C. Berkeley.
Teknik Routing Pertemuan 20 Matakuliah: H0484/Jaringan Komputer Tahun: 2007.
Adaptive Routing Proshanto Mukherji CSC 457: Computer Networks University of Rochester.
Data Communication and Networks Lecture 7 Networks: Part 2 Routing Algorithms October 27, 2005.
Structure and Phase Transition Phenomena in the VTC Problem C. P. Gomes, H. Kautz, B. Selman R. Bejar, and I. Vetsikas IISI Cornell University University.
Jan 6-10th, 2007VLSI Design A Reduced Complexity Algorithm for Minimizing N-Detect Tests Kalyana R. Kantipudi Vishwani D. Agrawal Department of Electrical.
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Sections 8.1 – 8.5)
1 Random Walks in WSN 1.Efficient and Robust Query Processing in Dynamic Environments using Random Walk Techniques, Chen Avin, Carlos Brito, IPSN 2004.
UNIVERSITY OF JYVÄSKYLÄ Resource Discovery Using NeuroSearch Presentation for the Agora Center InBCT-seminar Mikko Vapa, researcher InBCT 3.2.
S. Suri, M, Waldvogel, P. Warkhede CS University of Washington Profile-Based Routing: A New Framework for MPLS Traffic Engineering.
DAST, Spring © L. Joskowicz 1 Data Structures – LECTURE 1 Introduction Motivation: algorithms and abstract data types Easy problems, hard problems.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Slide 1 Comparison of Inter-Area Rekeying Algorithms for Secure Mobile Group Communication C. Zhang*, B. DeCleene +, J. Kurose*, D. Towsley* * Dept. Computer.
Data Communications & Computer Networks
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
22/07/ The MDS Scaling Problem for Cloud Storage Yu-chong Hu Institute of Network Coding.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)
Link-state routing  each node knows network topology and cost of each link  quasi-centralized: each router periodically broadcasts costs of attached.
CPU Cache Prefetching Timing Evaluations of Hardware Implementation Ravikiran Channagire & Ramandeep Buttar ECE7995 : Presentation.
Analysis of Ethernet-like protocols Andrey Lukyanenko University of Kuopio.
Capacity analysis of complex materials handling systems.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
Load Balancing and Termination Detection Load balance : - statically before the execution of any processes - dynamic during the execution of the processes.
Graph Algorithms. Definitions and Representation An undirected graph G is a pair (V,E), where V is a finite set of points called vertices and E is a finite.
Stochastic DAG Scheduling using Monte Carlo Approach Heterogeneous Computing Workshop (at IPDPS) 2012 Extended version: Elsevier JPDC (accepted July 2013,
11 Experimental and Analytical Evaluation of Available Bandwidth Estimation Tools Cesar D. Guerrero and Miguel A. Labrador Department of Computer Science.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.
1 - CS7701 – Fall 2004 Review of: Detecting Network Intrusions via Sampling: A Game Theoretic Approach Paper by: – Murali Kodialam (Bell Labs) – T.V. Lakshman.
TELE202 Lecture 6 Routing in WAN 1 Lecturer Dr Z. Huang Overview ¥Last Lecture »Packet switching in Wide Area Networks »Source: chapter 10 ¥This Lecture.
Rassul Ayani 1 Performance of parallel and distributed systems  What is the purpose of measurement?  To evaluate a system (or an architecture)  To compare.
CSCI1600: Embedded and Real Time Software Lecture 19: Queuing Theory Steven Reiss, Fall 2015.
NETE4631: Network Information System Capacity Planning (2) Suronapee Phoomvuthisarn, Ph.D. /
William Stallings Data and Computer Communications 7th Edition
Teknik Routing Pertemuan 10 Matakuliah: H0524/Jaringan Komputer Tahun: 2009.
Static Process Scheduling
2/14/2016  A. Orda, A. Segall, 1 Queueing Networks M nodes external arrival rate (Poisson) service rate in each node (exponential) upon service completion.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Spring Routing: Part I Section 4.2 Outline Algorithms Scalability.
Generalized Point Based Value Iteration for Interactive POMDPs Prashant Doshi Dept. of Computer Science and AI Institute University of Georgia
VADD: Vehicle-Assisted Data Delivery in Vehicular Ad Hoc Networks Zhao, J.; Cao, G. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 鄭宇辰
Distributed Systems Lecture 6 Global states and snapshots 1.
William Stallings Data and Computer Communications
Confluent vs. Splittable Flows
Simulation.
Load Balancing and Data centers
Serve Assignment Policies
A Study of Group-Tree Matching in Large Scale Group Communications
Dynamic Graph Partitioning Algorithm
In-Class Exercise: The Poisson Distribution
CASE − Cognitive Agents for Social Environments
Load Balancing/Sharing/Scheduling Part II
Memento: Making Sliding Windows Efficient for Heavy Hitters
Presentation transcript:

How Useful is Old Information? On the Management and Efficiency of Cloud Based Services seminar MICHAEL MITZENMACHER November 2010 Computer Science faculty Technion Alexander Zlotnik IEEE Transactions on Parallel and Distributed Systems, 1997

Amazon Elastic Load Balancing Share Incoming Traffic Scale Up Scale Down Detect & shutdown unhealthy EC2 instances Balance single/multiple Availability ZonesAvailability Zones Reports $0.025 per hour [$18/month] + Traffic

Background What is this paper not – Real system – Scale up/down system – Groups/AZ balancing

Background Supermarket model & benefits 1.On incoming task, poll d queues for their load 2.Send the task to the shorter queue -For d = 2 the service time is exponentially shorter -Further d = 3, 4, … the improvement is linear

Background Load Balancing Types – Centralized/Distributed – Static/Dynamic(adaptive)

Setting n nodes Tasks Arrival: Poisson (λn) Task is forwarded to a node for execution Execution rate is distributed Exp(µ) – Normalized to µ=1 Every T time units a bulletin board is updated with the current loads of all nodes

Policy Look at d random entries on the board, send the task to the shortest queue d=1 – M/M/1 d=n – Shortest queue  Static

Periodic updates 0T2T3TTtTt NodePosted LoadTrue Load … n-122 True Load Posted Load

Definitions P i,j (t) – Fraction of queues with posted load i, but have true load j q i (t) - Rate of arrivals at a queue of size i NodePosted LoadTrue Load … n …J … I Posted Load True Load P i,j (t)

Between board updates 01234…J … I Posted Load True Load µ qiqi µ qiqi P i,j (t)

On board update 01234…J … I Posted Load True Load P i,j (t) 0 0 πiπi bibi

01234…J … I Posted Load True Load P i,j (t) bibi Σ b i (t) – Fraction of queues with load i posted

b i (t) – Fraction of queues with posted load i Arrival rate to a queue with posted load i: Tasks arrival Chance that d selected queues have load i or more Chance that a queue of size i is selected Same chance for all queues of size i

Fixed Cycle Hope: Convergence to a fixed point – State that there is no “motivation” to exit from – Periodic Updates – “Jump” on bulletin board updates, t=kT – Fixed cycle: b i do not change Or, for k≥k 0, P(kT) = P(k 0 T)

Fixed Cycle 01234…J … I Posted Load True Load P i,j (t) bibi Σ πiπi Σ If π((k-1)T)= π(kT), the next phase will be the same

Fixed Cycle – finding vector π Method A Run the system until changes in π are small Method B m i,j (T) – Probability for M/M/1 queue to start with i tasks and after time T to have j tasks Solving a system of equations: Method A on truncated system of differential equations

Fixed Cycle – finding vector π Method B m i,j (T) – Probability for M/M/1 queue to start with i tasks and after time T to have j tasks – Solving a system of equations: Iterating 012…J … I π – Iteration start = x π – Iteration end Until small changes in π

Fixed Cycle – finding vector π Method B m i,j (T) – Probability for M/M/1 queue to start with i tasks and after time T to have j tasks – Iterating a system of equations: Bessel function of the first kind

Fixed Cycle – finding vector π Method A on truncated system of differential equations – Bound I, J – Iterate on – Until small changes in π

Results

More complex Centralized Strategies Time based – Split T into subintervals – At subinterval [t k,t k+1 ) tasks sent to random server with load at most k

More complex Centralized Strategies Record-insert – Upon task arrival: Sent to random server of servers with lowest load Server’s load is incremented on bulleting board Task ending not updated (until end of phase) – Uses real loads, not expectation Always better than time-based

More complex Centralized Strategies

Continuous Update

Continuous Updates Tasks benefit on occasional accurate info – Supported by other graphs too Partial accurate info – not helpful – If not known which data is up to date

Individual Updates

Conclusions Load Balancing is useful even with stale information Choosing least loaded of 2 nodes is better than shortest or 3 or more More complex policies can further help load balancing (like time-based or record-insert)

Open Questions Additional metrics Different theoretical frameworks More realistic arrival patterns (heavy-tailed)

Personal review Powerful theoretical framework Interesting and relevant results Cons – Too few loads scenarios – No comparison between the amount of needed bandwidth to load balancing benefit

Relevant Information More references – Collection of works: “The power of Two Random Choices: A Survey of Techniques and Results” M. Mitzenmacher et al 2001 Not covered in this presentation: – Deviation of simulating differential equations instead of full system [Sections 3.4, 3.5] – Theory of some systems, but their results are – Competitive Scenarios [Section 6] – Small number of servers