Load Balancing in Distributed Systems Nalini Venkatasubramanian

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Database Architectures and the Web
Hadi Goudarzi and Massoud Pedram
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Chapter 20: Multimedia Systems Silberschatz, Galvin and Gagne ©2005 Operating System Concepts Chapter 20: Multimedia Systems What is Multimedia.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition Chapter 20: Multimedia Systems.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.
Video Staging: A Proxy-Server- Based Approach to End-to-End Video Delivery over Wide-Area Networks Zhi-Li Zhang, Yuewei Wang, David H.C Du, Dongli Su Άννα.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
OSMOSIS Final Presentation. Introduction Osmosis System Scalable, distributed system. Many-to-many publisher-subscriber real time sensor data streams,
Scalable Adaptive Data Dissemination Under Heterogeneous Environment Yan Chen, John Kubiatowicz and Ben Zhao UC Berkeley.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
Chapter 12 Distributed Database Management Systems
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
1 Distributed Systems: Distributed Process Management – Process Migration.
Distributed Process Management1 Learning Objectives Distributed Scheduling Algorithms Coordinator Elections Orphan Processes.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 34 – Media Server (Part 3) Klara Nahrstedt Spring 2012.
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
Storage Allocation in Prefetching Techniques of Web Caches D. Zeng, F. Wang, S. Ram Appeared in proceedings of ACM conference in Electronic commerce (EC’03)
1 Distributed Operating Systems and Process Scheduling Brett O’Neill CSE 8343 – Group A6.
18 June 2001 Optimizing Distributed System Performance via Adaptive Middleware Load Balancing Ossama Othman Douglas C. Schmidt
CH2 System models.
XE33OSA Chapter 20: Multimedia Systems. 20.2XE33OSA Silberschatz, Galvin and Gagne ©2005 Chapter 20: Multimedia Systems What is Multimedia Compression.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
EFFECTIVE LOAD-BALANCING VIA MIGRATION AND REPLICATION IN SPATIAL GRIDS ANIRBAN MONDAL KAZUO GODA MASARU KITSUREGAWA INSTITUTE OF INDUSTRIAL SCIENCE UNIVERSITY.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
MapReduce and GFS. Introduction r To understand Google’s file system let us look at the sort of processing that needs to be done r We will look at MapReduce.
Serverless Network File Systems Overview by Joseph Thompson.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Distributed Real-Time Systems (contd.)
S-Paxos: Eliminating the Leader Bottleneck
Accommodating Bursts in Distributed Stream Processing Systems Yannis Drougas, ESRI Vana Kalogeraki, AUEB
Static Process Scheduling
The IEEE International Conference on Cluster Computing 2010
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 27 – Media Server (Part 2) Klara Nahrstedt Spring 2009.
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dynamic Mapping Dr. Xiao Qin Auburn University
Architecture for Resource Allocation Services Supporting Interactive Remote Desktop Sessions in Utility Grids Vanish Talwar, HP Labs Bikash Agarwalla,
Chapter 20: Multimedia Systems
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Web Server Load Balancing/Scheduling
Chapter 20: Multimedia Systems
Web Server Load Balancing/Scheduling
Introduction to Load Balancing:
Load Management in Distributed Video Servers
Transparent Adaptive Resource Management for Middleware Systems
Chapter 20: Multimedia Systems
Load Balancing in Distributed Systems
Specialized Cloud Architectures
Chapter 20: Multimedia Systems
Chapter 20: Multimedia Systems
Chapter 20: Multimedia Systems
Presentation transcript:

Load Balancing in Distributed Systems Nalini Venkatasubramanian

Global Distributed Systems and Multimedia2 Motivation zA given computer is overloaded yMust decrease load and maintain certain characteristics xScalability xPerformance Throughput yIdeally, this should be done transparently ySolution: Load Balancing

Global Distributed Systems and Multimedia3 Introduction to Load Balancing zDistributed resource allocation yCan be thought of as “distributed scheduling” xDeals with distribution of processes among processors connected by a network xHandles issues such as deciding which process should be handled by a given processor yCan also be influenced by “distributed placement” xImportant in data intensive environments and applications xData placement may force process placement

Global Distributed Systems and Multimedia4 Load Balancing Relationships zLoad Balancer yManages resources yResource assignment depends on policy or policies in effect zClient yRequests resources yRequests services

Global Distributed Systems and Multimedia5 Load Balancing Overhead zSatisfy client resource access without imposing large amounts of overhead yPerformance xHow well resources are managed yEfficiency xCost of accessing and using a resource obtained through a load balancer

Global Distributed Systems and Multimedia6 Load Balancing Issues zWhen to migrate processes or forward requests zWhich processor should be chosen to handle a given process or request zShould processes be moved off a computer zHow should searching for lightly loaded computer be performed

Global Distributed Systems and Multimedia7 Load Balancing Issues (cont.) zWhen should load balancing decisions be made zWhat should be taken into account when making the above decisions zHow should old data be handled zShould load balancing data be stored centrally, or in a distributed manner

Global Distributed Systems and Multimedia8 Load Balancing Issues (cont.) zShould computers make decisions together zWhat is the performance/overhead tradeoff incurred by load balancing zPrevention of overloading a lightly loaded computer

Global Distributed Systems and Multimedia9 Load Balancing Techniques zBasically two ways to perform load balancing yStatically xResource is allocated once yDynamically xResource is allocated and managed (possibly dynamically reallocated) to ensure balanced load

Global Distributed Systems and Multimedia10 Static Load Balancing zResource allocation is performed once yOnce resource is allocated it remains allocated (for what duration??) zScheduling decisions are made yDeterministically yProbabilistically

Global Distributed Systems and Multimedia11 Static Load Balancing – cont’d zAdvantages yState generally need not be stored xSimplifies implementation yLess network traffic due to load balancing related messages zDisadvantages yPoor resource utilization xA given resource may be used much more than others yDoes not adjust to fluctuations in the load xPossible for resource to become overloaded

Global Distributed Systems and Multimedia12 Static Load Balancing – cont’d zExample of Static Load Balancing yForwarding processes/requests to a given computer based on dynamically assigned addresses (e.g. via DNS) xWeb servers (e.g. CNN) xUCI host ea.uci.edu is load balanced

Global Distributed Systems and Multimedia13 Dynamic Load Balancing zAttempts to maintain a balanced load by managing resources while a resource is in use yMay involve the following xProcess migration xDisabling further access to a resource until a later time xAdding new resources “on-the-fly”

Global Distributed Systems and Multimedia14 Dynamic Load Balancing Strategies zDistributed versus non-distributed yShould load information be stored centrally or across several hosts xSimplicity versus overhead and reliability zCooperative versus non-cooperative yShould decisions be made by a single load balancer or several xGlobally managed resources versus locally managed resources

Global Distributed Systems and Multimedia15 Dynamic Load Balancing Strategies - cont’d zAdaptive versus non-adaptive yShould previous data effect scheduling decisions zPreemptive versus non-preemptive yShould a running process be preempted in favor of another process, or for migration to another resource

Global Distributed Systems and Multimedia16 Case Studies zLoad Balancing for Web Servers zLoad Balancing for Parallel Computers zLoad Balancing for Multimedia Applications

Global Distributed Systems and Multimedia17 Multimedia Applications Electronic Commerce Entertainment Distance Learning Tele-medicine Video Servers Graphics Processing Web Servers Requirements - Availability, Reliability, Quality-of-Service, Cost-effectiveness, Security

Global Distributed Systems and Multimedia18 Multimedia Load Management zPrimary focus yresource optimization across streams and resource management across servers. xQuality of Service continuous delivery requirement minor violations of performance requirements xAdmission Control - resource reservation/negotiation xMedia Delivery - Resource scheduling (CPU,Disk) yResource Mgmt. Implies Admission Control xCaching, VCR Control, Server Selection, Data Placement

Global Distributed Systems and Multimedia19 Distributed MM Servers zVideo Server Topology xPartitioned Server xExternally Switched xFully Switched zVideo File Placement yHeterogeneous Workload - large/small, hot/cold yOnline Placement yGood placement is important xdynamic replication time-consuming, dynamic load-bal complementary

Global Distributed Systems and Multimedia20 Dynamic Load Balancing yAdapts to statistical fluctuations and changing access patterns yDynamic Migration xDeals with poor initial placement yReplication xDynamic Segment Replication partial replication (quick response, less expensive) xTotal Replication on-demand vs. predictive

Global Distributed Systems and Multimedia21 Load Management of Distributed MM Servers zAdaptive Scheduling xAssigns requests to servers based on demand and load factors. xInvokes replication-on-demand, request migration zPredictive Placement xInvokes dereplication zOptimizations xEager Replication xLazy Dereplication

Global Distributed Systems and Multimedia22 A Scalable Video Server Architecture Distribution Network Distribution Controller Distribution Controller Data Source Data Source Data Source Data Source Data Source Data Source Tertiery Storage Tertiery Storage... control requests data

Global Distributed Systems and Multimedia23 Architectural View of a Networked MM System Qos Broker and Load Management System Node Manager Node Manager Node Manager Node Manager... Local Data Streaming

Global Distributed Systems and Multimedia24 Resources in a Video Server Network Client Communication Modules Data Manipulation Modules Storage Modules Processing Module

Global Distributed Systems and Multimedia25 Load Placement Scenario Data Source S2 Data Source S1 Access Network Clients... Storage: 8 objects Bandwidth: 3 requests Storage: 2 objects Bandwidth: 8 requests

Global Distributed Systems and Multimedia26 Characterizing Server Resource Usage yAbility to service a request on a server depends on: xresource available xcharacteristics of a request yLoad factor(LF) for a request: xrepresents how far a server is from request admission threshold. LF (Ri, Sj) = max (Dbi/DBj, Mi/Mj, CPUi/CPUj, Xi/Xj)

Global Distributed Systems and Multimedia27 Adaptive Scheduling zWhen the broker receives a request Ri for a video object Vi : xConsider only data sources that have a copy of Vi. xConsider only data sources tha have sufficient resources to support Ri. xChooser server for which LF (Ri, Sj) is a minimum. xIf no such server exists Reject request. Perform replication-on-demand. Perform request migration.

Global Distributed Systems and Multimedia28 Predictive Data Placement zDetermines when, where and how many replicas of a video object. zInitiated periodically. zResults in an assignment of replicas to data sources. zGreedy algorithm that uses revenue generated as a metric.

Global Distributed Systems and Multimedia29 The Greedy Cost Placement Matrix PM(Vi, Sj) is the maximum revenue that can accrue from allocating Vi to Sj. Greedy heuristic: Map(Vi,Sj) = 1 if PM(Vi,Sj) =  a  b max(PM(Va,Sb))

Global Distributed Systems and Multimedia30 Optimizations zTo minimize the overhead of replication yEager replication xReplication of video object in anticipation xPerformed when server resources are free yLazy Dereplication xCritical nature of storage resources xMark reusable resources, reclaim disk space later xIf disk blocks are not overwritten, can be reclaimed

Global Distributed Systems and Multimedia31 Life of a video object

Global Distributed Systems and Multimedia32 Performance Evaluation Policies

Global Distributed Systems and Multimedia33 Performance Evaluation - Startup Latencies

Global Distributed Systems and Multimedia34 Performance of the basic configuration p1 p2 p4 p3

Global Distributed Systems and Multimedia35 Performance Evaluation - Varying Replication BW

Global Distributed Systems and Multimedia36 Performance Evaluation Summary zP1: entails high startup latency, requires high storage and replication bandwidth. zP2: Unacceptably poor performance. zP3: Similar performance to P4 in many cases. xAt low transfer bandwidths, P4 outperforms P3. zP4: Performs well in all cases.