Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

Investigating Distributed Caching Mechanisms for Hadoop Gurmeet Singh Puneet Chandra Rashid Tahir.
1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.
Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Hadi Goudarzi and Massoud Pedram
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Evaluation of Data and Request Distribution Policies in Clustered Servers Adnan Khaleel and A. L. Narasimha Reddy Texas A&M University
What’s the Problem Web Server 1 Web Server N Web system played an essential role in Proving and Retrieve information. Cause Overloaded Status and Longer.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
Copyright 2007, Information Builders. Slide 1 Workload Distribution for the Enterprise Mark Nesson, Vashti Ragoonath June, 2008.
NETWORK LOAD BALANCING NLB.  Network Load Balancing (NLB) is a Clustering Technology.  Windows Based. (windows server).  To scale performance, Network.
1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Technical Architectures
Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Load Balancing in Web Clusters CS 213 LECTURE 15 From: IBM Technical Report.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Cooperative Caching Middleware for Cluster-Based Servers Francisco Matias Cuenca-Acuna Thu D. Nguyen Panic Lab Department of Computer Science Rutgers University.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
SERVER LOAD BALANCING Presented By : Priya Palanivelu.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Strategies for Implementing Dynamic Load Sharing.
01/22/09ICDCS20061 Load Unbalancing to Improve Performance under Autocorrelated Traffic Ningfang Mi College of William and Mary Joint work with Qi Zhang.
Wide Web Load Balancing Algorithm Design Yingfang Zhang.
1/25/2000 Active Names: Flexible Location and Transport of Wide-Area Resources Luis Rivera.
Chapter 2 Client Server Architecture
Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
1 Lecture 20: Parallel and Distributed Systems n Classification of parallel/distributed architectures n SMPs n Distributed systems n Clusters.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
©G. Millbery 2001Communications and Networked SystemsSlide 1 Purpose of Network Components  Switches A device that controls routing and operation of a.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
Othman Othman M.M., Koji Okamura Kyushu University 1.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
1 Multiprocessor and Real-Time Scheduling Chapter 10 Real-Time scheduling will be covered in SYSC3303.
Frontiers in Massive Data Analysis Chapter 3.  Difficult to include data from multiple sources  Each organization develops a unique way of representing.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
Load Distribution among Replicated Web Servers: A QoS-based Approach Marco Conti, Enrico Gregori, Fabio Panzieri WISP KAIST EECSD CALab Hwang.
Ceph: A Scalable, High-Performance Distributed File System
DYNAMIC LOAD BALANCING ON WEB-SERVER SYSTEMS by Valeria Cardellini Michele Colajanni Philip S. Yu.
VMware vSphere Configuration and Management v6
Symbiotic Routing in Future Data Centers Hussam Abu-Libdeh Paolo Costa Antony Rowstron Greg O’Shea Austin Donnelly MICROSOFT RESEARCH Presented By Deng.
AMQP, Message Broker Babu Ram Dawadi. overview Why MOM architecture? Messaging broker like RabbitMQ in brief RabbitMQ AMQP – What is it ?
Managing Web Server Performance with AutoTune Agents by Y. Diao, J. L. Hellerstein, S. Parekh, J. P. Bigus Presented by Changha Lee.
Adaptive Load Sharing for Clustered Digital Library Servers Song, yong-joo System Software Lab, EECS, KAIST 9. 13, 2000.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
GPFS: A Shared-Disk File System for Large Computing Clusters Frank Schmuck & Roger Haskin IBM Almaden Research Center.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Distributed Server Scheduler Eyal Serero Alex Fishgate Supervisor : Vitaly Suchin.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Clustered Web Server Model
Web Server Load Balancing/Scheduling
Web Server Load Balancing/Scheduling
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
VIRTUAL SERVERS Presented By: Ravi Joshi IV Year (IT)
Distributed System Structures 16: Distributed Structures
Outline Midterm results summary Distributed file systems – continued
CS510 - Portland State University
Performance-Robust Parallel I/O
Presentation transcript:

Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland

Outline Web Server Introduction Information Management Basics Load Sharing Policies –FLEX –WARD –EquiLoad –AdaptLoad SummaryConclusions Future Work

Introduction to Web Server Load Balancing Request enters a router Load balancing server determines which web server should serve the request Sends the request to the appropriate web server Internet Router Load-Balancing Server Web Servers Request Response Traditional Web Cluster

How do we split up information? Content Server Farm ?

Information Strategies Replication Partition

Load Balancing Approaches File Distribution Routing Content/Locality Aware DNS Server Size Aware Centralized Router Workload Aware Distributed Dispatcher

Issues Efficiently processing requests with optimizations for load balancing –Send and process requests to a web server that has files in cache – Send and process requests to a web server with the least amount of requests – Send and process requests to a web server determined by the size of the request

FLEX Locality aware load-balancing strategy based on two factors: –Accessed files, memory requirements –Access rates (working set), load requirements Partitions all servers into equally balanced groups Each server transfers the response to the browser to reduce bottleneck through the router (TCP Handoff) File Distribution Routing Content/Locality Aware DNS Server Size Aware Centralized Router Workload Aware Distributed Dispatcher

Flex Diagram DNS Server Requests To Client Browser Forwards Request S1S2 S3 S4 S5 S6 W(S1) ≈ W(S2) ≈ W(S3) ≈ … ≈ W(S6) Ar(S1) ≈ Ar(S2) ≈ Ar(S3) ≈ … ≈ Ar(S6)

FLEX Cont. Advantages: –Highly scalable –Reduces bottleneck by the load balancer –No software is required –Reduces number of cache misses

FLEX Cont. II Disadvantages: –Not dynamic, routing tale must be recreated –Only compared to RR –Number of access logs required on each server could be tremendous –Responsibility of load-balancing and transferring response is given to web servers – unorganized responsibility –How often to update access rates and working sets? Monitor?

WARD Workload-Aware Request Distribution Strategy Server core are essential files that represent majority of expected requests Server core is replicated at every server Ward-analysis computes the nearly optimal core size determined by workload access patterns –Number of nodes –Node RAM –TCP handoff overhead –Disk access overhead File Distribution Routing Content/Locality Aware DNS Server Size Aware Centralized Router Workload Aware Distributed Dispatcher

WARD Cont. Three components: dispatcher (load balancer), distributor (router), web server Three progressive architectures: Dispatcher Distributor Front End LAN Server Dispatcher Distributor Front End LAN Server Distributor Switch Dispatcher Distributor Front End LAN Server Distributor Switch Dispatcher Single front-end distributor, centralized dispatcher Co-located distributor and server Co-located distributor, server, and dispatcher LARD CARD WARD

WARD Diagram Requests S1 S2 S3 S4 S5 S6 Queue: Switch Each computer is a distributor and a dispatcher

WARD Cont. II Similar to FLEX, sends response directly to client Minimizes forwarding overhead from handoffs for the most frequent files Optimizes the overall cluster RAM usage “by mapping a small set of most frequent files to be served by multiple number of nodes, we can improve both locality of accesses and the cluster performance significantly”

WARD Cont. III Advantages: –No decision making, core files are replicated on every server –Minimizes transfer of requests and disk reads, both are “equally bad” –Outperforms Round Robin –Efficient use of RAM –Performance gain with increased number of nodes

WARD Cont. IV Disadvantages: –Core files are created on past day’s data, could decrease performance up to 15% –Distributed dispatcher increases the number of TCP requests transfers –If core files not selected correctly, higher cache miss rate and increased disk accesses

WARD Results

EquiLoad Determines which server will process a request determined by the size of the requested file Splits the content on each server by file size, forcing the queues sizes to be consistent. File Distribution Routing Content/Locality Aware DNS Server Size Aware Centralized Router Workload Aware Distributed Dispatcher

EquiLoad Solves Queue Length Problems This is bad QueueQueue This is better QueueQueue 1k1k1000k 1k1k1k2k2k1k1k2k1k 100k100k 1k1k1k2k100k

EquiLoad Diagram Distributor Requests To Client Browser Forwards Request S1S2 S3 S4 S5 S6 1k-2k 2k-3k 3k-10k10k-20k 20k-100k>100kDispatcher (periodically calculates partitions)

EquiLoad Advantages –Dynamic repartitioning –Can be implemented at various levels DNSDispatcherServer –Minimum queue buildup –Performs well under variable workload and high system load

EquiLoad Disadvantages –Cache affinity is neglected –Requires a front end dispatcher –Distributor must communicate with servers –Thresholds of parameter adjustment

EquiLoad  AdaptLoad AdaptLoad improves upon EquiLoad using “fuzzy boundaries” –Allows for multiple servers to process a request –Behaves better in situations where server partitions are very close in size

AdaptLoad Diagram Dispatcher Requests To Client Browser Forwards Request S1S2 S3 S4 S5 S6 1k-3k 2k-4k 3k-10k8k-20k 15k-100k>80kDistributor (periodically calculates partitions)

AdaptLoad Results

Summary File Distribution Routing Content/Locality Aware DNS Server Size Aware Centralized Router Workload Aware Distributed Dispatcher FLEX EquiLoad, AdaptLoad WARD

Conclusions There is no “best” way to distribute content among servers. There is no optimal policy for all website applications. Certain strategies are geared towards a particular website application.

Future Work Compare and contrast the three policies Figure out how often nodes should be repartitioned Compare each policy to a standard benchmark Figure out which policy works in a particular environment

Questions? Anyone have one?