1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.

Slides:



Advertisements
Similar presentations
1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.
Advertisements

Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
QoS Aware Scheduling in a Cluster-Based Web Server Jiani Guo Architecture Lab Department of Computer Science and Engineering University of California,
Scalable Web Server Clustering Technologies J. Wei.
Distributed Multimedia Systems
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
NETWORK LOAD BALANCING NLB.  Network Load Balancing (NLB) is a Clustering Technology.  Windows Based. (windows server).  To scale performance, Network.
MCDST : Supporting Users and Troubleshooting a Microsoft Windows XP Operating System Chapter 13: Troubleshoot TCP/IP.
Module 8: Concepts of a Network Load Balancing Cluster
Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.
Differentiated Services. Service Differentiation in the Internet Different applications have varying bandwidth, delay, and reliability requirements How.
2/23/2004 Load Balancing February 23, /23/2004 Assignments Work on Registrar Assignment.
Load Balancing in Web Clusters CS 213 LECTURE 15 From: IBM Technical Report.
1 Improving Web Servers performance Objectives:  Scalable Web server System  Locally distributed architectures  Cluster-based Web systems  Distributed.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
TCP Splicing for URL-aware Redirection
Fair Scheduling in Web Servers CS 213 Lecture 17 L.N. Bhuyan.
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
ACN: Congestion Control1 Congestion Control and Resource Allocation.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Dynamic Load Balancing on Web-server Systems Valeria Cardellini, Michele Colajanni, and Philip S. Yu Presented by Sui-Yu Wang.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
9/ Network Computing Lab EECS KAIST1 Deployment of cluster system and load balancing technique Junehwa Song.
Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
Redirection and Load Balancing
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
1 Distributed Operating Systems and Process Scheduling Brett O’Neill CSE 8343 – Group A6.
1 Chapter 6: Proxy Server in Internet and Intranet Designs Designs That Include Proxy Server Essential Proxy Server Design Concepts Data Protection in.
1 Distributed Systems : Server Load Balancing Dr. Sunny Jeong. Mr. Colin Zhang With Thanks to Prof. G. Coulouris,
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
CHEN Ge CSIS, HKU March 9, Jigsaw W3C’s Java Web Server.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
The Inter-network is a big network of networks.. The five-layer networking model for the internet.
1 Scheduling The part of the OS that makes the choice of which process to run next is called the scheduler and the algorithm it uses is called the scheduling.
Clustering and Load Balancing. Outline Introduction Linux Virtual Server Microsoft load balancing solution.
Mechanisms for Quality of Service in Web Clusters V. Cardellini, E. Casalicchio, S.Tucci M. Colajanni University of Roma “Tor Vergata” University of Modena.
A Throttling Layer-7 Web Switch James Furness. Motivation & Goals Specification & Design Design detail Demonstration Conclusion.
Request Distribution in Server Clusters Krithi Ramamritham Indian Institute of Technology Bombay.
DYNAMIC LOAD BALANCING ON WEB-SERVER SYSTEMS by Valeria Cardellini Michele Colajanni Philip S. Yu.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Courtesy Piggybacking: Supporting Differentiated Services in Multihop Mobile Ad Hoc Networks Wei LiuXiang Chen Yuguang Fang WING Dept. of ECE University.
Distributed Server Scheduler Eyal Serero Alex Fishgate Supervisor : Vitaly Suchin.
Lecture 4 CPU scheduling. Basic Concepts Single Process  one process at a time Maximum CPU utilization obtained with multiprogramming CPU idle :waiting.
CPU scheduling.  Single Process  one process at a time  Maximum CPU utilization obtained with multiprogramming  CPU idle :waiting time is wasted 2.
Cloud-based movie search web application with transaction service Group 14 Yuanfan Zhang Ji Zhang Zhuomeng Li.
Lab A: Planning an Installation
Clustered Web Server Model
Web Server Load Balancing/Scheduling
REPLICATION & LOAD BALANCING
Threads vs. Events SEDA – An Event Model 5204 – Operating Systems.
Affinity Depending on the application and client requirements of your Network Load Balancing cluster, you can be required to select an Affinity setting.
Web Server Load Balancing/Scheduling
Introduction to Load Balancing:
Processes and Threads Processes and their scheduling
Network Load Balancing
Author: Ragalatha P, Manoj Challa, Sundeep Kumar. K
VIRTUAL SERVERS Presented By: Ravi Joshi IV Year (IT)
Database Performance Tuning and Query Optimization
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
ANALYSIS OF TASK ASSIGNMENT POLICIES SCALABLE WEB SERVERS SYSTEMS
Content Distribution Networks
Web switch support for differentiated services
Multiple-resource Request Scheduling. for Differentiated QoS
Chapter 11 Database Performance Tuning and Query Optimization
Presentation transcript:

1 Routing and Scheduling in Web Server Clusters

2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano Casalicchio, Michele Colajanni and Philip S. Yu

3 Objectives  Routing in Distributed Web-based Servers  DNS-based routing mechanisms  Web server routing mechanisms  Dispatching Algorithms  Content-blind algorithms  Static  Client aware  Server aware  Client and Server aware  Content-aware algorithms  Client aware  Client and Server aware

4 Routing in Distributed Web-based Servers  DNS-based routing mechanisms  Handles multiple web servers hosting a Web site  The idea is conceived for locally distributed web servers but it is also applicable for geographically distributed web servers  Web server routing mechanisms  Routing mechanisms are implemented by the web server  Web servers can (re)direct a client request to another node

5 DNS-based routing  DNS-based routing intervenes at lookup phase  The authoritative DNS (A-DNS) server can select a different server for every address resolution  The A-DNS replies with (IP address, TTL)

6 Web server routing  Triangulation  HTTP redirection  URL rewriting

7 Web server routing: Triangulation  The client sends packets to the first contacted server  The first node routes packets to a second server  Routing is done at the TCP/IP level  Triangulation is based on tunneling!

8 Web server routing: HTTP redirection  HTTP protocol allows a Web server to respond to a client request with some redirection status code  HTTP has a built-in redirection mechanism  HTTP redirection is good for  Medium granularity down to individual Web pages  Allowing content-aware routing  Drawbacks  Extra round-trip time

9 Web server routing: HTTP redirection

10 Web server routing: HTTP rewriting  The first contacted server dynamically changes the links for the embedded objects within the Web page  This approach is used by Akamai  This approach introduces additional load on the redirecting Web server  It also cause a considerable DNS overhead

11 Dispatching Algorithms Strategies to select the target server of the web clusters  Static: Fastest solution to prevent web switch bottleneck, but do not consider the current state of the servers  Dynamic: Outperform static algorithms by using intelligent decisions, but collecting state information and analyzing them cause expensive overheads Requirements: (1) Low computational complexity (2) Full compatibility with web standards (3) state information must be readily available without much overhead

12 Content blind approach  Static Policies: Random distributes the incoming requests uniformly with equal probability of reaching any server Round Robin (RR) use a circular list and a pointer to the last selected server to make the decision Static Weighted RR (For heterogeneous severs) A variation of RR, where each server is assigned a weight Wi depending on its capacity

13 Content blind approach (Cont.)  Dynamic Client state aware static partitioning the server nodes and to assign group of clients identified through the clients information, such as source IP address Server State Aware Least Loaded, the server with the lowest load. Issue: Which is the server load index? Least Connection fewest active connection first

14 Content blind approach (Cont.)  Server State Aware Contd. Fastest Response responding fastest Weighted Round Robin Variation of static RR, associates each server with a dynamically evaluated weight that is proportional to the server load  Client and server state aware Client affinity instead of assigning each new connection to a server only on the basis of the server state regardless of any past assignment, consecutive connections from the same client can be assigned to the same server

15 Considerations of content blind  Static approach is the fastest, easy to implement, but may make poor assignment decision  Dynamic approach has the potential to make better decisions, but it needs to collect and analyze state information, may cause high overhead  Overall, simple server state aware algorithm is the best choice, least loaded algorithm is commonly used in commercial products

16

17 Content aware approach  Client state aware Cache Affinity The file space is partitioned among the server nodes. Load Sharing. SITEA (Size Interval Task Assignment with Equal Load) The web switch determines the size of the requested file and select the target server based on this information. CAP (Client-Aware Policy) The web switch requests are classified based on their impact on system resources: such as I/O bound, CPU bound

18 Content aware approach (Cont.)  Client state aware (Cont.) Service Partitioning Employs specialized servers for certain type of requests Client Affinity Using session identifier to assign all web transactions from the same client to the same server

19 Content aware approach (Cont.)  Client and server state aware LARD (Locality aware request distribution) Direct all requests of the same web object to the same server node as long as its utilization is below a given threshold. Cache Manager A cache manager that is aware of the cache content of all web servers.

20

21 Fair Scheduling in Web Servers

22 Objectives  Create an arbitrary number of service quality classes and assign a priority weight for each class.  Provide service differentiation for different use classes in terms of the allocation of CPU and disk I/O capacities

23

24

25

26

27

28

29

30 Fair Scheduling in a Web Cluster: Objective  Provide service differentiation (or QoS guarantee) for different user classes in terms of the allocation of CPU and disk I/O capacities => Scheduling  Balance the Load among various nodes in the cluster to ensure maximum utilization and minimum execution time => Load Balancing

31 Request Scheduling Two decisions:  Which request should be serviced next (Scheduling) according to each subscriber’s static resource reservation and dynamic resource usage  Which RPN should service this request (Load Balancing) according to the load information on each RPN (Least Load First) and also exploit access locality