Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.

Slides:

Advertisements

Similar presentations

Cost-Based Cache Replacement and Server Selection for Multimedia Proxy Across Wireless Internet Qian Zhang Zhe Xiang Wenwu Zhu Lixin Gao IEEE Transactions.

Advertisements

1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.

Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.

Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.

LOAD BALANCING IN A CENTRALIZED DISTRIBUTED SYSTEM BY ANILA JAGANNATHAM ELENA HARRIS.

Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,

Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.

1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.

Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.

SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,

1 Prefetching the Means for Document Transfer: A New Approach for Reducing Web Latency 1. Introduction 2. Data Analysis 3. Pre-transfer Solutions 4. Performance.

October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.

Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.

1 A Framework for Lazy Replication in P2P VoD Bin Cheng 1, Lex Stein 2, Hai Jin 1, Zheng Zhang 2 1 Huazhong University of Science & Technology (HUST) 2.

Adaptive Web Caching: Towards a New Caching Architecture Authors and Institutions: Scott Michel, Khoi Nguyen, Adam Rosenstein and Lixia Zhang UCLA Computer.

Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?

Analysis of Web Caching Architectures: Hierarchical and Distributed Caching Pablo Rodriguez, Christian Spanner, and Ernst W. Biersack IEEE/ACM TRANSACTIONS.

Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,

Submitting: Barak Pinhas Gil Fiss Laurent Levy

Towards a Better Understanding of Web Resources and Server Responses for Improved Caching Craig E. Wills and Mikhail Mikhailov Computer Science Department.

Design, Implementation, and Evaluation of Differentiated Caching Services Ying Lu, Tarek F. Abdelzaher, Avneesh Saxena IEEE TRASACTION ON PARALLEL AND.

Internet Cache Pollution Attacks and Countermeasures Yan Gao, Leiwen Deng, Aleksandar Kuzmanovic, and Yan Chen Electrical Engineering and Computer Science.

Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.

Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.

Proxy Caching the Estimates Page Load Delays Roland P. Wooster and Marc Abrams Network Research Group, Computer Science Department, Virginia Tech 元智大學.

A Case for Delay-conscious Caching of Web Documents Peter Scheuermann, Junho Shim, Radek Vingralek Department of Electrical and Computer Engineering Northwestern.

Wide Web Load Balancing Algorithm Design Yingfang Zhang.

Evaluating Content Management Techniques for Web Proxy Caches Martin Arlitt, Ludmila Cherkasova, John Dilley, Rich Friedrich and Tai Jin Hewlett-Packard.

The Medusa Proxy A Tool For Exploring User- Perceived Web Performance Mimika Koletsou and Geoffrey M. Voelker University of California, San Diego Proceeding.

Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.

Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.

Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.

Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)

Google File System Simulator Pratima Kolan Vinod Ramachandran.

On the Scale and Performance of Cooperative Web Proxy Caching University of Washington Alec Wolman, Geoff Voelker, Nitin Sharma, Neal Cardwell, Anna Karlin,

Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.

Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.

Distributed Maintenance of Cache Freshness in Opportunistic Mobile Networks Wei Gao and Guohong Cao Dept. of Computer Science and Engineering Pennsylvania.

A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.

Design and Analysis of Advanced Replacement Policies for WWW Caching Kai Cheng, Yusuke Yokota, Yahiko Kambayashi Department of Social Informatics Graduate.

ECO-DNS: Expected Consistency Optimization for DNS Chen Stephanos Matsumoto Adrian Perrig © 2013 Stephanos Matsumoto1.

Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.

Abdullah Aldahami ( ) March 23, Introduction 2. Background 3. Simulation Techniques a.Experimental Settings b.Model Description c.Methodology.

1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.

Multicache-Based Content Management for Web Caching Kai Cheng and Yahiko Kambayashi Graduate School of Informatics, Kyoto University Kyoto JAPAN.

Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.

PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.

A P2P-Based Architecture for Secure Software Delivery Using Volunteer Assistance Purvi Shah, Jehan-François Pâris, Jeffrey Morgan and John Schettino IEEE.

DYNAMIC LOAD BALANCING ON WEB-SERVER SYSTEMS by Valeria Cardellini Michele Colajanni Philip S. Yu.

Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,

6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.

The LSAM Proxy Cache - a Multicast Distributed Virtual Cache Joe Touch USC / Information Sciences Institute 元智大學資訊工程研究所系統實驗室陳桂慧

CFTP - A Caching FTP Server Mark Russell and Tim Hopkins Computing Laboratory University of Kent Canterbury, CT2 7NF Kent, UK 元智大學資訊工程研究所系統實驗室陳桂慧.

Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.

1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.

The Measured Access Characteristics of World-Wide-Web Client Proxy Caches Bradley M. Duska, David Marwood, and Michael J. Feeley Department of Computer.

MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.

A Comparative Evaluation of Transparent Scaling Techniques for Dynamic Content Servers Presented by Chen Zhang Written by C. Amza, A. L. Cox,

IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo a, Jose G. Delgado-Frias Publisher: Journal of Systems.

1 IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo, Jose G. Delgado-Frias Publisher: Journal of Systems.

Improving the WWW: Caching or Multicast? Pablo RodriguezErnst W. BiersackKeith W. Ross Institut EURECOM 2229, route des Cretes. BP , Sophia Antipolis.

1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.

Adaptive Configuration of a Web Caching Hierarchy Pranav A. Desai Jaspal Subhlok Presented by: Pranav A. Desai.

Lab A: Planning an Installation

Clustered Web Server Model

Web Server Load Balancing/Scheduling

Web Server Load Balancing/Scheduling

The Impact of Replacement Granularity on Video Caching

Memory Management for Scalable Web Data Servers

On the Scale and Performance of Cooperative Web Proxy Caching

Presentation transcript:

Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of Computer Science,University of Saskatchewan 元智大學資訊工程所宮春富 1999/07/28

OutLine: ⊙ Introduction ⊙ Methodology ⊙ Load Balancing ⊙ Cache Performance Consideration ⊙ Relation to Other Work ⊙ Conclusions

Introduction: ⊙ We evaluate various load distribution policies with respect to both their ability to achieve good load balance and also to their impact on the effectiveness of per machine caching. ⊙ One approach is based on caching copies of Web objects closer to the requesting clients. ⊙ Another approach is to use prefetching to reduce response times, by hiding server and network latency. ⊙ A complementary approach is to make the Web server more powerful through the use of a clustered architecture.

Methodology: ⊙ Random distribution (RANDOM): no information is utilized ⊙ Round-Robin distribution (ROUND-ROBIN):only information on past routing decisions is utilized ⊙ Load-based distribution (LOAD): information on the current load at each server is utilized

Introduction: ⊙ In particular, the benefits of using current state information (both cache contents and server loads) in load distribution. ⊙ Use of current server state information is necessary for good load balance. ⊙ Use of current server state information is not necessary for good cache behaviour. ⊙ Achieving both good cache performance and good load balance is possible, but it requires the use of policies that take both objectives into consideration, and that make use of information concerning current server loads.

Three choices are also considered with respect to use of information regarding cache contents: ⊙ no information is utilized ⊙ information on the "cache affinities" of the incoming requests is utilized ⊙ information on the cache affinities of the incoming requests is utilized ⊙ Trace-driven simulation is used to evaluate the performance that is achieved with each of the load distribution policies that we consider. Methodology:

Table1: Summary of Trace Characteristic

Load Balancing: ⊙ we consider only the objective of balancing the load across servers (with no consideration of the resulting caching performance). ⊙ LBM （ Load Balance Metric ） measure various load distribution police,we consider two variants: ‘LBM Request’ and ‘LBM Bandwidth’ ⊙ Small values of the LBM indicate better load balancing performance than large values. (smaller peak-to-mean load ratios)

Load Balancing: A number of observations are evident from these plots. ⊙ the Load distribution policy never does worse (on average) than the Random or Round-Robin distribution policies. ⊙ an under-resourced server ; all server nodes have large backlogs ⊙ an over-resourced server ; all server nodes have short request queues ⊙ loadbased request distribution policy can provide significant (15-25%) improvement in the LBM.

Load Balancing:

Cache Performance Considerations: ⊙ Cache misses may occur because of first-time references, references to dynamic content, limited cache capacity, and invalidation due to object modification. ⊙ The total cache space is used more effectively when fewer replicas exist at once (and they exist in the right place). ⊙ Ignoring cache contents in the load distribution policy can have a considerable impact on cache performance. ⊙ Configurations with a fixed aggregate cache size and a fixedper-server cache size.

Cache Performance Considerations: ⊙ The effects of employing cache affinity information in distribution decisions are investigated first in the context of the Load policy. ⊙ The cache hit ratio is highest for ε = ∞, since the "pure affinity" policy does not allow document replication in the caches. ⊙ When ε is small ; the 2-server configuration invariably outperforms the 4-server configuration with the same aggregate cache size. ⊙ When ε is large ; the cache hit ratio of the 4-server configuration approaches that of the 2-server configuration.

Cache Performance Considerations: ⊙ The relative improvement in load balancing performance of Load vs. Round-Robin to that of Load/Affinity vs. Round- Robin/Affinity, for varying per-request bandwidth constraints. ⊙ The Load/Affinity policy will send the request to a server with the lightest load ⊙ the Round-Robin/Affinity policy is defined to send the request to a server to which it has sent the fewest requests ⊙ Similar behaviour (although less dramatic) is observed when decisions are based on intermediate combinations of load balance and affinity considerations, rather than pure affinity.

Cache Performance Considerations:

Relation to Other Work: ⊙ A major focus in their work has been the design and implementation of a high performance TCP connection router ⊙ A secondary focus has been on resource utilization, load balancing performance, and end user response time. ⊙ The Locality-Aware Request Distribution (LARD) policy is proposed as an example of a content-aware request distribution policy. ⊙ The simulation model presents the workload to the server based on the timestamps in the trace; we do not adjust the request arrival rate so as to necessarily ensure a steady flow of requests.

Conclusions: ⊙ The results suggest that very simple policies such as Round-Robin may yield good load balance if the achievable per-request bandwidth is strongly network or client limited ⊙ The relative importance of using information on current server loads for load balancing purposes increases with the number of servers. ⊙ This paper considers only the case of multiple machines cooperating to provide the Web server function at a single physical location.