Mechanisms for Quality of Service in Web Clusters V. Cardellini, E. Casalicchio, S.Tucci M. Colajanni University of Roma “Tor Vergata” University of Modena.

Slides:



Advertisements
Similar presentations
Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
Advertisements

QoS Aware Scheduling in a Cluster-Based Web Server Jiani Guo Architecture Lab Department of Computer Science and Engineering University of California,
CS 443 Advanced OS Fabián E. Bustamante, Spring 2005 Resource Containers: A new Facility for Resource Management in Server Systems G. Banga, P. Druschel,
A SLA Framework for QoS Provisioning and Dynamic Capacity Allocation Rahul Garg (IBM India Research Lab), R. S. Randhawa (Stanford University), Huzur Saran.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.
Computer Networks Performance Metrics Advanced Computer Networks.
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Networking Issues in LAN Telephony Brian Yang
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
Analysis and Simulation of a Fair Queuing Algorithm
Runtime Autonomous Component Management Systems. CMS Runtime Component Optimizer We have designed software APIs for CMS Runtime Optimizer Develop general-purpose.
Performance Evaluation
Fair Scheduling in Web Servers CS 213 Lecture 17 L.N. Bhuyan.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Influence of File Size Distribution on Legacy LAN QoS Parameters Nikolaus Färber Nov. 15, 2000.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
LDU Parametrized Discrete-Time Multivariable MRAC and Application to A Web Cache System Ying Lu, Gang Tao and Tarek Abdelzaher University of Virginia.
Operating Systems Operating System Support for Multimedia.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Peer-to-peer Multimedia Streaming and Caching Service by Won J. Jeon and Klara Nahrstedt University of Illinois at Urbana-Champaign, Urbana, USA.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Automated Workload Management in.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
Computer System Lifecycle Chapter 1. Introduction Computer System users, administrators, and designers are all interested in performance evaluation. Whether.
Active Network Applications Tom Anderson University of Washington.
Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
Tiziana Ferrari Quality of Service Support in Packet Networks1 Quality of Service Support in Packet Networks Tiziana Ferrari Italian.
A Novel Adaptive Distributed Load Balancing Strategy for Cluster CHENG Bin and JIN Hai Cluster.
NPS: A Non-interfering Web Prefetching System Ravi Kokku, Praveen Yalagandula, Arun Venkataramani, Mike Dahlin Laboratory for Advanced Systems Research.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Identifying Application Impacts on Network Design Designing and Supporting Computer.
Web Server Support for Tired Services Telecommunication Management Lab M.G. Choi.
2000/3/9QoS Introduction1 References: 1.Paul Ferguson and Geoff Huston, Quality of Service, John Wiley & Sons, Xipeng Xiao and Lionel M. Ni, “Internet.
Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco PublicITE I Chapter 6 1 Identifying Application Impacts on Network Design Designing and Supporting.
Wolfgang EffelsbergUniversity of Mannheim1 Differentiated Services for the Internet Wolfgang Effelsberg University of Mannheim September 2001.
Beyond Best-Effort Service Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot November 2010 November.
1 University of Maryland Linger-Longer: Fine-Grain Cycle Stealing in Networks of Workstations Kyung Dong Ryu © Copyright 2000, Kyung Dong Ryu, All Rights.
Policy Enabled Handoff Across Heterogeneous Wireless Networks Helen W and Jochen Giese Computer Science, Berkeley.
Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 8 Omar Meqdadi Department of Computer Science and Software Engineering University of.
A Cyclic-Executive-Based QoS Guarantee over USB Chih-Yuan Huang,Li-Pin Chang, and Tei-Wei Kuo Department of Computer Science and Information Engineering.
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
McGraw-Hill©The McGraw-Hill Companies, Inc., 2004 Chapter 23 Congestion Control and Quality of Service.
Request Distribution in Server Clusters Krithi Ramamritham Indian Institute of Technology Bombay.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Living in a Network Centric World Network Fundamentals – Chapter 1.
DYNAMIC LOAD BALANCING ON WEB-SERVER SYSTEMS by Valeria Cardellini Michele Colajanni Philip S. Yu.
Rassul Ayani 1 Performance of parallel and distributed systems  What is the purpose of measurement?  To evaluate a system (or an architecture)  To compare.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
DotSlash – or how to deal with 15 minutes of fame Weibin Zhao Henning Schulzrinne Columbia University CATT/WICAT Annual Research Review November 14, 2003.
03/03/051 Performance Engineering of Software and Distributed Systems Research Activities at IIT Bombay Varsha Apte March 3 rd, 2005.
1 PerfCenter and AutoPerf: Tools and Techniques for Modeling and Measurement of the Performance of Distributed Applications Varsha Apte Faculty Member,
1 / 21 Providing Differentiated Services from an Internet Server Xiangping Chen and Prasant Mohapatra Dept. of Computer Science and Engineering Michigan.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Architecture for Resource Allocation Services Supporting Interactive Remote Desktop Sessions in Utility Grids Vanish Talwar, HP Labs Bikash Agarwalla,
تجارت الکترونیک سیار جلسه پنجم مدرس : دکتررامین کریمی.
Web Server Load Balancing/Scheduling
Web Server Load Balancing/Scheduling
Introduction to Load Balancing:
Implementing a Load-balancing Web Server Using Red Hat Cluster Suite
Web switch support for differentiated services
Multiple-resource Request Scheduling. for Differentiated QoS
Presentation transcript:

Mechanisms for Quality of Service in Web Clusters V. Cardellini, E. Casalicchio, S.Tucci M. Colajanni University of Roma “Tor Vergata” University of Modena Speaker: Emiliano Casalicchio Speaker: Emiliano Casalicchio Terena Networking Conference - TNC2001 Additional Info:

May 2001Terena Networking Conference - TNC Outline Motivations –Quality of Service & Quality of Web Services System architecture QoS-blind vs. QoS-aware dispatching algorithms Results –System and workload model –Performance metrics –Simulation results Summary

May 2001Terena Networking Conference - TNC Why QoS in Web services? The second generation of Web sites –communication channel for critical information –fundamental technology for information systems of the most advanced companies and organizations –business-oriented media The new Web requires –differentiation of users and services –supports to heterogeneous applications and user expectations –differentiated pricing for content hosting and service providing

May 2001Terena Networking Conference - TNC A Web cluster architecture for high performance Web services IBM TCP Router CISCO Local Director IBM NetDispatcher Foundry Networks’ ServerIron family CISCO HP WebQoS Local Director

May 2001Terena Networking Conference - TNC Quality of Web Services (QoWS) QoS principles and mechanisms have been deeply studied in the computer network area, but –QoS principles are not immediately applicable to the server side of the Web System –Network QoS and server QoWS principles must be combined to provide a peer-to-peer QoS for Web services The focus of this talk will be on QoWS for Web clusters High performance systems  Systems for Quality of Service

May 2001Terena Networking Conference - TNC From QoS to QoWS in Web clusters Our idea: starting from main QoS principles –classification –performance isolation –high resource utilization –request admission control To define QoWS principles we need to find out –feasible mechanisms to achieve QoWS –Web Cluster components that can implement the QoWS principles and mechanisms QoWS

May 2001Terena Networking Conference - TNC QoWS principles Classification (at layer-4/7 Web Switch) –clients and services classification –users identification Performance isolation –queuing scheduling policies (at Web server: CPU,disk,…) –resource partitioning (at the Web server for fine-grain level, at the Web Switch for coarse-grain level) High resource utilization (at Web switch/server) –dynamic resource partitioning Request admission (at Web switch/server) –estimation of resource need (at Web switch) –access control mechanism (at Web switch/server)

May 2001Terena Networking Conference - TNC A QoWS-enabled Web cluster architecture

May 2001Terena Networking Conference - TNC QoS-blind dispatching algorithms Round-Robin (RR) Least-Loaded server (LL) Weighted Round-Robin (WRR) Server Admission with priority (SerADM-SerPRI)

May 2001Terena Networking Conference - TNC QoS-aware dispatching algorithms without partitioning of Web servers –Switch Admission (SwiADM) –Switch Admission with server priority (SwiADM-SerPRI) with partitioning of Web servers –Dynamic partition of servers (Partition)

May 2001Terena Networking Conference - TNC QoS-aware algorithms SerADM-SerPRI SwiADM SwiADM-SerPRI Partition YesYesNoYes YesNoNoYes YesYesNoYes YesYesYesYes ClassificationPerf.IsolationHigh Res.Util.Req.Adm

May 2001Terena Networking Conference - TNC System and workload model ParameterValue (default) Number of servers Disk parameter Memory transfer rate Intra-server bandwidth MBps, 7200 RPM, 0.05 msec c.d., 9 msec s.t. 100 MBps 100 Mbps switched LAN Arrival rate Requests per session User think time Embedded objects Hit size - body Hit size - tail clients per second (300cps) Inverse Gaussian (  = 3.86, = 9.46 ) Pareto (  = 1.4, k = 2) Pareto (  = 1.33, k = 1) Lognormal (  = 7.640,  = ) Pareto (  = 1.383, k = 2924) Service ClassesHigh (30%) - Medium (26.5%) - Low (43.5%)

May 2001Terena Networking Conference - TNC Performance metrics 90-percentile of page delay –measure the completion time of a Web page at the Web cluster side –the X-percentile is used to define the Service Level Agreement (SLA) for predictive services Percentage of dropped requests –measure the percentage of users that perceive a deny of service

May 2001Terena Networking Conference - TNC Simulation results (1)

May 2001Terena Networking Conference - TNC Simulation results (2)

May 2001Terena Networking Conference - TNC Conclusions Basic requirements for QoWS are satisfied by policies that integrate main mechanisms for network-QoS into main components of a Web cluster (i.e., Web servers and/or Web switch): –priority scheduling –dynamic server partitioning –admission control

May 2001Terena Networking Conference - TNC Work in progress Level-7 Web switch with QoS-enabled policies New policies for dynamic request partitioning Prototype implementation Additional information: