Providing Differentiated Levels of Service in Web Content Hosting Jussara Almeida, etc... First Workshop on Internet Server Performance, 1998 Computer.

Slides:



Advertisements
Similar presentations
Conserving Disk Energy in Network Servers ACM 17th annual international conference on Supercomputing Presented by Hsu Hao Chen.
Advertisements

Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
CS 443 Advanced OS Fabián E. Bustamante, Spring 2005 Resource Containers: A new Facility for Resource Management in Server Systems G. Banga, P. Druschel,
Copyright © 2005 Department of Computer Science CPSC 641 Winter PERFORMANCE EVALUATION Often in Computer Science you need to: – demonstrate that.
Flash: An efficient and portable Web server Authors: Vivek S. Pai, Peter Druschel, Willy Zwaenepoel Presented at the Usenix Technical Conference, June.
Design and Implementation of a Caching System for Streaming Media over the Internet Ethendranath Bommaiah, Katherine Guo, Markus Hofmann,and Sanjoy Paul.
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
Exploiting Gray-Box Knowledge of Buffer Cache Management Nathan C. Burnett, John Bent, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau University of.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part IV Capacity Planning Methodology.
Performance Interactions Between P-HTTP and TCP Implementations J. Heidemann ACM Computer Communication Review April 1997 김호중 CA Lab., KAIST.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
1 Part IV Capacity Planning Methodology © 1998 Menascé & Almeida. All Rights Reserved.
Threads Irfan Khan Myo Thein What Are Threads ? a light, fine, string like length of material made up of two or more fibers or strands of spun cotton,
Threads 1 CS502 Spring 2006 Threads CS-502 Spring 2006.
Fair Scheduling in Web Servers CS 213 Lecture 17 L.N. Bhuyan.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
1 PERFORMANCE EVALUATION H Often in Computer Science you need to: – demonstrate that a new concept, technique, or algorithm is feasible –demonstrate that.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
LDU Parametrized Discrete-Time Multivariable MRAC and Application to A Web Cache System Ying Lu, Gang Tao and Tarek Abdelzaher University of Virginia.
Dynamic Process Allocation in Apache Server Yu Cai.
Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.
Xen and the Art of Virtualization. Introduction  Challenges to build virtual machines Performance isolation  Scheduling priority  Memory demand  Network.
Module 8: Monitoring SQL Server for Performance. Overview Why to Monitor SQL Server Performance Monitoring and Tuning Tools for Monitoring SQL Server.
1Chapter 05, Fall 2008 CPU Scheduling The CPU scheduler (sometimes called the dispatcher or short-term scheduler): Selects a process from the ready queue.
A Scalable Content Distribution Service for Dynamic Web Content Seejo Sebastine Department of Computer Science University of Virginia.
Using Standard Industry Benchmarks Chapter 7 CSE807.
Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Virtual Machine Scheduling for Parallel Soft Real-Time Applications
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
Storage Management in Virtualized Cloud Environments Sankaran Sivathanu, Ling Liu, Mei Yiduo and Xing Pu Student Workshop on Frontiers of Cloud Computing,
1 A Feedback Control Architecture and Design Methodology for Service Delay Guarantees in Web Servers Presentation by Amitayu Das.
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Threads and Processes.
CHEN Ge CSIS, HKU March 9, Jigsaw W3C’s Java Web Server.
1 RTOS Design Some of the content of this set of slides is taken from the documentation existing on the FreeRTOS website
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Thread Scheduling.
1 University of Maryland Linger-Longer: Fine-Grain Cycle Stealing in Networks of Workstations Kyung Dong Ryu © Copyright 2000, Kyung Dong Ryu, All Rights.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Simulating a $2M Commercial Server on a $2K PC Alaa R. Alameldeen, Milo M.K. Martin, Carl J. Mauer, Kevin E. Moore, Min Xu, Daniel J. Sorin, Mark D. Hill.
On the Performance of TCP Splicing for URL-aware Redirection Ariel Cohen, Sampath Rangarajan, and Hamilton Slye The 2 nd USENIX Symposium on Internet Technologies.
1 Scheduling The part of the OS that makes the choice of which process to run next is called the scheduler and the algorithm it uses is called the scheduling.
Scalable Kernel Performance for Internet Servers under Realistic Loads. Gaurav Banga, etc... Western Research Lab : Research Report 1998/06 (Proceedings.
Module 9: Implementing Caching. Overview Caching Overview Configuring General Cache Properties Configuring Cache Rules Configuring Content Download Jobs.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
System Software Lab 1 Enhancement and Validation of Squid ’ s Cache Replacement Policy John Delley Martin Arlitt Stephane Perret WCW99 김 재 섭 EECS System.
VTurbo: Accelerating Virtual Machine I/O Processing Using Designated Turbo-Sliced Core Embedded Lab. Kim Sewoog Cong Xu, Sahan Gamage, Hui Lu, Ramana Kompella,
Nikolaos Vasiliou and Hanan Lutfiyya The University of Western Ontario London, Ontario, Canada Differentiated Quality of Service for Web Servers for Electronic.
1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
ITFN 2601 Introduction to Operating Systems Lecture 4 Scheduling.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Department of Computer Science and Software Engineering
Processes & Threads Introduction to Operating Systems: Module 5.
CMP/CMT Scaling of SPECjbb2005 on UltraSPARC T1 (Niagara) Dimitris Kaseridis and Lizy K. John The University of Texas at Austin Laboratory for Computer.
Managing Web Server Performance with AutoTune Agents by Y. Diao, J. L. Hellerstein, S. Parekh, J. P. Bigus Presented by Changha Lee.
Communication and Information Systems Lab. Application-Level Differentiated Services for Web Servers Tue. KAIST Dept. EECS CISLAB 송지영
Providing Differentiated Levels of Service in Web Content Hosting J ussara Almeida, Mihaela Dabu, Anand Manikutty and Pei Cao First Workshop on Internet.
-1- System Software Research Lab. Resource Management Policies for E-commerce Servers Daniel A. Menasce, Virgilio A. F. Almeida 2 nd Workshop on Internet.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
KAIST CORE LAB. Chul Lee Performance Issues in WWW Servers Erich Nahum, Tsipora Barzilai, and Dilip Kandlur IBM T.J Watson Research Center SIGMETRICS Feb.
Experiences with VI Communication for Database Storage Yuanyuan Zhou, Angelos Bilas, Suresh Jagannathan, Cezary Dubnicki, Jammes F. Philbin, Kai Li.
Taeho Kgil, Trevor Mudge Advanced Computer Architecture Laboratory The University of Michigan Ann Arbor, USA CASES’06.
Abhinav Kamra, Vishal Misra CS Department Columbia University
Dynamic Process Allocation in Apache Server
Computer Systems Performance Evaluation
Computer Systems Performance Evaluation
Xing Pu21 Ling Liu1 Yiduo Mei31 Sankaran Sivathanu1 Younggyun Koh1
Presentation transcript:

Providing Differentiated Levels of Service in Web Content Hosting Jussara Almeida, etc... First Workshop on Internet Server Performance, 1998 Computer Architecture Lab. CS Dept. KAIST 2000/9/18 Kim, Sung-Wan

1/16 Contents Introduction Methodology Design & Implementation Experiment & Results Limitations of the study Conclusions

2/16 Introduction (1/2) Increased Web content hosting –Many servers charge fees for the Web service –Customers expect quality of service proportional to the fee Apache –Most used Web server –FCFS(first-com first-served) doesn’t support for differentiated quality of service Objective –To provide differentiated levels of service by priority-based request scheduling

3/16 Introduction (2/2) Priority-based request scheduling –Performance metric QoS - Response time to the request –Method of improving performance Restrict the maximum number of concurrent processes at each priority –Approaches User-level & kernel-level

4/16 Methodology Priority of requests –Based on customer who pay, not documents Approaches –User-level Add a scheduler process to Apache –Kernel-level Both Apache and Linux kernel by adding new system calls –Mapping from request priorities into process priorities –Keeping track of which processes are running at each priority level Performance metric –Response time The average latency time taken by the server After accepting a connection, until closing the connection

5/16 Design & Implementation (1/3) Scheduling policies –Sleep policy : Upon receiving a request –Wakeup policy : In place of a completed request –Implementation Maximum thresholds : A fixed number of slots for each priority level Queue for blocked requests Conserving policies –Non-work conserving Allow requests to occupy only slots of the same type –Work conserving Does NOT allow above High-priority Low-priority Queue request or

6/16 Design & Implementation (2/3) User-level approach –A master process spawn a child process for each request and a Scheduler process –The child process determines its priority from URL Maps the customer name into a priority value Master process Scheduler spawn Child process #1 Child process #2 Child process #3 spawn requests request scheduling sleep or wakeup policy

7/16 Design & Implementation (3/3) Kernel-level approach –Parameters The number of priority levels The number of concurrent processes at each level The priority value assigned to a blocked process –SLEEPING_PRIORITY –Roles of kernel Maps request priority to a process priority Scheduling (sleep & wake-up policy) Wake-up –Decides the priority level of the processes to be unblocked –Choose the oldest process New system calls –initialize_priority_scheme, my_set_priority, my_release_priority

8/16 Experimental Setup For user-level approach –Sun SparcStation Two 66MHz CPUs, 64 MB RAM, Solaris 2.4, 100 Mbps Ethernet For kernel-level approach –DEC 90MHz Pentium 32 MB RAM, Linux , 10 Mbps Ethernet HTTP server : Apache 1.3b2, KeepAlive off Client : WebStone benchmark –6 machines, 5 client processes per machine –2 different workloads

9/16 Results (1/6) - User-level approach (1/3) Non-work conserving

10/16 Results (2/6) - User-level approach (2/3) Non-work conserving

11/16 Results (3/6) - User-level approach (3/3) Work conserving

12/16 Results (4/6) - Kernel-level approach (1/3) Average latency for requests of type A & B for both workload with no policy The configurations used in the experiments

13/16 Results (5/6) - Kernel-level approach (2/3) Average latency for workload WA Average latency for workload WB

14/16 Results (6/6) - Kernel-level approach (3/3) Average latency for workload WB using non-work conserving and SLEEPING_PRIORITY = -1

15/16 Limitations of the Study For truly differentiated QoS –CPU scheduling –Replacement policy for buffer cache –Disk I/O scheduling to favor high-priority –Networking QoS –But, focused on only CPU scheduling in this study Various mix of high-priority and low-priority requests

16/16 Conclusions Implement the priority-based scheduling Restricting the number of concurrent processes is a simple and effective strategy Work conserving policy is not adequate when the thresholds are large –Non-work conserving is better for multiple levels of priority Critique –Kernel modification –Too small benefit for high-priority, too much loss for low-priority –Thresholds parameter –Request ratio - High-priority : low-priority = 1 : 1 ? –Separated server for high-priority requests