KAIST CORE LAB. Chul Lee Performance Issues in WWW Servers Erich Nahum, Tsipora Barzilai, and Dilip Kandlur IBM T.J Watson Research Center SIGMETRICS Feb.

Slides:



Advertisements
Similar presentations
Distributed Packet Rewriting and its Application to Scalable Server Architectures The 6 th IEEE International Conference on Network Protocol, Oct
Advertisements

The Effects of Wide-Area Conditions on WWW Server Performance Erich Nahum, Marcel Rosu, Srini Seshan, Jussara Almeida IBM T.J. Watson Research Center,
Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.
IO-Lite: A Unified Buffering and Caching System By Pai, Druschel, and Zwaenepoel (1999) Presented by Justin Kliger for CS780: Advanced Techniques in Caching.
Flash: An efficient and portable Web server Authors: Vivek S. Pai, Peter Druschel, Willy Zwaenepoel Presented at the Usenix Technical Conference, June.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
1 Network Measurements of a Wireless Classroom Network Carey Williamson Nuha Kamaluddeen Department of Computer Science University of Calgary.
2006/07/12ICPADS Secure and High-performance Web Server System for Shared Hosting Service Daisuke Hara and Yasuichi Nakayama The University.
Oct. 27, 2010BROADNETS Greece1 Hi-sap: Secure and Scalable Web Server System for Shared Hosting Services Daisuke Hara, Ryohei Fukuda, Kazuki.
CStream: Neighborhood Bandwidth Aggregation For Better Video Streaming Thangam Vedagiri Seenivasan Advisor: Mark Claypool Reader: Robert Kinicki 1 M.S.
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
M. S. Thesis Defense, 09/24/20011 Migratory TCP (MTCP) Transport Layer Support for Highly- Available Network Services Kiran Srinivasan DisCoLab Division.
1 Design and Implementation of A Content-aware Switch using A Network Processor Li Zhao, Yan Luo, Laxmi Bhuyan University of California, Riverside Ravi.
Project objectives Side issues Main issues MileGate ? Embedded HTTP server HTML Generation Client tools Frameworks functionalities Evaluated frameworks.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
Yaksha: A Self-Tuning Controller for Managing the Performance of 3-Tiered Web Sites Abhinav Kamra, Vishal Misra CS Department Columbia University Erich.
Protocol Implementation An Engineering Approach to Computer Networking.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Automated Workload Management in.
Xen and the Art of Virtualization. Introduction  Challenges to build virtual machines Performance isolation  Scheduling priority  Memory demand  Network.
Department of Computer Science Southern Illinois University Edwardsville Dr. Hiroshi Fujinoki and Kiran Gollamudi {hfujino,
©2001 Pål HalvorsenINFOCOM 2001, Anchorage, April 2001 Integrated Error Management in MoD Services Pål Halvorsen, Thomas Plagemann, and Vera Goebel University.
Performance Tradeoffs for Static Allocation of Zero-Copy Buffers Pål Halvorsen, Espen Jorde, Karl-André Skevik, Vera Goebel, and Thomas Plagemann Institute.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services
Automatic Identification of Concurrency in Handel-C Joseph C Libby, Kenneth B Kent, Farnaz Gharibian Faculty of Computer Science University of New Brunswick.
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
CORE KAIST EECS Computer Engineering Research Lab A General Purpose Proxy Filtering Mechanism Applied to the Mobile Environment Bruce Zenel Jupyung Lee.
A Novel Adaptive Distributed Load Balancing Strategy for Cluster CHENG Bin and JIN Hai Cluster.
High Performance User-Level Sockets over Gigabit Ethernet Pavan Balaji Ohio State University Piyush Shivam Ohio State University.
张俊 BTLab Embedded Virtualization Group Outline  Introduction  Performance Analysis  PerformanceTuning Methods.
1 Specification and Implementation of Dynamic Web Site Benchmarks Sameh Elnikety Department of Computer Science Rice University.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
On the Performance of TCP Splicing for URL-aware Redirection Ariel Cohen, Sampath Rangarajan, and Hamilton Slye The 2 nd USENIX Symposium on Internet Technologies.
Srihari Makineni & Ravi Iyer Communications Technology Lab
Scalable Kernel Performance for Internet Servers under Realistic Loads. Gaurav Banga, etc... Western Research Lab : Research Report 1998/06 (Proceedings.
Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.
CS 164: Slide Set 2: Chapter 1 -- Introduction (continued).
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
TCP behavior of a Busy Internet Server: Analysis and Improvements Y2K Oct.10 Joo Young Hwang Computer Engineering Research Laboratory KAIST. EECS.
Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Providing Differentiated Levels of Service in Web Content Hosting Jussara Almeida, etc... First Workshop on Internet Server Performance, 1998 Computer.
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
TCP Offload Through Connection Handoff Hyong-youb Kim and Scott Rixner Rice University April 20, 2006.
A record and replay mechanism using programmable network interface cards Laurent Lefèvre INRIA / LIP (UMR CNRS, INRIA, ENS, UCB)
What is gaming console & platform? A game console is a device which outputs video signal into TV screen to display the video game. A platform is in which.
Sockets Direct Protocol Over InfiniBand in Clusters: Is it Beneficial? P. Balaji, S. Narravula, K. Vaidyanathan, S. Krishnamoorthy, J. Wu and D. K. Panda.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
 This is a very good software for Only £39.99 from curry's I recommend this website because you need very little requirements for it. REQUIREMENTS Operating.
Survey and evaluate different client Frameworks and describe their compatibilities with the web services.
LAIO: Lazy Asynchronous I/O For Event Driven Servers Khaled Elmeleegy Alan L. Cox.
Ethernet Bomber Ethernet Packet Generator for network analysis
UNIT-3 Performance Evaluation UNIT-3 IT2031. Web Server Hardware and Performance Evaluation Key question is whether a company should host their own Web.
Computer Organization Instruction Set Architecture (ISA) Instruction Set Architecture (ISA), or simply Architecture, of a computer is the.
An Efficient Threading Model to Boost Server Performance Anupam Chanda.
Communication and Information Systems Lab. Application-Level Differentiated Services for Web Servers Tue. KAIST Dept. EECS CISLAB 송지영
Providing Differentiated Levels of Service in Web Content Hosting J ussara Almeida, Mihaela Dabu, Anand Manikutty and Pei Cao First Workshop on Internet.
Cachebox 200 Series Performance Caching, Outstanding Value CONDUIT TECHNOLOGIES PVT.LTD.
Taeho Kgil, Trevor Mudge Advanced Computer Architecture Laboratory The University of Michigan Ann Arbor, USA CASES’06.
Expertsfromindia for Joomla Development. Introduction Joomla is an open source and free content management system (CMS) for publishing content on the.
GridOS: Operating System Services for Grid Architectures
Abhinav Kamra, Vishal Misra CS Department Columbia University
Diskpool and cloud storage benchmarks used in IT-DSS
Overview Introduction VPS Understanding VPS Architecture
Admission Control and Request Scheduling in E-Commerce Web Sites
Performance Issues in WWW Servers
Cluster Computers.
Presentation transcript:

KAIST CORE LAB. Chul Lee Performance Issues in WWW Servers Erich Nahum, Tsipora Barzilai, and Dilip Kandlur IBM T.J Watson Research Center SIGMETRICS Feb Chul Lee

KAIST CORE LAB. Chul Lee Contents Introduction Background Experimental Setup/Testbed Evaluation Result Conclusion and Critique

KAIST CORE LAB. Chul Lee Introduction Performance issues in WWW server on UNIX-style platforms Issues –new socket function –per-byte optimizations –per-connection optimizations

KAIST CORE LAB. Chul Lee Background

KAIST CORE LAB. Chul Lee Experimental Setup/Testbed Hardware –4 IBM 3P RS/6000 (128MB of RAM and 200MHz PowerPC604e) –100 mbps ethernet Workload Generator –WebStone as a microbenchmark –SURGE as a macrobenchmark

KAIST CORE LAB. Chul Lee

Experimental Setup/Testbed OS –AIX with several modifications Web server software –Flash POLL Single threaded event-driven server Fastest, well optimized, exploit almost all optimization Use poll() rather than select()

KAIST CORE LAB. Chul Lee WebStone benchmark Throughput in operation/sec

KAIST CORE LAB. Chul Lee Evaluation Result proposed socket function –acceptex() Combines accept(),getsockname(),recv() –send_file() over mmap()/writev() send_file() copy data from the file system mmap()/writev() copy data from user space send_file() perform poorly

KAIST CORE LAB. Chul Lee Evaluation Result Throughput in HTTP ops/sec

KAIST CORE LAB. Chul Lee Evaluation Result Per-Byte Optimization –send_file() with mbuf cache a close approximation of a zero-copy –Disabling the internet checksum certain network interface support Host CPU does not touch the data at all

KAIST CORE LAB. Chul Lee Evaluation Result Throughput in HTTP ops/sec

KAIST CORE LAB. Chul Lee Evaluation Result Per-Connection Optimization –send_file() with close() –Piggybacking the FIN –Delaying ACK of FIN –Delaying ACK of SYN-ACK

KAIST CORE LAB. Chul Lee

Evaluation Result

KAIST CORE LAB. Chul Lee Evaluation Result

KAIST CORE LAB. Chul Lee Evaluation Result Overall performance with SURGE

KAIST CORE LAB. Chul Lee Conclusion new socket function –little increase in performance using acceptex() per-byte optimizations –observed an increase in throughput up to 51% per-connection optimizations –Raising server throughput by up to 20% Aggregate benefits –Improved aggregate server performance by 25%

KAIST CORE LAB. Chul Lee Conclusion IBM ’ s AIX division released these features in AIX For future work, evaluation of these mechanisms with HTTP 1.1 workloads Critique –Good observation/evaluation –Contributed to the newly released OS –Considered only throughput as a metric