Design, Implementation and Evaluation of a Client Characterization Driven Web Server Balachander Krishnamurthy – AT&T Research Labs Balachander Krishnamurthy.

Slides:



Advertisements
Similar presentations
Inktomi Confidential and Proprietary The Inktomi Climate Lab: An Integrated Environment for Analyzing and Simulating Customer Network Traffic Stephane.
Advertisements

Distributed Multimedia Systems Tarek Elshaarani Vahid Rafiei.
Design and Implementation of HTTP-Gnutella Gateway Baoning Wu (baw4) Wei Zhang (wez5) CSE Department Lehigh University.
Dynamic Adaptive Streaming over HTTP2.0. What’s in store ▪ All about – MPEG DASH, pipelining, persistent connections and caching ▪ Google SPDY - Past,
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
Outline Web measurement motivation Challenges of web measurement Web measurement tools Current web measurements Web properties Web traffic data gathering.
Toolbox Mirror -Overview Effective Distributed Learning.
Dynamic Process Allocation in Apache Server Yu Cai.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Capacity planning for web sites. Promoting a web site Thoughts on increasing web site traffic but… Two possible scenarios…
What is adaptive web technology?  There is an increasingly large demand for software systems which are able to operate effectively in dynamic environments.
Investigating Forms of Simulating Web Traffic Yixin Hua Eswin Anzueto Computer Science Department Worcester Polytechnic Institute Worcester, MA.
Content Networking - CON Content Overlay Network Vishal Kumar Singh Eilon Yardeni April, 28 th 2005.
 Proxy Servers are software that act as intermediaries between client and servers on the Internet.  They help users on private networks get information.
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
CS 218 F 2003 Nov 3 lecture:  Streaming video/audio  Adaptive encoding (eg, layered encoding)  TCP friendliness References: r J. Padhye, V.Firoiu, D.
CIS679: RTP and RTCP r Review of Last Lecture r Streaming from Web Server r RTP and RTCP.
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
 Zhichun Li  The Robust and Secure Systems group at NEC Research Labs  Northwestern University  Tsinghua University 2.
Towards a Safe Playground for HTTPS and Middle-Boxes with QoS2 Zhenyu Zhou CS Dept., Duke University.
Network-Enabled Digital Video Recorder (NED) LANMAN’01, March 2001 C. Cranor *, C Kalmanek *, D. Shur *, S. Sibal *, C. Sreenan +, J. van der Merwe * *
Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services
Application Measurements: Web Measurement. Motivation Web is the single most popular Internet application. Measurement can be very useful.
1 Web Server Administration Chapter 1 The Basics of Server and Web Server Administration.
1 An SLA-Oriented Capacity Planning Tool for Streaming Media Services Lucy Cherkasova, Wenting Tang, and Sharad Singhal HPLabs,USA.
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
Adaptive Overload Control for Busy Internet Servers Matt Welsh and David Culler USENIX Symposium on Internet Technologies and Systems (USITS) 2003 Alex.
Othman Othman M.M., Koji Okamura Kyushu University 1.
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
© Lindsay Bradford1 Scaling Dynamic Web Content Provision Using Elapsed-Time- Based Content Degradation Lindsay Bradford, Stephen Milliner and.
1 Specification and Implementation of Dynamic Web Site Benchmarks Sameh Elnikety Department of Computer Science Rice University.
Performance of HTTP Application in Mobile Ad Hoc Networks Asifuddin Mohammad.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Othman Othman M.M., Koji Okamura Kyushu University 1.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.
On Network-Aware Clustering of Web Clients Balachander Krishnamurthy AT&T Labs-Research, Florham Park, NJ, USA Jia Wang
A Method for Transparent Admission Control and Request Scheduling in E-Commerce Web Sites S. Elnikety, E. Nahum, J. Tracey and W. Zwaenpoel Presented By.
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
CHUL LEE, CORE Lab. E.E. 1 Web Server QoS Management by Adaptive Content Delivery September Chul Lee Tarek F. Abdelzaher and Nina Bhatti Quality.
DotSlash – or how to deal with 15 minutes of fame Weibin Zhao Henning Schulzrinne Columbia University CATT/WICAT Annual Research Review November 14, 2003.
Web Server.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
UNIT-3 Performance Evaluation UNIT-3 IT2031. Web Server Hardware and Performance Evaluation Key question is whether a company should host their own Web.
Apache Web Server Architecture Chaitanya Kulkarni MSCS rd April /23/20081Apache Web Server Architecture.
An Efficient Threading Model to Boost Server Performance Anupam Chanda.
On the Effect of Server Adaptation for Web Content Delivery IMW ’ 02, Marseille, Nov Joint work with Balachander Krishnamurthy (AT&T) Craig Wills.
Information-Centric Networks Section # 5.1: Content Distribution Instructor: George Xylomenos Department: Informatics.
By Ruizhe Ma, Avinash Madineni Sidoine Lafleur Kamgang Nov,
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Chapter 8 Environments, Alternatives, and Decisions.
WWW and HTTP King Fahd University of Petroleum & Minerals
Processes The most important processes used in Web-based systems and their internal organization.
Early Measurements of a Cluster-based Architecture for P2P Systems
On the Use and Performance of Content Distribution Networks
Admission Control and Request Scheduling in E-Commerce Web Sites
Group 3: Olena Hunsicker and Divya Josyula
Presentation transcript:

Design, Implementation and Evaluation of a Client Characterization Driven Web Server Balachander Krishnamurthy – AT&T Research Labs Balachander Krishnamurthy – AT&T Research Labs Craig Wills – WPI Yin Zhang – AT&T Research Labs Kashi Vishwanath – Duke University

Users with varying connectivity request a variety of files Web site wants no user to get disinterestedWeb site wants no user to get disinterested Handle requests based on connectivityHandle requests based on connectivity Motivation End Users Web server

Outline of our solution Figure out connectivity : use web server logs Figure out connectivity : use web server logs Classify clients(IPs) :store in shared memory Classify clients(IPs) :store in shared memory  Cluster: group of IPs [KW00] When a page is requested, the Web server: When a page is requested, the Web server:  Identifies client connectivity  Takes action if required: alternate content etc [12/Apr/2003:01:40:22 –0400] “GET /foo.html HTTP/1.0” [12/Apr/2003:01:40:22 –0400] “GET /foo.html HTTP/1.0”

System Architecture Server Log Classification Table (Shared Memory) Classifier User Requests Apache Processes

What is a page access? User clicks: container page page eg. /foo.html User clicks: container page page eg. /foo.html Subsequent requests : embedded objects eg. /img1.gif Subsequent requests : embedded objects eg. /img1.gif At server : how long before the user At server : how long before the user  starts getting the first object ?:E_first  gets the entire page ?: E_last Smoothed means for popular pages Smoothed means for popular pages Classes (poor, rich, normal) – [KW02] Classes (poor, rich, normal) – [KW02]  Poor : E_first >3 and E_last >5  Rich : E_first <=1 and E_last <=2

Modified Apache Server Modification to Apache Modification to Apache  three files, less than 200 lines  http_main.c, http_protocol.c, http_core.c One time overhead One time overhead  Open cluster library  Read in configuration file  /foo.html.gz.p_lc ;.r_mc  Initialize shared memory

Modified Apache Server : Per Request Changes Per Request Changes Per Request Changes  Is it a popular page ( URI ) ?  Lookup class (in shared memory data)  Cluster lookup if IP unavailable  Tailored action if appropriate and log it

Testing: How to choose test pages Proxy logs from a large manufacturing company (Dec 2001) Proxy logs from a large manufacturing company (Dec 2001) Over 100,000 users Over 100,000 users Select 1000 most popular pages Select 1000 most popular pages Download these : April 2002 Download these : April successful pages reconstructed 641 successful pages reconstructed 33% and 66% percentile value for characteristics 33% and 66% percentile value for characteristics  Container bytes, # of Embedded object, embedded bytes  3 x 3 x 3 = 27 buckets of pages

Testing: Distribution for test pages Percentage of pages in each bucket

Server Overhead : Latency increase at client Generate IPs (X-IP header)

Plot of average increase in processing time for the modified server Plot of average increase in processing time for the modified server Stress Test

Placing Clients and Modified Servers Prototype Apache Server with our test site Prototype Apache Server with our test site  Linux – att.com in NJ, USA  Linux – wpi.com MA, USA  FreeBSD – icir.org CA,USA Clients Clients  att:AT&T Labs-Research, NJ,USA  de: Saarbruecken University, Germany  cable: cable modem user, NJ,USA  modem: 56Kbps dialup modem user, NJ,USA  uk:London, U.K via a dedicated 56Kbps line.

Clients: Observed network characteristics uk-icir uk-att cable-icir uk-wpi de-att att-icir att-wpi Spans a wide spectrum

Experiments :Httperf Clients Request similar mix of pages as described earlier Request similar mix of pages as described earlier 200 random requests each 200 random requests each New Headers: X-Server-Actions, X-Class New Headers: X-Server-Actions, X-Class Baseline measure Baseline measure  parallel-1.0 – up to 4 parallel HTTP/1.0 requests Server actions Server actions  Manner of delivery  Compress, serial-1.1, pipeline-1.1, bundle  Amount served  Base-only, half-object,half-resolution

Classifier Stability and Correctness Overall consistent with expectation 90% Rich Moderate Poor

Results : Low throughput client Uk-wpi

Results : medium throughput de-att de-att

Results:Summary

Conclusions What all do we do ? What all do we do ?  Online client classification  Deliver modified server actions  Measure latency reduction to different clients  Compare various actions First to do this in a unified framework.

Conclusions: Server and Classifier Overhead at server Overhead at server  Average:75 usec. Negligible for end-user  Turn off classification during overload  Server poor-content during overload Classification Classification  Close to expectation  Stable over the duration of experiment  Improve by using select pages and better thresholds

Future Other server actions Other server actions  Delta encoding  Policies regarding cacheablility of objects Create a test for clustering Create a test for clustering

Acknowledgement Client testing Client testing  Saarbruecken, Germany  ICIR, USA Proxy logs - Manufacturing company Proxy logs - Manufacturing company

Thanks! Questions ? Slides:

Related Work Mark network packets [NT02] Mark network packets [NT02]  Improve performance of all clients Adapt content based on server load [AB99] Adapt content based on server load [AB99] User’s expectation [BBK00] User’s expectation [BBK00] Alternate admission control and server scheduling policies [CMC01, CR02] Alternate admission control and server scheduling policies [CMC01, CR02] Dynamically altering multimedia content [MB01] Dynamically altering multimedia content [MB01] Explicit client notification e.g. SPAND [ SSK97] Explicit client notification e.g. SPAND [ SSK97]