1 Caching Characteristics of Internet and Intranet Web Proxy Traces Arthur Goldberg Ilya Pevzner Robert Buff Courant Institute of Mathematical Sciences.

Slides:



Advertisements
Similar presentations
Internet Applications INTERNET APPLICATIONS. Internet Applications Domain Name Service Proxy Service Mail Service Web Service.
Advertisements

Conserving Disk Energy in Network Servers ACM 17th annual international conference on Supercomputing Presented by Hsu Hao Chen.
Latency-sensitive hashing for collaborative Web caching Presented by: Xin Qi Yong Yang 09/04/2002.
Cloud Download : Using Cloud Utilities to Achieve High-quality Content Distribution for Unpopular Videos Yan Huang, Tencent Research, Shanghai, China Zhenhua.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Multimedia Proxy Caching Mechanism for Quality Adaptive Streaming Applications in the Internet R. Rejaie, H. Yu, M. Handley, D. Estrin.
October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
1 A Framework for Lazy Replication in P2P VoD Bin Cheng 1, Lex Stein 2, Hai Jin 1, Zheng Zhang 2 1 Huazhong University of Science & Technology (HUST) 2.
1 Web Performance Modeling Chapter New Phenomena in the Internet and WWW Self-similarity - a self-similar process looks bursty across several time.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Energy Efficient Prefetching – from models to Implementation 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software Engineering.
1 Simultaneous Distribution Control and Privacy Protection for Proxy based Media Distribution George Mason University Songqing Chen (George Mason University)
Analysis of Web Caching Architectures: Hierarchical and Distributed Caching Pablo Rodriguez, Christian Spanner, and Ernst W. Biersack IEEE/ACM TRANSACTIONS.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies John Dilley and Martin Arlitt IEEE internet computing volume3 Nov-Dec 1999 Chun-Fu.
Web Caching Robert Grimm New York University. Before We Get Started  Interoperability testing  Type theory 101.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Web-Conscious Storage Management for Web Proxies Evangelos P. Markatos, Dionisios N. Pnevmatikatos, Member, IEEE, Michail D. Flouris, and Manolis G. H.
A Hybrid Caching Strategy for Streaming Media Files Jussara M. Almeida Derek L. Eager Mary K. Vernon University of Wisconsin-Madison University of Saskatchewan.
A Case for Delay-conscious Caching of Web Documents Peter Scheuermann, Junho Shim, Radek Vingralek Department of Electrical and Computer Engineering Northwestern.
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
Web Proxy Server Anagh Pathak Jesus Cervantes Henry Tjhen Luis Luna.
Locality-Aware Request Distribution in Cluster-based Network Servers Presented by: Kevin Boos Authors: Vivek S. Pai, Mohit Aron, et al. Rice University.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
1 Analysis of Push Initiator Tool used for Wireless Application Protocol Taotao Huang Helsinki University of Technology Department of Electrical and Communication.
1 Towards Cinematic Internet Video-on-Demand Bin Cheng, Lex Stein, Hai Jin and Zheng Zhang HUST and MSRA Huazhong University of Science & Technology Microsoft.
« Performance of Compressed Inverted List Caching in Search Engines » Proceedings of the International World Wide Web Conference Commitee, Beijing 2008)
Aditya Akella The Performance Benefits of Multihoming Aditya Akella CMU With Bruce Maggs, Srini Seshan, Anees Shaikh and Ramesh Sitaraman.
Understanding the Performance of Web Caching System with an Analysis Model and Simulation Xiaosong Hu Nur Zincir-Heywood Sep
Internet Information Server Name : Yao Gu Date : 10-June-2000 COSC : 573.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Dr. Yingwu Zhu Summary Cache : A Scalable Wide- Area Web Cache Sharing Protocol.
1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason.
Module 9: Implementing Caching. Overview Caching Overview Configuring General Cache Properties Configuring Cache Rules Configuring Content Download Jobs.
《 Hierarchical Caching Management for Software Defined Content Network based on Node Value 》 Reporter : Jing Liu , China Affiliation : University of Science.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.
Improving Disk Throughput in Data-Intensive Servers Enrique V. Carrera and Ricardo Bianchini Department of Computer Science Rutgers University.
Implementing ISA Server Caching
Delta Encoding for HTTP/1.1 Sloutsky Alexander Fink Dmitry Supervised by Lavy Libman.
1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
August 23, 2001ITCom2001 Proxy Caching Mechanisms with Video Quality Adjustment Masahiro Sasabe Graduate School of Engineering Science Osaka University.
1 Chapter Seven. 2 Users want large and fast memories! SRAM access times are ns at cost of $100 to $250 per Mbyte. DRAM access times are ns.
Memory Hierarchy: Terminology Hit: data appears in some block in the upper level (example: Block X)  Hit Rate : the fraction of memory access found in.
1 Chapter Seven CACHE MEMORY AND VIRTUAL MEMORY. 2 SRAM: –value is stored on a pair of inverting gates –very fast but takes up more space than DRAM (4.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Evaluating Content Management Technique for Web Proxy Cache M. Arlitt, L. Cherkasova, J. Dilley, R. Friedrich and T. Jin MinSu Shin.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
1 COMP 431 Internet Services & Protocols HTTP Persistence & Web Caching Jasleen Kaur February 11, 2016.
Web Proxy Caching: The Devil is in the Details Ramon Caceres, Fred Douglis, Anja Feldmann Young-Ho Suh Network Computing Lab. KAIST Proceedings of the.
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
On the scale and performance of cooperative Web proxy caching 2/3/06.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Virtual Memory By CS147 Maheshpriya Venkata. Agenda Review Cache Memory Virtual Memory Paging Segmentation Configuration Of Virtual Memory Cache Memory.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Presented by Peter Oldreive ICT Consultant in English Language Art, Social Studies, Fine Arts Eric Therrien ICT Consultant in Mathematics & Sciences Getting.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Web Performance Modeling (Book, Chapter 10)
W3 Status Analyzer.
Internet Applications
A Presentation on online voting system
INTERNET APPLICATIONS
Your computer is the client
Presentation transcript:

1 Caching Characteristics of Internet and Intranet Web Proxy Traces Arthur Goldberg Ilya Pevzner Robert Buff Courant Institute of Mathematical Sciences New York University

2 Clients, Servers and Proxy

3 HTTP Through a Proxy BrowserProxyServer Miss Hit

4 Potential Web Caching Benefits Reduce response time by delivering document from a closer and/or less loaded server than the origin serverReduce response time by delivering document from a closer and/or less loaded server than the origin server Save bandwidth costs between proxy and origin serverSave bandwidth costs between proxy and origin server

5 Goals Study large internet and intranet tracesStudy large internet and intranet traces Evaluate caching opportunities and problemsEvaluate caching opportunities and problems Examine cache size needs and document residence timesExamine cache size needs and document residence times

6 Part 1 Proxy trace sources and proxy configurations

7 Data Sources

8 ISP Usage 450,000 users450,000 users LoadLoad –Peak 500 unique clients500 unique clients 30 requests per second30 requests per second –Average 1M requests per day1M requests per day

9 ISP hardware details IBM RS/6000 systemIBM RS/6000 system 256 MB RAM256 MB RAM Three 4 GB disksThree 4 GB disks

10 ISP proxy configuration details 8 proxies nationwide8 proxies nationwide Netscape 2.5 proxyNetscape 2.5 proxy 5.5 GB cache size5.5 GB cache size Netscape extended-2 log formatNetscape extended-2 log format ParametersParameters –max-uncheck - 6 hours –lm-factor –term-percent - 80%

11 Intranet Usage 8,000 employees8,000 employees LoadLoad –Peak VariesVaries –Average 500K requests per day, over 10 hours500K requests per day, over 10 hours

12 Intranet hardware details Sun Microsystems Ultra 1 serverSun Microsystems Ultra 1 server 1 GB RAM1 GB RAM Seven 4 GB disksSeven 4 GB disks

13 Intranet proxy configuration details 2 proxies2 proxies Squid proxySquid proxy 12 GB disk cache size12 GB disk cache size 750MB memory cache size750MB memory cache size Extended log formatExtended log format

14 Part 2 Analysis of ISP and Intranet traces assuming unlimited cache storage

15 Key Cache Metrics Hit Ratio (HR )Hit Ratio (HR ) Fractional Bandwidth Savings (BT)Fractional Bandwidth Savings (BT)

16 Analyzing Caching Properties

17 ISP documents that cannot be cached, as per HTTP specification

18 Comment about “cookies” For Prodigy, RFC figures assume that Netscape proxy follows RFCFor Prodigy, RFC figures assume that Netscape proxy follows RFC In reality, Netscape proxy does not cache documents with cookiesIn reality, Netscape proxy does not cache documents with cookies Documents with cookies, account for 2% of responses in Prodigy traceDocuments with cookies, account for 2% of responses in Prodigy trace It follows that RFC figures for Prodigy may be up to 2% higher than shownIt follows that RFC figures for Prodigy may be up to 2% higher than shown

19 ISP Hit Ratio vs. Trace Length

20 ISP BT vs. Trace Length

21 Intranet HR vs. Trace Length

22 Intranet BT vs. Trace Length

23 Part 3 Analysis of ISP trace with finite cache sizes.

24 Prophetic Cache Replacement Algorithm A Prophetic cache stores exactly the set of documents that will be referenced in the futureA Prophetic cache stores exactly the set of documents that will be referenced in the future An on-line prophetic cache algorithm cannot be builtAn on-line prophetic cache algorithm cannot be built However, given a trace, prophetic caching decisions can be determined off- lineHowever, given a trace, prophetic caching decisions can be determined off- line

25 Prophetic Cache Replacement Algorithm (continued) Cache space used by a prophetic cache is the minimum size needed to avoid cache missesCache space used by a prophetic cache is the minimum size needed to avoid cache misses –notes: true for any maximum residence timetrue for any maximum residence time analyses make cyclical tracesanalyses make cyclical traces

26 Maximum Hit Rate as a function of residence time

27 Maximum Hit Rate as a function of residence time, by document size

28 Conclusions We analyze very long Web proxy traces from an ISP and an intranetWe analyze very long Web proxy traces from an ISP and an intranet We propose a new method to evaluate a proxy by comparing the actual hit rate with potential hit rateWe propose a new method to evaluate a proxy by comparing the actual hit rate with potential hit rate We show that it is important to keep the cache residence time above one dayWe show that it is important to keep the cache residence time above one day

29 Addresses WWW: Paper and presentation is available at and presentation is available at