Web Caching Robert Grimm New York University. Before We Get Started  Interoperability testing  Type theory 101.

Slides:



Advertisements
Similar presentations
Dissemination-based Data Delivery Using Broadcast Disks.
Advertisements

A Survey of Web Cache Replacement Strategies Stefan Podlipnig, Laszlo Boszormenyl University Klagenfurt ACM Computing Surveys, December 2003 Presenter:
Workloads Experimental environment prototype real sys exec- driven sim trace- driven sim stochastic sim Live workload Benchmark applications Micro- benchmark.
Memory System Characterization of Big Data Workloads
1 School of Computing Science Simon Fraser University, Canada Modeling and Caching of P2P Traffic Mohamed Hefeeda Osama Saleh ICNP’06 15 November 2006.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Multimedia Robert Grimm New York University. Before We Get Started…  Digest access authentication  What is the basic idea?  What is the encoding? 
1 A Comparison of Load Balancing Techniques for Scalable Web Servers Haakon Bryhni, University of Oslo Espen Klovning and Øivind Kure, Telenor Reserch.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
1 The Content and Access Dynamics of a Busy Web Server: Findings and Implications Venkata N. Padmanabhan Microsoft Research Lili Qiu Cornell University.
What’s a Web Cache? Why do people use them? Web cache location Web cache purpose There are two main reasons that Web cache are used:  to reduce latency.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies John Dilley and Martin Arlitt IEEE internet computing volume3 Nov-Dec 1999 Chun-Fu.
Submitting: Barak Pinhas Gil Fiss Laurent Levy
1 Probabilistic Models for Web Caching David Starobinski, David Tse UC Berkeley Conference and Workshop on Stochastic Networks Madison, Wisconsin, June.
Internet Cache Pollution Attacks and Countermeasures Yan Gao, Leiwen Deng, Aleksandar Kuzmanovic, and Yan Chen Electrical Engineering and Computer Science.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
SAIU: An Efficient Cache Replacement Policy for Wireless On-demand Broadcasts Jianliang Xu, Qinglong Hu, Dik Lun Department of Computer Science in HK University.
Web Caching Schemes For The Internet – cont. By Jia Wang.
Least Popularity-per-Byte Replacement Algorithm for a Proxy Cache Kyungbaek Kim and Daeyeon Park. Korea Advances Institute of Science and Technology (KAIST)
Large-Scale Web Caching and Content Delivery Jeff Chase CPS 212: Distributed Information Systems Fall 2000.
Web Caching and Content Delivery. Caching for a Better Web Performance is a major concern in the Web Proxy caching is the most widely used method to improve.
E-insights, LLC © 2000 All rights reserved. Understanding Web Traffic Michael Whelan part - 2.
By Ravi Shankar Dubasi Sivani Kavuri A Popularity-Based Prediction Model for Web Prefetching.
Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.
ICN Considerations for ISP’s Existing Networks Lichun Li, Xin Xu, Jun Wang, Zhenwu Hao {xu.xin18, wang.jun17,
1 Cache Me If You Can. NUS.SOC.CS5248 OOI WEI TSANG 2 You Are Here Network Encoder Sender Middlebox Receiver Decoder.
Minimal Test Collections for Retrieval Evaluation B. Carterette, J. Allan, R. Sitaraman University of Massachusetts Amherst SIGIR2006.
On the Scale and Performance of Cooperative Web Proxy Caching University of Washington Alec Wolman, Geoff Voelker, Nitin Sharma, Neal Cardwell, Anna Karlin,
A Dynamic Caching Algorithm Based on Internal Popularity Distribution of Streaming Media 資料來源 : Multimedia Systems (2006) 12:135–149 DOI /s x.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
Web Prefetching Between Low-Bandwidth Clients and Proxies : Potential and Performance Li Fan, Pei Cao and Wei Lin Quinn Jacobson (University of Wisconsin-Madsion)
Segment-Based Proxy Caching of Multimedia Streams Authors: Kun-Lung Wu, Philip S. Yu, and Joel L. Wolf IBM T.J. Watson Research Center Proceedings of The.
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Understanding the Performance of Web Caching System with an Analysis Model and Simulation Xiaosong Hu Nur Zincir-Heywood Sep
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Web Cache Consistency. “Requirements of performance, availability, and disconnected operation require us to relax the goal of semantic transparency.”
On The Cooperation of Web Clients and Proxy Caches Yiu Fai Sit, Francis C.M. Lau, Cho-Li Wang Department of Computer Science The University of Hong Kong.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Practical LFU implementation for Web Caching George KarakostasTelcordia Dimitrios N. Serpanos University of Patras.
Multimedia Information System Lab. Network Architecture Res. Group Cooperative Video Streaming Mechanisms with Video Quality Adjustment Naoki Wakamiya.
Evaluating Content Management Techniques for Web Proxy Caches Martin Arlitt, Ludmila Cherkasova, John Dilley, Rich Friedrich and Tai Jin Proceeding on.
Network Protocols: Design and Analysis Polly Huang EE NTU
Hot Systems, Volkmar Uhlig
NUS.SOC.CS5248 Ooi Wei Tsang 1 Proxy Caching for Streaming Media.
Parallel Crawlers Efficient URL Caching for World Wide Web Crawling Presenter Sawood Alam AND.
The Measured Access Characteristics of World-Wide-Web Client Proxy Caches Bradley M. Duska, David Marwood, and Michael J. Feeley Department of Computer.
An Overview of Proxy Caching Algorithms Haifeng Wang.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Evaluating Content Management Technique for Web Proxy Cache M. Arlitt, L. Cherkasova, J. Dilley, R. Friedrich and T. Jin MinSu Shin.
Web Prefetching Lili Qiu Microsoft Research March 27, 2003.
1 COMP 431 Internet Services & Protocols HTTP Persistence & Web Caching Jasleen Kaur February 11, 2016.
Web Proxy Caching: The Devil is in the Details Ramon Caceres, Fred Douglis, Anja Feldmann Young-Ho Suh Network Computing Lab. KAIST Proceedings of the.
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
On the scale and performance of cooperative Web proxy caching 2/3/06.
© 2003, Carla Ellis Strong Inference J. Pratt Progress in science advances by excluding among alternate hypotheses. Experiments should be designed to disprove.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Whole Page Performance Leeann Bent and Geoffrey M. Voelker University of California, San Diego.
Adaptive Configuration of a Web Caching Hierarchy Pranav A. Desai Jaspal Subhlok Presented by: Pranav A. Desai.
The Impact of Replacement Granularity on Video Caching
Replacement Policy Replacement policy:
Multilevel Memories (Improving performance using alittle “cash”)
On the Scale and Performance of Cooperative Web Proxy Caching
Distributed Systems CS
Distributed Systems CS
Zipf-Distributions & Caching
Group Based Management of Distributed File Caches
Web Proxy Caching Model
Presentation transcript:

Web Caching Robert Grimm New York University

Before We Get Started  Interoperability testing  Type theory 101

Interoperability Testing  Four groups  Mangoes: Mrudang, Sri Prasad, Zeno, MaoJen, Juan  Loki: Ken, Peter, Jonathan, Sajid, Jian  Optimus: Dmitriy, Alexandre, Oleg, Natalia  Nemo: Amos, Ravi, Chris, Nikolai  Round robin testing  “X  Y” means “group X tests group Y’s server”  Mangoes  Loki  Optimus  Nemo  Mangoes

Type Theory 101  What is a type?  “Qualities common to a number of individuals that distinguish them as an identifiable class” [Merriam-Webster]  Why do we care?  Help us reason about the meaning of programs  How can we do this formally?  One approach: rewrite rules  Axioms (e.g., “() matches ()” )  Inference rules  Value matches Type Value matches Type1 | Type2

Web Caching

What’s in a Model?  Some mathematical formulation about reality  Why do we care?  Predict the future  Evaluate algorithms  Effectiveness  Limitations  Project systems behavior  Very large client populations  What’s hard about models?  Identifying a model  Verifying a model

Breslau et al. Reality  Six web proxy traces  Digital Equipment (nee Compaq nee HP)  University of California at Berkeley (Home IP service)  Questnet (Australian ISP)  National Lab for Applied Networking Research  FuNet (academic ISP in Finland)

Breslau et al. Analysis

Breslau et al. Observations  Request distribution is indeed Zipf-like  “10/90” rule does not hold  25-40% of documents draw 70% of web accesses  Low statistical correlation between  Document access frequency  Document size  Hardly any statistical correlation between  Document access frequency  Document update rate

Breslau et al. Model  Stream of requests for N web pages, ranked by popularity  Probability request is for page I  Each request is independent from others  No cache invalidations where

Breslau et al. Implications  Hit ratio grows logarithmically or like a small power with number of requests  Consistent with data, other researchers’ observations  Independent reference model suggests least- frequently-used cache replacement policy  But, GD-Size performs better for small cache sizes and LRU has decent byte hit ratios  What about temporal effects?

Cooperative Caching  Basic idea  Several caches work together to provide a larger cache  Why do we care?  We hope that a larger cache gives us better hit rates  Possible organizations  Hierarchical  Hash-based  Directory-based

Wolman et al. Questions to Ask  What is the best performance one could achieve with “perfect” cooperative caching?  For what range of client populations can cooperative caching work effectively?  Does the way in which clients are assigned to caches matter?  What cache hit rates are necessary to achieve worthwhile decreases in document access latency?

Wolman et al. Traces  From University of Washington and Microsoft ParameterUWMicrosoft HTTP Requests82.8 million107.7 million HTTP Objects18.4 million15.3 million Total Bytes677 GB(N/A) Ave. Requests/s Clients22,98460,233 Servers244,211360,586 Duration7 days6 days 6 hours

Wolman et al. Simulation Methodology  Infinite-size caches  No capacity misses, but compulsory misses  Two types of caches  Ideal  Everything is cacheable  Practical  HTTP/1.1 cache control headers, no-cache pragmas  Cookies  Object names with suffixes mapping dynamic objects  Uncacheable methods  Authorization, Vary header fields

Wolman et al. Hit Rate vs. Population  Why is Microsoft’s ideal rate higher than UW’s?  How many caches should we deploy?

Wolman et al. Latency vs. Population  What is the impact of population size on latency?

Wolman et al. How to Save Bandwidth  How do shared objects compare to other objects in size?  How does population size impact bandwidth consumption?

Wolman et al. Hit Rate vs. Organizations  What is the effect of organizations?  Real  Random  What is the effect of cooperative caching between organizations?

Wolman et al. Hit Rate vs. Large Population  What is the correlation between sharing and cacheability?  Are there population limits?

Wolman et al. Hit Rate vs. Cooperation  What is the degree of sharing between organizations?  What is the case for unpopular documents?

Wolman et al. Model  Just like Breslau et al., but  Steady-state performance rather than finite sequence  Incorporates document rate of change  Exponential distribution  Independent of document size and latency  Dependent on popularity  What’s the intuition here?

Wolman et al. Rate of Change (in Days) Scenario Popular Mean Popular Median Unpopular Mean Unpopular Median Normal14< Always change 3< Never change 27< Cacheable5< Uncacheable<1 22<1

Wolman et al. Implications on Hit Rate  What is the impact of rate of change on hit rate?

Wolman et al. Implications on Hit Rate (cont.)  Again, what is the impact of rate of change on hit rate? 250,000 clients20 million clients

Wolman et al. Cooperative Caching City State West Coast  What about latency? Request rate?

Wolman et al. Conclusions  Little need for more work on cooperative caching  Largest benefit achieved with small populations  Performance limited by cacheability  Mutual interest does not provide advantages  What about the effects of  Dynamic documents?  Streaming multimedia?  What about protocols?