Web Cache Characterizing Roles of Front-end Servers in End-to-End Performance of Dynamic Content Distribution  46842197Li ZHANG  78884704 Dakuo WANG.

Slides:



Advertisements
Similar presentations
Network Aware Forward Caching Presenter: Alexandre Gerber Jeffrey Erman, Mohammad T. Hajiaghayi, Dan Pei, Oliver Spatscheck AT&T Labs Research April 24.
Advertisements

Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol Li Fan, Pei Cao and Jussara Almeida University of Wisconsin-Madison Andrei Broder Compaq/DEC.
Networking Problems in Cloud Computing Projects. 2 Kickass: Implementation PROJECT 1.
Multicasting in Mobile Ad-Hoc Networks (MANET)
A Taxonomy and Survey of Content Delivery Networks Meng-Huan Wu 2011/10/26 1.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol By Abuzafor Rasal and Vinoth Rayappan.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
1 Web Proxies Dr. Rocky K. C. Chang 6 November 2005.
Flash Crowds And Denial of Service Attacks: Characterization and Implications for CDNs and Web Sites Aaron Beach Cs395 network security.
Internet Networking Spring 2002 Tutorial 13 Web Caching Protocols ICP, CARP.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Web Caching Schemes For The Internet – cont. By Jia Wang.
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
The Medusa Proxy A Tool For Exploring User- Perceived Web Performance Mimika Koletsou and Geoffrey M. Voelker University of California, San Diego Proceeding.
Content Networking - CON Content Overlay Network Vishal Kumar Singh Eilon Yardeni April, 28 th 2005.
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
Web Cache. Introduction what is web cache?  Introducing proxy servers at certain points in the network that serve in caching Web documents for faster.
Towards Understanding Modern Web Traffic
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
Christopher M. Pascucci Basic Structural Concepts of.NET Browser – Server Interaction.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
1. 1.Charting the CDNs(locating all their content and DNS servers). 2.Assessing their server availability. 3.Quantifying their world-wide delay performance.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
1 Chapter 6: Proxy Server in Internet and Intranet Designs Designs That Include Proxy Server Essential Proxy Server Design Concepts Data Protection in.
Chapter 1: Introduction to Web Applications. This chapter gives an overview of the Internet, and where the World Wide Web fits in. It then outlines the.
DISTRIBUTED COMPUTING
Networks QUME 185 Introduction to Computer Applications.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Exploiting Proxy-Based Transcoding to Increase the User Quality of Experience in Networked Applications Maarten Wijnants Patrick Monsieurs Peter Quax Wim.
1 Chapter 1 Web Components (Introduction) Web Protocols and Practice.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
The Inter-network is a big network of networks.. The five-layer networking model for the internet.
NetCache Architecture and Deployment Peter Danzig Network Appliance, Santa Clara, CA 元智大學 系統實驗室 陳桂慧
Web Performance 성민영 SNU Computer Systems lab.. 2 차례 4 Modeling the Performance of HTTP Over Several Transport Protocols. 4 Summary Cache : A Scaleable.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Dr. Yingwu Zhu Summary Cache : A Scalable Wide- Area Web Cache Sharing Protocol.
Investigating the Performance of Audio/Video Service Architecture II: Broker Network Ahmet Uyar & Geoffrey Fox Tuesday, May 17th, 2005 The 2005 International.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
ICP and the Squid Web Cache Duane Wessels and K. Claffy 산업공학과 조희권.
Network Protocols: Design and Analysis Polly Huang EE NTU
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
REST By: Vishwanath Vineet.
Firewalls A brief introduction to firewalls. What does a Firewall do? Firewalls are essential tools in managing and controlling network traffic Firewalls.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter One Introduction to Exchange Server 2003.
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
Hiearchial Caching in Traffic Server. Hiearchial Caching  A set of techniques and mechanisms to increase the size and performance of network caches.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
Performance Comparison of Ad Hoc Network Routing Protocols Presented by Venkata Suresh Tamminiedi Computer Science Department Georgia State University.
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
Coral: A Peer-to-peer Content Distribution Network
Content Distribution Networks
Caching Temporary storage of frequently accessed data (duplicating original data stored somewhere else) Reduces access time/latency for clients Reduces.
July 3, 2015 MuSIC (co-located with ICME) 2015, Torino, Italy
Web Caching? Web Caching:.
Internet Networking recitation #12
Edge computing (1) Content Distribution Networks
Content Delivery and Remote DNS services
Presentation transcript:

Web Cache Characterizing Roles of Front-end Servers in End-to-End Performance of Dynamic Content Distribution  Li ZHANG  Dakuo WANG  Xuejie SUN  Yang LIU

1.Introduction of Web Cache 1.Related Paper Overview 2.1 Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol 2.2 Going Viral: Flash Crowds in an Open CDN 3. Characterizing Roles of Front-end Servers in End-to- End Performance of Dynamic Content Distribution 3.1 Problem Definition 3.2 Motivation 3.3 Model 3.4 Result 3.5 Conclusion 3.6 Pro & Con 4. Q & A

Introduction to Web Caching(Proxy Server) CONCEPT Web cache is a mechanism for the temporary caching of web documents to reduce bandwidth usage, server load, and perceived lag. TYPES OF PROXY SERVER Forward proxies, Open proxies, Reverse proxies, Performance Enhancing Proxies USES OF PROXY SERVER To speed up access to resources. To control access to internal resources. To filter content. To hide the real IP. To circumvent Internet filtering to access content otherwise blocked by governments. To breakthrough own IP access restrictions.

Summary Cache: A Scalable Wide- Area Web Cache Sharing Protocol Li Fan, Member, IEEE, Pei Cao, Jussara Almeida, and Andrei Z. Broder

Internet Cache Protocol (ICP) - Simple Cache Sharing: fetch and store locally - No load balancing - Overhead: UDP messages (factor of 73 to 90), network traffic (8% - 13%), client HTTP request latency (8%-12%) Summary Cache - Each proxy store a summary of its directory of cached document in every other proxy - Cache miss, check the summaries to see if it exist in other proxies Summary Cache Enhanced ICP (SC-ICP) - Add new opcode in ICP version 2 - Introduce additional header follows regular ICP header - Modify Squid software to implement the protocol

Going Viral: Flash Crowds in an Open CDN Patrick Wendell, Michael J. Freedman Flash Crowds on CoralCDN - CoralCDN: an open Content Distribution Network (CDN) running at several hundred POPs - Flash Crowds: a period over which request rates for a particular fully-qualified domain name are increasing exponentially average per minute request rate over a particular period ti - 4 years CDN traffic, 33 billion HTTP requests Analysis conclusion: - Potential benefits of cooperative vs. independent caching by CDN node - The efficacy of elastic redirection and resource provisioning - The ecosystem of portals, aggregators and social networks

Going Viral: Flash Crowds in an Open CDN Patrick Wendell, Michael J. Freedman Flash Crowd Cacheability - The degree of caches coordination in fetching origin content - CoralCDN uses a distributed hash table for global content discovery - Commercial CDNs, Akamai, use non-cooperative caching, where each remote proxy independently fetches content from the origin site - Fewer requests to origin site, higher complexity and additional overhead

Going Viral: Flash Crowds in an Open CDN Patrick Wendell, Michael J. Freedman Flash Crowd Cacheability

The Motivations  Most content on the Internet is stored at data centers in the cloud, and they are dynamic for user’s request.  The scale and cost of building and operating large-scale powerful data centers are increasing.  The way to improve the overall response time is to deploy “proxy” servers closer to users.  FE servers can be exploited to improve the user-perceived performance due to : 1) A portion of the dynamic content may be static; thus can be cached and delivered immediately from the FE servers. 2) Via split TCP connections, a FE server can establish a persistent TCP connection with the data center which not only eliminates the effect of TCP slow-start between the FE and BE, but also reduce the RTT between the user and the server.

The Problem  Authors conduct an active measurement-based comparative study of Google and Microsoft Bing web search services.  Use the PlanetLab nodes to perform extensive measurements of Google and Bing search services using a variety of keyword search, and collect dynamically generated content and application-layer measurement data.  Use these collected data to analysis the role of FE.

How to solve it  They develop an in-house user search query emulator, which performs exactly the same functionality as the web- based search box.  They conduct extensive measurements by submitting the same search queries to both Bing and Google search engines, and collect detailed TCPdump with full application-layer payloads.  Perform two sets of experiments: 1) In the first set, search queries are launched from all measurement nodes to their default 3 FE servers every 10 seconds. 2) In the second set, they fix one FE server (of Bing or Google respectively) at a time, and launch queries from all measurement nodes to this server.

Content distribution Content includes static and dynamic (i.e., search results) Static portion: HTTP header, HTML header, CSS style files and the static menu bar. Dynamic portion: keyword- dependent menu bar, search results and ads. Static portion is cached and directly delivered by FE servers. Dynamic portion is generated by BE data centers and them passed onto the FE servers for delivery. The experiment shows T dynamic varies significantly with the types of search keywords used, whereas T static is mostly insensitive.

Several parameters: Tb: start of TCP three-way handshake T1: HTTP GET request T2: receive packets from server T3/T4: receive first/last packet containing the static portion T5/T6: receive first/last packet containing the dynamic portion

T static depends mostly on the time to generate and deliver the static content portion at the FE server. When RTT is small, T dynamic is roughly a constant while T delta decreases as a function of RTT. When RTT increases beyond a certain threshold, the dynamic content portion will be received by the FE server before the static content portion is entirely delivered to the client. Hence T dynamic increases as a function of RTT, while T delta becomes zero. Observation:

Performance First cluster represents the three-way TCP handshake between the client and the FE server. The second and third cluster represent the delivery of static and dynamic contents. As the RTT increases, the gap between the end of the second and the beginning of the third clusters decreases, and eventually the two are lumped together.

Google has slightly farther FE servers from the clients, but has significantly lower Tstatic and Tdynamic. These results illustrate that placing FE servers closer to clients does not necessarily reduce Tstatic and Tdynamic. The x-axis represents the PlanetLab nodes, and the yaxis represents the box-plot for the distribution for different samples. The results show that comparing Google, users using the Bing search service tend to experience slightly longer and more variable overall response times.

Comparing Bing & Google Performance and discuss The fetch time between Google FE servers and BE data centers tends to be smaller and more stable. In contrast, fetch time between Akamai FE servers and Bing data centers tends to be larger and shows higher variability. Although Bing place FE servers closer to client, it has significantly higher T static and T dynamic compare to Google. The reason for this may be due to the higher and more variable loads at Akamai FE server, as Bing shared with other servicers. The end to end performance is determined solely by the FE-BE fetch time. T fetch consists of two key components: T proc and RTT be

Several Results of This Paper FE severs do not cache any dynamically generated search result. It only cache the static information, such as Http header, Html header. Placing FE closer to users can improve user-perceived performance. There is a trade-off between placement of FE severs and the FE-BE fetch time. There is a threshold within which placing FE further closer to users is no longer helpful. While placing FE severs closer to users can help reduce latency, other key factors, such as processing times, loads at FE/BE data centers, and the quality of connections between them also play a critical role in determining the overall user-perceived performance. Improving and optimizing these factors are important for overall user-perceived performance in dynamic content distribution such as dynamic generation of search results in response to user requires.

Strong point of the paper This paper investigated the role of FE sever in improving user-perceived performance of dynamic content distribution, which is emerging as the next big business for CDN. This paper developed a good and simple model-based inference framework to measure and quantify the frontend- to-backend fetching time, which contains the query processing time at BE and delivery time between BE and FE. They used Bing and Google search services, and performed extensive network measurement and analysis, based on several sets of experiments. This paper also took into consideration about the difference between the FE of Bing and FE of Google.

Weakness of the Paper In this paper, they focused on standard search functions of search engines. However, more recently, some search engines introduced more advanced search features such as the interactive feature. By using this feature, after each letter user typed, a separate query is sent to the FE sever. And subsequent queries are highly correlated. Most nodes they used for test may introduce some unfairness between Bing and Google (because they are placed closer to Bing FE sever). No significant packet loss during the measurements. In a high loss rate environment, placing FE closer to users may significantly improve the user-perceived end-to- end performance.

Thanks Any Questions?