Summary of WWW Characterizations James E. Pitkow Xerox Palo Alto Research Center WWW Journal 99 발표자 : 노양우.

Slides:



Advertisements
Similar presentations
Inktomi Confidential and Proprietary The Inktomi Climate Lab: An Integrated Environment for Analyzing and Simulating Customer Network Traffic Stephane.
Advertisements

The Internet and the Web
Copyright © 2012 Certification Partners, LLC -- All Rights Reserved Lesson 4: Web Browsing.
DT228/3 Web Development WWW and Client server model.
Lesson 4: Web Browsing.
Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)
Adapted from Menascé & Almeida.1 Workload Characterization for the Web.
1 10 Web Workload Characterization Web Protocols and Practice.
1 Network Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
Empirical Investigations of WWW Surfing Paths Jim Pitkow User Interface Research Xerox Palo Alto Research Center.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
October 14, 2002MASCOTS Workload Characterization in Web Caching Hierarchies Guangwei Bai Carey Williamson Department of Computer Science University.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Jump to first page Web Facts and Fantasy presented by Andreas Anagnostatos CSE 291 Feb. 29, 2000 Stephen Manley, Network Appliance Margo Seltzer, Harvard.
1 Web Performance Modeling Chapter New Phenomena in the Internet and WWW Self-similarity - a self-similar process looks bursty across several time.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
1 The Content and Access Dynamics of a Busy Web Server: Findings and Implications Venkata N. Padmanabhan Microsoft Research Lili Qiu Cornell University.
1 Simulation Evaluation of a Heterogeneous Web Proxy Caching Hierarchy Mudashiru Busari Carey Williamson University of Saskatchewan University of Calgary.
Hardware-based Load Generation for Testing Servers Lorenzo Orecchia Madhur Tulsiani CS 252 Spring 2006 Final Project Presentation May 1, 2006.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
Multimedia & the WWW Week 1 Introduction To….. Today’s Agenda Who I am Who I am Who you are survey & discussion Who you are survey & discussion Course.
Progress Report 11/1/01 Matt Bridges. Overview Data collection and analysis tool for web site traffic Lets website administrators know who is on their.
Internet Cache Pollution Attacks and Countermeasures Yan Gao, Leiwen Deng, Aleksandar Kuzmanovic, and Yan Chen Electrical Engineering and Computer Science.
Tracking the Evolution of Web Traffic: Felix Hernandez-Campos, Kevin Jeffay F. Donelson Smith IEEE/ACM International Symposium on Modeling, Analysis.
Web Usage Mining - W hat, W hy, ho W Presented by:Roopa Datla Jinguang Liu.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
1 YouTube Traffic Characterization: A View From the Edge Phillipa Gill¹, Martin Arlitt²¹, Zongpeng Li¹, Anirban Mahanti³ ¹ Dept. of Computer Science, University.
 Proxy Servers are software that act as intermediaries between client and servers on the Internet.  They help users on private networks get information.
P2P Architecture Case Study: Gnutella Network
1 3 Web Proxies Web Protocols and Practice. 2 Topics Web Protocols and Practice WEB PROXIES  Web Proxy Definition  Three of the Most Common Intermediaries.
Web Characterization: What Does the Web Look Like?
1 Computer Communication & Networks Lecture 28 Application Layer: HTTP & WWW p Waleed Ejaz
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
How did the internet develop?. What is Internet? The internet is a network of computers linking many different types of computers all over the world.
THE INTERNET Chapter 13. Internet- Interconnection and Networks “the Net” Computers have played a significant role in our everyday life Growth in popularity.
Web Prefetching Between Low-Bandwidth Clients and Proxies : Potential and Performance Li Fan, Pei Cao and Wei Lin Quinn Jacobson (University of Wisconsin-Madsion)
MIS 424 Professor Sandvig. Overview  Why Analytics?  Two major approaches:  Server logs  Google Analytics.
Understanding the Performance of Web Caching System with an Analysis Model and Simulation Xiaosong Hu Nur Zincir-Heywood Sep
Web Performance 성민영 SNU Computer Systems lab.. 2 차례 4 Modeling the Performance of HTTP Over Several Transport Protocols. 4 Summary Cache : A Scaleable.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
Microsoft Research1 Characterizing Alert and Browse Services for Mobile Clients Atul Adya, Victor Bahl, Lili Qiu Microsoft Research USENIX Annual Technical.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web (Book, chap. 6)
1 Lifetime Behavior and its Impact on Web Caching X. Chen and P. Mohapatra, IEEE Workshop on Internet Applications (WIAPP), 김호중, CA Lab. Site 별,
The Intranet.
Characterizing User Access To Videos On The World Wide Web MMCN 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Peter Parnes.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.
1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Lesson No:12 Introduction to Internet CHBT-01 Basic Micro process & Computer Operatio.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
1 Web Performance Modeling Issues Daniel A. Menascé Department of Computer Science George Mason University 
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
MSc Publishing on the Web Week 4 Image Maps. Aims and Objectives Discover what are image maps To understand the different types of image map To understand.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
1 Chapter 22 World Wide Web (HTTP) Chapter 22 World Wide Web (HTTP) Mi-Jung Choi Dept. of Computer Science and Engineering
The Distributed Object Consistency Protocol Version 1.0 John Dilley, Martin Arlitt, Stephane Perret, Tai Jin Hewlett-Packard Laboratories Palo Alto, CA.
On Caching Search Engine Query Results Evangelos Markatos Evangelos Markatoshttp://archvlsi.ics.forth.gr/OS/os.html Computer Architecture and VLSI Systems.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Web Performance Modeling (Book, Chapter 10)
Does Internet media traffic really follow the Zipf-like distribution? Lei Guo 1, Enhua Tan 1, Songqing Chen 2, Zhen Xiao 3, and Xiaodong Zhang 1 1 Ohio.
Web Development Web Servers.
Lesson 4: Web Browsing.
Sec (4.3) The World Wide Web.
E-commerce | WWW World Wide Web - Concepts
E-commerce | WWW World Wide Web - Concepts
Computer Communication & Networks
Evaluation of Load Balancing Algorithms and Internet Traffic Modeling for Performance Analysis By Arthur L. Blais.
Lesson 4: Web Browsing.
Network Traffic Modeling
Web Proxy Caching Model
Presentation transcript:

Summary of WWW Characterizations James E. Pitkow Xerox Palo Alto Research Center WWW Journal 99 발표자 : 노양우

System Software Research Lab. 2 Contents mIntroduction /Client, Proxies and Gateways, Server /Traces and Analysis (distribution) mSummary /1994 /1995 /1996 /1997 /1998 mConclusion

System Software Research Lab. 3 Introduction mGrowth of Web Usages /representative characterization --> enjoyable Web surfing /various data sets at various points ( clients, proxy and gateways, servers ) 4 several invariants mClients /informative but rare --> browser implementation, sufficient APIs mProxy and Gateways /greater availability /less concentration on characteristics --> caching algorithm mServers /traffic analysis

System Software Research Lab mA Caching Relay for WWW 4 DEC proxy, 4000/day from 100 users 4 document popularity --> Zipf 4 cache : hit (1/3), miss (1/3), invalidation (1/3) mMosaic Will Kill Me 4 Intel Intranet Proxies --> Images traffic mA Simple Yet Robust Caching Algorithm 4 Georgia IT server : recency > frequency 4 LRU : Server side cache-hit rate (80%) mInvalidation in Large Scale Object Cache 4 Harvest Cache, Xmosaic 4 HTML : frequently modified ( 75 days), Image (107 days)

System Software Research Lab mCharcteristics of WWW Client-Based Traces 4 Xmosaic, 600 users, 6 months 4 transmission times, doc. size, doc. size versus # of requests (Pareto) 4 unix file systems : more small and large file exists mApplication Level Document Caching mExplaining WWW Traffic - Self - Similarity 4 1 second seconds : self-similar 4 Busiest periods : self-similar, idle periods: non self-similar mCaching Proxies : Limitations and Potentials 4 # of requests per sever (Zipf), CGI (0.5%) mNetwork Behaviour of a Busy Web Servers /DEC, Congressional Election Server 4 images --> major traffic, inter-arrival time --> not Poission

System Software Research Lab mWWW Cache Consistency 4 Microsoft, BU, Harvard 4 popularity : inverse with frequency of change 4 image : 65 %, CGI : 9 % 4 HTML : 50 days, GIF : 85 days mWeb Server Workload Characterization 4 University of Waterloog, Calgary, NASA, NCSA 4 10% documents --> 90% requests 4 10% domains --> 75% usags mEvaluating History Mechanism 4 Xmosaic, 6 weeks 4 new URL : 42%, revisting URL: 58%

System Software Research Lab mStrong Regularites in Web Surfing 4 click per sites --> inverse Gaussian 4 average clicks 8.32, typical case : 1 click mShared User Behaviour 4 DEC, Korean National Proxy, Virginia Tech, AOL 4 Median file size : 2KB, Mean file size : 27 KB 4 25 % server : 80-95% requests, 90% bytes : 25% servers mCharacterizing WWW Queries 4 CGI : 4 % (KNP), 9 % (AOL), 12 % (VT) 4 99% queries : simple mWeb Facts and Fantasy 4 Educational (Harvard, Rice), Business (BUS, ISP, FSS, AE), Info (GOV, PROF) 4 Characterization of Sites ( size of the site, diversity of users, user access patterns) q Renovational growth ( Business )

System Software Research Lab q Size growth ( Eudcational sites) q Visit growth by the same user ( Information sites ) q Attraction ( Adult Entertainment : Search Engine) 4 CGI : low requests, low traffic : counter, login, search engine 4 Peak Activity : network bottlneck mGenerating representative Web workloads 4 SURGE q file size : body (lognormal), tail (Pareto), popularity : Zipf, request size: Pareto, reading times : Pareto, …. q realistic benchmark : HTTP-NG

System Software Research Lab. 9 Conclusion mDynamic Web --> Several Invariants /file popularity, file size, # of request per user, /site popularity, life span, request type…. mFuture Research /Relation between file popularity and reoccurence rate /User’s navigation paths