Computer Science Department 1 Studying the Impact of More Complete Server Information on Web Caching Craig E. Wills and Mikhail Mikhailov Worcester Polytechnic.

Slides:



Advertisements
Similar presentations
Hopkins Storage Systems Lab, Department of Computer Science Automated Physical Design in Database Caches T. Malik, X. Wang, R. Burns Johns Hopkins University.
Advertisements

Chapter 16 The World Wide Web.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
An Overview of Database Access on the Web An Overview of Database Access on the Web Using ASP and Microsoft Database Technology Sheffield Hallam University.
Towards a Better Understanding of Web Resources and Server Responses for Improved Caching Craig E. Wills and Mikhail Mikhailov Computer Science Department.
Multiple Tiers in Action
Internet Cache Pollution Attacks and Countermeasures Yan Gao, Leiwen Deng, Aleksandar Kuzmanovic, and Yan Chen Electrical Engineering and Computer Science.
Technologies for EC/EB Walt Scacchi FEMBA 290 Winter 2003.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Web Latency Reduction Via Client-side Prefetching Avinoam N. Eden; Brian W. Joh; Trevor Mudge Electrical Engineering and Conputer Science Department The.
Kerim KORKMAZ A. Tolga KILINÇ H. Özgür BATUR Berkan KURTOĞLU.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
Chapter 9 Introduction to the Document Object Model (DOM) JavaScript, Third Edition.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Leveraging User Interactions for In-Depth Testing of Web Application Sean McAllister Secure System Lab, Technical University Vienna, Austria Engin Kirda.
 What I hate about you things people often do that hurt their Web site’s chances with search engines.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
Operational Data Tools Chapter Eight. Copyright © Houghton Mifflin Company. All rights reserved.8–28–2 Chapter Eight Learning Objectives To learn database.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Chapter 16 The World Wide Web. 2 Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Write basic HTML.
Hybrid Prefetching for WWW Proxy Servers Yui-Wen Horng, Wen-Jou Lin, Hsing Mei Department of Computer Science and Information Engineering Fu Jen Catholic.
Chapter 16 The World Wide Web Chapter Goals ( ) Compare and contrast the Internet and the World Wide Web Describe general Web processing.
Server-side Scripting Powering the webs favourite services.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Chapter 16 The World Wide Web. 2 The Web is an infrastructure of distributed information combined with software that uses networks as a vehicle to exchange.
Universiti Utara Malaysia Chapter 3 Introduction to ASP.NET 3.5.
Web Caching By Neeraj Agrawal. Caching Caching is widely used for improving performance in many context( e.g processor caches in hardware, buffer pool.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Top Five Web Application Vulnerabilities Vebjørn Moen Selmersenteret/NoWires.org Norsk Kryptoseminar Trondheim
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
Understanding the Performance of Web Caching System with an Analysis Model and Simulation Xiaosong Hu Nur Zincir-Heywood Sep
Dynamic-Content Web Caching with Cooperative Proxy Scheme Βελισκάκης Μανώλης Εθνικό Μετσόβιο Πολυτεχνείο Dept. of Electrical & Computer Engineering Knowledge.
Web Site Statistics A Metric for Measuring Engagement.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
On The Cooperation of Web Clients and Proxy Caches Yiu Fai Sit, Francis C.M. Lau, Cho-Li Wang Department of Computer Science The University of Hong Kong.
(C) Copyright PodsCat, 1998 A Smart Internet Caching System1 Director : D r. 張燕光 Speaker : Chun-Po Kuo From :
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.
1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component.
WEB SERVER SOFTWARE FEATURE SETS
Aruna Balasubramanian, Yun Zhou, W Bruce Croft, Brian N Levine and Arun Venkataramani Department of Computer Science, University of Massachusetts, Amherst.
The World's Largest computer Network. The World Wide Web In 1989, Tim Berners-Lee, an Oxford-trained computer scientist, had an idea for a "global hypertext.
Web Measurement. The Web is Different from other Commuication Media More precise measurement of activity on Web sites is available More precise measurement.
A RESEARCH SUPPORT SYSTEM FRAMEWORK FOR WEB DATA MINING Jin Xu, Yingping Huang, Gregory Madey Department of Computer Science and Engineering University.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
1 Chapter 22 World Wide Web (HTTP) Chapter 22 World Wide Web (HTTP) Mi-Jung Choi Dept. of Computer Science and Engineering
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
General Architecture of Retrieval Systems 1Adrienn Skrop.
On Caching Search Engine Query Results Evangelos Markatos Evangelos Markatoshttp://archvlsi.ics.forth.gr/OS/os.html Computer Architecture and VLSI Systems.
Group 18: Chris Hood Brett Poche
WWW and HTTP King Fahd University of Petroleum & Minerals
Improving searches through community clustering of information
PHP / MySQL Introduction
Dynamic Web Pages (Flash, JavaScript)
Database Driven Websites
WebAnywhere Addressing Performance and Security
PHP and Forms.
Group 3: Olena Hunsicker and Divya Josyula
Web Mining Department of Computer Science and Engg.
Chapter 16 The World Wide Web.
Client-Server Model: Requesting a Web Page
Web Servers (IIS and Apache)
Presentation transcript:

Computer Science Department 1 Studying the Impact of More Complete Server Information on Web Caching Craig E. Wills and Mikhail Mikhailov Worcester Polytechnic Institute Presented by Mikhail Mikhailov May 23, 2000

Computer Science Department 2 Outline of Talk Observations Proposed approach Experiments –Methodology –Test sets –Results Conclusions Future work

Computer Science Department 3 Observations Heterogeneous dynamic content Monolithic pages, loss of information Changes are predictable, can be localized Heuristic approaches to caching (many validations)

Computer Science Department 4

5 Proposed Approach Object classification by type and change characteristics Preserve object identities Object Composition (vs. monolithic approach) Object Relationships Piggybacking

Computer Science Department 6 Exp1: Methodology (content reuse) Popular sites (100hot.com) and popular URLs (NLANR proxy logs) Unconditionally GET HTML and embedded images each day at the same time for 11 days Catalogue resources, compute MD5 Analyze changes with Chunking Tool

Computer Science Department 7 Exp1: Test Sets (content reuse) Cnt300 (7 NLANR logs) Top50 (50 most popular sites, 100hot.com) ECom (50 largest b-2-c shopping sites, 100hot.com) Srcheng (11 top search engines) EComQ (2 queries, top 10 EComm set) SrchengQ (2 queries, Srcheng set)

Computer Science Department 8 Exp1: Results (content reuse)

Computer Science Department 9 Exp2: Methodology (eliminating validation requests) NLANR proxy logs For each 304 response look for a 200 response from the same server within a given window (10 sec on each side) Focus on 304 responses for images

Computer Science Department 10 Exp2: Results (eliminating validation requests)

Computer Science Department 11 Exp3.1: Methodology / Results (object change characteristics) Dynamic, Access Dependent objects (Top50, R,R,15min,R) most of short-term changes occur immediately

Computer Science Department 12 Dependency-based objects (SrchengQ, EComQ, same query, retrieved daily) Exp3.2: Methodology / Results (object change characteristics) some changes may be attributed to dynamic/access dependent objects; further study needed

Computer Science Department 13 Input Dependent objects (SrchengQ, EComQ, different queries, retrieved daily) Exp3.3.1: Methodology / Results (object change characteristics)

Computer Science Department 14 Exp3.3.2: Methodology / Results (object change characteristics) Input Dependent objects (objects with cookies from Cnt300, Top50, ECom, obtain 2 cookies for each object, R-cookie1,R-cookie2)

Computer Science Department 15 Conclusions Proposed techniques have potential to: –increase content reuse –reduce number of validation requests

Computer Science Department 16 Future Work Combine object types and change characteristics with object relationships Extend web server and proxy caching software to support proposed techniques

Computer Science Department 17 Object classification by change characteristics Periodic (changes at regular intervals: hour, day, etc) Dependency-based (depends on a file or DB changing) Dynamic (different on every access, can’t be prefetched) Access Dependent (different on every access, can be prefetched) Input Dependent (query, cookies) Relatively Dynamic (changes frequently) Static (never changes) Relatively Static (changes infrequently)

Computer Science Department 18 Figure 1. Current News Composite Object