Presentation is loading. Please wait.

Presentation is loading. Please wait.

Functions of a Web Warehouse Kai Cheng, Yahiko Kambayashi, Seok Tae Lee Graduate School of Informatics, Kyoto University, Japan and Mukesh Mohania Western.

Similar presentations


Presentation on theme: "Functions of a Web Warehouse Kai Cheng, Yahiko Kambayashi, Seok Tae Lee Graduate School of Informatics, Kyoto University, Japan and Mukesh Mohania Western."— Presentation transcript:

1 Functions of a Web Warehouse Kai Cheng, Yahiko Kambayashi, Seok Tae Lee Graduate School of Informatics, Kyoto University, Japan and Mukesh Mohania Western Michigan University, USA

2 13-16 November 2000ICDL 20002 Table of Contents  Survival from “Information Explosion”  Warehouse-Mediated Content Delivery  Community-Oriented Web Warehouses  Technical Issues  Warehouse Enhanced Web Caching  Related Work  Concluding Remarks

3 13-16 November 2000ICDL 20003 Survival from “Information Explosion”  Web Traffic Doubled Every 3-6 Months  Exponential Growth of the Web –1 Billion Pages, January 2000 –2 Billion Pages, June 2000 –100 Times Increase in the Next 2 Years Information Overload for both Nets and Users

4 13-16 November 2000ICDL 20004 Scale up the Web and Internet  More Bandwidth –Never Keep Pace with the Traffic Growth  More Server Capacity –How to Deal with “Hot-Spots” ?  Site Replication –Only Benefit Replicated Servers ?

5 13-16 November 2000ICDL 20005 Our Approach  Tame the Chaotic Info. Streams Saving Redundant Data Transfers  Unite the Individual Users Sharing Findings and Efforts of Each Other

6 13-16 November 2000ICDL 20006 Warehouse-Mediated Content Delivery  Direct Delivery –  QoS: Server, Network  Overloaded –  Personalized Services  Unrealistic –  Information Hunting  Difficult Internet

7 13-16 November 2000ICDL 20007 Indirect Content Delivery Storage Output Analysis Notification Transformation Buffering WWW Input Resource Discovery Clustering Searching Navigation Filtering Web Warehouse

8 13-16 November 2000ICDL 20008 Community-Oriented Web Warehousing Sharing   Contribution The Community of Users * People with Special Information Needs/Interests

9 13-16 November 2000ICDL 20009 Examples of User Community Sports Fan Patients Businessman Researchers

10 13-16 November 2000ICDL 200010 Real/Cyber Communities (a) Real Communities Dependent on Location (b) Cyber Communities Independent on Location

11 13-16 November 2000ICDL 200011 Technical Issues  Functions of a Web Warehouse  Web Caching vs. Web Warehousing  Data Warehousing vs. Web Warehousing  Dynamic Hierarchical Web Warehouses

12 13-16 November 2000ICDL 200012 Functions of a Web Warehouse  Buffering  Transformation 1.Transcoding 2.Summarizing  Content Analysis  Notification Resource Discovery Storage Reusing Transform Format A Format B Content A Transform Content B Data/Information Analysis Knowledge

13 13-16 November 2000ICDL 200013 Web Caching Research Program Content Analysis Transformation Warehousing

14 13-16 November 2000ICDL 200014 From Web Caching to Web Warehousing Web CachingWeb Warehousing ObjectDataInformation ObjectiveReusingSharing StorageBoundedBound-Free PopulationResponsesWeb View ModelFS DependentHypermedia

15 13-16 November 2000ICDL 200015 From Data Warehousing to Web Warehousing ItemsData WHWeb WH 1ObjectiveDecision SupportInformation Sharing 2ModelRDB/OORDBHypermedia 3PopulationView Materialization Resource Discovery Content Localization 4ResourceOperational DataWeb Documents 5Data TypeStructuredSemi-/Un-structured 6Tie to Web DWH  WebWWH  Web

16 13-16 November 2000ICDL 200016 Warehouse as Shared Information Repository  Real Communities  –Centralized Management of Warehouses –Unicast Data Transfer  Cyber Communities  –Distributed Management of Warehouse –Multicast Data Transfer

17 13-16 November 2000ICDL 200017 Hierarchy of Web Warehouses HP Design Sports Skiing Tennis Mr. A, Ms. C Mrs. D … Mr. A, Ms. C Mrs. D … Mr. A. Mr. D ….. Mr. A. Mr. D …..

18 13-16 November 2000ICDL 200018 Dynamic Formation of Web Warehouses (Split ) Tennis Skiing A B Sports Tennis Skiing A A B B

19 13-16 November 2000ICDL 200019 Dynamic Formation of Web Warehouses (Union ) Painting Drawing A A B B Painting & Drawing Painting & Drawing A A B B

20 13-16 November 2000ICDL 200020 Current Status: Content-Sensitive Caching Web Caching Warehousing Content Sensitive Caching Content-Sensitive Caching

21 13-16 November 2000ICDL 200021 Content-Sensitive Cache Replacement Policy  Cache Replacement : Keep? Replace?  Traditional Caching Long Time Observation  Replacement Decision 60% One-Access Objects  How Differentiate ? Content-Sensitive Caching LRU-SP+

22 13-16 November 2000ICDL 200022 LRU-SP+: Content-Sensitive Size-Adjusted & Popularity-Aware LRU  Daily Indexing: Cache Content  Indices  Indices  Popular Topics  How Similar? New Document  Popular Topics  Benefit/Size Model “Observed” Pop. + “Inherent” Pop.  Implement this Model

23 13-16 November 2000ICDL 200023 Related Work  LSAM’s Proxy Cache (Push) –Multicast-Based Virtual Cache –Affinity Groups and Push Channels  INTELSAT’s Wormhole Content Delivery –Warehouse-Koisk Model –Satellite-Based Delivery Platform

24 13-16 November 2000ICDL 200024 Concluding Remarks Proposed to Cope with the Scaling Problems by Web Warehouse-Mediated Content Delivery  Discussed the Basic Functions of a Web Warehouse: Buffering, Transformation, Notification and Content Analysis  Introduced our Current Work: Warehouse-Enhanced Web Caching


Download ppt "Functions of a Web Warehouse Kai Cheng, Yahiko Kambayashi, Seok Tae Lee Graduate School of Informatics, Kyoto University, Japan and Mukesh Mohania Western."

Similar presentations


Ads by Google