Adaptive Push-Pull: Disseminating Dynamic Web Data Pavan Deolasee, Amol Katkar, Krithi,Ramamritham Indian Institute of Technology Bombay Dept. of CS University.

Slides:



Advertisements
Similar presentations
Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.
Advertisements

Dissemination-based Data Delivery Using Broadcast Disks.
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Fuzzy Logic and its Application to Web Caching
Cooperative Overlay Networking for Streaming Media Content Feng Wang 1, Jiangchuan Liu 1, Kui Wu 2 1 School of Computing Science, Simon Fraser University.
Distributed Multimedia Systems
Dynamic Adaptive Streaming over HTTP2.0. What’s in store ▪ All about – MPEG DASH, pipelining, persistent connections and caching ▪ Google SPDY - Past,
Institute of Networking and Multimedia, National Taiwan University, Jun-14, 2014.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
NETWORK LOAD BALANCING NLB.  Network Load Balancing (NLB) is a Clustering Technology.  Windows Based. (windows server).  To scale performance, Network.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 OSI Transport Layer Network Fundamentals – Chapter 4.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
1 Experimental Methodology H Experimental methods can be used to: – demonstrate that a new concept, technique, or algorithm is feasible –demonstrate that.
Dept. of Computer Science & Engineering, CUHK1 Trust- and Clustering-Based Authentication Services in Mobile Ad Hoc Networks Edith Ngai and Michael R.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Caching Neighborhood Protocol(CNP): a Foundation for Building Dynamic Web Caching Hierarchies with Proxy Servers Cho-Yu Chiang, Ming T.Liu, Mervin E.Muller.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
Collaborative Web Caching Based on Proxy Affinities Jiong Yang, Wei Wang in T. J.Watson Research Center Richard Muntz in Computer Science Department of.
Reliability on Web Services Pat Chan 31 Oct 2006.
Computer Science Lecture 2, page 1 CS677: Distributed OS Last Class: Introduction Distributed Systems – A collection of independent computers that appears.
Differentiated Multimedia Web Services Using Quality Aware Transcoding S. Chandra, C.Schlatter Ellis and A.Vahdat InfoCom 2000, IEEE Journal on Selected.
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
Efficiently Maintaining Stock Portfolios Up-To-Date On The Web Prashant Shenoy Manish Bhide Krithi Ramamritham 2002 IEEE E-Commerce System Proceedings.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.
Web Caching Schemes For The Internet – cont. By Jia Wang.
BA 427 – Assurance and Attestation Services
Exploiting Virtualization for Delivering Cloud based IPTV Services Speaker : 吳靖緯 MA0G IEEE Conference on Computer Communications Workshops.
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
Cache Updates in a Peer-to-Peer Network of Mobile Agents Elias Leontiadis Vassilios V. Dimakopoulos Evaggelia Pitoura Department of Computer Science University.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
Exploring VoD in P2P Swarming Systems By Siddhartha Annapureddy, Saikat Guha, Christos Gkantsidis, Dinan Gunawardena, Pablo Rodriguez Presented by Svetlana.
Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.
Boğaziçi University – Computer Engineering Dept. CMPE 521 An Efficient and Resilient Approach to Filtering &Disseminating Streaming Data PAPER PRESENTATION.
POWER CONTROL IN COGNITIVE RADIO SYSTEMS BASED ON SPECTRUM SENSING SIDE INFORMATION Karama Hamdi, Wei Zhang, and Khaled Ben Letaief The Hong Kong University.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
1 Resilient and Coherence Preserving Dissemination of Dynamic Data Using Cooperating Peers Shetal Shah, IIT Bombay Kirthi Ramamritham, IIT Bombay Prashant.
A Distributed Clustering Framework for MANETS Mohit Garg, IIT Bombay RK Shyamasundar School of Tech. & Computer Science Tata Institute of Fundamental Research.
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
CS 5204 (FALL 2005)1 Leases: An Efficient Fault Tolerant Mechanism for Distributed File Cache Consistency Gray and Cheriton By Farid Merchant Date: 9/21/05.
Distributed Maintenance of Cache Freshness in Opportunistic Mobile Networks Wei Gao and Guohong Cao Dept. of Computer Science and Engineering Pennsylvania.
Routing In Socially Selfish Delay Tolerant Networks Chan-Myung Kim
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Cache Updates in a Peer-to-Peer Network of Mobile Agents Ilias Leontiadis Master’s Thesis University of Ioannina,Greece Supervisors: Vassilios V. Dimakopoulos.
Computer Science Lecture 14, page 1 CS677: Distributed OS Last Class: Concurrency Control Concurrency control –Two phase locks –Time stamps Intro to Replication.
Re-Configurable Byzantine Quorum System Lei Kong S. Arun Mustaque Ahamad Doug Blough.
Energy-Efficient Data Caching and Prefetching for Mobile Devices Based on Utility Huaping Shen, Mohan Kumar, Sajal K. Das, and Zhijun Wang P 邱仁傑.
Representational State Transfer (REST). What is REST? Network Architectural style Overview: –Resources are defined and addressed –Transmits domain-specific.
Data Scheduling for Multi-item and transactional Requests in On-demand Broadcast Nitin Pabhu Vijay Kumar MDM 2005.
1 Shetal Shah, IITB Dissemination of Dynamic Data: Semantics, Algorithms, and Performance.
1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component.
Client Assignment in Content Dissemination Networks for Dynamic Data Shetal Shah Krithi Ramamritham Indian Institute of Technology Bombay Chinya Ravishankar.
An Adaptive Zone-based Storage Architecture for Wireless Sensor Networks Thang Nam Le, Dong Xuan and *Wei Yu Department of Computer Science and Engineering,
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
1/39 Motion Adaptive Search for Fast Motion Estimation 授課老師:王立洋老師 製作學生: M 蔡鐘葳.
Last Class: Introduction
Corelite Architecture: Achieving Rated Weight Fairness
Controlling the Cost of Reliability in Peer-to-Peer Overlays
Internet Networking recitation #12
Understanding and Exploiting Amazon EC2 Spot Instances
Improving the Freshness of NDN Forwarding States
Dissemination of Dynamic Data on the Internet
Dynamically Maintaining Frequent Items Over A Data Stream
Presentation transcript:

Adaptive Push-Pull: Disseminating Dynamic Web Data Pavan Deolasee, Amol Katkar, Krithi,Ramamritham Indian Institute of Technology Bombay Dept. of CS University of Massachusetts

Outline Introduction Background description Push vs. Pull PAP algorithm POP algorithm Performance Conclusion

Introduction An crucial issue in the dissemination of time- varying web data is the maintenance of temporal coherency. Clients Pull the data based on the dynamics of the data and a user’s coherency requirements. Servers with Push capability maintain state information pertaining to clients and push only those changes that are of interest to a user.

Introduction(cont.) Complementary properties of these two techniques: temporal coherency, communication overheads, state space overheads, loss of coherency due to server failure. This paper depicts how to combine push and pull –based techniques to achieve the best features of both. The experimental results demonstrate that such adaptive data dissemination is essential to meet diverse temporal coherency requirements, to be resilient to failures.

Background description Time-varying : changes frequently. Coherency requirements associated with user tolerence. Eg: a user may desire to have stronger coherency requirements for data items such as stock prices than news information. Maintaining Temporal coherency on the web –User specify a temporal coherency requirements (tcr) for each data item of interest. –tcr : the maximum permissible deviation of the cached value from the value at the server. tcr can be specified in units of time or value(5mins or 1dollar).

Background description –|U(t) – S(t)| ≦ c –fidelity f : observed by a user to be the total length of time that the above inequality holds. –We focus on two issues: user specified coherency, fidelity requirements.

Background description The need for combining Push and Pull –Push-based: suitable when a client requires its coherency requirements to be satisfied with a high fidelity, or when the communication overheads are the bottleneck. –Pull-based: suited to less frequently changing data or less stringent coherency requirements, and when resilient to failures is important. –The goal is combining push and pull in an adaptive manner while offering good resiliency and scalability.

Push vs. Pull Pull –Time To Refresh (TTR) attributes with each data item. –TTR denotes the next time the proxy should poll the server to refresh the data which has changed in the interim. –Rapidly changing data results in a smaller TTR. –Proxy only pull those changes that are of interest to the user, need not pull every single change.

Pull Adaptive TTR approach –[TTR min,TTR max ] : range of TTR value. –TTR mr : smallest TTR value used so far. –TTR dyn :according to the last two recent changes to reflect the changes in the future. TTR dyn =(w × TTR estimate )+((1-w) × TTR latest ) TTR estimate = –Weight w: 0.5 < w < 1, initially 0.5

Pull –Computing TTR adaptive : Max(TTR min, Min(TTR max, a× TTR mr +(1-a) ×TTR dyn )) 0 ≦ a ≦ 1,with a higher fidelity, a higher value of a.

Push The proxy registers with a server, identifying the data of interest and the associated tcr, i.e.,the value c. Whenever the value of the data changes, the server uses the tcr value c to determine if the new value should be pushed to the proxy. The current value Dι is pushed if and only if | Dι - Dκ| ≧ c, 0< κ < ι. Dκwas the last value that was pushed to the proxy. The server needs to maintain state information of a list of proxies and data items, the tcr of each proxy and the last update sent to that proxy.

Push(cont.) Key advantage: it can meet stringent coherency requirements.Since the server is aware of every change.

Performance of Pull vs. Push Experimental model –Pull: vanilla HTTP web server with prototype proxy. –Push: prototype server uses unicast and connection- oriented sockets to push data to proxies. –On a local intranet. Traces used –Using real world stock price streams as dynamic data. –A trace that is 2 hours long,approximately data values.

Performance of Pull vs. Push –The Pull approach was evaluated using the Adaptive TTR algorithm with a = 0.9,TTR min =1 sec,and three TTR max values of 10,30 and 60 seconds.

Performance of Pull vs. Push Maintenance of Temporal Coherency –A push-based server is well-suited to achieve a fidelity value of 1. –As to pull-based server, the frequency of the pulls(the assignment of TTR values) determines the degree to which client needs are met. –We quantify the fidelity of pull-based approach in the probability that user’s tcr will be met. –Measuring the durations when |U(t)-S(t)| >c. –Let δ 1, δ 2,…,δ n denotes the total time for which data was observed by a user.

Performance of Pull vs. Push –The fidelity is expressed as a percentage –Observed_period is the total time for which data was observed by a user.

Pull-based algorithm with adaptive TTRs

Performance of Pull vs. Push Communication Overheads –In a push approach, number of messages transferred over the network is equal to the number of times the user is informed of data changes. –A pull approach requires two messages: HTTP request and response per poll. –We quantify communication overheads of the number of messages exchanged between server and proxy.

Coherency requirement $0.05 ≦ c ≦ $0.4

Performance of Pull vs. Push Resiliency –By virtue of being stateless, a pull-based server is resilient to failures. –In contrast, a push-based server maintains crucial state information about the needs of its clients; this state is lost when the server fails.

PAP approach PAP(Push and Pull): simultaneously employs both push and pull. Client Server Client register and informs its coherency requirements tcr. Pull with alg. A to decide its TTR value Server pushes the new data value, if server fails, client is served as well as alg. A

PAP approach Approximation to compute the client’s next TTR. Let diff = T(i) - T(i-1) Server predicts the next client polling time as t predict =T(i) + diff. In practice, server should allow the client to pull data if the changes of interest to the client occur close to the client’s expected pulling time. Server waits, for a duration of ε,for client to pull. If a client doesn’t pull when server expect it to, the server extends the push duration by adding (diff - ε) to t predict

PAP approach A series of rapid changes occur during (t i , t i+1 ), the probability that some violations may occur in (t i , t i+1 ) is very high and thus these changes will also be pushed by the server. This occurrence further forcing a decrease in the TTR at the proxy and causing frequent polls from the proxy. So the TTR value at the proxy will tend towards TTR min and diff will also approach zero. Thus making the durations of possible pushes from the server close to zero.

POP approach POP: Dynamically choosing between Push or Pull. A server can categorize its clients either as push or pull one and this categorization can change with system dynamics. Basic ideas: –allow failures at the server to be detected early so that,if possible,clients can switch to pulls. – servers are designed to push data values when one of two conditions is met. –Server can be designed to provide push service as the default to all clients provided it has sufficient resources.

POP approach –If the request desires 100% fidelity and the server doesn’t have sufficient resources to satisfy it, then the server takes steps to convert some push clients to pull. If this conversion is not possible, then the new request is denied.

Conclusion Combining Push and Pull approaches Goal :maintaining user specified coherency and fidelity requirements. Intelligent : ability to dynamically choose the most efficient set of mechanisms to service each application. Adaptive : adapt a particular mechanism to changing network and workload characteristics.