FeedEx: Collaborative Exchange of News Feeds Seung Jun and Mustaque Ahamad Georgia Tech Information Security Center Georgia Institute of Technology.

Slides:



Advertisements
Similar presentations
A Media Subscription Service By Peter Kriens CEO aQute OSGi Technology Officer
Advertisements

GeoFeed: A Location-Aware News Feed System
Peer-to-Peer and Social Networks An overview of Gnutella.
© Telcordia Technologies 2004 – All Rights Reserved AETG Web Service Tutorial AETG is a service mark of Telcordia Technologies. Telcordia Technologies.
NGAS – The Next Generation Archive System Jens Knudstrup NGAS The Next Generation Archive System.
Research Issues in Web Services CS 4244 Lecture Zaki Malik Department of Computer Science Virginia Tech
Yunling Wang VoIP Security COMS 4995 Nov 24, 2008 XCAP The Extensible Markup Language (XML) Configuration Access Protocol (XCAP)
Supporting Cooperative Caching in Disruption Tolerant Networks
November 2006 TECO-WIS, Seoul 1 Definition of the Internet Network of networks –millions of smaller domestic, academic, business, and government networks.
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Social Media.
Peer-to-Peer (P2P) Distributed Storage 1Dennis Kafura – CS5204 – Operating Systems.
SIMPLE Presence Traffic Optimization and Server Scalability Vishal Kumar Singh Henning Schulzrinne Markus Isomaki Piotr Boni IETF 67, San Diego.
A 2 -MAC: An Adaptive, Anycast MAC Protocol for Wireless Sensor Networks Hwee-Xian TAN and Mun Choon CHAN Department of Computer Science, School of Computing.
HIERARCHY REFERENCING TIME SYNCHRONIZATION PROTOCOL Prepared by : Sunny Kr. Lohani, Roll – 16 Sem – 7, Dept. of Comp. Sc. & Engg.
Corona: A High Performance Publish-Subscribe System for the World Wide Web Authors: V. Ramasubramanian, R. Peterson and E.G. Sirer Cornell University Presenter:
Ranveer Chandra , Kenneth P. Birman Department of Computer Science
Web Content Filter: technology for social safe browsing Ilya Tikhomirov Institute for Systems Analysis of the Russian Academy of Sciences
Cobra: Content-based Filtering and Aggregation of Blogs and RSS Feeds Ian Rose 1, Rohan Murty 1, Peter Pietzuch 2, Jonathan Ledlie 1, Mema Roussopoulos.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
Multicasting in Mobile Ad-Hoc Networks (MANET)
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
FeedTree: Sharing Web Micronews with Peer-to-Peer Event Notification D. Sandler, A. Mislove, A. Post, P. Druschel Presented by: Andrew Sutton.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Dept. of Computer Science & Engineering, CUHK1 Trust- and Clustering-Based Authentication Services in Mobile Ad Hoc Networks Edith Ngai and Michael R.
Corona: A High Performance Publish-Subscribe System for the World Wide Web Cornell University, Ithaca, NY Networked System Design and Implementation (NSDI),
An Authentication Service Against Dishonest Users in Mobile Ad Hoc Networks Edith Ngai, Michael R. Lyu, and Roland T. Chin IEEE Aerospace Conference, Big.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Web Caching Schemes For The Internet – cont. By Jia Wang.
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
Content Networking - CON Content Overlay Network Vishal Kumar Singh Eilon Yardeni April, 28 th 2005.
Peer-to-peer file-sharing over mobile ad hoc networks Gang Ding and Bharat Bhargava Department of Computer Sciences Purdue University Pervasive Computing.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Web Cache. Introduction what is web cache?  Introducing proxy servers at certain points in the network that serve in caching Web documents for faster.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Privacy-Preserving P2P Data Sharing with OneSwarm -Piggy.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
On P2P Collaboration Infrastructures Manfred Hauswirth, Ivana Podnar, Stefan Decker Infrastructure for Collaborative Enterprise, th IEEE International.
GeoGrid: A scalable Location Service Network Authors: J.Zhang, G.Zhang, L.Liu Georgia Institute of Technology presented by Olga Weiss Com S 587x, Fall.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Wireless Ad Hoc Podcasting. Ad hoc ad hoc network typically refers to a system of network requiring little or no planning a decentralized type of wireless.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Trust- and Clustering-Based Authentication Service in Mobile Ad Hoc Networks Presented by Edith Ngai 28 October 2003.
Network Computing Laboratory FeedEx: Collaborative Exchange of News Feeds Seung Jun, Mustaque Ahamad Georgia Institute of Technology WWW 2006.
PPWEB: A Peer-to-Peer Approach for Web Surfing On the Go Ling-Jyh Chen, Ting-Kai Huang Institute of Information Science, Academia Sinica, Taiwan Guang.
ECO-DNS: Expected Consistency Optimization for DNS Chen Stephanos Matsumoto Adrian Perrig © 2013 Stephanos Matsumoto1.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
Communication Support for Location- Centric Collaborative Signal Processing in Sensor Networks Parmesh Ramanathan University of Wisconsin, Madison Acknowledgements:K.-C.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Eclipse Attacks on Overlay Networks: Threats and Defenses By Atul Singh, et. al Presented by Samuel Petreski March 31, 2009.
VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.
S imple O bject A ccess P rotocol Karthikeyan Chandrasekaran & Nandakumar Padmanabhan.
ICP and the Squid Web Cache Duane Wessels and K. Claffy 산업공학과 조희권.
Push Technology Humie Leung Annabelle Huo. Introduction Push technology is a set of technologies used to send information to a client without the client.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Connect. Communicate. Collaborate Deploying Authorization Mechanisms for Federated Services in the eduroam architecture (DAMe)* Antonio F. Gómez-Skarmeta.
CoopNet: Cooperative Networking
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
1 FairOM: Enforcing Proportional Contributions among Peers in Internet-Scale Distributed Systems Yijun Lu †, Hong Jiang †, and Dan Feng * † University.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Presentation on Distributed Web Based Systems Submitted by WWW
Client Behavior and Feed Characteristics of RSS
Early Measurements of a Cluster-based Architecture for P2P Systems
Data-Centric Networking
PPWEB: A Peer-to-Peer Approach for Web Surfing On the Go
Presentation transcript:

FeedEx: Collaborative Exchange of News Feeds Seung Jun and Mustaque Ahamad Georgia Tech Information Security Center Georgia Institute of Technology

Motivation RSS/Atom feeds have become increasingly popular –Published by most traditional media and blogs Scalability of feed servers –Frequent pull requests create high load –Infrequent requests increase latency and may lead to missed items Our Approach –Use resources at peer nodes to deliver feed items –Scalable growth in resources with service demand Challenges –Peers may not fully cooperate and execute the agreed protocols

FeedEx Overview Feeds have different update and usage patterns. –A new hybrid transport mechanism –Pull from servers –Push among peer nodes Peers in FeedEx –Form a distribution mesh, –Fetch feeds from web servers occasionally, and –Exchange new entries among each other –Peer incentives for exchanging entries

RSS/Atom Primer Feed format NYT Technology Basics: Going Wireless on... Wi-Fi has revolutionized the... Current way of reading feeds –Stand-alone applications (e.g., Mozilla Thunderbird) –Web-based service (e.g., Bloglines and My Yahoo!)

Analysis of Feed Publishing Purpose –Interesting by itself and helpful in designing FeedEx Methodology –245 popular feeds monitored for 10 days –Feeds fetched every 2 minutes

Publishing Rate by Rank

Entry Count

Publishing Rate by Time

Entry Lifetime

Architecture of FeedEx

Bootstrapping Obtain a list of peers –Dedicated list server (Gnutella and BitTorrent) –Embedding (Pseudoserving [Kong and Ghosal 1999] and CoopNet [Padmanabhan and Sripanidkulchai 2002] ) –Local cache Connect to peers 1.Establish connection 2.Exchange subscription sets: {(url,hop),...}

Neighbor Selection Metrics for good neighbors –Subscription set match –Topological proximity –Duration of relationship

Adaptive Fetching from Servers Coordinated fetching by peers –High coordination overhead –Lots of nodes with high churn rate Solution: Adaptive fetching –Freshness rate f : Fraction of new entries in a fetched document –Set a target freshness rate f t –Fetching interval is doubled or halved, bounded by T min and T max

Entry Exchange Among Peers New entries obtained –By fetching from web servers –From neighbors Entry bundle –A set of new entries –Document identifier (did): Assigned by SHA-1 digest –Flooded to matching neighbors Two-phase flooding –check_did(did) call: 344 bytes including HTTP request header –put_entries(bundle)

Incentive Mechanism Pairwise fairness is simple and effective –Uses local information only –Easy to implement and enforce the mechanism Contribution metric c j,i : c j,i += w fhf Deficit of contribution d i,j : d i,j = c i,j c j,i Node i ensures d i,j < D for every neighbor j and a parameter D.

Prototype Implementation Python: python.org XML-RPC: xmlrpc.com/spec Twisted: twistedmatrix.com SQLite: sqlite.org Universal Feed Parser: feedparser.org

Experimental Setup Two modes –Stand-alone applications: sln –FeedEx: xch Metrics –Time lag –Missing entries –Communication cost Experiments –Use 189 PlanetLab nodes –Run 22 hours on a weekday –Primary factor: 6 fetching intervals –Let each node subscribe 20 out of 70 feeds

Results: Time Lag

Results: Missing Entries

Results: Communication Cost

Advantages Server scalability Archivability Controllability Filtering and recommendation Privacy

Related Work News feed delivery –Corona (Cornell) –FeedTree (Rice) Web caching and CDN [Freedman et al. 2004, Wang et al. 2004] Gossip-based protocols [Birman et al. 1999, Ganesh et al. 2003, Eugster et al. 2003]

Conclusions A new transport mechanism for news feeds –Pull by and exchange among peers FeedEx encourages cooperation by enforcing pair-wise fairness, while achieving –Reduced feed server load –Low latency –High coverage –Low communication overhead