1 Efficient Massive Sharing of Content among Peers by Peter Triantafillou, Chryssani Xiruhaki and Manolis Koubarakis Dept. of Electronics and Computer.

Slides:



Advertisements
Similar presentations
February 20, Spatio-Temporal Bandwidth Reuse: A Centralized Scheduling Mechanism for Wireless Mesh Networks Mahbub Alam Prof. Choong Seon Hong.
Advertisements

A Cloud Data Center Optimization Approach using Dynamic Data Interchanges Prof. Stephan Robert University of Applied Sciences.
Supporting Cooperative Caching in Disruption Tolerant Networks
P2PR-tree: An R-tree-based Spatial Index for P2P Environments ANIRBAN MONDAL YI LIFU MASARU KITSUREGAWA University of Tokyo.
Replication Strategies in Unstructured Peer-to-Peer Networks Edith Cohen Scott Shenker This is a modified version of the original presentation by the authors.
Fast Algorithms For Hierarchical Range Histogram Constructions
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Small-world Overlay P2P Network
Technical Architectures
1 Complexity of Network Synchronization Raeda Naamnieh.
Deterministic Wavelet Thresholding for Maximum-Error Metrics Minos Garofalakis Bell Laboratories Lucent Technologies 600 Mountain Avenue Murray Hill, NJ.
2/25/2004 The Google Cluster Architecture February 25, 2004.
A Hybrid Approach of Failed Disk Recovery Using RAID-6 Codes: Algorithms and Performance Evaluation Yinlong Xu University of Science and Technology of.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
1 of 14 1/15 Schedulability Analysis and Optimization for the Synthesis of Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
Adaptive Content Management in Structured P2P Communities Jussi Kangasharju Keith W. Ross David A. Turner.
Zoë Abrams, Ashish Goel, Serge Plotkin Stanford University Set K-Cover Algorithms for Energy Efficient Monitoring in Wireless Sensor Networks.
Performance Evaluation of Peer-to-Peer Video Streaming Systems Wilson, W.F. Poon The Chinese University of Hong Kong.
Object Naming & Content based Object Search 2/3/2003.
Efficient agent-based selection of DiffServ SLAs over MPLS networks Thanasis G. Papaioannou a,b, Stelios Sartzetakis a, and George D. Stamoulis a,b presented.
A Grid-enabled Branch and Bound Algorithm for Solving Challenging Combinatorial Optimization Problems Authors: M. Mezmaz, N. Melab and E-G. Talbi Presented.
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)
Paraskevi Raftopoulou 1,2 Paraskevi Raftopoulou 1,2 and Euripides G.M. Petrakis 2 1 Max-Planck Institute for Informatics, Saarbruecken, Germany
PNear Combining Content Clustering and Distributed Hash-Tables Ronny Siebes Vrije Universiteit, Amsterdam The netherlands
Distributed Load Balancing for Key-Value Storage Systems Imranul Hoque Michael Spreitzer Malgorzata Steinder.
Load Balancing in Distributed Computing Systems Using Fuzzy Expert Systems Author Dept. Comput. Eng., Alexandria Inst. of Technol. Content Type Conferences.
Min Xu1, Yunfeng Zhu2, Patrick P. C. Lee1, Yinlong Xu2
Network Aware Resource Allocation in Distributed Clouds.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Presented By HaeJoon Lee Yanyan Shen, Beng Chin Ooi, Bogdan Marius Tudor National University of Singapore Wei Lu Renmin University Cang Chen Zhejiang University.
Optimal Client-Server Assignment for Internet Distributed Systems.
A Dynamic Data Grid Replication Strategy to Minimize the Data Missed Ming Lei, Susan Vrbsky, Xiaoyan Hong University of Alabama.
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
O PTIMAL SERVICE TASK PARTITION AND DISTRIBUTION IN GRID SYSTEM WITH STAR TOPOLOGY G REGORY L EVITIN, Y UAN -S HUN D AI Adviser: Frank, Yeong-Sung Lin.
QoS Supported Clustered Query Processing in Large Collaboration of Heterogeneous Sensor Networks Debraj De and Lifeng Sang Ohio State University Workshop.
1 Push-to-Peer Video-on-Demand System. 2 Abstract Content is proactively push to peers, and persistently stored before the actual peer-to-peer transfers.
1 Statistical Modeling and Analysis of P2P Replication to Support Vod Service zyp Infocom, 2011, Shanghai.
ASSIGNMENT, DISTRIBUTION AND QOS PROVISIONING IN COMMUNICATION NETWORKS.
An Effective Method to Improve the Resistance to Frangibility in Scale-free Networks Kaihua Xu HuaZhong Normal University.
Vasilis Syrgkanis Cornell University
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Peer-to-Peer Video Systems: Storage Management CS587x Lecture Department of Computer Science Iowa State University.
P2P Content Search: Give the Web Back to the People Matthias Bender Sebastin Michel Peter Triantafillou Gerhard Weikum Christian Zimmer Mariam John CSE.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
GPFS: A Shared-Disk File System for Large Computing Clusters Frank Schmuck & Roger Haskin IBM Almaden Research Center.
COMMUNICATING VIA FIREFLIES: GEOGRAPHIC ROUTING ON DUTY-CYCLED SENSORS S. NATH, P. B. GIBBONS IPSN 2007.
On Improving the Performance Dependability of Unstructured P2P Systems via Replication ANIRBAN MONDAL YI LIFU MASARU KITSUREGAWA Institute of Industrial.
CMSC 691B Multi-Agent System A Scalable Architecture for Peer to Peer Agent by Naveen Srinivasan.
P2P Networking: Freenet Adriane Lau November 9, 2004 MIE456F.
Distributed Control and Autonomous Systems Lab. Sang-Hyuk Yun and Hyo-Sung Ahn Distributed Control and Autonomous Systems Laboratory (DCASL ) Department.
Web Server Load Balancing/Scheduling
Golrezaei, N. ; Molisch, A.F. ; Dimakis, A.G.
Web Server Load Balancing/Scheduling
Trustworthiness Management in the Social Internet of Things
Parallel Density-based Hybrid Clustering
CHAPTER 3 Architectures for Distributed Systems
CSE 591: Energy-Efficient Computing Lecture 15 SCALING: storage
Comparison of LAN, MAN, WAN
Paraskevi Raftopoulou, Euripides G.M. Petrakis
Peer-to-Peer Video Services
H-store: A high-performance, distributed main memory transaction processing system Robert Kallman, Hideaki Kimura, Jonathan Natkins, Andrew Pavlo, Alex.
Group Based Management of Distributed File Caches
How Yahoo! use to serve millions of videos from its video library.
ContinuStreaming: Achieving High Playback Continuity of Gossip-based Peer-to-Peer Streaming IPDPS 2008 LI Zhenhua Dept. Computer, Nanjing University.
Presentation transcript:

1 Efficient Massive Sharing of Content among Peers by Peter Triantafillou, Chryssani Xiruhaki and Manolis Koubarakis Dept. of Electronics and Computer Engineering Technical University of Crete Chania, Greece

2 Outline Background Aim System Architecture Experimental Result

3 Background Internet content sharing system has become very popular recently(e.g. Napster) Content sharing systems could consists of very large number of nodes, offering resources and content(documents).

4 Background The central control of information at special nodes is undesirable – Central points of failure – Peformance bottleneck Need to create a P2P system Architecture in which nodes can collaborate with each other.

5 Aim Ensuring high performance in large-scale P2P content sharing systems. This paper focus on two particular sub-goals: – Ensuring load balancing across all nodes of such a system. – Ensuring short response times to user requests.

6 System Architecture Divide documents into a number of semantic categories. It uniquely maps each document to a semantic category Category 1Category 2 Document 1 Document 2 Document 3 Document 4 Document 5Document 6 ……..

7 System Architecture The nodes will be divided into a number of clusters. Node 2Node 1 Node 7Node 6 Node 5 Node 4 Node 3 Cluster 1Cluster 2

8 System Architecture Each semantic category can only be found in one and only one clusters Cluster 1 Cluster 2 Category 1 Category 3 Category 2 Category 4

9 System Architecture Each node store the cluster metadata describing which category of documents are stored by which cluster nodes Global load balance if – Inter-cluster load balance – Intra-cluster load balance

10 User Request Processing Request can be either to publish(contribute) or to retrieve(download) documents. – Identified the semantic category of the document. – The request will be forwarded to the proper clusters – the request will be forwarded to one randomly chosen node of the cluster – If the node does/could not store the requested documents, it will forward the request to some other nodes of the same cluster.

11 User Request Processing With this approach, the paper claims that – The load is balanced within a cluster – The response time is low and it is bounded by the number of nodes in the cluster.

12 Inter-cluster Load Balance Assumption: – Each node contribute documents belonging to a single semantic category – The content has known popularity, with document popularities following the Zipf distribution – Each node have the same storage and processing power The system contain N nodes

13 Inter-cluster Load Balance Question: Is there a partition of N nodes into k(given) clusters N 1, N 2,…, N k such that the following two constraint are satisfied? – If two documents belong to the same semantic category, then the nodes that contributed/store these documents belongs to the same cluster – Clusters have equal normalized popularities Formally, p(Ni)/| Ni | = p(Nj)/| Nj |,for all 1<=i,j <=k

14 Inter-cluster Load Balance Cluster 3 Cluster 1 Cluster 2

15 Inter-cluster Load Balance Cluster 1Cluster 2 Cluster 3

16 Inter-cluster Load Balance The problem is NP-complete Consider partitioning the given set of nodes N into clusters with nearly equal normalized popularities Minimizing the following quantity:

17 Inter-cluster Load Balance The paper proposed a greedy algorithm, called MinDiff – Initially all cluster are empty – The MinDiff considers each semantic category in turn – It assign the category to the cluster of nodes which minimize the quantity of the above equation

18 Inter-cluster Load Balance MinDiff is incomplete(it might miss the optimal solution) MinDiff runs in polynomial time and achieves very good results.

19 Experimental Results Documents: Semantic categories: 300 Clusters: 50 Popularity distribution of documents are Zipf-like with parameter O = 0.3 and O = 0.7

20 Experimental Results O=0.3 (more skewed)

21 Experimental Results O=0.6 (less skewed)