LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.

Slides:



Advertisements
Similar presentations
A Construction of Locality-Aware Overlay Network: mOverlay and Its Performance Found in: IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 22, NO.
Advertisements

Quality-of-Service Routing in IP Networks Donna Ghosh, Venkatesh Sarangan, and Raj Acharya IEEE TRANSACTIONS ON MULTIMEDIA JUNE 2001.
Advisor : Prof. Yu-Chee Tseng Student : Yi-Chen Lu 12009/06/26.
Multicast in Wireless Mesh Network Xuan (William) Zhang Xun Shi.
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Network Layer Routing Issues (I). Infrastructure vs. multi-hop Infrastructure networks: Infrastructure networks: ◦ One or several Access-Points (AP) connected.
Efficient Search - Overview Improving Search In Peer-to-Peer Systems Presented By Jon Hess cs294-4 Fall 2003.
Improving Search in Peer-to-Peer Networks Beverly Yang Hector Garcia-Molina Presented by Shreeram Sahasrabudhe
An Innovative Approach to Content Search Across P2P Inter-Networks Potharaju S.R.P Saradhi Mohmed Nazuruddin Shaik Potharaju S R Aditya Under The Guidance.
Structuring Unstructured Peer-to-Peer Networks Stefan Schmid Roger Wattenhofer Distributed Computing Group HiPC 2007 Goa, India.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Gossip Scheduling for Periodic Streams in Ad-hoc WSNs Ercan Ucan, Nathanael Thompson, Indranil Gupta Department of Computer Science University of Illinois.
Farnoush Banaei-Kashani and Cyrus Shahabi Criticality-based Analysis and Design of Unstructured P2P Networks as “ Complex Systems ” Mohammad Al-Rifai.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Denial-of-Service Resilience in Peer-to-Peer Systems D. Dumitriu, E. Knightly, A. Kuzmanovic, I. Stoica and W. Zwaenepoel Presenter: Yan Gao.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Expediting Searching Processes via Long Paths in P2P Systems 05/30 IDEA Lab.
Peer-to-Peer Networks João Guerreiro Truong Cong Thanh Department of Information Technology Uppsala University.
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
Eddie Bortnikov/Aran Bergman, Principles of Reliable Distributed Systems, Technion EE, Spring Principles of Reliable Distributed Systems Recitation.
1 CSE 591-S04 (lect 14) Interconnection Networks (notes by Ken Ryu of Arizona State) l Measure –How quickly it can deliver how much of what’s needed to.
Exploring Tradeoffs in Failure Detection in P2P Networks Shelley Zhuang, Ion Stoica, Randy Katz HIIT Short Course August 18-20, 2003.
1 Denial-of-Service Resilience in P2P File Sharing Systems Dan Dumitriu (EPFL) Ed Knightly (Rice) Aleksandar Kuzmanovic (Northwestern) Ion Stoica (Berkeley)
UNIVERSITY OF JYVÄSKYLÄ Topology Management in Unstructured P2P Networks Using Neural Networks Presentation for IEEE Congress on Evolutionary Computing.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao Cisco Systems, Inc. (Joint work with Christine Lv, Edith Cohen, Kai Li and Scott Shenker)
1 A Distributed Delay-Constrained Dynamic Multicast Routing Algorithm Quan Sun and Horst Langendorfer Telecommunication Systems Journal, vol.11, p.47~58,
 Idit Keidar, Technion Intel Academic Seminars, February Octopus A Fault-Tolerant and Efficient Ad-hoc Routing Protocol Idit Keidar, Technion Joint.
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
Vassilios V. Dimakopoulos and Evaggelia Pitoura Distributed Data Management Lab Dept. of Computer Science, Univ. of Ioannina, Greece
LPT for Data Aggregation in Wireless Sensor networks Marc Lee and Vincent W.S Wong Department of Electrical and Computer Engineering, University of British.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Searching in Unstructured Networks Joining Theory with P-P2P.
UNIVERSITY OF JYVÄSKYLÄ Resource Discovery in Unstructured P2P Networks Distributed Systems Research Seminar on Mikko Vapa, research student.
Nearcast: A Locality-Aware P2P Live Streaming Approach for Distance Education XUPING TU, HAI JIN, and XIAOFEI LIAO Huazhong University of Science and Technology.
Focus on Distributed Hash Tables Distributed hash tables (DHT) provide resource locating and routing in peer-to-peer networks –But, more than object locating.
Network Topologies.
P2P Architecture Case Study: Gnutella Network
1 Reading Report 4 Yin Chen 26 Feb 2004 Reference: Peer-to-Peer Architecture Case Study: Gnutella Network, Matei Ruoeanu, In Int. Conf. on Peer-to-Peer.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
Routing Indices For P-to-P Systems ICDCS Introduction Search in a P2P system –Mechanisms without an index –Mechanisms with specialized index nodes.
Structuring P2P networks for efficient searching Rishi Kant and Abderrahim Laabid Abderrahim Laabid.
RELAX : An Energy Efficient Multipath Routing Protocol for Wireless Sensor Networks Bashir Yahya, Jalel Ben-Othman University of Versailles, France ICC.
Benjamin AraiUniversity of California, Riverside Reliable Hierarchical Data Storage in Sensor Networks Song Lin – Benjamin.
Super-peer Network. Motivation: Search in P2P Centralised (Napster) Flooding (Gnutella)  Essentially a breadth-first search using TTLs Distributed Hash.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
1 Detecting and Reducing Partition Nodes in Limited-routing-hop Overlay Networks Zhenhua Li and Guihai Chen State Key Laboratory for Novel Software Technology.
Computer Networks Dr. Jorge A. Cobb The Performance of Query Control Schemes for the Zone Routing Protocol.
The Performance of Query Control Schemes for the Zone Routing Protocol Zygmunt J. Haas Marc R. Pearlman.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
A Multicast Mechanism in WiMax Mesh Network Jianfeng Chen, Wenhua Jiao, Pin Jiang, Qian Guo Asia-Pacific Conference on Communications, (APCC '06)
P2p, Fall 06 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems Search in Unstructured P2p.
KAIS T On the problem of placing Mobility Anchor Points in Wireless Mesh Networks Lei Wu & Bjorn Lanfeldt, Wireless Mesh Community Networks Workshop, 2006.
On Reducing Mesh Delay for Peer- to-Peer Live Streaming Dongni Ren, Y.-T. Hillman Li, S.-H. Gary Chan Department of Computer Science and Engineering The.
Paper by Song Guo and Oliver Yang; supporting images and definitions from Wikipedia Presentation prepared by Al Funk, VT CS 6204, 10/30/07.
Peer to Peer Network Design Discovery and Routing algorithms
1 Scalable AOI-cast for Peer-to-Peer Networked Virtual Environments Jehn-Ruey Jiang, Yu-Li Huang, and Shun-Yun Hu CSIE, National Central University, Taiwan.
Self-stabilizing energy-efficient multicast for MANETs.
Spring 2000CS 4611 Routing Outline Algorithms Scalability.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
School of Electrical Engineering &Telecommunications UNSW Cost-effective Broadcast for Fully Decentralized Peer-to-peer Networks Marius Portmann & Aruna.
1 Roie Melamed, Technion AT&T Labs Araneola: A Scalable Reliable Multicast System for Dynamic Wide Area Environments Roie Melamed, Idit Keidar Technion.
4. Interconnecting Networks: Routers. © Tallal Elshabrawy 2 Bridges Vs Routers BRIDGES DO WELL IN SMALL (FEW HUNDRED HOSTS) WHILE ROUTERS USED IN LARGE.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Distributed Caching and Adaptive Search in Multilayer P2P Networks Chen Wang, Li Xiao, Yunhao Liu, Pei Zheng The 24th International Conference on Distributed.
SCOPE: Scalable Consistency in Structured P2P Systems
COS 561: Advanced Computer Networks
Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays
Topology Mapping Bo Sheng Sept. 15.
Presentation transcript:

LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary

Unstructured P2P Overlay P2P overlay –Application level network over physical network –Self-organized by peers voluntarily Characteristics –Power-law distribution: a small number of peers have high connectivity –Dynamic population: peers come and go frequently –Resilient to random node failures

Search in P2P Overlay Flooding Expanding ring (ICS’02) Random walk (ICS’02, SIGCOM’03) Iterative deepening (ICDCS’02) Directed BFS (ICDCS’02) Super peer (ICDE’03) Interested-based Locality (INFOCOM’03)

Flooding Simple and robust –No state maintenance needed –High tolerance to node failures Effective and of low latency –Always find the shortest / fastest routing paths Fundamental operation for –Broadcasting in distributed systems –P2P communications

Problems of Flooding Loops in Gnutella networks –Caused by redundant links –Result in endless message routing Current solutions by Gnutella –Detect and discard redundant messages –Limit TTL (time-to-live) of messages Unnecessary traffic is still too much – The redundant links are still there

Traffic Minimization: Spanning Tree Reduce traffic without changing P2P overlay How much bandwidth can we save? –Average degree of Gnutella nodes: about 3 ~ 5 –N-node spanning tree N-1 links N-1 messages for a broadcast –Estimated traffic reduction: about 67% ~ 80% Bandwidth efficiency is not the only objective

Problems of Spanning Tree Long latency for flooding –More than 30 hops to cover 95% of nodes –Only 7 hops to cover 95% of nodes by Gnutella flooding Weak reliability due to node failures –A node failure can disconnect a large portion of network

P2P Overlay

Flooding in Spanning Tree HOPS = 7HOPS = 8HOPS = 9HOPS = 10HOPS = 11HOPS = 0HOPS = 1HOPS = 2HOPS = 3HOPS = 4HOPS = 5HOPS = 6 Spanning Tree

HOPS = 0HOPS = 1HOPS = 2HOPS = 3HOPS = 4HOPS = 5HOPS = 6 Flooding in P2P Overlay

Node Failure

Trade-offs Traffic efficiency and routing latency Redundancy and robustness Let us take a look at pure flooding in Gnutella … NEED to update

Observations of Pure Flooding

Motivations Pure flooding is efficient in the initial hops –Node coverage grows quickly, while –Only account for a small portion of redundant msgs Most redundant messages are generated in high hops

Our Solution Combining both merits of pure flooding and spanning tree Constructing FloodNet: a tree-like structure over P2P network Flooding over P2P network in initial hops Flooding over FloodNet in rest hops

FloodNet: a Tree-like Sub-overlay States maintained in each node –Number of neighbors –The node degree of each neighbor Topology construction –Father node: the neighbor with the highest degree –Tie breaking –Dynamic updating: very low overhead A tree-like structure over Gnutella overlay

Constructing FloodNet

FloodNet: Loops? At most one loop in the structure Nodes in a loop have the same degree –Root candidates

LOOP

FloodNet: Loops? At most one loop in the structure Nodes in a loop have the same degree –Root candidates Endless routing –Easy to detect and avoid: same as Gnutella Redundant messages –At most one redundant message per flooding

FloodNet: Multiple Trees? Possible but the number is very small

FloodNet: Multiple Trees? Possible but the number is very small –Only high degree nodes can be tree roots –Only a few nodes have high connectivity (recall the power law distribution) –These high degree nodes may connect each other Normally less than 10 trees in Gnutella overlay according to our simulation

Broadcast in FloodNet Aims –Reach all FloodNet trees –Cover most nodes in a few hops Problems –How can a node initiating a broadcast know other FloodNet trees? –Flooding from a leaf node needs long hops to cover most portion of the tree

LightFlood Basic idea –Broadcasting messages in parallel from multiple nodes of different trees 2-stage broadcasting –Low hops: the initial M flooding hops –High hops: the rest N flooding hops Denoted as (M, N) policy (7, 0) is same as Gnutella flooding

LightFlood Low hops: utilizing redundant links –Flooding in P2P overlay –Reach many nodes of different trees with small overheads High hops: keep away from redundant links –Flooding in FloodNet –Flooding from multiple nodes in parallel

Low hops

High hops

Performance Evaluation

Coverage vs. Latency (4,*) takes only additional 3 hops to reach same coverage as (7,0) (7, 0) (4, *) 3 hops

Traffic Efficiency (7, 0): 28.1% (4, *): 90.8%

Degradation Due to Node Failures Nearly same coverage

Expanding Ring: Traffic LightFlood Pure Flood

Conclusion FloodNet is easy to construct and maintain –Using local and neighboring information –Dynamically updated with little overhead LightFlood is both broadcast effective and bandwidth efficient –Large coverage –Small routing hops –Small amount of redundant messages An efficient and effective flooding scheme