Count / Top-k Continuous Queries on P2P Networks 01/11/2006.

Slides:



Advertisements
Similar presentations
SDN Controller Challenges
Advertisements

Cognitive Publish/Subscribe for Heterogeneous Clouds Šarūnas Girdzijauskas, Swedish Institute of Computer Science (SICS) Joint work with:
Adaptive Frequency Counting over Bursty Data Streams Bill Lin, Wai-Shing Ho, Ben Kao and Chun-Kit Chui Form CIDM07.
An Innovative Approach to Content Search Across P2P Inter-Networks Potharaju S.R.P Saradhi Mohmed Nazuruddin Shaik Potharaju S R Aditya Under The Guidance.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Routing Indices For Peer-to-Peer Systems Arturo Crespo, Hector Garcia-Molina Stanford ICDCS 2002.
2005/11/09 Continuous Queries in P2P Networks. Motivation.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Expediting Searching Processes via Long Paths in P2P Systems 05/30 IDEA Lab.
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
Peer-to-peer Multimedia Streaming and Caching Service Jie WEI, Zhen MA May. 29.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
Spotlighting Decentralized P2P File Sharing Archie Kuo and Ethan Le Department of Computer Science San Jose State University.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
Communication-Efficient Distributed Monitoring of Thresholded Counts Ram Keralapura, UC-Davis Graham Cormode, Bell Labs Jai Ramamirtham, Bell Labs.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
LSDS-IR’08, October 30, Peer-to-Peer Similarity Search over Widely Distributed Document Collections Christos Doulkeridis 1, Kjetil Nørvåg 2, Michalis.
Object Naming & Content based Object Search 2/3/2003.
CS218 – Final Project A “Small-Scale” Application- Level Multicast Tree Protocol Jason Lee, Lih Chen & Prabash Nanayakkara Tutor: Li Lao.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Amir Rasti Daniel Stutzbach Reza Rejaie The ION P2P Project University of Oregon On the Long-term Evolution of the Two-Tier.
Supporting Ranked Search in Parallel Search Cluster Networks Fang XiongQiong LuoDyce Jing Zhao {xfang, luo, Hong Kong University of.
Nearcast: A Locality-Aware P2P Live Streaming Approach for Distance Education XUPING TU, HAI JIN, and XIAOFEI LIAO Huazhong University of Science and Technology.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Models of Influence in Online Social Networks
P2P File Sharing Systems
Detecting Distance-Based Outliers in Streams of Data Fabrizio Angiulli and Fabio Fassetti DEIS, Universit `a della Calabria CIKM 07.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
P2P Architecture Case Study: Gnutella Network
Evaluating Centralized, Hierarchical, and Networked Architectures for Rule Systems Benjamin Craig University of New Brunswick Faculty of Computer Science.
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
By Shobana Padmanabhan Sep 12, 2007 CSE 473 Class #4: P2P Section 2.6 of textbook (some pictures here are from the book)
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
APPLICATIONS OF DATA MINING IN INFORMATION RETRIEVAL.
HERO: Online Real-time Vehicle Tracking in Shanghai Xuejia Lu 11/17/2008.
1 Exploiting locality for scalable information retrieval in peer-to-peer networks D. Zeinalipour-Yazti, Vana Kalogeraki, Dimitrios Gunopulos Manos Moschous.
ROUTING ALGORITHMS IN AD HOC NETWORKS
1 Heterogeneity in Multi-Hop Wireless Networks Nitin H. Vaidya University of Illinois at Urbana-Champaign © 2003 Vaidya.
Full-Text Search in P2P Networks Christof Leng Databases and Distributed Systems Group TU Darmstadt.
Super-peer Network. Motivation: Search in P2P Centralised (Napster) Flooding (Gnutella)  Essentially a breadth-first search using TTLs Distributed Hash.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
Enabling Peer-to-Peer SDP in an Agent Environment University of Maryland Baltimore County USA.
Peer Pressure: Distributed Recovery in Gnutella Pedram Keyani Brian Larson Muthukumar Senthil Computer Science Department Stanford University.
Towards Efficient Large-Scale VPN Monitoring and Diagnosis under Operational Constraints Yao Zhao, Zhaosheng Zhu, Yan Chen, Northwestern University Dan.
Peer Centrality in Socially-Informed P2P Topologies Nicolas Kourtellis, Adriana Iamnitchi Department of Computer Science & Engineering University of South.
Dynamic P2P Indexing and Search based on Compact Clustering Mauricio Marin Veronica Gil-Costa Cecilia Hernandez UNSL, Argentina Universidad de Chile Yahoo!
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
Analyzing the Vulnerability of Superpeer Networks Against Attack Niloy Ganguly Department of Computer Science & Engineering Indian Institute of Technology,
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
By Jonathan Drake.  The Gnutella protocol is simply not scalable  This is due to the flooding approach it currently utilizes  As the nodes increase.
LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
ADVANCED COMPUTER NETWORKS Peer-Peer (P2P) Networks 1.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Peer to Peer Network Design Discovery and Routing algorithms
Network-Aware Query Processing for Stream- based Application Yanif Ahmad, Ugur Cetintemel - Brown University VLDB 2004.
Brief Announcement : Measuring Robustness of Superpeer Topologies Niloy Ganguly Department of Computer Science & Engineering Indian Institute of Technology,
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 37 – Introduction to P2P (Part 1) Klara Nahrstedt.
An overlay for latency gradated multicasting Anwitaman Datta SCE, NTU Singapore Ion Stoica, Mike Franklin EECS, UC Berkeley
03/19/02Scalab Seminar Series1 Finding Good Peers in Peer-to-Peer Networks Ramaswamy N.Vadivelu Scalab, ASU.
CMSC 691B Multi-Agent System A Scalable Architecture for Peer to Peer Agent by Naveen Srinivasan.
CLUSTERING GRID-BASED METHODS Elsayed Hemayed Data Mining Course.
Kalman Filter and Data Streaming Presented By :- Ankur Jain Department of Computer Science 7/21/03.
Distributed Caching and Adaptive Search in Multilayer P2P Networks Chen Wang, Li Xiao, Yunhao Liu, Pei Zheng The 24th International Conference on Distributed.
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 24 – Introduction to Peer-to-Peer (P2P) Systems Klara Nahrstedt (presented by Long Vu)
Early Measurements of a Cluster-based Architecture for P2P Systems
Presentation transcript:

Count / Top-k Continuous Queries on P2P Networks 01/11/2006

Outline Problem Definition P2P Architecture Count Top-K Experiment Setup Future Work

Streaming Data in P2P P2P  Dynamic changing topology, large scale, … Streaming data  Continuous, unbounded, rapid, time-varying, noise P2P + Streaming data  Dynamic in both data and topology

Objective and Goal Objective  Issue a continuous query to estimate count and top-K Goal  Lower down the communication cost  Lightweight maintenance  Approximated answers  An adaptive and progressive approach

Naïve approach Flooding the overlay continuous  Pros Closer to the exact answer  Cons Network congestion Still non-real time

The State-of-the-Art Count  Focus on one-time answer in P2P  Deal with streaming data only Top-K  P2P environment without streaming data  Distributed environment not P2P

P2P architecture Assumption  Hierarchical P2P (Focused) Super-peer hierarchical structure Query issuer is a super-peer Super peer connect with other super peers Each peer belongs to only one super peer  Pure unstructured P2P

Big picture Group Accumulate information within a group based on the constraint and statistics Set Constraint Report changes Approximated answer

Group in hierarchical P2P Issuer Coordinator Peer

Group in hierarchical P2P

After partition Group1 Group3 Group2 Assume we have N objects and K Groups after partition

User-specified Epsilon Group1 Group3 Group2 User-specified ε(Precision)

Consider a group P4P4 P1P1 P3P3 P2P2 Coordinator Node Objects O1O1 O2O2 O3O3

Each node maintain the distribution information of owning objects P2P2 P4P4 P1P1 P3P3 object Rate # R1R1 R2R2 R3R3 R4R4

At initial - Polling P4P4 P1P1 P3P3 P2P2 Coordinator Node

At initial - Polling P4P4 P1P1 P3P3 P2P2 Coordinator Node

Information at coordinator after polling object # P4P4 P3P3 P2P2 P1P1

Statistics information object # P 1 P 2 P 3 P 4 Δ O 1 1/1 6/6 10/10 5/5 22 O 2 11/11 13/13 5/5 9/7 36 O 3 15/15 6/6 3/3 9/9 33 R T Updated time stamp Maximum changing rate(+/-) of objects in each peer Change value for each object Latest real value Estimated value

Update to Coordinator ( Δ 11, Δ 21, Δ 31) T2T2 ( Δ 12, Δ 22, Δ 32) ( Δ 13, Δ 23, Δ 33)

Calculate Count

Redistribute Epsilon w i =Max(Δ i )/C x,0 where x is the i-index of Max(Δ i ) δ i =w i εC x,0 / ∑w i

Visiting sequence P4P4 P3P3 P2P2 P1P1 Pick those peers would violate δ

Update information Group P 1 P 2 P 3 P 4 Δ O 1 1/1 6/6 10/10 8/8 - O 2 11/11 11/11 5/5 6/6 - O 3 15/15 5/5 3/3 11/11 - R T

For those nodes not being visited Group P 1 P 2 P 3 P 4 Δ O 1 1/2 6/6 10/9 8/8 25 O 2 11/13 11/11 5/4 6/6 34 O 3 15/18 5/5 3/2 11/11 36 R T

Un-notified Leave P1P1 Ping P 1 is dead Remove P 1 ’s information P4P4 P3P3 P2P2

Experiment Setup Generate synthetic data set by statistics distribution for  Streaming data  Life time of peers Metrics  Message size  Communication cost  Response latency  Result accuracy

Top-K Use Regression to predicate the reasonable trend of changes  Once a updated result is required, Super Peer only need to ask those doubtful peers for doubtful objects  Update its counting list, and return the top k objects

Future Work Connect and recommend latent good friends for each user  Good friends: the ones with the same interests (behaviors) Exploiting current connecting peers to discover good friends bit by bit Design a system that could make clusters reflecting current interests of individual peers and connecting them together based on their similarity by using user’s social network

Advantages Reduce search time and diminish query traffic by using friends list By utilizing their different strength of arcs/edges/ties = friendshipness, social networks exceed random-walk networks in quickly finding target objects

Example Level 1 Level 2

Example has larger weight than Score(N i ) Similarity