Handling Spatial Data In P2P Systems Verena Kantere, Timos Sellis, Yannis Kouvaras.

Slides:



Advertisements
Similar presentations
CAN 1.Distributed Hash Tables a)DHT recap b)Uses c)Example – CAN.
Advertisements

Nearest Neighbor Search
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK
Scalable Content-Addressable Network Lintao Liu
Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.
Searching on Multi-Dimensional Data
PDPTA03, Las Vegas, June S-Chord: Using Symmetry to Improve Lookup Efficiency in Chord Valentin Mesaros 1, Bruno Carton 2, and Peter Van Roy 1 1.
Technische Universität Chemnitz Kurt Tutschku Vertretung - Professur Rechner- netze und verteilte Systeme Chord - A Distributed Hash Table Yimei Liao.
Massively Distributed Database Systems Distributed Hash Spring 2014 Ki-Joune Li Pusan National University.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Denial-of-Service Resilience in Peer-to-Peer Systems D. Dumitriu, E. Knightly, A. Kuzmanovic, I. Stoica and W. Zwaenepoel Presenter: Yan Gao.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Attribute-based Indexing Overlay Apr Outline Introduction Basic Idea Advantage Challenge Conclusion.
Thomas ZahnCST1 Seminar: Information Management in the Web Query Processing Over Peer- to-Peer Data Sharing Systems (UC Santa Barbara)
Spatial Indexing I Point Access Methods. PAMs Point Access Methods Multidimensional Hashing: Grid File Exponential growth of the directory Hierarchical.
A Scalable Content Addressable Network (CAN)
--Presented By Sudheer Chelluboina. Professor: Dr.Maggie Dunham.
Spatial Information Systems (SIS) COMP Raster-based structures (2) Data conversion.
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
Looking Up Data in P2P Systems Hari Balakrishnan M.Frans Kaashoek David Karger Robert Morris Ion Stoica.
Spatial Indexing I Point Access Methods.
XtreemOS IP project is funded by the European Commission under contract IST-FP XtreemOS WP3.2 - T3.2.3 Scalable Directory Service Design State.
SkipNet: A Scaleable Overlay Network With Practical Locality Properties Presented by Rachel Rubin CS294-4: Peer-to-Peer Systems By Nicholas Harvey, Michael.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Searching in Unstructured Networks Joining Theory with P-P2P.
P2P Course, Structured systems 1 Introduction (26/10/05)
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
P-Grid Presentation by Thierry Lopez P-Grid: A Self-organizing Structured P2P System Karl Aberer, Philippe Cudré-Mauroux, Anwitaman Datta, Zoran Despotovic,
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Introduction to Routing. The Routing Problem Apply after placement Input: –Netlist –Timing budget for, typically, critical nets –Locations of blocks and.
1 A scalable Content- Addressable Network Sylvia Rathnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker Pirammanayagam Manickavasagam.
Other Structured P2P Systems CAN, BATON Lecture 4 1.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
Document retrieval Similarity –Vector space model –Multi dimension Search –Range query –KNN query Query processing example.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Multi-dimensional Search Trees
Node Lookup in P2P Networks. Node lookup in p2p networks In a p2p network, each node may provide some kind of service for other nodes and also will ask.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Answering Similar Region Search Queries Chang Sheng, Yu Zheng.
Spatial DBMS Spatial Database Management Systems.
Similarity Searching in High Dimensions via Hashing Paper by: Aristides Gionis, Poitr Indyk, Rajeev Motwani.
P2P Group Meeting (ICS/FORTH) Monday, 28 March, 2005 A Scalable Content-Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp,
1 Distributed Hash Table CS780-3 Lecture Notes In courtesy of Heng Yin.
Chord Advanced issues. Analysis Search takes O(log(N)) time –Proof 1 (intuition): At each step, distance between query and peer hosting the object reduces.
1. Efficient Peer-to-Peer Lookup Based on a Distributed Trie 2. Complex Queries in DHT-based Peer-to-Peer Networks Lintao Liu 5/21/2002.
PRIN WOMEN PROJECT Research Unit: University of Naples Federico II G. Ferraiuolo
1 30 November 2006 An Efficient Nearest Neighbor (NN) Algorithm for Peer-to-Peer (P2P) Settings Ahmed Sabbir Arif Graduate Student, York University.
Multi-dimensional Search Trees CS302 Data Structures Modified from Dr George Bebis.
BATON A Balanced Tree Structure for Peer-to-Peer Networks H. V. Jagadish, Beng Chin Ooi, Quang Hieu Vu.
R-trees: An Average Case Analysis. R-trees - performance analysis How many disk (=node) accesses we ’ ll need for range nn spatial joins why does it matter?
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Peer-to-Peer Networks 03 CAN (Content Addressable Network) Christian Schindelhauer Technical Faculty Computer-Networks and Telematics University of Freiburg.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
Two Peer-to-Peer Networking Approaches Ken Calvert Net Seminar, 23 October 2001 Note: Many slides “borrowed” from S. Ratnasamy’s Qualifying Exam talk.
Efficient Semantic Web Service Discovery in Centralized and P2P Environments Dimitrios Skoutas 1,2 Dimitris Sacharidis.
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
VLSI Physical Design Automation
COS 461: Computer Networks
Improving and Generalizing Chord
S-Chord: Using Symmetry to Improve Lookup Efficiency in Chord
Spatial Indexing I Point Access Methods.
Zhichen Xu, Mallik Mahalingam, Magnus Karlsson
A Scalable content-addressable network
Multidimensional Indexes
COS 461: Computer Networks
R-trees: An Average Case Analysis
Presentation transcript:

Handling Spatial Data In P2P Systems Verena Kantere, Timos Sellis, Yannis Kouvaras

Problem Definition Assumptions: Structure P2P Overlays Spatial Data in Peers Problem: Necessity for indexing and routing techniques for such an environment

Technique Requirements We need a technique that: Guarantees the retrieval of any existing spatial information in the system Achieves satisfying index size for any node Achieves a satisfying length for any search path Provides easier access to popular information Preserves proximity of areas

Related Work P2PR-tree (Workshop of EDBT ‘04) Quad-tree approach (Poster in ICDE’05) kd-tree approach (WebDB’04)

Partitioning Space Spatial Coding

Partitioning Space cont’d The code of an area A is: A  a z a x a y a z : is the granularity level, i.e. the size in terms of cells. a x : is the average of the digit d 1 of all the area cells. a y : is the average of the digit d 2 of all the area cells. Distance Metrics: d 1 (A 1, A 2 ) = |a 1x -a 2x |+|a 1y -a 2y | d 2 (A 1, A 2 ) = ||a 1x -a 2x |-|a 1y -a 2y || D(A 1, A 2 )  d 1 d 2 D zoom (A 1, A 2 )  D(A 1, A 2 )/(0.5*|A 1.a z +A 2.a z |)

Distributed Indexing for Spatial Areas Distributed Indexing Algorithm Step 1: Indexing the grids of each level Phase A: Indexing areas of the same grid Phase B: Indexing areas of other grids of the same level Step 2: Indexing the grids of other levels

Grid Example

Direct Neighbors & Data Hashing Peers have 2 neighbors on each dimension: Horizontal Vertical Perpendicular Neighbors are the closest peers on each direction. Areas are stored to the closest neighbor following a priorities of dimensions

Direct Neighbors: Examples

Routing of Spatial Queries Basic Routing Algorithm Step 1: Find a peer corresponding to the same size as the sought area. Step 2: Find a peer hosting the sought area or an overlapping area of the latter Step 3: Find the peer hosting the sought area

Routing Example

Details…

Complexity Characteristics Complexity for both pure Chord and our method is O(3log 2 n). But with our method: Peers with big areas are favored with small indexes Shorter search paths for bigger areas The search path grows with the relative distance

Assignment of Peer IDs The peer ID plays a significant role in our approach  joining peers can ask for specific ids: it is adequate to assign an id that belongs to the same or similar level as the requested id The above constraint loosens with the size of the area (higher levels are more connected)

Preliminary Experiments

Preliminary Experiments (2)