Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SERVICE AND.

Slides:



Advertisements
Similar presentations
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Advertisements

Peer to Peer and Distributed Hash Tables
Scalable Content-Addressable Network Lintao Liu
A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci A P2P REcommender system based on Gossip Overlays.
Building of P2P Overlay Networks via Voronoi and Gossip Ranieri Baraglia.
Kademlia: A Peer-to-peer Information System Based on the XOR Metric Petar Mayamounkov David Mazières A few slides are taken from the authors’ original.
PDPTA03, Las Vegas, June S-Chord: Using Symmetry to Improve Lookup Efficiency in Chord Valentin Mesaros 1, Bruno Carton 2, and Peter Van Roy 1 1.
Self-Organizing Hierarchical Routing for Scalable Ad Hoc Networking David B. Johnson Department of Computer Science Rice University Monarch.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Small-world Overlay P2P Network
Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,
JaDE: A JXTA Support for Distributed Virtual Environments Luca Genovali Laura Ricci Luca Genovali, Laura Ricci Università degli Studi di Pisa JaDE: A JXTA.
Applications over P2P Structured Overlays Antonino Virgillito.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Scalable Resource Information Service for Computational Grids Nian-Feng Tzeng Center for Advanced Computer Studies University of Louisiana at Lafayette.
1 AINA 2006 Wien, April th 2006 DiVES: A DISTRIBUTED SUPPORT FOR NETWORKED VIRTUAL ENVIRONMENTS The IEEE 20th International Conference on Advanced.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
A Scalable Content-Addressable Network Authors: S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker University of California, Berkeley Presenter:
XtreemOS IP project is funded by the European Commission under contract IST-FP XtreemOS WP3.2 - T3.2.3 Scalable Directory Service Design State.
Object Naming & Content based Object Search 2/3/2003.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
Hierarchical P2P Overlays for DVE: An Additively Weighted Voronoi Based Approach Michele Albano Luca Genovali Laura Ricci HIERARCHICAL P2P OVERLAYS FOR.
Ecole Polytechnique Fédérale de Lausanne, Switzerland Efficient processing of XPath queries with structured overlay networks Gleb Skobeltsyn, Manfred Hauswirth,
Peer-to-peer file-sharing over mobile ad hoc networks Gang Ding and Bharat Bhargava Department of Computer Sciences Purdue University Pervasive Computing.
SIMULATING A MOBILE PEER-TO-PEER NETWORK Simo Sibakov Department of Communications and Networking (Comnet) Helsinki University of Technology Supervisor:
Mobile Ad-hoc Pastry (MADPastry) Niloy Ganguly. Problem of normal DHT in MANET No co-relation between overlay logical hop and physical hop – Low bandwidth,
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Tapestry GTK Devaroy (07CS1012) Kintali Bala Kishan (07CS1024) G Rahul (07CS3009)
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
A Distributed Architecture for Multi-dimensional Indexing and Data Retrieval in Grid Environments Athanasia Asiki, Katerina Doka, Ioannis Konstantinou,
CROSS-ROAD: CROSS-layer Ring Overlay for AD Hoc Networks Franca Delmastro IIT-CNR Pisa Cambridge, March 23 rd 2004.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Lyon, June 26th 2006 ICPS'06: IEEE International Conference on Pervasive Services 2006 Routing and Localization Services in Self-Organizing Wireless Ad-Hoc.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Automatic Cache Update Control for Scalable Resource Information Service with WS-Management September 23, 2009 Kumiko Tadano, Fumio Machida, Masahiro Kawato,
Resource Addressable Network (RAN) An Adaptive Peer-to-Peer Substrate for Internet-Scale Service Platforms RAN Concept & Design  Adaptive, self-organizing,
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications.
Reporter : Yu Shing Li 1.  Introduction  Querying and update in the cloud  Multi-dimensional index R-Tree and KD-tree Basic Structure Pruning Irrelevant.
National Institute of Advanced Industrial Science and Technology Query Processing for Distributed RDF Databases Using a Three-dimensional Hash Index Akiyoshi.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
DHT-based unicast for mobile ad hoc networks Thomas Zahn, Jochen Schiller Institute of Computer Science Freie Universitat Berlin 報告 : 羅世豪.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Idit Keidar, Principles of Reliable Distributed Systems, Technion EE, Spring Principles of Reliable Distributed Systems Lecture 2: Distributed Hash.
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
1. Efficient Peer-to-Peer Lookup Based on a Distributed Trie 2. Complex Queries in DHT-based Peer-to-Peer Networks Lintao Liu 5/21/2002.
PRIN WOMEN PROJECT Research Unit: University of Naples Federico II G. Ferraiuolo
Peer to Peer Network Design Discovery and Routing algorithms
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
Middleware issues: From P2P systems to Ad Hoc Networks
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
NCLAB 1 Supporting complex queries in a distributed manner without using DHT NodeWiz: Peer-to-Peer Resource Discovery for Grids Sujoy Basu, Sujata Banerjee,
Large Scale Sharing Marco F. Duarte COMP 520: Distributed Systems September 19, 2004.
Malugo – a scalable peer-to-peer storage system..
CS694 - DHT1 Distributed Hash Table Systems Hui Zhang University of Southern California.
Chapter Five Distributed file systems. 2 Contents Distributed file system design Distributed file system implementation Trends in distributed file systems.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
CHAPTER 3 Architectures for Distributed Systems
Zhichen Xu, Mallik Mahalingam, Magnus Karlsson
Replication Middleware for Cloud Based Storage Service
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Chapter 17: Database System Architectures
Database System Architectures
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Kademlia: A Peer-to-peer Information System Based on the XOR Metric
Presentation transcript:

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SERVICE AND RESOURCE DISCOVERY SUPPORTS OVER P2P OVERLAYS EMANUELE CARLINI, MASSIMO COPPOLA, DOMENICO LAFORENZA, PATRIZIO DAZZI, LAURA RICCI International Conference on Ultra Modern Telecommunications, ICUMT Saint Petersburg, October 12-14th, 2009 Università degli Studi di Pisa Dipartimento di Informatica

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci INTRODUCTION Grid environments exploit a huge amount of geographically scattered computing resources Main features of large computational grids – Dynamic environment – Huge amount of heterogeneous resources – Complex middlewares for accessing the resources XtreemOS: a research project funded by the European Commission – main goal: definition of an Open Source, Grid enabled Operating System – scalable and transparent management of large computational platforms – federation of several virtual organizations – users exploit the distributed system through a standard operating system interface

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SRDS: SERVICE AND RESOURCE DISCOVERY SRDS: a basic service of XtreemOS providing a highly distributed directory service SRDS main features – enables resource look-up and exploitation in a multi-VO environment – hides the effect of scale when exploiting individual systems – may be exploited by different clients other modules of XtreemOS applications – supports different kind of queries key-based multi-attribute range queries over dynamic attributes

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SRDS ARCHITECTURE SRDS exploits a set of P2P overlays where each overlay includes nodes from different virtual organizations The choice of the P2P model enables – scalability – low overhead – fault tolerance – management of information in a dynamic environment SRDS services are exploited by different clients, each one with different requirements. – to cope with the diversity of these requirements, several P2P overlays characterized by different features have been defined (Distributed Hash Tables, structured overlays,...) ‏

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SRDS: THE ARCHITECTURE Facade: an easy-to-extend multiple interface protocols Query Provider (QP): set of modules for client query translation Information Management Layer(IML): common interface to DHT-like overlays ADS(Application Directory Service) = Facade+ QP + IML RSS Resource Selection Service a P2P overlayallowing scalable resource location in large overlays Scalaris, Overlay Weaver: DHT with different characteristics

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SRDS MAIN MODULES: ADS AND RSS RSS (Resource Selection Service) supports resource discovery through queries on constant value attributes CPU = IA32, MEM 2  [4GB;  ), BANDWIDTH  [512Kb=s;  ), DISK  [128GB;  ),OS  fLinux ,..., Linux } ADS (Application Directory Service) supports complex queries over dynamic attributes Example: the RSS selects a set of resources matching whose static attributes match the query constraints. the descriptors of these resources are stored in the ADS. the dynamic state of the resources (for instance, current free memory) is monitored through the ADS RSS acts a machete, while ADS acts like a 'bistury'

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci RSS: RESOURCE SELECTION SERVICE Supports resource discovery through multi attribute range queries over a set of static attributes, i.e.constant-valued attributes, known at inizialization time. RSS main features – each node represents its own attributes in the overlay – no delegation of the resource information to other nodes, like in DHT- based approaches – speed up resource location

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci ADS: THE QUERY PROVIDER (QP) ‏ Query Provider Layer: provides a set of modules devoted to query translation Implements a set of algorithms for the interpretation of the queries of different SRDS clients For instance, a job directory service is required to monitor the state of the jobs of an application/VO – when a new job is created, the client submits an AddJob to the SRDS – the AddJob operation is interpreted by a QP modules which translates it into a sequence of operations on the underlying DHT Check of the existence of a proper job directory service, if it does not exist, it requires its creation Insertion of the job ID into the DHT Insertion further information about the jobs under proper keys to suppor inverse queries – The QP makes all these steps transparent for the user

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci ADS: THE INFORMATION MANAGEMENT LAYER (IML) ‏ Namespaces defines the context where the key is used. For instance different name space for different job directories ADS (Application Directory Service) – provides an implementation of namespaces over DHT – receives from a QP module an abstract operation: OP QP = { op, key M, value M, NSpace, ClientType, ClientID } – provides an implementation of namespaces – generates an operation for the underlying DHT in the proper namespace OP DHT ={ op, key D, value D, auxinfo } where value D: generally equals value M key D: may differ from key M because of namespace implementation auxinfo: data expiration timeouts, user-defined secrets,....

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci EXPLOITING NAMESPACES: AN EXAMPLE Network coordinates (NC) embedding system embed latency such as round trip times among nodes into some geometric space Each node is assigned network coordinates in the geometric space Unmisured round trip times is estimated by computing the distance between two nodes in the geometric space To support direct queries, i.e. given the IP of the nodes return its network coordinates inverse queries given the X/Y coordinate of the node, find the the IP of the 'nearest' neighbours' the ADS exploits three different namespaces: IP, X, Y each namespace may be mapped on a different DHT or on the same DHT and may have different characteristics

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci NAMESPACE IMPLEMENTATION Different choices for the implementation of the namepsaces: a different DHT for each namespace a set of namespaces on the same DHT

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci NAMESPACE IMPLEMENTATION Single Ring Approach: DHT key is prefixed by the an identifier of the name space main drawback: DHT features, like replication strategy, fault repair strategy,... cannot be tuned according to the name space Multiple Ring Approach On demand ring creation Parameters and policies of the DHT ring are customized at ring set-up time Some rings may always remain active – include essential key space, for instance resource directories Smaller rings may have a shorter lifespan – application rings, for instance job directory for a given application,....

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci NAMESPACE IMPLEMENTATION The Current version of the ADS exploits two different rings, based on two different DHT, Scalaris, Overlay Weaver Scalaris – A transactional based DHT – Provides consistent replication of data Overlay Weaver – implements different DHT Chord, Pastry, CAN,... – define a routing layer common to all the DHTs. The Overlay Weaver Architecture

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci COMPLEX QUERIES ON DHT DHT supports only basic key-value queries More complex queries may be submitted by the SRDS clients Multidimensional range queries on dynamic attributes Examples exact match query: Arch.='x86' and CPU-Speed='3 Ghz' and RAM='256MB' partial match queries: CPU-Speed='3 Ghz' and RAM='256MB' (and Arch.=*) ‏ range queries 1Ghz<CPU-Speed<'3Ghz' and 512MB<RAM<1Gb similarity queries (o nearest neighbour queries) ‏ require the definition of a metric in the attribute space the user submits an exact match query, which defines a point P in the attribute space. P may not correspond to any resource. output: k resources nearest to P, according to the defined metric

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci RANGE QUERY SUPPORT – an approach based on the MAAN proposal – exploits the Chord DHT Resource pubblication – Each resource is described by k pairs (a i, v i ) ‏ – A locality preserving hashing function maps the value of each attribute onto the DHT H(v i ) = (v i - v imin ) x (2 m -1) / (v imax – v imin ) ‏ 2 m : dimension of the key space The descriptor of each resource is published onto k DHT nodes SRDS supports multiattribute range queries

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci RANGE QUERY SUPPORT Consider a multi attribute range query a 1  [v 1l, v 1u ],...a k  [v kl,...v ku ] The hashing function maps the range of each attribute onto a DHT range Selectivity of an attribute S i = 2 m / H(v iu ) – H(v il ) The dominant attribute a i = [v il,..v iu ] with the highest selectivity is choosen. The query is sent to H(v il ) and is propagated on a DHT arc A till it reaches H(v iu ) ‏ Each node on the A checks if the query satisfies all the query constraints The results are collected along A and sent by the H(v iu ) ‏ to the querying node

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci PUBLICATION OPTIMIZATION SRDS optimizes the publication process of the resources defined by MAAN Publication optimization: exploits soft state cache to store the routing results obtained during the publication process Routing on the DHT is avoided if the routing path to a node is stored in the cache

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci PUBLICATION OPTIMIZATION A second optimization is defined to avoid the publication of 'unpopular' attributes Popularity of an attribute A = number of times A is chosen as dominant in a query – depends on the query distribution Descriptors associated with low popularity attributes are updated with lower frequency Popularity is – dinamically refined in a distributed fashion by the nodes receiving the queries – estimated at target nodes receiving the query and sent back to publishing nodes by put-reply messages

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci SRDS EVALUATION testing environment: Grid 5000 Platform, nodes belons to different Grid 5000 clusters all nodes publish information every 30s a large fraction of nodes run queries every 100 ms.

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci JOB DIRECTORY SERVICE EVALUATION nodes belonging to two clusters of the Grid 5K platform each node performs publications over the DHT at fixed 30 seconds rate time interval between different requests 200 milliseconds Latency of different operations are measured AddJob requires a set of put/get operations RequestJob: a single DHT get

Sericce and Resource Discovery Supports over P2P Overlays Emanuele Carlini, Massimo Coppola Patrizio Dazzi, Domenico Laforenza, Laura Ricci CONCLUSIONS SRDS: a service and resourse discovery support developed for the XtreemOs distributed operating system Provides scalable and customisable information query support over large platforms Future works: – testing SRDS on a large computing platform – dynamic definition of namespaces on different DHTs – definition of hierarchical name spaces – investigation of further strategies for range queries (multi attribute range and neighbours query) ‏