EGEE is a project funded by the European Union under contract IST-2003-508833 Distributed Superscheduling Matteo Mordacchini INFN – Padova Università Ca’

Slides:



Advertisements
Similar presentations
Peer to Peer and Distributed Hash Tables
Advertisements

Kademlia: A Peer-to-peer Information System Based on the XOR Metric.
1Department of Electrical Engineering and Computer Science, University of Michigan, USA. 2Department of Computer Science, National University of Singapore,
CHORD – peer to peer lookup protocol Shankar Karthik Vaithianathan & Aravind Sivaraman University of Central Florida.
Technische Universität Yimei Liao Chemnitz Kurt Tutschku Vertretung - Professur Rechner- netze und verteilte Systeme Chord - A Distributed Hash Table Yimei.
Technische Universität Chemnitz Kurt Tutschku Vertretung - Professur Rechner- netze und verteilte Systeme Chord - A Distributed Hash Table Yimei Liao.
Ion Stoica, Robert Morris, David Liben-Nowell, David R. Karger, M
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications Ion StoicaRobert Morris David Liben-NowellDavid R. Karger M. Frans KaashoekFrank.
Massively Distributed Database Systems Distributed Hash Spring 2014 Ki-Joune Li Pusan National University.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Presented by Elisavet Kozyri. A distributed application architecture that partitions tasks or work loads between peers Main actions: Find the owner of.
Topics in Reliable Distributed Systems Lecture 2, Fall Dr. Idit Keidar.
1 Peer-To-Peer-Based Resource Discovery In Global Grids: A Tutorial Rajiv Ranjan, Aaron Harwood And Rajkumar Buyya, The University Of Melbbourne IEEE Communications.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
1 A Scalable Content- Addressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Proceedings of ACM SIGCOMM ’01 Sections: 3.5 & 3.7.
Structure Overlay Networks and Chord Presentation by Todd Gardner Figures from: Ion Stoica, Robert Morris, David Liben- Nowell, David R. Karger, M. Frans.
XtreemOS IP project is funded by the European Commission under contract IST-FP XtreemOS WP3.2 - T3.2.3 Scalable Directory Service Design State.
Secure Overlay Services Adam Hathcock Information Assurance Lab Auburn University.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 CS 194: Distributed Systems Distributed Hash Tables Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer.
Improving Data Access in P2P Systems Karl Aberer and Magdalena Punceva Swiss Federal Institute of Technology Manfred Hauswirth and Roman Schmidt Technical.
P-Grid Presentation by Thierry Lopez P-Grid: A Self-organizing Structured P2P System Karl Aberer, Philippe Cudré-Mauroux, Anwitaman Datta, Zoran Despotovic,
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
P2P File Sharing Systems
Freenet. Anonymity  Napster, Gnutella, Kazaa do not provide anonymity  Users know who they are downloading from  Others know who sent a query  Freenet.
Security Considerations for Structured p2p Peng Wang 6/04/2003.
Lyon, June 26th 2006 ICPS'06: IEEE International Conference on Pervasive Services 2006 Routing and Localization Services in Self-Organizing Wireless Ad-Hoc.
1 Reading Report 5 Yin Chen 2 Mar 2004 Reference: Chord: A Scalable Peer-To-Peer Lookup Service for Internet Applications, Ion Stoica, Robert Morris, david.
1 Distributed Hash Tables (DHTs) Lars Jørgen Lillehovde Jo Grimstad Bang Distributed Hash Tables (DHTs)
Vincent Matossian September 21st 2001 ECE 579 An Overview of Decentralized Discovery mechanisms.
Handling Spatial Data In P2P Systems Verena Kantere, Timos Sellis, Yannis Kouvaras.
Grid Workload Management Massimo Sgaravatto INFN Padova.
Resource Addressable Network (RAN) An Adaptive Peer-to-Peer Substrate for Internet-Scale Service Platforms RAN Concept & Design  Adaptive, self-organizing,
Presentation 1 By: Hitesh Chheda 2/2/2010. Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT Laboratory for Computer Science.
Fast Searching in Peer-to-Peer Networks Self-Organizing Parallel Search Clusters Rocky Dunlap.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Peer-to-Peer Supported Cache System for File Transfer Joonbok Lee
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan Presented.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
Dynamic Networks for Peer-to-Peer Systems Pierre Fraigniaud CNRS LRI, Univ. Paris Sud Joint work with Philippe Gauron.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
ADVANCED COMPUTER NETWORKS Peer-Peer (P2P) Networks 1.
EGEE is a project funded by the European Union under contract IST Data Management Gaps Krzysztof Nienartowicz Gavin McCance EGEE JRA1 Data.
Kademlia: A Peer-to-peer Information System Based on the XOR Metric
EGEE is a project funded by the European Union under contract IST WS-Based Advance Reservation and Co-allocation Architecture Proposal T.Ferrari,
BATON A Balanced Tree Structure for Peer-to-Peer Networks H. V. Jagadish, Beng Chin Ooi, Quang Hieu Vu.
Peer-to-Peer Networks 03 CAN (Content Addressable Network) Christian Schindelhauer Technical Faculty Computer-Networks and Telematics University of Freiburg.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
Freenet: Anonymous Storage and Retrieval of Information
CS 347Notes081 CS 347: Parallel and Distributed Data Management Notes 08: P2P Systems.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
NCLAB 1 Supporting complex queries in a distributed manner without using DHT NodeWiz: Peer-to-Peer Resource Discovery for Grids Sujoy Basu, Sujata Banerjee,
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
EGEE is a project funded by the European Union under contract IST End-user requirements for network monitoring Paul Mealor JRA4 EGEE Kick-Off.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
EGEE is a project funded by the European Union under contract IST LCG open issues Massimo Sgaravatto INFN Padova JRA1 IT-CZ cluster meeting,
EGEE is a project funded by the European Union under contract IST Report from the PTF Fabrizio Pacini Datamat S.p.a. Milan, IT-CZ JRA1 meeting,
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract IST Report from.
EGEE is a project funded by the European Union under contract IST Padova report Massimo Sgaravatto On behalf of the INFN Padova JRA1 Group.
Brocade: Landmark Routing on Overlay Networks
Ion Stoica, Robert Morris, David Liben-Nowell, David R. Karger, M
Magdalena Balazinska, Hari Balakrishnan, and David Karger
EE 122: Peer-to-Peer (P2P) Networks
DHT Routing Geometries and Chord
Building Peer-to-Peer Systems with Chord, a Distributed Lookup Service
Reading Report 11 Yin Chen 1 Apr 2004
MIT LCS Proceedings of the 2001 ACM SIGCOMM Conference
Presentation transcript:

EGEE is a project funded by the European Union under contract IST Distributed Superscheduling Matteo Mordacchini INFN – Padova Università Ca’ Foscari di Venezia JRA1 IT-CZ Meeting, 14-15/12/2004

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling Problem When submitting a job, the UI uses a list of known RBs If the first RB fails to satisfy the request, the UI will re- submit the request to the second RB on the list, and so on. Problems Maybe not all the RBs with resources useful for the user are included in the list. If the RBs capable of satisfying the request are not on top of the list, there is a great waste of time and resources before reaching the right site. Some brokers could be overloaded (while other brokers could be under-loaded)

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling Problem Aim Find a (fast) way to allow RBs to collaborate in order to satisfy a user’s request. Don’t affect the current scheduling methods in the case the first RB already has the resources capable of processing the job. Forwarding of job requests to another WMS must be analysed in detail, but seems feasible

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling Some Considerations This “collaboration” among brokers should be transparent to the end-user Having *all* resources (CEs) available on the Grid seen by a single broker (or by all brokers) can have scalability problems Solutions in literature Construct a distributed index of the resources and organize them into a overlay network  Different brokers considering different subsets of the existing Grid resources

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling Indexing Some solutions suggest to organize the resources into a multidimensional space, where every attribute represents a dimension of the space. The space will be divided into zones. Every index entry points to one of this zone and then it is used to access all the resources that lay in that region of space.

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling Overlay Networks Given the above space subdivision, each region is assigned to a node in the network. Every node is responsible of  giving the information about the resources that belong to its region  routing request that affect other regions, passing it to the neighbour that is the closest to the interested regions. The network is self-organizing: the presence of a resource is communicated to only one node; then it will collaborate with the other nodes in order to find the right region (and associated node) where the resource has to be indexed

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling - Proposal Use some of the nodes in the Grid to construct an overlay network similar to that described above. Instead of indexing the single resources (CEs), we suggest to index RBs, in order to find the best set of RBs that are (with high probability) able to solve a given user request. RB s

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling - Proposal In order to index RBs, we need to create a sort of “RB profile”  Considering in some way the content of the ISM The creation of the “RB profile” is done as follows:  The resources of the RB are grouped into sets based on the similarities of the characteristics of the CEs.  A representative of every set is created.  The RB profile is formed by the representatives of the sets. The RB will be indexed in as much regions of the space as the number of different representative he has in its profile

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling - Proposal Search operations If a RB cannot satisfy a user request, it asks to a node in the overlay network to find a set of RB that can process the given request. Once it has this set, it could ask to the RBs in the set to check if they really can process the request (they really have the proper CEs available); then, he will transfer the request to one of the RBs that have answered positively.

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling - Proposal Advantages For registration and query operations, every RB needs only to know the address of one node in the overlay network. RB profiles could be updated less frequently than the descriptions of CEs. The process is completely transparent to the user. Disadvantages In order to improve performances, it is better to replicate the nodes in the overlay network. Staging of Input- and Output-Sandbox Communications to the LB.

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling References S. Ratnasamy, P. Francis, M. Handley, R. Karp, “A Scalable Content Addressable Network”, ACM SIGCOMM’01, Aug , 2001 I. Stoica, R. Morris, D. Karger, M. Frans Kaashoek, H. Balakrishnan, “Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications”, ACM SIGCOMM’01, Aug , 2001 A. Andrzejak, Z. Xu, “Scalable Efficient Range Queries for Grid Information Services”, 2nd IEEE International Conference on Peer-to-Peer Computing, Sept. 5-7, 2002, Linköping, Sweden P. Ganesan, B. Yang, H. Garcia-Molina, “One Torus to Rule Them All: Multi-dimensional Queries in P2P Systems”, 7th International Workshop on the Web and Databases (WebDB2004), June 17-18, 2004, Paris, France

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling C. Schmidt, M. Parashar, “Flexible Information Discovery in Decentralized Distributed Systems”, 12th IEEE International Symposium on High Performance Distributed Computing (HPDC'03), June , 2003, Seattle, Washington M. Cai, M. Frank, J. Chen, P. Szekely, “MAAN: A Multi-Attribute Addressable Network for Grid Information Services”, 4th International Workshop on Grid Computing, Phoenix, Arizona, 2003 K. Aberer, P. Cudré-Mauroux, A. Datta, Z. Despotovic, M. Hauswirth, M. Punceva, R. Schmidt, “P-Grid: A Self-organizing Structured P2P System”, ACM SIGMOD Record, Vol. 32, No. 3, September 2003 A. M. Ouksel, G. Moro, “G-Grid: A Class of Scalable and Self- organizing Data Structures for Multi-dimensional Querying and Content Routing in P2P Networks”, Technical Report no. DEIS-LIA , February 2004, Univ. of Bologna, Univ. of Illinois at Chicago

JRA1 IT-CZ Meeting, 14-15/12/ Distributed Superscheduling E. Tanin, A. Harwood, H. Samet, “Indexing Distributed Complex Data for Complex Queries”, Proceedings of the National Conference on Digital Government Research, pp , Seattle, Washington, 2004