EMI INFSO-RI-261611 Catalogue synchronization & ACL propagation Fabrizio Furano (CERN IT-GT)

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

NGAS – The Next Generation Archive System Jens Knudstrup NGAS The Next Generation Archive System.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
One.box Distributed home service interface. Core Components Pop3 client Router Storage Pop3 Server.
ECEN5053 SW Eng of Dist Systems, Arch Des Part 3, Univ of Colorado, Boulder1 Architectural Design of Distributed Systems, Part 3 ECEN5053 SW.
Software Engineering and Middleware: a Roadmap by Wolfgang Emmerich Ebru Dincel Sahitya Gupta.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
O. Keeble, F. Furano, A. Manzi, A. Ayllon, I. Calvet, M. Hellmich DPM Workshop 2014 DPM Monitoring.
Chapter 30 Electronic Mail Representation & Transfer
Service Broker Lesson 11. Skills Matrix Service Broker Service Broker, provides a solution to common problems with message delivery and consistency that.
» Explain the way that electronic mail ( ) works » Configure an client » Identify message components » Create and send messages.
An Architecture for Online Information Integration on Concurrent Resource Access on a Z39.50 Environment Michalis Sfakakis 1 and Sarantos Kapidakis 2 An.
1 Software Development Configuration management. \ 2 Software Configuration  Items that comprise all information produced as part of the software development.
Meet with the AppEngine Márk Gergely eu.edge. What is AppEngine? It’s a tool, that lets you run your web applications on Google's infrastructure. –Google's.
SSDS: Data for Science A Walkthrough of Proposed SSDS Capabilities 4 April 2002 John Graybeal.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
 Communication Tasks  Protocols  Protocol Architecture  Characteristics of a Protocol.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
SOA-14: Deploying your SOA Application David Cleary Principal Software Engineer.
INFNGrid Constanza Project: Status Report A.Domenici, F.Donno, L.Iannone, G.Pucciani, H.Stockinger CNAF, 6 December 2004 WP3-WP5 FIRB meeting.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved RPC Tanenbaum.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
6 th dCache WS | Daniel Becker| 18 April 2012 | 1 Daniel Becker 6 th dCache workshop, Zeuthen, April 18, 2012 The HTTP Federation.
Stephen Burke – Data Management - 3/9/02 Partner Logo Data Management Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF.
EMI is partially funded by the European Commission under Grant Agreement RI Discovering Infrastructures with EMI Registry (EMIR) Emidio Giorgio.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
EMI is partially funded by the European Commission under Grant Agreement RI SA2 – Development Tools Andres Abad Rodriguez SA2.4 Tools Activity Leader.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
IETF 67 – SIMPLE WG SIMPLE Problem Statement Draft-rang-simple-problem-statement-01 Tim Rang - Microsoft Avshalom Houri – IBM Edwin Aoki – AOL.
AMQP, Message Broker Babu Ram Dawadi. overview Why MOM architecture? Messaging broker like RabbitMQ in brief RabbitMQ AMQP – What is it ?
EMI INFSO-RI ARC tools for revision and nightly functional tests Jozef Cernak, Marek Kocan, Eva Cernakova (P. J. Safarik University in Kosice, Kosice,
Group Communication Theresa Nguyen ICS243f Spring 2001.
The new FTS – proposal FTS status. EMI INFSO-RI /05/ FTS /05/ /05/ Bugs fixed – Support an SE publishing more than.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
Lecture 7 Page 1 CS 111 Online Process Communications and Concurrency CS 111 On-Line MS Program Operating Systems Peter Reiher.
EMI INFSO-RI Catalogue synchronization & ACL propagation Fabrizio Furano (CERN IT-GT)
CS223: Software Engineering Lecture 13: Software Architecture.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Overview of DMLite Ricardo Rocha ( on behalf of the LCGDM team.
EMI is partially funded by the European Commission under Grant Agreement RI Common Authentication Library Daniel Kouril, for the CaNL PT EGI TF.
Distributed Data Access Control Mechanisms and the SRM Peter Kunszt Manager Swiss Grid Initiative Swiss National Supercomputing Centre CSCS GGF Grid Data.
Globus Data Storage Interface (DSI) - Enabling Easy Access to Grid Datasets Raj Kettimuthu, ANL and U. Chicago DIALOGUE Workshop August 2, 2005.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Monitoring Pedro Andrade CERN – IT/GT HEPiX Spring 2012.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
EMI is partially funded by the European Commission under Grant Agreement RI Common Authentication Library Daniel Kouril, for the CaNL PT EGI CF.
Lecture 1 Page 1 CS 111 Summer 2013 Important OS Properties For real operating systems built and used by real people Differs depending on who you are talking.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
EMI INFSO-RI Catalogue synchronization & ACL propagation Fabrizio Furano (CERN IT-GT-DMS)
EMI INFSO-RI Catalogue synchronization & ACL propagation Fabrizio Furano (CERN IT-GT)
EMI is partially funded by the European Commission under Grant Agreement RI Future Proof Storage with DPM Oliver Keeble (on behalf of the CERN IT-GT-DMS.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Powerpoint Templates Data Communication Muhammad Waseem Iqbal Lecture # 07 Spring-2016.
UNICORE and Argus integration Krzysztof Benedyczak ICM / UNICORE Security PT.
Federating Data in the ALICE Experiment
Argus EMI Authorization Integration
Jean-Philippe Baud, IT-GD, CERN November 2007
The DPIaaS Controller Prototype
Integrating HA Legacy Products into OpenSAF based system
Self Healing and Dynamic Construction Framework:
Peer-to-peer networking
A Messaging Infrastructure for WLCG
Express Spring Integration
THE STEPS TO MANAGE THE GRID
Taxonomy of network applications
Java Messaging Service (JMS)
POP: Building Automation Around Secure Server Deployment
Jonathan Rosenberg dynamicsoft
Message Queuing.
Presentation transcript:

EMI INFSO-RI Catalogue synchronization & ACL propagation Fabrizio Furano (CERN IT-GT)

2 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Various catalogues keep information that is related – E.g. LFC keeps info about the content of remote Storage Elements, each one with its own catalogue A change in the permissions of a file in LFC is not automatically reflected by the peripheric catalogue If a SE looses a file, the LFC does not know If a new file is not correctly registered -> dark data Keeping them in sync is a very hard problem Namespace scanning for diffs is an expensive workaround The problem

3 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Make the various catalogues/SE able to talk to each other – In order to exchange messages that keep them synchronized – 2 directions: Central Catalogue->SE (downstream) – e.g. to propagate changes in the permissions SE->Central Catalogue (upstream) – e.g. to propagate info about lost and missing files The idea

4 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Looking for good ways to reliably communicate and cooperate Communication SE1 SEn SE2 SE or exp. catalogue subscribes to the relevant topics (e.g. “Changes”) SE sends to the appropriate topics (e.g. “Lost”) LFC Subscribes to the relevant topics (e.g. “Lost”) SE Sends to the appropriate topics (e.g. “Changes”) LFC Other catalogue/SE e.g. ATLAS Broker(s)

5 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Types of interactions What can we do with this? – Fix inconsistencies as they are found “SE1 apparently lost file X” – Prevent inconsistencies by sending messages when something happens “File X has new access permissions” “SE1 has a new file Y” – Allow a central system to query the others to synchronize itself “Who has file Z”? “Do you still have file W?”

6 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Message content This will be enhanced/agreed in the WG The message content/semantics is the building block of the functionalities We are using a very simple structure – Header Proto version Sender hostname – Body A set of (tag, value), just very basic types These are message semantics - related, hence need to use the same set of tags (things like “Filename”)

7 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Proposed demonstrators to use reliable message (i.e. industry standard MQ) as backbone of the reliability – All interested catalogues can “subscribe” for permissions that changed in the LFC – Lost files can be broadcast on the “lost” topic to interested catalogues Note: in general we are talking about synchronizing catalogues Somehow possible also to fix a local catalogue with respect to the content of the local disks – Trickier but with an obvious added value Milestone

8 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Milestone This means – Defining the architecture – Develop the system Libraries+executables that implements the guidelines Usable for LFC/DPM and anybody else – Integrate it into an early LFC/DPM prototype – Deploy it for evaluation/testing – We have now (Mid Dec 2010) a working prototype

9 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation The architecture LFC Msg brokers Upstream topic “SEMsg_upstream” NotAvailable(sfn) [ FileCreated(sfn) ] Adapter Downstream topic “SEMsg_downstream” Chmod(sfn) SE or other Catalogue A file can be N/A if: - it was requested to a DB that does not know it - OR if it was requested to a GridFTP that does not find it (trickier) Fix info! Adapter Fix info! Uses a virtual destination, e.g. : Consumer.DPNS1.SEMsg_downstream the broker queues messages for this endpoint if it disconnects momentarily. Uses a virtual destination, e.g. : Consumer.LFC1.SEMsg_upstream the broker queues messages for this endpoint if it disconnects momentarily.

10 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation The adapter The adapter is a component called SEMsg – Built to be robust, efficient and easy to integrate – Plugin-based (ev. with “null” plugins), loaded at runtime A plugin that performs actions (in the catalogue) when a message comes A plugin that performs SE(Catalogue)-specific actions when a message has to be sent through the API – Provides commands to manually send messages As well as a simple C/C++ API to send our messages (hides message composition and the security stuff) – The same library is used for the LFC and DPM prototype, but with different sets of plugins Hence, more sets of plugins can be added, to talk to other systems

11 Security DPM consumer plugin DPM consumer plugin Default producer plugin Default producer plugin Producer API, e.g. NotifyChmod(sfn) NotifyNotAvailable(sfn) Producer API, e.g. NotifyChmod(sfn) NotifyNotAvailable(sfn) LFC EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Detail - SEMsg plugins Security LFC consumer plugin LFC consumer plugin LFC producer plugin LFC producer plugin Producer API, e.g. NotifyChmod(sfn) NotifyNotAvailable(sfn) Producer API, e.g. NotifyChmod(sfn) NotifyNotAvailable(sfn) DPM Msg brokers These implement some kind of realtime storage aggregation Must be robust, fast and low latency To be able to trust them and build an evolving thing

12 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation SEMsg Completely asynchronous, multithreaded design Does everything (in background) to keep the consumer connection UP Never hiccups in case of conn troubles/broker restart The consumer may live as a small daemon or be started by another daemon (LFC/DPM use the external daemon by now) Also provides commands to send the messages – E.g. to manually notify that a set of files is not available Natively supports bulk operations – Automatic creation of bulk messages from the internal queues. – No need for weird APIs or complex implementations.

13 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Security Preliminary requirements: – Guarantee the identity of the senders – Guarantee the correctness of the content – Simplicity of use and deployment It seems that the best way to do this is to sign the messages at app level (i.e. in SEMsg) – Like we do with PGP for s – This puts an additional difficulty in the dev Not a tragic one if it’s done once and for all

14 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation What’s next Write the first version of the protocol spec Message format, conventions etc... Prepare for official builds & deployment Still without security by now Implement a more serious configuration subsystem Non-trivial packaging exercise Add less trivial notifications (e.g. ACLs) Sync with the other SE developers Start implementing the security part Being able to deploy LFC/DPM+SEMsg for prod test Robustness tests + ev. fixes/additions in SEMsg/DPM/LFC At each step, keep an eye on the applicability to the computing models + ev. fixes/additions

15 EMI INFSO-RI Nov 2010 F.Furano - Catalogue Synchronization & ACL propagation Several aspects have still to be sorted out, but at least we started well. Making catalogues and SEs interact seems a good way to attack the consistency problem – It’s a form of realtime interaction between SEs and catalogues The live demonstrator of SEMsg/LFC/DPM is available – The messaging (test) infrastructure and the tools seem really OK Conclusions

16 EMI INFSO-RI Oct 2010 F.Furano - Catalogue Synchronization & ACL propagation Thank you EMI is partially funded by the European Commission under Grant Agreement INFSO-RI