WP3 The status of the EU DataGrid's R-GMA system Steve Fisher / RAL 24/4/2003.

Slides:



Advertisements
Similar presentations
EGEE is a project funded by the European Union under contract IST R-GMA status and plans Abdeslem DJAOUI / RAL GRIDPP10 meeting at CERN, 3.
Advertisements

21 Sep 2005LCG's R-GMA Applications R-GMA and LCG Steve Fisher & Antony Wilson.
WP3 WP3 17/9/2002 Steve Fisher / RAL. WP3 Steve Fisher 17/9/2002WP32 Summary Quality Current status 1.2 R-GMA in release 2.0 Recent Requirements Work.
EGEE is a project funded by the European Union under contract IST R-GMA: Status and Plans Antony Wilson / RAL GridPP 12 - Brunel
Steve Fisher/RAL - 12/6/2002R-GMA and WP71 R-GMA Use the GMA from GGF A relational implementation Applied to both information and monitoring Creates impression.
Grid Application Builders Teach In31/01/02Antony Wilson Information & Monitoring Services WP3.
Information and Monitoring The European DataGrid Project Team
The role of a Mediator in R-GMA Manfred Oevers IBM Andrew Cooke Heriot Watt Laurence Field RAL Steve Fisher RAL James Magowan IBM Werner Nutt Heriot Watt.
CrossGrid WP3 Task Non-invasive Monitoring Trinity College Dublin Brian Coghlan, Stuart Kenny, David O’Callaghan Santiago FEB-2003.
CrossGrid WP3 Task 3.3 Grid Monitoring Trinity College Dublin (TCD, AC14 - CR11) Brian Coghlan, Stuart Kenny CYFRONET Academic Computer Centre, Krakow.
WP3 R-GMA Revisited 23/7/2002 Werner Nutt / Heriot-Watt University.
Canonical Producer CP API User Code CP Servlet Files CreateTable, Port, Protocol, Security, SQL Support, Multiple Query Support Security Insert Query Port.
WP3 R-GMA & OGSA 23/7/2002 James Magowan / IBM. WP3 James Magowan - 23/7/2002R-GMA & OGSA2 Contributors Brian CoghlanTCD Andy CookeHeriot-Watt Ari DattaQMUL.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to R-GMA: Relational Grid Monitoring Architecture.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
WP3 RGMA Deployment Laurence Field / RAL Steve Fisher / RAL.
Republishing Mechanisms for R-GMA Benefits and Approaches. Talk by: Alasdair Gray Collaborators: Andy Cooke, Lisha Ma, and Werner Nutt Heriot-Watt University.
Poznan July-2003 CrossGrid Task 3.3 CrossGrid Task 3.3 Grid Monitoring Trinity College Dublin (TCD, AC14 – CR11) Brian Coghlan, Stuart Kenny, David O’Callaghan.
DataGrid is a project funded by the European Commission under contract IST GridPP-2 Middleware 4 th -5 th Mar 2004 Information and Monitoring.
Introduction on R-GMA Shi Jingyan Computing Center IHEP.
Computer and Automation Research Institute Hungarian Academy of Sciences Presentation and Analysis of Grid Performance Data Norbert Podhorszki and Peter.
DataGrid is a project funded by the European Union CHEP March 2003 R-GMA 1 R-GMA: First results after deployment Steve Fisher (EDG - WP3)
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Information System (IS) Valeria Ardizzone.
GRID IIII D UK Particle Physics GridPP Collaboration meeting - R.P.Middleton (RAL/PPD) 23-25th May Grid Monitoring Services Robin Middleton RAL/PPD24-May-01.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
GLite Information System(s) Antonio Juan Rubio Montero CIEMAT 10 th EELA Tutorial. Madrid, May 7 th -11 th,2007.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks R-GMA Now With Added Authorization Steve.
13 May 2004EB/TB Middleware meeting Use of R-GMA in BOSS for CMS Peter Hobson & Henry Nebrensky Brunel University, UK Some slides stolen from various talks.
WP3 Information and Monitoring Steve Fisher / RAL 23/9/2003.
An information and monitoring system for static and dynamic information about grid resources, applications, networks … RDBMS Servlet aware of API during.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
E-infrastructure shared between Europe and Latin America 12th EELA Tutorial for Users and System Administrators gLite Information System.
WP3 R-GMA: Likely status New Years Eve Steve Fisher / RAL 24/2/2003.
Computer and Automation Research Institute Hungarian Academy of Sciences SZTAKI’s work in DataGrid WP September Norbert Podhorszki Laboratory of.
EGEE is a project funded by the European Union under contract IST R-GMA: Production Services for Information and Monitoring in the Grid John.
WP3 RGMA Deployment Laurence Field / RAL Steve Fisher / RAL.
WP3 R-GMA: A Relational Grid information and monitoring system Steve Fisher / RAL 13/12/2002.
INFSO-RI Enabling Grids for E-sciencE
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
Website: Answering Continuous Queries Using Views Over Data Streams Alasdair J G Gray Werner.
WP3 Werner Nutt (Heriot-Watt University) R-GMA – DataGrid’s Monitoring System 1/7/2003.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
R-GMA – an Update A reminder of R-GMA The need for a mediator Work with WP7 Release 1.2 and beyond Some Implications of OGSA.
INFSO-RI Enabling Grids for E-sciencE Building a robust distributed system: some lessons from R-GMA WLCG Service Reliability.
 CMS data challenges. The nature of the problem.  What is GMA ?  And what is R-GMA ?  Performance test description  Performance test results  Conclusions.
WP3 Information and Monitoring Rob Byrom / WP3
E-infrastructure shared between Europe and Latin America gLite Information System(s) Manuel Rubio del Solar CETA-CIEMAT EELA Tutorial, Mérida,
The impact of R-GMA (upon WP1 and WP4). EDG (Paris) 6 Mar James MagowanImpact of R-GMA Grid Monitoring Architecture (GMA) We use it not only for.
INFSO-RI Enabling Grids for E-sciencE Information System Valeria Ardizzone INFN EGEE NA4 Generic Applications Meeting Catania,
An Active Security Infrastructure for Grids Stuart Kenny*, Brian Coghlan Trinity College Dublin.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
INFSO-RI Enabling Grids for E-sciencE R-GMA Gergely Sipos and Péter Kacsuk MTA SZTAKI Credit to Valeria Ardizzone.
Supporting Join Queries Talk by: Andy Cooke Collaborators: Alasdair Gray, Lisha Ma, and Werner Nutt Heriot-Watt University.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using R-GMA.
INFSO-RI Enabling Grids for E-sciencE gLite Information System: R-GMA Tony Calanducci INFN Catania gLite tutorial at the EGEE User.
CERN 21 January 2005Piotr Nyczyk, CERN1 R-GMA Basics and key concepts Monitoring framework for computing Grids – developed by EGEE-JRA1-UK, currently used.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to R-GMA: Relational Grid Monitoring Architecture.
EGEE is a project funded by the European Union under contract IST The UK Cluster Steve Fisher / RAL JRA1 meeting at Cork, 19/ April
WP3 Relational Work Progress Report 11/5/2001 Steve Fisher / RAL.
The Mediator: What Next? Talk by: Andy Cooke Collaborators: Alasdair Gray, Lisha Ma, and Werner Nutt Heriot-Watt University.
Grid Event Management Using R-GMA Monitoring Framework
gLite Information System(s)
R-GMA as an example of a generic framework for information exchange
The EU DataGrid – Information and Monitoring Services
Practicals on R-GMA Valeria Ardizzone INFN
gLite Information System(s)
R-GMA (Relational Grid Monitoring Architecture) for monitoring applications “s” gLite and LCG.
RELATIONAL GRID MONITORING ARCHITECHTURE
gLite Information System
Canonical Producer CP API CP Servlet User Code Files
Presentation transcript:

WP3 The status of the EU DataGrid's R-GMA system Steve Fisher / RAL 24/4/2003

WP3 Steve Fisher/RAL - 24/4/2003R-GMA2 Who we are Heriot-Watt, Edinburgh –Andrew Cooke, Werner Nutt IBM-UK –James Magowan, (Manfred Oevers), Paul Taylor INFN –Roberto Barbera, Giuseppe Save, Gennaro Tortone Queen Mary, University of London –Roney Cordenonsi, (Ari Datta) CCLRC/PPARC –Rob Byrom, Laurence Field, Steve Hicks, Manish Soni, Antony Wilson, (Xiaomei Zhu), Jason Leake –Linda Cornwall, Abdeslem Djaoui, Steve Fisher, Robin Middleton SZTAKI, Hungary –Peter Kacsuk, Norbert Podhorszki Trinity College Dublin –Brian Coghlan, Stuart Kenny, David O’Callaghan, (John Ryan)

WP3 Steve Fisher/RAL - 24/4/2003R-GMA3 GMA From GGF Very simple model Does not define: –Data model –How data are moved from Producer to Consumer –What registry looks like Producer Consumer Registry Store location Lookup location execute or stream

WP3 Steve Fisher/RAL - 24/4/2003R-GMA4 Use the GMA from GGF A relational implementation –Powerful data model and query language Applied to both information and monitoring Creates impression that you have one RDBMS per VO Producer Consumer Registry Store location Lookup location execute or stream

WP3 Steve Fisher/RAL - 24/4/2003R-GMA5 Relational Data Model Not a general distributed RDBMS system, but a way to use the relational model in a distributed environment where global consistency is not important. Producers announce:SQL “CREATE TABLE” publish:SQL “INSERT” Consumers collect:SQL “SELECT” Some producers, the Registry and Schema make use of RDBMS as appropriate – but what is central is the relational model.

WP3 Steve Fisher/RAL - 24/4/2003R-GMA6 Producer  Consumer Consumer can issue one-off queries –Similar to normal database query Consumer can also start a continuous query –Requests all data published which matches the query Can be seen as an alert mechanism

WP3 Steve Fisher/RAL - 24/4/2003R-GMA7 Registry choices Decided early to keep them separate In fact they have different requirements for distribution/replication Each implemented with one RDBMS per instance Registry (of Producers and Consumers) Schema (descriptions of tables)

WP3 Steve Fisher/RAL - 24/4/2003R-GMA8 Virtual RDBMS Creates impression that you have one RDBMS per VO –This makes it very easy to use –1 integrated system –1 query language Users like it But how will it fit in with GridServices?

WP3 Steve Fisher/RAL - 24/4/2003R-GMA9 Producers DataBaseProducer – Supports History Queries –Information not lost –Supports joins –Clean up strategy StreamProducer – Supports Continuous Queries –In memory data structure –Can define minimum retention period ResilientStreamProducer – Supports Continuous Queries –Like the StreamProducer but won’t lose data if system crashes –So slightly slower LatestProducer – Supports Latest Queries –Just holds the latest information for any “primaryish” key –Supports joins CanonicalProducer – Supports anything –Offers anything as relations

WP3 Steve Fisher/RAL - 24/4/2003R-GMA10 Archiver (Re-publisher) It is a combined Consumer-Producer You just have to tell it what to collect and it does so on your behalf Re-publishes to any kind of “Insertable” (i.e. not to the CanonicalProducer)

WP3 Steve Fisher/RAL - 24/4/2003R-GMA11 Canonical Producer Allows user defined code to be invoked to respond to SQL query Developed in collaboration with CrossGrid CP API User Code Canonical Producer Servlet Files CreateTable, Port, Protocol, Security, SQL Support, Multiple Query Support Security Insert Query Port Register

WP3 Steve Fisher/RAL - 24/4/2003R-GMA12 Functionality - mediator Queries posed against a virtual data base The Mediator must: –find the right Producers –combine information from them Hidden component – but vital to R-GMA Can now merge information from several producers The final mediator will take “any” SQL statement and do the right thing See Werner Nutt’s talk

WP3 Steve Fisher/RAL - 24/4/2003R-GMA13 Topologies Normally publish via SP Archivers instantiated with a Producer and a Predicate Must avoid cycles in the graph A SP A A LP A HP SP

WP3 Steve Fisher/RAL - 24/4/2003R-GMA14 Schema & Contributions CPULoad (Global Schema) CountrySiteFacilityLoadTimestamp UKRALCDF UKRALATLAS UKGLACDF UKGLAALICE CHCERNALICE CHCERNCDF CPULoad (Producer 3) CHCERNATLAS CHCERNCDF CPULoad (Producer 1) UKRALCDF UKRALATLAS CPULoad (Producer 2) UKGLACDF UKGLAALICE

WP3 Steve Fisher/RAL - 24/4/2003R-GMA15 Contributions are Views CPULoad (Producer 1) UKRALCDF UKRALATLAS CPULoad (Producer 2) UKGLACDF UKGLAALICE SELECT * FROM cpuLoad WHERE country = ’UK’ AND site = ’RAL’ SELECT * FROM cpuLoad WHERE country = ’UK’ AND site = ’GLA’

WP3 Steve Fisher/RAL - 24/4/2003R-GMA16 R-GMA Tools R-GMA CLI –Command Line Interface (similar to MySQL) –Supports single query and interactive modes –Can perform simple operations with Consumers, Producers and Archivers R-GMA Browser –JSP application dynamically generating web pages –Supports pre-defined and user-defined queries Pulse –R-GMA Java client-based GUI –Supports streaming and simple graphical displays

WP3 Steve Fisher/RAL - 24/4/2003R-GMA17 GIN and GOUT (Gadget IN and Gadget OUT) R-GMA Consumers LDAP InfoProvider GIN LDAP Server LDAP InfoProvider CircularBuffer Producer GIN Consumer (CE) Consumer (SE) Consumer (SiteInfo) RDBMS DataBase Producer GOUT ConsumerA PI Archiver CircularBuffer Producer R-GMA GLUE Schema

WP3 Steve Fisher/RAL - 24/4/2003R-GMA18 R-GMA – How? Currently based on servlet technology –Behind every API there is a Servlet –Multiple hand crafted APIs Java, C++, C, Python and Perl –Tomcat –Soft state registration –Uniform exception handling To ensure that useful messages and stack traces are preserved.

WP3 Steve Fisher/RAL - 24/4/2003R-GMA19 OGSIfication Have recently started the migration to web and grid services –Apache axis –WSDL generated APIs –Will provide a wrapper for backwards compatibility

WP3 Steve Fisher/RAL - 24/4/2003R-GMA20 All Grid Services OGSA Factories, GSH, GSR Registry includes HandleMapper SQL as Service Data Element Query Language Consumer Factory Producer Instance OGSIfied R-GMA Sensor Producer API Application Consumer API Schema Registry Consumer Instance Producer Factory

WP3 Steve Fisher/RAL - 24/4/2003R-GMA21 OGSIfication issues Consider XML as internal representation of service data elements –Depends on other developments Consider XQuery as service data elements query language –Depends on how XQuery develops X-GMA ?? –Will this be distinguishable from what is in GT3

WP3 Steve Fisher/RAL - 24/4/2003R-GMA22 Resilience - Registry Will have one logical registry and schema per VO Each logical registry will have multiple physical “copies” Each entry in registry has 3 possible states Transmit new records and deleted records and checksum after records deleted locally Self healing even supports new registry instances Consumer uses any instance Fail over mechanism not yet implemented Schema more tricky Producer1 Producer2 Registry2 Info mastered by Registry2 Copy of info from Registry1 Copy of info from Registry3 Registry3 Info mastered by Registry3 Copy of info from Registry1 Copy of info from Registry2 Registry1 Info mastered by Registry1 Copy of info from Registry2 Copy of info from Registry3 See poster

WP3 Steve Fisher/RAL - 24/4/2003R-GMA23 Soft-state Registration and the Registry Registry records existence of Producers and Consumers Registry holds last contact time and ‘expiry’ time Producers and Consumers periodically refresh their time stamps Producer and Consumer servlets avoid unnecessary traffic to Registry Scheduled removal of entries that have timed-out

WP3 Steve Fisher/RAL - 24/4/2003R-GMA24 Resilience Testing Taking 7 components –Schema –2 registry instances –Producer API –Consumer API –Producer Servlet with other APIs –Consumer Servlet with other APIs Consider each component in turn –Break the network and bring it back –Close the component down and bring it back –Crash the component and bring it back Will also consider real life scenarios

WP3 Steve Fisher/RAL - 24/4/2003R-GMA25 Performance By design: –Very flexible - to avoid bottlenecks –Powerful queries allow a single query to be made Performance and Optimisation –Will use NetLogger and profiling tools to identify possible bottlenecks Internally not high speed because of XML etc

WP3 Steve Fisher/RAL - 24/4/2003R-GMA26 Summary R-GMA is a combined Grid information and monitoring system Supports notion of Virtual Database Recently deployed in the EDG development testbed Now focusing on reliability, stability and performance Thanks to the EU and our national funding agencies for their support of this work

WP3 Steve Fisher/RAL - 24/4/2003R-GMA27 And finally GGF8… RGIS-RG –The two short sessions will be held: Session 1: Database use cases and best practices in the grid environment (outside the traditional data areas) »Using databases to store application metadata »Using databases to store monitoring information »Using databases as a grid registry »Creating grid registries for locating relational and XML databases Session 2: Data discovery in the grid environment –We will also discuss our milestones and future directions. (e.g should we include XML as well as Relational models.) –See A GMA BOF is planned for GGF8