GOCDB new model for EGEE-III and beyond Gilles Mathieu – STFC GAG meeting, Abingdon 4 December 2008.

Slides:



Advertisements
Similar presentations
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
Advertisements

GOC resilience John Gordon, STFC GridPP 22 microtalk.
Information Resources Management January 23, 2001.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
Chapter 1: The Database Environment
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
Chapter 1 1 © Prentice Hall, 2002 Database Design Dr. Bijoy Bordoloi Introduction to Database Processing.
Chapter 1 1 © Prentice Hall, 2002 Database Design Dr. Bijoy Bordoloi Introduction to Database Processing.
MAHI Research Database Project Status Report August 9, 2001.
Systems Design. Systems Design Skills People skill (25%) - Listening, understanding others, understanding between two lines, conflict resolution, handling.
OASIS V2+ Next Generation Open Access Server CSD 2006 / Team 12.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
1 The Problem Do you have: A legacy ABL system with millions of Lines of ABL Code? Years and years of modifications to your ABL code? System documentation.
GOCDB A repository for a worldwide grid infrastructure G. Mathieu, A. Richards, J. Gordon, C. Del Cano Novales, P. Colclough, M. Viljoen CHEP09, Prague,
Module Title? DBMS Introduction to Database Management System.
CSC271 Database Systems Lecture # 4.
ITEC 3220M Using and Designing Database Systems
Architecture for a Database System
Introduction to Database Management. 1-2 Outline  Database characteristics  DBMS features  Architectures  Organizational roles.
CS480 Computer Science Seminar Introduction to Microsoft Solutions Framework (MSF)
© 2007 by Prentice Hall 1 Introduction to databases.
File Systems and Databases Lecture 1. Files and Databases File: A collection of records or documents dealing with one organization, person, area or subject.
Category #2 Bundling and Unbundling Workgroup September 24, 2007.
Database Management System Prepared by Dr. Ahmed El-Ragal Reviewed & Presented By Mr. Mahmoud Rafeek Alfarra College Of Science & Technology- Khan younis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
INFSO-RI Enabling Grids for E-sciencE 1 Downtime Process Author : Osman AIDEL Hélène Cordier.
1 CS 430 Database Theory Winter 2005 Lecture 2: General Concepts.
Aspect-Oriented Requirements Engineering David Schaefer, Joao Araujo, Isabel Brito, Awais Rashid, Claudia Mesquita.
THE DATABASE ENVIRONMENT Definitions: Data, Information, Database, MetadataData, Information File Processing Systems The Database Approach Components of.
GOCDB evolution plans GAG, 22/09/2008 Gilles Mathieu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
DAY 15: ACCESS CHAPTER 1 Rahul Kavi October 6,
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
Copyright © The OWASP Foundation Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Slide 1 Use Case Packets.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE GStat Work Plans for EGEE-III Joanna Huang, ASGC/OPS EGEE SA1 F2F Meetings, Abingdon.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
SAM Database and relation with GridView Piotr Nyczyk SAM Review CERN, 2007.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
CERN 21 January 2005Piotr Nyczyk, CERN1 R-GMA Basics and key concepts Monitoring framework for computing Grids – developed by EGEE-JRA1-UK, currently used.
Hoi Le. Why database? Spreadsheet is not good to: Store very large information Efficiently update data Use in multi-user mode Hoi Le2.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid is a Bazaar of Resource Providers and.
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
Managing Data Resources File Organization and databases for business information systems.
Databases and DBMSs Todd S. Bacastow January 2005.
Gridpp37 – 31/08/2016 George Ryall David Meredith
Chapter (12) – Old Version
Database Development (8 May 2017).
An Introduction to database system
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
GOCDB current status and plans
Maite Barroso, SA1 activity leader CERN 27th January 2009
Data, Databases, and DBMSs
Teaching slides Chapter 8.
The Globus Toolkit™: Information Services
Lecture 1 File Systems and Databases.
Chapter 1: The Database Environment
The Database Environment
(team representative name here)
The Database Environment
Practical Database Design and Tuning Objectives
Presentation transcript:

GOCDB new model for EGEE-III and beyond Gilles Mathieu – STFC GAG meeting, Abingdon 4 December 2008

Current limitations GOCDB3 schema is fine as long as: –we don’t modify it too much –we don’t distribute GOCDB, even partially –regions don’t have specific needs But this will happen… and we may face: –an increased complexity of the relational model –scalability problems –Some regions wanting more, and “leaving the ship” to implement their own solution –interoperability problems

Current limitations These limitations are mainly due to: –The high complexity of the information –The unclear definition of future needs –The low flexibility of relational DB models We must then propose something that: –can cope with any level of complexity –is flexible enough to be easily modified –is customisable to answer regional needs

proposed evolutions Keep a central service, not necessarily a central DB –There is a need for a central access point, but: –the fact that regional DB are distributed or not must not be an issue Build a sustainable architecture that allows regionalisation but doesn’t force it –Not all regions are at the same level Propose an implementation where nothing exists, work with existing solutions otherwise –Some regions have their own solution and don’t want to be forced to use another one

“Imagine no relations, It’s easy if you try…” Proposed DB schema - ID - Sitename - SITES -ID -hostname -IP_addr NODES - ID - path - start_date - end_date DOWNTIMES - ID - name - desc SERVICES - path -SiteID -NodeID -ServiceID PATHS -Type - Name -Database -Grid objectTypes -Type -objectID objects - parent_type -Child_type -Allowed objectLinkTypes -ID -Link_type -Parent_objectID -Child_objectID objectLinks -GridID -ObjectID - Sitename - TABLE1 -GridID -ObjectID - hostname - IP_addr TABLE2 -GridID -ObjectID - serv_name - desc TABLE3 -GridID -ObjectID - start_date - end_date TABLE4 -GridID -ObjectID - Sitename - TABLE1.2 -GridID -ObjectID - hostname - IP_addr TABLE2.2 -GridID -ObjectID - serv_name - desc TABLE3.2 -GridID -ObjectID - start_date - end_date TABLE4.2 -GridID -ObjectID - Sitename - TABLE1.3 -GridID -ObjectID - hostname - IP_addr TABLE2.3 -GridID -ObjectID - serv_name - desc TABLE3.3 -GridID -ObjectID - start_date - end_date TABLE4.3 Current relational modelProposed object model Grid 1Grid 2Grid 3 Core tables (relationships) - Physical Data Tables - Hard coded relationships and constraints Data tables

Proposed architecture Logically separated Grids. We don’t care where they are physically. Central/local access No data duplication Grid n Grid 2 Grid 1 Main core dictionary Local access Central access Core object tables Core GOCDB tables Local GOCDB tables Common schema Custom schema Local Dictionary Shared use Local use A logical grid entity (“regional GOCDB”) The whole system Existing local DB Local entity used directly Use of an adapter if a different local solution exists

Region n Region 1 RAL Custom tables Regionalisation scenario GOCDB Region 1 Region 2 Region … Region n GOC Central portal Region 1 Region n (image) Region 1 local portal Region n local DB Region n local portal Adapter Query interface 3 rd party tools WS interface

Considerations Administration of local systems –responsibility (and maintenance) comes with hosting –GOCDB will: Define the structure of the information to publish Propose an implemented solution Provide tools, APIs, adapters, documentation and support Compatibility with existing tools –No change required once we have moved to WS Hardware/software –Primary implementation of the model on Oracle –Other solutions possible, but that needs investigations

Risks and limitations Multi-server architecture –What if one of the regional DBs is down? –What if some regions change the model? Developers not getting the idea –The model cannot be taken without good understanding

Workplan and timelines By January 09 –Start prototyping new model –2 regional use-cases definitions (NGS and Grid-Ireland) By May 09 –sustainable prototype implementation of the new model –regional use-cases working in parallel with a central DB. –More use-cases study –External adapters prototyped By October 09 –New model operated and in production –More distributed instances, depending on regions choice and/or readiness

For more details… GOCDB regionalisation – New architecture and model description – “A pseudo object database model and its applications on a highly complex distributed architecture” –IARA/IEEE Conference on Advances in Databases (DB 2009) March 1-6, Gosier, Guadeloupe/France