EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks A GRID based platform to host multiple repositories.

Slides:



Advertisements
Similar presentations
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Advertisements

Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
ACAT 2008 Erice, Sicily WebDat: Bridging the Gap between Unstructured and Structured Data Jerzy M. Nogiec, Kelley Trombly-Freytag, Ruben Carcagno Fermilab,
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-III INFSO-RI Enabling Grids for E-sciencE The Medical Data Manager : the components Johan Montagnat, Romain Texier, Tristan.
Microsoft SharePoint Server 2010 for the Microsoft ASP.NET Developer Yaroslav Pentsarskyy
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite IPv6 compliance project tests Further.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
UNIONE EUROPEA Jorge Sevilla Cedillo Istituto Nazionale di Fisica Nucleare – Catania 2.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks R-GMA Now With Added Authorization Steve.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Using gLite API Vladimir Dimitrov IPP-BAS “gLite middleware Application Developers.
EGEE-Forum – May 11, 2007 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks A gateway platform for Grid Nicolas.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks NA3 Resources Robin McConnell.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks AMGA PHP API Claudio Cherubino INFN - Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
EGEE-III INFSO-RI Enabling Grids for E-sciencE I. Blanquer (1), V. Hernandez (1), L. Martí (2), D. Quilis (1), J. Salavert (1) (1)
EGEE-II INFSO-RI Enabling Grids for E-sciencE Hands on session: the AMGA Metadata Catalogue.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE User Forum, Manchester, 10 May ‘07 Nicola Venuti
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Web Portal for Chemists M. Sterzel,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Medical Data Manager 1 Dicom retrieval : overview of the DPM One command line to retrieve a file:
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Evaluating Metadata access strategies with.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Storage Accounting for Grid Environments Fabio Scibilia INFN - Catania.
INFSO-RI Enabling Grids for E-sciencE Information System Valeria Ardizzone INFN EGEE NA4 Generic Applications Meeting Catania,
INFSO-RI Enabling Grids for E-sciencE VOMS & MyProxy interaction Emidio Giorgio INFN NA4 Generic Applications Meeting 10 January.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra GSAF Grid Storage Access Framework Salvatore Scifo INFN of Catania EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Interfacing gLite services with the Kepler.
INFSO-RI Enabling Grids for E-sciencE Grid based telemedicine application for GATE Monte Carlo dosimetric studies using HOPE (Hospital Platform.
INFSO-RI Enabling Grids for E-sciencE gLite Information System: R-GMA Tony Calanducci INFN Catania gLite tutorial at the EGEE User.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks OpenSAML extension library and API to support.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data management in EGEE.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
Miguel Ángel Saúl Soto INFN - Sezione di Catania Supervisor: Antonio Calanducci
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Integration of China Relics and gLite with gLibrary You MENG
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
UNIONE EUROPEA Jorge Sevilla Cedillo Istituto Nazionale di Fisica Nucleare – Catania 2.
The eCSG Mobile App Mario Torrisi INFN – Division of Catania 24 June 2013 Webinar on the eCSG 1.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion.
Web and mobile access to digital repositories Mario Torrisi National Institute of Nuclear Physics – Division of
Portlet Development Konrad Rokicki (SAIC) Manav Kher (SemanticBits) Joshua Phillips (SemanticBits) Arch/VCDE F2F November 28, 2008.
Enabling Grids for E-sciencE EGEE-II INFSO-RI The Development of SRM interface for SRB Fu-Ming Tsai Academia Sinica Grid Computing.
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
AMGA Web Interface Salvatore Scifo INFN sez. Catania
GSAF Grid Storage Access Framework
GSAF Grid Storage Access Framework
AMGA Web Interface Vincenzo Milazzo
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks A GRID based platform to host multiple repositories for digital content Antonio Calanducci 1 J.M. González 3, R. Ramos 2, M. Rubio 2, D.Tcaci 3 1 INFN Catania, 2 CETA-CIEMAT, 3 MAAT-G Knowledge 3rd EGEE User Forum Febrary 2008 – Clermont-Ferrand (France)

2 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Introduction Need to offer a GRID based platform to host arbitrary repositories A digital repository is a set of annotated digitalized data offered to users in a structured manner. Both digitalized data and annotations can vary greatly from one rep to another but the following commonalties are acknoledged: − There is a basic informational unit of digitalized data (a mammogram, a page of an ancient manuscript, a 3D model..) − There is metadata around each unit of digitalized data (patient info, diagnoses, translation, historical context, physical properties …) − Specific algorithms process the data (search microcalcifications, automatic translation…) − Users browse, search and update the repository, launch algorithms (GRID WMS) − Data is stored in a federated way: each institution owns and manages its content − Metadata to DB, Digitalized data to archive (GRID SE) 2

3 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Goals of gLibrary/DRI To host multiple repositories of arbitrary structure On a GRID infrastructure (security, federation, …) Reduce the “cost-to-deploy”, reach new communities Open architecture Easy to use platform, web based interface Collaboration between INFN and CETA-CIEMAT Builds on INFN gLibrary 3

4 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI INFN gLibrary Created by GILDA team at INFN Catania Secure, robust, easy to use interface to handle digital assets stored in GRID SE Interface to browse entries and finding files in SE –“à la iTunes” browsing allows mouse-clicks searches Built on top of gLite GRID services: any SRM SE, LFC, AMGA, VOMS authorization Authentication/Authorization − Via applet, creating a proxy cert on the user’s PC − Proxy used to interact directly with GRID elements (LFC, SE, AMGA) Files transferred directly from SE to applet and viceversa. 4

5 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary screenshots 5

6 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI Extends gLibrary by: − Making it multirepository − No predefined repository content structure: each repository describes itself. − Decoupling navigation + management from repository specifics − DRI: Digital Repositories Infrastructure A repository must provide: − A description of its navigational structures (trees, filters) and a viewer − A description of its data model − An storage engine (for data model persistence) − The DRI API specification describes HOW this is provided A repository provider can − Make its own implementation of the specification − Use (or extend) the default one provided 6

7 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI web interface 7

8 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI DICOM viewer 8

9 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI API specification A repository has to provide: Data Model: –XML format description of the repository’s data –Relational data model supported –Indication of which part of the data model is saved on the federated DB and which on the Storage System Storage Module: –it takes care of data persistency –Load() and Saves() method have to be provided for loading and saving instances of the data model User Interface Module: –definition of the navigational trees and filters –viewer for the specific repository 9

10 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI API specification Data Model: The repository provider describes in XML format the data of their repository. It supports relational data models, so a parent node with dependent entries can be specified. The data model also defines which parts of the data are stored in the federated database or in the storage elements. gLibrary/DRI has a defined specification for the XML data model that the provider must abide by. Storage Module: This part takes care of the data persistence. The provider gives a set of classes for loading and saving instances of the data model. The save function would inspect a given instance to decide which part of it will be stored in an SE or in the federated database. The load function will also take care of this issue. User Interface Module: In this module the provider will specify the way of building the navigation trees and the filters that the web portal presents for the quick location of any element. In this module, the repository viewer is also contained, so the viewer function will receive an instance of the data model and the data will be properly represented. 10

11 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI API specification Contract between gLibrary/DRI platform and specific repository implementations Each application must provide three Java modules implementing the following interfaces: − DRIUIInterface for describing trees, filters and viewers − DRIStorageInterface for storing and retrieving data − DRINodeInterface for defining repository data model gLibrary/DRI engine orchestrates API calls to different interface implentations 11

12 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI UI API extract P public interface DRIUIInterface { public Vector getRepositoryTrees (String reposititoryName); public TreeHierarchy getTreeHierarchy (String treeName); public Vector getFilterNameInstances (); public Vector getFilterEntries (String filterName); public void loadViewer (String viewerClass); } public class MyRepositoryUI implements DRIUIInterface { public Vector getRepositoryTrees (String repositoryName) { // access repository config file/db/etc to get tree data … return new Vector( new Tree(“By author”), new Tree(“By date”)); } … } 12

13 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI Engine Orchestration Registered repositories MGUI.getRepositoryTrees() : what are your navigation trees? MGUI.getFilterNameInstances() what are your filters? MGUI.LoadViewer() : return an applet with the viewer application to display and manipulate the selected repository item MGUI.getFilterEntries() what are the possible values for the selected filter? 13

14 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI Storage API public interface DRIStorageInterface { public DRIGenericNode Load(String Id); public void Remove (String Id); public void CreateNew (DRIGenericNode Node); public void Save (DRIGenericNode Node); } public class MyRepositoryStorage implements DRIStorageInterface { public MyRepositoryNode Load (String id) { // access db, GRID SE, etc.. Assemble one instance of data model … MyRepositoryNode node = new MyRepositoryNode (db, data, …); return node; } … } 14

15 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI default implementation We provide a default implementation for UI and Storage APIs public class DRIUIModule implements DRIUIInteface public class DRIStorageModule implements DRIStorageInterface UI default implementation: − Loads repository trees from AMGA − Loads filter definitions from AMGA − Field display definitions from AMGA Storage − Reads repository data model from XML file − Stores/Loads data model in AMGA and marked items in SEs 15

16 PatientID int PatientName String Varchar(80) PatientAge Int studies Entity StorageStudy 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI XML Data model def example 16 StorageID int Diagnose String Varchar(255) Mammogram LFN Varchar(255) DRIStorageModule stores specially marked fields in a GRID Storage Element e register them in the File Catalog DRIStorageModule stores regular fields in AMGA public class MyRepStorageModule inherits DRIStorageModule {} public class MyRepNode inherits DRIGenericNode DRI Storage module reads data model from XML files:

17 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Using UI default implementation public class MyRepUIModule inherits DRIUIModule {} (not implements DRIUIInterface)‏ AMGA dump Collection: /ceta/mgplus/config/trees Content: /ceta/mgplus/config/trees/alphabetical (Collection) > ls Query> getattr 0 tag parentid path filter fields >> FromAtoD >> FromEtoJ >> FromKtoO >> FromPtoU >> FromVtoZ /ceta/mgplus/config/trees/pathologies (Collection) > ls >> 0 Query>getattr 0 tag parentid path filter fields PathologyId >> Benign >> TumorMorphology >> Spread >> Microcalcifications >> study >> ‘/ceta/mgplus/data/patient/study:PathologyId=0 and /ceta/mgplus/data/patient:MGPlusPatientId=/ceta/mgplus/data/patient/study:MGPlusStudyId’ /ceta/mgplus/data/patient:MGPlusPatientId,PatientId,PatientName,Gender,AgeAtMenarche,AgeAtMenopause Note the EMPY implementation Where MGPLUS trees are stored Alphabetical patient tree definition Contents of the alphabetical tree Pathologies tree definition Contents of pathologies tree Filter definition for Microcalcification branch 17

18 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Mammography repository example Goals: a GRID based repository for mammograms, patient history and collaborative diagnoses Uses UI and Storage default implementations Provides its own viewer which accepts a MGPlusNode: − Based on Open Source TUDOR DICOM viewer − Adapted it to comply with the DRI API − Converted it into an applet − Extended functionality (display specific patient data, annotations directly on the mammograms, etc.) − Save() method retrieve directly data files from SEs using direct GridFTP transfers 18

19 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Repository specific viewer 19

20 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI gLibrary/DRI architecture 20

21 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Technologies Web 2.0 Web interface (AJAX) PHP 5 for the front-end engine Java Servlets for the back-end DRI engine Usage of Java-PHP bridge Applets − For user authentication with their VO certificate − For viewers implementation Java Introspection XML gLite Java APIs: AMGA, LFC wrappers, JGlobus GridFTPclient 21

22 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Where we are Engine deployed and working, API and default implementation working MGPlus repository implemented on DRI Current work: − Interface to launch and manage jobs on Grid WMS − Generic uploader 22

23 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Conclusions and future work Effectively reduced cost by APIs and default implementation. New repository providers must: − Provide empty implementations of UI and Storage (very easy) − Describe their data model in XML (very easy) − Adapt/make viewer (difficult) Provides: − Generic multirepository platform, making GRID facilities easily accessible − attract new communities, ease of hosting Future work: − Having a SOA and JSR170 compliant − Generic viewer and tree management interface (almost ZERO cost for rep providers)‏ − EELA-II Official Digital Library product 23

24 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Contacts Mailing list: Authors: Prototypes: – (INFN gLibrary platform) – (gLibrary/DRI platform) 24

25 3rd EGEE User Forum, 13 Feb 08, Clermont-Ferrand Enabling Grids for E-sciencE EGEE-II INFSO-RI Questions? 25