GSAF Grid Storage Access Framework

Slides:



Advertisements
Similar presentations
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Advertisements

TU/e Service Discovery Mechanisms: two case studies / IC2002 Service Discovery Mechanisms: Two case studies Control point Device UPnP Terminal Domain Host.
Data Management Expert Panel - WP2. WP2 Overview.
E-science grid facility for Europe and Latin America A Data Access Policy based on VOMS attributes in the Secure Storage Service Diego Scardaci.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
From Digital Objects to Content across eInfrastructures Content and Storage Management in gCube Pasquale Pagano CNR –ISTI on behalf of Heiko Schuldt Dept.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Harokopio University of Athens – Department of Informatics and Telematics HAROKOPIOUNIVERSITY A Distributed Architecture for Building Federated Digital.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra GSAF Grid Storage Access Framework Salvatore Scifo INFN of Catania EGEE.
FESR Consorzio COMETA - Progetto PI2S2 GSAF Grid Storage Access Framework Salvatore Scifo
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Integration of China Relics and gLite with gLibrary You MENG
REST API to develop application for mobile devices Mario Torrisi Dipartimento di Fisica e Astronomia – Università degli Studi.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA GRIDICOM G. Foti 1), S. Scifo 2), R. Barbera 3), F. Portuese 4), S. Parisi 5)
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Scuola Grid INFN, Trieste, 1-12 Dic Managing Confidential Data in the gLite Middleware – The Secure Storage.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Web and mobile access to digital repositories Mario Torrisi National Institute of Nuclear Physics – Division of
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Database Systems: Design, Implementation, and Management Tenth Edition
GFAL Grid File Access Library
GFAL: Grid File Access Library
gLite Basic APIs Christos Filippidis
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Java API del Logical File Catalog (LFC)
Amazon Storage- S3 and Glacier
Scuola Grid INFN, Martina Franca, Nov
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
ADAT Salvatore Scifo INFN Catania
Joseph JaJa, Mike Smorul, and Sangchul Song
Introduction to Data Management in EGI
AMGA Web Interface Salvatore Scifo INFN sez. Catania
TGSAF - Grid Storage Access Framework
Riccardo Rotondo INFN Catania – Italy
CHAPTER 3 Architectures for Distributed Systems
Flexible Extensible Digital Object Repository Architecture
CS 501: Software Engineering Fall 1999
Flexible Extensible Digital Object Repository Architecture
GSAF Grid Storage Access Framework
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
AMGA Web Interface Vincenzo Milazzo
Grid Engine Diego Scardaci (INFN – Catania)
Information Services Claudio Cherubino INFN Catania Bologna
SDMX IT Tools SDMX Registry
Presentation transcript:

GSAF Grid Storage Access Framework Salvatore Scifo INFN sez. Catania JRA1 All Hands meeting Catania, 07-09.03.2007

Partnership GSAF Resources The GSAF Project is carried out by INFN - Catania with the cooperation of the IR&T engineering s.r.l. (an SME of Catania). The context of this work is the TriGrid VL Project and the ADAT Project (“Archivio Digitale Antichi Testi”). Trigrid VL Project aims to port several Industrial Use Cases over the Grid Infrastructure and the ADAT Project wants to design and implement a Digital Archive for Cultural Heritage that adopts Grid as a Content Management System (CMS). Resources INFN S. Scifo (s.scifo@ct.infn.it) Gilda Team IR&T engineering (http://www.irt-engineering.it) V. Milazzo (v.milazzo@irt-engineering.it) A. Magrì (a.magri@irt-engineering.it) Catania, 07-09 March 2007

Web integration with the Grid Designing and developing Web Application on the Grid is not easy. There is no a simple system that allows user to manage dynamic content for generic applications (e.g. web portal, digital libraries, …). Main objectives of web application Infrastructure side Organize and handle big amounts of information Share documents among several organizations Security: Manage Access Control Policies Development side Build and maintain dynamic web content Build application without specific technical knowledge User side Manage Groups and Users Manage Digital Resources. Catania, 07-09 March 2007

GRID Offer Storage Virtualization Security capabilities Unique and uniform interface to manage DATA provided by the grid middleware Unique and uniform interface to manage METADATA provided by the grid middleware Large and numerous file handling capability also in a geographic distributed environment Ubiquity: data access independently by their location. Security capabilities Centralized access control mechanism based on x.509 certificates and user roles according to Virtual Organization policies that users belong to. Availability, Scalability, Fault Tolerance. Catania, 07-09 March 2007

Classic Web Application Data Presentation Layer consists of all graphical interfaces that make user able to interact with application; Data Business Layer collects all software components that implement the behavior of the given application; Data Access Layer is made up by software components that allow application to manage data (ascii files, xml files, digital object, metadata, SQL data). Usually Data Access Layer components interact to several types of data sources (by means of proper APIs), and typical data source are file system (for data stored into files), or Relational Database Management System (for data organized into SQL tables). Catania, 07-09 March 2007

Grid Web Application Inside the Grid environment files are stored inside a Storage Element (SE); files can be replicated on several SEs for ubiquity, security and sharing needs; relationship among locations of files and replicas and theirs identifier are kept within a specific File Catalogue Service for each file is possible to associate descriptive metadata arranged by means a specific Metadata Catalogue Service. Developing applications for Grid means just substitute the traditional Data Access Layer with an appropriate interface that permits business components to manage data stored within the DMS and presentation objects to search and retrieve data from DMS. Catania, 07-09 March 2007

GSAF: building blocks GSAF means Grid Storage Access Framework and it is a kind of Development Toolkit designed to help developers in building applications based on Grid Storage Services for managing files and data. The most important requirement of the GSAF is to hide the complexity and the fragmentation of the several APIs provided by the gLite 3.0 middleware in order to interface the main three Grid Data Services. Catania, 07-09 March 2007

GSAF: goals Implement the main framework capabilities: Managing Metadata Schemas for data collection Managing Group and User to access metadata Uploading file to the SE, registering LFN to the LFC and saving metadata into AMGA in a coherent and atomic mode. Browsing Metadata Catalogue to download file and/or access to attributes schemas and values. Search file by Metadata to download file and/or access to attributes schemas and values. Deleting file in atomic mode from SE, LFC and AMGA Develop a web application as a demonstrator The application demonstrates the framework behaviour allowing Grid User to manage file and metadata remotely towards a web user interface. Catania, 07-09 March 2007

High Level Architecture Catania, 07-09 March 2007

File Upload Catania, 07-09 March 2007

File Download Catania, 07-09 March 2007

File Delete Catania, 07-09 March 2007

File Search Catania, 07-09 March 2007

File Browse Catania, 07-09 March 2007

Catania, 07-09 March 2007

Catania, 07-09 March 2007

Catania, 07-09 March 2007

Catania, 07-09 March 2007

Catania, 07-09 March 2007

Conclusions Sharing information belonging to different organizations in secure, scalable and efficient way is very frequent and actual in the ICT context. GRID offers Reliable Resources Organization Distributed storage virtualization Uniform data access Security and data Preservation GSAF means Useful API to develop Storage based applications Useful and simple web interface to access Data Management Services remotely Catania, 07-09 March 2007

References GSAF wiki pages Amga Web Interface wiki pages https://grid.ct.infn.it/twiki/bin/view/TRIGRID/GSAF Amga Web Interface wiki pages https://grid.ct.infn.it/twiki/bin/view/TRIGRID/AMGAWI AMGA Service and Java API http://project-arda-dev.web.cern.ch/project-arda-dev/metadata/index.html GFAL Java API http://grid-deployment.web.cern.ch/grid-deployment/gis/GFAL/gfal.3.html https://grid.ct.infn.it/twiki/bin/view/GILDA/APIGFAL LFC Java API http://wiki.egee-see.org/index.php/SEE-GRID_File_Management_Java_API Catania, 07-09 March 2007