FESR www.trigrid.it Trinacria Grid Virtual Laboratory University of Coimbra AMGA Use cases: gLibrary & gMOD Tony Calanducci INFN Catania, NA3 & NA4 First.

Slides:



Advertisements
Similar presentations
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE AMGA Metadata Access on the Grid Mike Mineter.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America The AMGA metadata catalog with use cases.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
E-science grid facility for Europe and Latin America A Data Access Policy based on VOMS attributes in the Secure Storage Service Diego Scardaci.
The AMGA metadata catalog Riccardo Bruno - INFN Madrid, 07-11/05/2007.
Asterios Katsifodimos Saturday, May 23, 2015 High Performance Computing systems Lab University of Cyprus The AMGA metadata catalog – An Overview Slides.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America The AMGA metadata catalog with use cases.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America AMGA metadata catalog with use cases Tony.
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra AMGA Use Cases Tony Calanducci NA4 Generic Applications Meeting January.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America The AMGA metadata catalog with use cases.
INFSO-RI Enabling Grids for E-sciencE The GENIUS Grid portal Tony Calanducci INFN Catania - Italy First Latin American Workshop.
IST E-infrastructure shared between Europe and Latin America The AMGA metadata catalog with use cases Domenico Vicinanza, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE The Medical Data Manager : the components Johan Montagnat, Romain Texier, Tristan.
INFSO-RI Enabling Grids for E-sciencE gLibrary: a Multimedia Contents Management System on the grid Tony Calanducci INFN Catania,
FESR Trinacria Grid Virtual Laboratory The AMGA metadata catalog with use cases Riccardo Bruno - INFN gLite Tutorial Istanbul, July.
EGEE-II INFSO-RI Enabling Grids for E-sciencE AMGA Metadata Access on the Grid Mike Mineter.
James Akrigg Microsoft Ltd Integrating InfoPath Forms Into Workflow Solutions And Business Processes.
EGRIS-1 E-infrastructure shared between Europe and Latin America AMGA Metadata Services: examples and usage scenarios Tony Calanducci INFN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks AMGA PHP API Claudio Cherubino INFN - Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Hands on session: the AMGA Metadata Catalogue.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
INFSO-RI Enabling Grids for E-sciencE VOMS & MyProxy interaction Emidio Giorgio INFN NA4 Generic Applications Meeting 10 January.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks A GRID based platform to host multiple repositories.
1 Egrid portal Stefano Cozzini and Angelo Leto. 2 Egrid portal Based on P-GRADE Portal 2.3 –LCG-2 middleware support: broker, CEs, SEs, BDII –MyProxy.
INFSO-RI Enabling Grids for E-sciencE Summary of the data access session EGEE User Forum, March 3 rd, 2006 Johan Montagnat Birger.
FP6−2004−Infrastructures−6-SSA Enabling Grids for E-sciencE The AMGA Metadata Catalog Introduction and hands-on exercises Nuno Santos.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra GSAF Grid Storage Access Framework Salvatore Scifo INFN of Catania EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE AMGA Metadata Service Gergely Sipos.
EGEE-II INFSO-RI Enabling Grids for E-sciencE More on gLite: 2 services you have not seen! Mike Mineter.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America The GILDA t-Infrastructure Roberto Barbera.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
FESR Consorzio COMETA - Progetto PI2S2 The AMGA Metadata Catalog with use cases Salvatore Scifo, Tony Calanducci INFN Catania Grid.
First South Africa Grid Training June 2008, Catania (Italy) GILDA t-Infrastructure Valeria Ardizzone INFN Catania.
Miguel Ángel Saúl Soto INFN - Sezione di Catania Supervisor: Antonio Calanducci
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
FESR Consorzio COMETA - Progetto PI2S2 AMGA Official Metadata Service for EGEE Salvatore Scifo – Consorzio Cometa - Catania, ITALY.
FESR Consorzio COMETA - Progetto PI2S2 AMGA Official Metadata Service for EGEE Salvatore Scifo – Consorzio Cometa - Catania, ITALY.
EGEE-II INFSO-RI Enabling Grids for E-sciencE AMGA Metadata Service Mike Mineter.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Web and mobile access to digital repositories Mario Torrisi National Institute of Nuclear Physics – Division of
FESR Trinacria Grid Virtual Laboratory Practical using WMProxy advanced job submission Emidio Giorgio INFN Catania.
Grid based telemedicine application
NA4/medical imaging. Medical Data Manager Installation
AMGA - Official Metadata Service for EGEE
Medical Data Manager use case: 3D medical images analysis workflow.
GILDA Project Valeria Ardizzone INFN Catania Italy
Grid2Win: Porting of gLite middleware to Windows XP platform
AMGA Web Interface Salvatore Scifo INFN sez. Catania
Grid2Win: Porting of gLite middleware to Windows XP platform
Short update on the latest gLite status
GSAF Grid Storage Access Framework
Grid2Win: Porting of gLite middleware to Windows XP platform
GSAF Grid Storage Access Framework
Introduction to DSpace
AMGA Web Interface Vincenzo Milazzo
The AMGA metadata catalog
EGEE Middleware: gLite Information Systems (IS)
Presentation transcript:

FESR Trinacria Grid Virtual Laboratory University of Coimbra AMGA Use cases: gLibrary & gMOD Tony Calanducci INFN Catania, NA3 & NA4 First Trigrid Tutorial 15 March 2006, Catania

First Grid Tutorial, Catania, Outline gLibrary: a Multimedia Contents Management System on the grid –Motivations –gLibrary features –Implementation details –Security features –Future planned improvements –Conclusions gMOD: grid Movie on Demand

First Grid Tutorial, Catania, gLibrary Motivations Huge amounts of data can be saved on SEs (did we forget about the existence of Data Grids?) But how can we easily find later a file that we need? –(if you have good memory, its GUID could be a solution ) –File Catalogues just let us to arrange files in folders and subfolders, no way to query on their contents –Metadata Catalogues are a possible solution, but not always “affordable” especially for non expert users (powerful but complex to use) Our solution: a higher level application built on top of many gLite grid services: a Metadata Catalogue + File Catalogues + Storage Elements  gLibrary Requirements: easy to use, fast, secure, extensible

First Grid Tutorial, Catania, gLibrary goals Attempt to create a Multimedia Management System on the Grid –Examples of Multimedia Contents handled by gLibrary:  Images  Movies  Audio Files  Office Documents (Powerpoint, Word, Excel, OpenOffice)  s, PDFs, HTMLs  Customized versions of well-know document type (ex. EGEE PPTs)  …. Keep track and organize in a uniform way all the additional details (metadata) of files saved in Storage Elements and registered in File Catalogues Provide users an easy way to locate and retrieve files based on their contents

First Grid Tutorial, Catania, Usage scenarios Example 1: –Locate all theoretical (PPTType) PowerPoint (Type) presentations about FireMan (Keywords) given in 2005 (Date) by Uncle Sam (Speaker); –Find all the movies (Type) in which Julia Roberts (Cast) performed together with Hugh Grant (Cast) produced in USA (Country) in 2004 (ReleaseDate); or all the acoustic (Genre) mp3 (Format) audio files (Type) of Alanis Morissette (Singer) that last more than 3 minutes (Runtime). Example 2: –A doctor is looking for brain (keyword) DICOM (Type) images of male (Gender) patients older than 65 (Age). Example 3: –A job can behave as a storage crawler: it scans pre-existing files in Storage Elements to extract relevant metadata that will be published on gLibrary for further data mining.

First Grid Tutorial, Catania, gLibrary prototype implementation Files are saved on SEs and registered into file catalogues (LFC and/or FiReMan) The AMGA Metadata Catalogue is used to archive and organize metadata and to answer users’ queries. gLibrary is built using the following AMGA collections: –/gLibrary contains generic metadata for each entry –/gLAudio, /gLImage, /gLVideo, /gLPPT, /EGEEPPT, /gLDoc, … are examples of collections of “additional features” (shown later) –/gLTypes  keeps the associations between document types and the names of the collection that contains the “additional features”  is used by gLibrary to find out where it has to look when new document types are added into the system (extensibility) –/gLKeys is used to store Decryption Keys

First Grid Tutorial, Catania, Example of entries Collection/gLibrary Entry Names Attributes FileNamePathNameTypeSubmitter 4ffaffc8-26e b460-3d5bf08081a4 DedicatoAte.mp3/grid/gilda/calanducciAudioTony Calanducci 00454dca-a269-4b93-8a45-c4012af05600 ardizzonelarocca_is_ ppt.gpg/grid/gilda/calanducci/ EGEE EGEEDOCTony Calanducci /gLibrary (continuum) Attributes SubmissionDateEncryptionDescriptionKeywordsCreationDate :00:00falseCanzone delle vibrazioni che ha ricevuto un enorme successo tra i teenagers nel 2003 Vibrazioni :00: :44:22truegLite Information SystemR-GMA, RGMA, BDII, IS :40

First Grid Tutorial, Catania, Example of gLibrary collections Collection/gLTypes Entry names Attributes Path (refers to a collection) Audio /gLAudio Image /gLImage Video /gLVideo Documents /gLDOC PowerPoint /gLPPT EGEEDOC /EGEEPPT Collection/EGEEPPT Entry names Attributes TitleRuntimeAuthorTypeDateEventSpeakerTopic 00454dca-a269- 4b93-8a45- c4012af05600 Information Systems 00:30:00Valeria Ardizzione, Giuseppe La Rocca Theorical th EGEE Conferen ce Giuseppe La Rocca, Valeria Ardizzone R-GMA, BDII Collection/gLAudio Entry names Attributes SongTitleDurationAlbumGenreSingerFormat 4ffaffc8-26e b460-3d5bf08081a4 Dedicato A Te00:03:27Dedicato A TePopLe VibrazioniMP3 Collection/gLKeys Entry names Attributes Passphrase 00454dca-a269-4b93-8a45- c4012af05600 ardizzo “additional features”

First Grid Tutorial, Catania, gLibrary query examples Query> selectattr /gLibrary:FILE /gLibrary:FileName /gLibrary:Description /EGEEPPT:Author /EGEEPPT:Title /EGEEPPT:Event '/gLibrary:FILE=/EGEEPPT:FILE and like(/gLibrary:Keywords, "%VOMS%")‘ >> 1f6e9ac6-5c b03b-560e0e7ea38a >> VOMS_server_Installation.ppt.gpg >> VOMS Server installation tutorial done in Venezuela >> ziggy, Giorgio >> Installing a gLite VOMS Server >> First Latin American Workshop for Grid Administrators Query> selectattr /gLibrary:FileName SubmissionDate Submitter /gLAudio:SongTitle Singer Duration Genre '/gLibrary:FILE=/gLAudio:FILE and /gLAudio:Format=“MP3”' >> DedicatoAte.mp3 >> :00:00 >> Tony Calanducci >> Dedicato A Te >> Le Vibrazioni >> 00:03:27 >> Pop

First Grid Tutorial, Catania, gLibrary Security User Requirements: –a valid proxy with VOMS extensions –VOMS Role and Group needed to be recognized by gLibrary as a contents manager. 3 kinds of users: –gLibraryManager: (s)he can create new content type and allows a generic VO user to become gLibrarySubmitter –gLibrarySubmitters: they can add new entries and define access rights on the entries they create.  Fine-grained permission (reading, writing, listing, decrypting) settings on each entry: whole VO members, VO groups, list of DNs –generic VO users: browse and make queries (on entries they have access to) Basic level of cryptography: –New files saved on SEs can be encrypted beforehand with a symmetric passphrase that will be saved in /gLKeys. Only selected users (that have a specific DN in the subject of their VOMS proxy) can access the passphrase and decrypt the file.

First Grid Tutorial, Catania, Security example (I) Connecting to amga.ct.infn.it: ARDA Metadata Server Query> whoami >> tony Query> user_listcred tony >> 'C = IT, O = GILDA, OU = Personal Certificate, L = INFN Catania, CN = Tony Calanducci, Address = Query> grp_member >> gilda:users >> gLibraryManager:glibrarysubmitters Query> addentry /gLibrary/1f6e9ac6-5c b03b-560e0e7ea38a FileName VOMS_server_Installation.ppt.gpg PathName /grid/gilda/calanducci/EGEE Type EGEEDOC Submitter 'Tony Calanducci' SubmissionDate ' :44' DecryptKeyDir '/DLKeys/gildateam' Description 'VOMS Server installation tutorial done in Venezuela' Keywords 'VOMS Server' CreationDate ' :28‘ Query> acl_show /gLibrary/1f6e9ac6-5c b03b-560e0e7ea38a >> tony rwxr-x >> gLibraryManager:glibrarysubmitters rwx

First Grid Tutorial, Catania, Security example (II) Query> dir /gLibrary >> /gLibrary/00454dca-a269-4b93-8a45-c4012af05600 >> entry >> /gLibrary/abd52d35-1bee-4de9-b234-a9abd920307e >> entry >> /gLibrary/1f6e9ac6-5c b03b-560e0e7ea38a >> entry Let’s logout and login again using a VOMS proxy with just VO Gilda membership (No Role or group) ARDA Metadata Server Query> whoami >> gilda Query> grp_member >> gilda:users Query> dir /gLibrary >> /gLibrary/00454dca-a269-4b93-8a45-c4012af05600 >> entry Query> acl_show /gLibrary/00454dca-a269-4b93-8a45-c4012af05600 >> gLibraryManager rwxr-x >> gilda:users rx The entry previously created does not even appear to non authorized users

First Grid Tutorial, Catania, Implementation Heavy exploitation of AMGA features –support for VOMS proxy authentication –fine-grained authorization capabilities to set ACLs per entry basis to restrict access to the decryption keys.  Allow gLibrarySubmitters to control which users (based on DNs, VOMS Roles and Groups) can list and get the attributes’ value for the submitted entries GUI Front-ends (to achieve the “easy of use” promise): –Java SWING GUI to be run on a Grid UserInterface (JVM required) -- prototype is under way –Portlet based front-end will be deployed in GENIUSPHERE and made available for any other JSR168 compliant portlets cointainer  Both use AMGA Java APIs

First Grid Tutorial, Catania, gLibrary Deployment scenario Authenticate with X509 Certificate VOMS Proxy with Group & Role Information AMGA Server PostGreSQL VOMS (gLibraryManager, gLibrarySubmitter, VO user) LFC (or Fireman) Catalog VOMS Proxy w/Role & Group SE VOMS Proxy UI

First Grid Tutorial, Catania, gLibrary JAVA GUI screenshot Alpha Prototype

First Grid Tutorial, Catania, gLibrary JAVA GUI Screenshot (II) Alpha Prototype

First Grid Tutorial, Catania, Future planned improvements Splitting of big files among several SEs (different chunks stored in different SEs): –Enforce security of data: even if a chunk is intercepted it has no meaning alone. –Increase upload/download bandwidth –Possible implementation:  one more NumberOfChunks attribute in /gLibrary collection.  /gLChunks collection keeps track of FirstChunkGUID-Chunk#-ChunkGUID Automatic extraction and population of metadata for well known document types –use of GNU libextractor to extract metadata from HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, MP3 (ID3v1 and ID3v2), OGG, WAV, EXIV2, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, REAL, RIFF (AVI), MPEG, QT and ASF –use of Lucenne algorithm for indexing document types containing text Evaluation of gLite Hydra Key Store to save decryptions keys

First Grid Tutorial, Catania, Splitting Implementation UI SE EGEE_Movie.mpg EGEE_Movie.mpg_gpg_1 EGEE_Movie.mpg_gpg_2 EGEE_Movie_mpg_gpg_3 EGEE_Movie.mpg_gpg_4

First Grid Tutorial, Catania, gLibrary Conclusion Born as an use case to demonstrate AMGA features Built on top of many gLite services Considering collaboration and integration with NA3 Document Digital Library System Fast → thanks to AMGA Secure → ACLs, encryption, and splitting Easy to use → User friendly Java GUI and portal soon available Easily extensible to support any document types (Medical Images and files, Invoices, Proceedings, Scientific Publications, Newspapers clips, …)

First Grid Tutorial, Catania, gMOD: grid Movie On Demand

First Grid Tutorial, Catania, gMOD: grid Movie On Demand gMOD provides a Video-On-Demand service User chooses among a list of video and the chosen one is streamed in real time to the video client of the user’s workstation For each movie a lot of details (Title, Runtime, Country, Release Date, Genre, Director, Case, Plot Outline) are stored and users can search a particular movie querying on one or more attributes Two kind of users can interact with gMOD: TrailersManagers that can administer the db of movies (uploading new ones and attaching metadata to them); GILDA VO users (guest) can browse, search and choose a movie to be streamed.

First Grid Tutorial, Catania, gMOD under the hood Built on top of gLite services: Storage Elements, sited in different place, physically contain the movie files FireMan, the File Catalogue, keeps track in which Storage Element a particular movie is located AMGA is the repository of the detailed information for each movie, and makes possible queries on them The Virtual Organization Membership Service (VOMS) is used to assign the right role to the different users The Workload Management System (WMS) is responsible to retrieve the chosen movie from the right Storage Element and stream it over the network down to the user’s desktop or laptop

First Grid Tutorial, Catania, gMOD interactions VOMS FireMan Catalogue Metadata Catalogue WNWN WN CE Storage Elements User Genius Portal Workload Management System get Role AMGA

First Grid Tutorial, Catania, gMOD screenshot gMOD is accesible through the Genius Portal (

First Grid Tutorial, Catania, Any questions? Thanks for the attention