Karolina Sarnowska, University of Virginia Andrew Grimshaw, University of Virginia Mark Morgan, University of Virginia Akos Frohner, CERN Erwin Laure,

Slides:



Advertisements
Similar presentations
Genesis II Open Source, OGSA Implementation Genesis II: Mapping Grids into the Local File System: Access, RNS, and ByteIO Andrew Grimshaw Genesis II Team.
Advertisements

Genesis II Open Source, OGSA Implementation Genesis II: Easy-to-use, Standards Based Grid Middleware Andrew Grimshaw Genesis II Team University of Virginia.
Adoption. Premise We have a sufficient corpus of specifications and profiles to realize (implement) identified use cases in computing and data. Customer.
© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
JSAGA2 Overview job desc. gLite plug-ins Globus plug-ins JSAGA hidemiddlewareheterogeneity (e.g. gLite, Globus, Unicore) JDLRSL.
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
© 2008 Open Grid Forum Grid Standards Realizing Basic Grid Use Cases Using Existing Standards and Profiles.
© 2006 Open Grid Forum Grid – path to pervasive adoption Mark Linesch - President, Open Grid Forum Hewlett Packard.
Web Service Standards, Security & Management Chris Peiris
The Data Replication Service Ann Chervenak Robert Schuler USC Information Sciences Institute.
Web Services and HL7v3 in IHE profiles Vassil Peytchev Epic.
Session 1194 JBuilder 2005 Web Services Designer Ravi Kumar Principal Engineer Borland.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
Why do we need PGI? Shahbaz Memon Jülich Supercomputing Centre (JSC)
File and Object Replication in Data Grids Chin-Yi Tsai.
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
Accelerating business innovation; a Technology Strategy Board programme The Standards Landscape Dave Berry Standards for Interoperable.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Nov. 18, EGEE and gLite are registered trademarks gLite Middleware Usage Dusan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Genesis II “Open Source, OGSA Implementation” Genesis II: From Specification to Implementation Andrew Grimshaw, Mark Morgan Global Bio Grid Team University.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
FP6−2004−Infrastructures−6-SSA CNGrid Middleware GOSv2 Yongjian Wang BUAA – Beijing, China Interoperability workshop of euchinagrid Beijing,
Grid Services I - Concepts
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
Metadata Mòrag Burgon-Lyon University of Glasgow.
Overview of grid activities in France in relation to FKPPL FKPPL Workshop Thursday February 26th, 2009 Dominique Boutigny.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
INFSO-RI Enabling Grids for E-sciencE Experiences with LFC and comparison with RNS Erwin Laure Jean-Philippe.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
© 2008 Open Grid Forum File Catalog Development in Japan e-Science Project GFS-WG, OGF24 Singapore Hideo Matsuda Osaka University.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
Gridshell Security Master Project Akylbek Zhumabayev Rochester Institute of Technology.
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
NorduGrid plans and questions for gLite Marko Niinimaki, NorduGrid 3 rd EGEE meeting Athens, April 2005.
EGEE is a project funded by the European Union under contract IST VO box: Experiment requirements and LCG prototype Operations.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
Open Grid Services for Earth Observation Pedro Gonçalves.
Data Breakout. OGSA Architecture – databases Eldas, OGSA-DAI and GridMiner implement a slightly old version of OGSA / DAIS –Architecture doc describes.
DataGrid is a project funded by the European Commission EDG Conference, Heidelberg, Sep 26 – Oct under contract IST OGSI and GT3 Initial.
The Institute of High Energy of Physics, Chinese Academy of Sciences Sharing LCG files across different platforms Cheng Yaodong, Wang Lu, Liu Aigui, Chen.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
OGF24 15 September 2008 Data Area Overview Erwin Laure David E. Martin Data Area Directors.
EGEE Data Management Services
StoRM: a SRM solution for disk based storage systems
Java API del Logical File Catalog (LFC)
GSAF Grid Storage Access Framework
Data Management cluster summary
Why does EDGeS need OGF PGI ?
gLite The EGEE Middleware Distribution
Presentation transcript:

Karolina Sarnowska, University of Virginia Andrew Grimshaw, University of Virginia Mark Morgan, University of Virginia Akos Frohner, CERN Erwin Laure, KTH

Introduction Development Interoperability Performance Conclusion 2

Interoperability and Grid Standards 3

Create a standards-based interface for accessing data resources in EGEE grids Read data from LFCs via standardized interface Enable interoperability Validate specifications 4

Catalog of data files on storage elements Each logical file entry can correspond to multiple physical files (replicas) Provides mapping between logical file names and storage URLs 5 LFC /grid /atlas /alice /cms SE Storage Elements EGEE Grid

Infrastructure WS-Addressing, Resource Namespace Service (RNS), WS-Naming Compute JSDL, OGSA-BES, HPC Basic Profile Data RNS, OGSA-ByteIO, gridFTP, WS-DAI APIs SAGA, DRMAA, GridRPC Security WSI-BSP, WS-Security, WS-Trust, WS-Federation, WSSecurityPolicy, WS-SecureNaming, WS-Secure Communication 6

Need means of referring to resources What’s in a name? human-readable maps to unique resource 7

RNS: Resource Namespace Service Describes standard way of mapping names to grid endpoints RNS 1.0 Operations: Add, List, Remove, Query, Move 8

Standards-based Naming for Accessing Resources in an LFC 9 SNARL Service (1) RNS Requests (3) RNS Responses LFC /grid /atlas /alice /cms /dteam Grid Client (2) LFC API Calls SE EGEE Grid Another Grid Storage Elements

Need means of accessing data associated with resources 10

Describes standard way of handling transfer of data associated with grid entities POSIX-like interface Random, session-less or stateful session access Operations: Read, Write, Append, TruncAppend, SeekRead, SeekWrite 11

Standards-based Access to Bytes of LFC Entries 12 (3) LCG & GFAL API Calls SABLE Service (1) ByteIO Requests (3) ByteIO Responses LFC /grid /atlas /alice /cms /dteam Client SE EGEE Grid Another Grid Storage Elements (2) GFAL API Calls

Creating web services with Axis2/C 13

1.Setup web services framework with Axis2/C 2.Create skeleton services with WSDL2C 3.Create clients for testing 4.Implement RNS and ByteIO Operations using gLite APIs 14

Web services engine implemented in C Handles SOAP message processing Built-in support includes: WS-Addressing WS-Policy WS-SecurityPolicy 15

1.Specify web service in WSDL file 2.Use tool to generate C web service skeleton Resulting Service Skeleton Size: SNARL: 89k LOC SABLE :77k LOC 16

Example: RNS Add Request 17 foo

Translating RNS operations to API calls Should replicas be visible to users? 18 RNS OperationLFC API Call Addlfc_creat Listlfc_readdir Removelfc_unlink Querylfc_stat Movelfc_unlink + lfc_creat

19 LFC /grid/foo SNARL Service (1) RNS List (3) RNS List Response Grid Client (2) LFC API Calls SE fooReplicaA SE fooReplicaB

Translating ByteIO operations to API calls Should writes be allowed? 20 ByteIO OperationAPI Call Allgfal_open Readgfal_lseek + gfal_read Write/Appendgfal_lseek + gfal_write TruncAppendgfal_creat + Append

21

Project/SpecRNSByteIO Genesis IIXX Unicore 6X Fujitsu USMTX Microsoft CCS Globus OMII-UKX GridSAM Crown Platform OGSA-DAIXX NAREGI gFarmX 22

1.Using RNS list request, choose LFC entry 2.Using ByteIO read request, read data resource associated with LFC entry 23

24 (3) RNS List Response LFC /grid /atlas /alice /cms /dteam Grid Client (2) LFC API Calls SE (1) RNS List Request SNARL Service Storage Elements EGEE Grid GenesisII Grid

25 (3) ByteIO Read Response LFC /grid /atlas /alice /cms /dteam Grid Client SE (1)ByteIO Read Request SABLE Service Storage Elements (2) GFAL API Calls EGEE Grid GenesisII Grid

26

Scientific CERN Linux 4 Intel Pentium GHz CPU 1 GB memory 27

28 Dir Size: lfc-ls Cmdline LFC API Axis2 Client Genesis II Client Small (10 entries)0.42s1.49s1.94s4.43s Medium (100 entries)0.47s11.73s15.92s20.27s Large (1000 entries)0.71s2m 0.37s2m 1.11s2m 5.05s

29 File Size: lcg-cp Cmdline GFAL API Axis2 Client Genesis II Client Small (1 KB)2.54s2.60s2.55s3.49s Medium (1 MB)2.83s3.61s2.93s3.64s

30

Add security context Implement MIME, DIME, MTOM Explore use as interface between LFC and other tools Update for RNS 1.1 Implement StreamableByteIO 31

Implementation of RNS and ByteIO specification for existing grid system is possible RNS and ByteIO provide an interface that enables the sharing of grid data 32

CERN & DM Team UVA Genesis II Team OpenLab Program at CERN 33

1."Open Grid Forum," 2.M. Pereira, O. Tatebe, L. Luan, and T. Anderson, "Resource Namespace Service Specification,” M. Morgan, “ByteIO Specification,” M. M. Morgan and A. S. Grimshaw, "Genesis II – Standards Based Grid Computing," Seventh IEEE International Symposium on Cluster Computing and the Grid, “Apache Axis2/C,” 34