OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh

Slides:



Advertisements
Similar presentations
Large-Scale, Adaptive Fabric Configuration for Grid Computing Peter Toft HP Labs, Bristol June 2003 (v1.03) Localised for UK English.
Advertisements

Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
OGSA-DAI Data Access and Integration for the Grid Neil Chue Hong
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
An Overview of OGSA-DAI Kostas Tourlas
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
FirstDIG First Data Investigation on the Grid Paul Graham, Terry Sloan, Adam Carter EPCC Ian Gregory, Darren Unwin First South Yorkshire tel:+44 (0)131.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
1 An Introduction to OGSA-DAI Konstantinos Karasavvas 13 th September 2005.
1 e-science & data mining workshop, NeSC, UK, November 30 th, 2004 Terry Sloan EPCC, The University of Edinburgh INWA : using OGSA-DAI.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
OGSA-DAI: Future Work and Wrap-up The OGSA-DAI Team
Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
1 UK NeSC Meeting, November 18 th, 2004 Terry Sloan EPCC, The University of Edinburgh INWA : using OGSA-DAI in a commercial environment.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
The Queen’s University of Belfast The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
EdSkyQuery-G Overview Brian Hills, December
EQUELLA Product Strategy and Development
ES Metadata Management Enabling Grids for E-sciencE ES metadata OGSA-DAI NA4 GA Meeting, D. Weissenbach, IPSL, France.
1 EPCC Sun Data and Compute Grids Project Using Sun Grid Engine and Globus to Schedule Jobs Across a Combination of Local.
Using OGSA-DAI in a commercial environment Terry Sloan EPCC Telephone:
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Engr. M. Fahad Khan Lecturer Software Engineering Department University Of Engineering & Technology Taxila.
OGSA-DAI Architecture The OGSA-DAI Team
Introduction to OGSA-DAI The OGSA-DAI Team
DAIT (DAI Two) NeSC Review 18 March Description and Aims Grid is about resource sharing Data forms an important part of that vision Data on Grids:
OGSA-DAI Scenarios and Requirements OGSA-DAI for Developers GridWorld 2006, Washington DC 11 September 2006.
1 / 18 Federal University of Rio de Janeiro – COPPE/UFRJ Author : Wladimir S. Meyer – Doctorate Student Advisors : Jano Moreira de Souza – Ph.D. Milton.
ODD-Genes: Accelerating data-driven scientific discovery NeSC Review 2003 NeSC
INFSO-RI Enabling Grids for E-sciencE Supporting legacy code applications on EGEE VOs by GEMLCA and the P-GRADE portal P. Kacsuk*,
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
1 1 EPCC 2 Curtin Business School & Edinburgh University Management School Michael J. Jackson 1 Ashley D. Lloyd 2 Terence M. Sloan 1 Enabling Access to.
Usability Talk, 26 th January 2006 Development of Usable Grid Services for the Biomedical Community Prof Richard Sinnott Technical Director National e-Science.
Jian Gui WANG New Implementation of Agriculture Models APAN19---Jan New Implementations of Agriculture Models Using Mediate Architecture.
OGSA-DAI.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI Technology Update GGF17, Tokyo (Japan)
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE User Forum, Manchester, 10 May ‘07 Nicola Venuti
Experiences with OGSA-DAI : Portlet Access and Benchmark Deepti Kodeboyina and Beth Plale Computer Science Dept. Indiana University.
1 OGSA-DAI Status Report Neil P Chue Hong 20 th May 2005.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
OGSA-DAI & DAIT projects Update for TAG Prof. Malcolm Atkinson Director 30 th October 2003.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Requirements Gathering Exercise 2 nd DIALOGUE workshop eSI, 9-10.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
OGSA-DAI Users’ Meeting Introduction Malcolm Atkinson Director 7 th April 2004.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Status and Benchmarks All Hands Meeting 2005 Nottingham, 22 September.
The OGSA-DAI Project Databases and the Grid Neil Chue Hong Project Manager EPCC, Edinburgh
OGSA-DAI Open Grid Services Architecture – Data Access and Integration NeSC Review 18 March 2004.
INFSO-RI Enabling Grids for E-sciencE Web Services Mike Mineter National e-Science Centre, Edinburgh.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
1 A new Architecture for OGSA-DAI Malcolm Atkinson, K. Karasavvas, M. Antonioletti, R. Baxter, A. Borley, N. Chue Hong, A. Hume, M. Jackson, A. Krause,
OGSA-DAI Current Version Guy Warner.
OGSA-DAI.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Presentation transcript:

OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh AHM 2004, Data Services and Middleware, 2 nd September 2004

2http:// - AHM2004, 2nd September 2004 OGSA-DAI  OGSA-DAI is middleware –Middleware should be invisible –Provide additional functionality or hide complexity  Allows uniform access to data resources –data resources: relational and XML databases, files, …  Provides an extensible framework –You can extend functionality - fill any gaps  We think it works well –But need feedback  Recount how OGSA-DAI is being used  Some background first …

3http:// - AHM2004, 2nd September 2004 Basic Operational Model Data Resource Container DAISGR Client GDSF GDS

4http:// - AHM2004, 2nd September 2004 Why OGSA-DAI?  Why use OGSA-DAI over JDBC? –Language independence at the client end Do not need to use Java –Platform independence Do not have to worry about connection technology, drivers, etc –Can handle XML resources –Can embed additional functionality at the service end Transformations Third party delivery etc Avoiding unnecessary data movement –Provision of Metadata is powerful –Usefulness of the Registry for service discovery Dynamic service binding process

5http:// - AHM2004, 2nd September 2004 More Complex Behaviour Data Resource Container Client GDS GDT Data Resource Container GDS GDT Deliver data back to the client. Data Resource Deliver data to a third party. Deliver data another GDS. And there's a lot more that you can do …

6http:// - AHM2004, 2nd September 2004 Usage Patterns G A Q S+R Data Q - Query D - Delivery S - Status R - Result U - Update I - Data id Q+D A C G S R G C A Q S D R A G Q+U S RetrieveUpdate/InsertPipeline G2=C G1=P A I Q1 S2 S1 U/R Q2+D Q1+D G2=C A G1=P S2 S1 Q2 U/R Actors - OGSI process - Non-OGSI process A - Analyst C - Consumer G - GDS P - Producer Call Response Data Flow A P G U I Q S A P G U I S Q+D

7http:// - AHM2004, 2nd September 2004 Activities are the drivers  Express a task to be performed by a GDS  Three broad classes of activities: –Statement –Transformations –Delivery  Extensible: –Easy to add new functionality –Does not require modification to the service interface –Extension operate within the OGSA-DAI framework  Functionality: –Implemented at the service –Work where the data is (do not require to move data back)

8http:// - AHM2004, 2nd September 2004 OGSA-DAI Deck

9http:// - AHM2004, 2nd September 2004 Building Applications  Activities are grouped together –Perform document –Data can flow between activities  Optimisation –Avoids multiple message exchanges  Can deliver to other GDSs –Prerequisite for data integration  Base middleware for projects requiring data access –Some capability for data integration  That is the theory … now for the practice  OGSA-DAI being adopted by a number of projects …

10http:// - AHM2004, 2nd September 2004 Who is Using OGSA-DAI? OGSA-DAI ( AstroGrid ( BioSimGrid ( BioGrid ( Bridges ( eDiaMoND ( FirstDig ( GeneGrid ( GEON ( IU RGRBench ( myGrid ( N2Grid ( ODD-Genes ( OGSA-WebDB ( MCS ( INWA ( GridMiner (

11http:// - AHM2004, 2nd September 2004 Project classification

12http:// - AHM2004, 2nd September 2004 Projects using OGSA-DAI  These projects form a list of case studies –Need to capture requirements –How OGSA-DAI is being used –Where it succeeds and where it fails –Other issues that arise –An on-going process  Only time to outline salient points from a couple of projects –More detail in the paper … but this only gives a top level overview On-going process …  Solicit more –If you have more then please get in touch …

13http:// - AHM2004, 2nd September 2004  e-Digital MammOgraphy National Database  Built a prototype of a national database of mammographic images in support of the UK Breast screening programme  Employ Grid technologies to facilitate this process  Mike Brady gave a keynote that went over the details

14http:// - AHM2004, 2nd September 2004 DB2 Content Manager DB2 Content Manager DB2 Content Manager DB2 Content Manager DB2 Federation OGSA-DAI Database Files OGSA-DAI Core Services Core Services Core Services Core Services Data Load Training App Training Services UCL KCLUEDCHU Core API Training API Training Application Core & Training API OGSA-DAI Data Load Training App Core & Training API Data Load Training App Core & Training API Data Load Training App Core & Training API

15http:// - AHM2004, 2nd September 2004  eDiaMoND Findings: –OGSA-DAI provides a flexible framework –Dynamically configure the system through discovery –Activities can operate with different levels of granularity –Federation can introduced at various levels –Upgrading from R3  R4 broke some things Low level XML issues –Good documentation on how to extend the framework Extended Activities to access IBM DB2 Content Manager

16http:// - AHM2004, 2nd September 2004 INWA Objectives  Innovation Node Western Australia –Informing Business & Regional Policy: Grid-enabled fusion of global data and local knowledge  Project –Run from Nov Aug 2004 –Involved 10 partners (6 UK + 4 Australia)  Aim –Data mine commercially sensitive data –Security an absolute MUST –Employ Grid technologies –Need access to data and computational resources  Demonstrator using: –OGSA-DAI Incorporate data resources –Sun DCG's TOG (Transfer-queue Over Globus) Handle job submission to analyse micro array data

17http:// - AHM2004, 2nd September 2004 Curtin,Australia EPCC,UK INWA Grid Engine BankTelco Grid Engine BankTelco OGSA-DAI TOG Data Browser Telco data Bank data Australian property UK Property

18http:// - AHM2004, 2nd September 2004 INWA: Lessons Learned  Performing Data Integration: –TimeZone date problems  Security issues: –Bugs in JavaCoG in GT3 OGSA-DAI could not switch security for Grid data transfers TOG had no security option –All of these have been fixed  Middleware not mature enough for commercial deployment

19http:// - AHM2004, 2nd September 2004 ODD-Genes  OGSA-DAI Demo for Genetics  Collaboration between –EPCC –Scottish Centre for Genomic Technology and Informatics (GTI) –Human Genetics Unit (HGU)  ODD-Genes demonstrates: –Perform high-speed batch analysis of microarray data on the Grid –Browse the results of previous analyses stored in a database –View data from arbitrary databases as HTML –Discover related databases on the Grid –Perform coupled queries on newly- discovered databases to provide a richer analysis of gene data

20http:// - AHM2004, 2nd September 2004 ODD-Genes Actors HGU Mouse Genome Information OGSA-DAI GTI ODD-Genes Webapp OGSA-DAI Micro Array Data TOG GridEngine EPCC GridEngine Globus OGSA-DAI DAISGR 1. Client 2. EPCC is an example of a computational resource. 3. HGU is an example of a data repository.

21http:// - AHM2004, 2nd September 2004 ODD-Genes Findings  Data discovery perceived to be very important –Map data views: time -> spatial locations –Discovery of new resources  Transparency to data access had an XML database had a relational database –Deploy OGSA-DAI and not worry about databases  Issues –Registry maintenance policy –Semantics of the discovery process –Groups working the same area but different schemas, no generic metadata (schemas were the effective metadata)  Provides an additional tool for researchers

22http:// - AHM2004, 2nd September 2004 Other Projects  AstroGrid –Identified (and fixed) a number of bugs –Passed on requirements  FirstDig –Identified a number of bugs –Have contributed a data browser to OGSA-DAI  GeneGrid –Interfacing Perl through an OGSA-DAI service to access biological databases –Requirement for file support  EdSkyQuery-G –Collaboration between OGSA-DAI & Eldas –Based on SkyQuery project by John Hopkins University, Baltimore, USA

23http:// - AHM2004, 2nd September 2004 More on Projects  MSc at Edinburgh looking at data integration scenarios –Benchmarking OGSA-DAI –Investigating capabilities GDSActivity allowing perform documents to be executed at other GDSs Identifying further requirements for data integration - control flow –sequence –flow Question as to whether such capabilities should be included in OGSA-DAI or OGSA-DAI should interface with other workflow languages  MSc at Edinburgh looking at C bindings to the OGSA-DAI CTK –For language independance need to provide more of these… Perl, Python … Eiffel!!  GridMiner –Have a really cute logo –Have a member of that team currently at NeSC

24http:// - AHM2004, 2nd September 2004 Conclusions  Still early days –Standardisation process not stabilising quickly enough –Infrastructure still developing and prone to change  OGSA-DAI acting as an enabler –Showing people what can be done –However is it cracking a nut with a sledge hammer?  Usage patterns are similar –Call for people to work together to solve similar problems  Problems that are not OGSA-DAI specific –Metadata, Time zones, security, …  Data discovery perceived to be important –Is this in the scope of what OGSA-DAI should be doing?  Need to talk to users and gather war stories – –On going process …

25http:// - AHM2004, 2nd September 2004 Further Information  The OGSA-DAI Project Site: –  The DAIS-WG site: –  OGSA-DAI Users Mailing list –General discussion on grid DAI matters  Formal support for OGSA-DAI releases –  OGSA-DAI training courses