Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens.

Slides:



Advertisements
Similar presentations
Delivering User Needs: A middleware perspective Steven Newhouse Director.
Advertisements

© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
Neil Chue Hong Project Manager, EPCC OGSA-DAI data access and integration NERC GridGIS workshop eSI, 1 February.
The National Grid Service and OGSA-DAI Mike Mineter
OGSA-DAI Data Access and Integration for the Grid Neil Chue Hong
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
An Overview of OGSA-DAI Kostas Tourlas
OGSA-DAI: Patterns and Functions Supercomputing ‘06 AIST Stand Malcolm Atkinson Director e-Science Institute & e-Science Envoy 14 th November.
Facilitating the use of eInfrastructure: NeSC Training Team Enabling, facilitating and delivering quality training in the UK and Internationally.
EGEE is a project funded by the European Union under contract IST International Summer School on Grid Computing Vico Equense, 16 th July 2005.
Amy Krause Applications Consultant, EPCC Tom Sugden Applications Consultant, EPCC OGSA-DAI Client Toolkit Principles.
Distributed Heterogeneous Data Warehouse For Grid Analysis
Intelligent Grid Solutions 1 / 18 Convergence of Grid and Web technologies Alexander Wöhrer und Peter Brezany Institute for Software.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
1 An Introduction to OGSA-DAI Konstantinos Karasavvas 13 th September 2005.
Mike Jackson EPCC OGSA-DAI Today Release 2.2 Principles and Architectures for Structured Data Integration: OGSA-DAI.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
17 July 2006ISSGC06, Ischia, Italy1 Agenda Session 26 – 14:30-16:00 An Overview of OGSA-DAI OGSA-DAI today – and future features How to extend OGSA-DAI.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Globus 4 Guy Warner NeSC Training.
Introduction to OGSA-DAI Neil Chue Hong 15 th February 2006 GGF16, Athens.
1 OGSA-DAI: Status and Future Plans Neil Chue Hong.
OGSA-DAI: Future Work and Wrap-up The OGSA-DAI Team
Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Extensible Framework for Data Access & Integration Malcolm Atkinson Director 10 th November 2004.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Introduction to OGSA-DAI The OGSA-DAI Team
DAIT (DAI Two) NeSC Review 18 March Description and Aims Grid is about resource sharing Data forms an important part of that vision Data on Grids:
OGSA-DAI Scenarios and Requirements OGSA-DAI for Developers GridWorld 2006, Washington DC 11 September 2006.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
1 1 EPCC 2 Curtin Business School & Edinburgh University Management School Michael J. Jackson 1 Ashley D. Lloyd 2 Terence M. Sloan 1 Enabling Access to.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Grids - the near future Mark Hayes NIEeS Summer School 2003.
OGSA-DAI.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Data access and integration with OGSA-DAI: OGSA-DQP Steven Lynden University of Manchester.
Grid Services I - Concepts
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
OGSA-UK: Putting the users first Steven Newhouse OMII Deputy Director.
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
OGSA-DAI Neil Chue Hong 29 th January 2007 OGF19, Chapel Hill.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI Technology Update GGF17, Tokyo (Japan)
IBM & HSBC visit Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 30 th March 2006.
1 OGSA-DAI Status Report Neil P Chue Hong 20 th May 2005.
OGSA-DAI & DAIT projects Update for TAG Prof. Malcolm Atkinson Director 30 th October 2003.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Requirements Gathering Exercise 2 nd DIALOGUE workshop eSI, 9-10.
OGSA-DAI Users’ Meeting Introduction Malcolm Atkinson Director 7 th April 2004.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Neil Chue Hong Project Manager, EPCC OGSA-DAI Status and Benchmarks All Hands Meeting 2005 Nottingham, 22 September.
The OGSA-DAI Project Databases and the Grid Neil Chue Hong Project Manager EPCC, Edinburgh
OGSA-DAI Open Grid Services Architecture – Data Access and Integration NeSC Review 18 March 2004.
Mike Jackson EPCC OGSA-DAI Today – Release 8 OGSA-DAI Tutorial GGF17, Tokyo.
Neil Chue Hong EPCC Authorization Models for Data Services EGEE Workshop on Management of Rights in Production Grids.
OGSA-DQP Steven Lynden University of Manchester. Data access & integration with OGSA-DAI: GGF 17 2 Introduction OGSA-DQP is a service based distributed.
OGSA-DAI 简介及其它在 China-VO DAS 系统中的应用 杨阳 中国虚拟天文台研发团队 Chinese Virtual Observatory.
1 A new Architecture for OGSA-DAI Malcolm Atkinson, K. Karasavvas, M. Antonioletti, R. Baxter, A. Borley, N. Chue Hong, A. Hume, M. Jackson, A. Krause,
OGSA-DAI Current Version Guy Warner.
OGSA-DAI.
A Grid Data Integration Service (OGSA-DQP) Paul Watson, University of Newcastle-upon-Tyne based on the work of… Norman Paton, Tasos Gounaris,
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Wsdl.
Grid Systems: What do we need from web service standards?
Presentation transcript:

Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 © 2 Data Services: challenges Scale Many sites, large collections, many uses Longevity Research requirements outlive technical decisions Diversity No “one size fits all” solutions will work Primary Data, Data Products, Meta Data, Administrative data, … Many Data Resources Independently owned & managed Geographically distributed and I haven’t even mentioned security yet!

GGF16, Feb 2006 © 3 Use Cases for Data Services Data Filtering: Single source producing large amounts of data distributed to many sites downstream Data Discovery: many sources, many query entry points in a linked system Data Translation: source to sink, conversion of data model / structure Data Federation: many sources, linked to provide view as a single source Data Replication full or partial copies to improve throughput Data Integration (model aggregation) e.g. integration of time variant data, streams, files Data Integration (knowledge expansion) forming links between databases to increase knowledge

GGF16, Feb 2006 © 4 Trade Offs Speed vs completeness do you require the exact answer or an answer? Application specific vs language specific queries how will users interrogate a data service? Static system vs Dynamic Discovery can you actually have dynamic resources? Static vs Dynamic data READ only, INSERT only, UPDATE permitted Static vs Dynamic queries optimisation over flexibility Intranet vs Internet speed over security Single data model versus mixed data models ease/speed over integration Queries vs Questions assume that we know the structure when we form the query

GGF16, Feb 2006 © 5 Requirements on Data Services? Common Data Model e.g. RowSet Common Query Language(s) e.g. XQuery, SQL Standard access to data resource schema information physical data resource information for optimisation purposes data resource descriptive information for discovery / integration Single, seamless security model Dynamic publication and discovery Multiple, efficient delivery methods Move computation towards data Data aggregation functionality Replication information

GGF16, Feb 2006 © 6 OGSA-DAI In One Slide An engineered extensible framework for data access and integration. Expose heterogeneous data resources to a grid through web services. Interact with data resources: Queries and updates. Data transformation / compression Data delivery. Customise for your project using Additional Activities Client Toolkit APIs Data Resource handlers A base for higher-level services federation, mining, visualisation,…

GGF16, Feb 2006 © 7 MySQL OGSA-DAI service Engine SQLQuery JDBC Data Resources Activities DB2 GZipGridFTPXPath XMLDB XIndice readFile File SWISS PROT XSLT SQL Server Data- bases Application Client Toolkit

GGF16, Feb 2006 © 8 MySQL OGSA-DAI service Engine SQLQuery JDBC SQL JDBC SQL JDBC SQL JDBC SQL JDBC Multiple SQL GDS SQLQuery

GGF16, Feb 2006 © 9 Distributed Query Processing Higher level services building on OGSA-DAI Queries mapped to algebraic expressions for evaluation Parallelism represented by partitioning queries Use exchange operators table_scan (protein) table_scan termID=S92 (proteinTerm) reduce hash_join (proteinId) op_call (Blast) reduce exchange 3,4 12

GGF16, Feb 2006 © 10 DQP architecture

GGF16, Feb 2006 © 11 Contributing to OGSA-DAI Additional functionality: Provide activities which implement specific functionality Provide extra client functionality Provide different security mechanisms Provide higher level components and applications Different levels of contributions Based on OGSA-DAI? Works with OGSA-DAI? Part of OGSA-DAI?

GGF16, Feb 2006 © 12 Future plans A new version of the OGSA-DAI Engine better support for concurrency, sessions, monitoring and notification Implementing new DAIS specifications Key things that we will be addressing: Performance (particularly format representation and transport) Security Model which can be applied across platforms Transactions provision More data integration facilities Integration with other components registries (e.g. GRIMOIRES) workflow editors (e.g. Taverna) Working with new projects e.g. CancerGrid, iSpider, GEODE

GGF16, Feb 2006 © 13 Further information The OGSA-DAI Project Site: The DAIS-WG site: OGSA-DAI Users Mailing list General discussion on grid DAI matters Formal support for OGSA-DAI releases OGSA-DAI training courses

GGF16, Feb 2006 © 14 OMII-UK Context e-Science Reclamation Yard Collaborative Development Users OGSA-DAI Collaborative Development OMII OMII-UK

GGF16, Feb 2006 © 15 The OGSA-DAI Team IBM Development Team, Hursley NEReSC, Newcastle NeSC, Edinburgh ESNW, Manchester IBM Dissemination Team EPCC Team, Edinburgh

GGF16, Feb 2006 © 16 Software Process Testing Reqs. Prototype Prioritisation Fix Bugs Use Cases Requests Design ImplementQA Release Support Test Cases Programme Board Technical Review Board Technical Reviewer DEVELOPERS USERS REVIEW Contribs Ingest Dissem. Training Nightly unit + system tests Additional test cases System tests based on reqs Continual process → Deep track features Users’ Group Peer Review and Inspection

International Cooperation and Recognition USA: o Globus Alliance o IBM Corporation o caBIG o BIRN o Indiana University o GridSphere o GEON o LEAD o MCS o NCSA o Secure Data Grid o UNC Japan: o AIST o BioGrid o NAREGI Europe: o CERN o DataMiningGrid o GridMiner o GridSphere o inteligrid o N2Grid o OntoGrid o Provenance o SIMDAT UK: o OMII o NGS o NCeSS o NIeeS o AstroGrid o BioSimGrid o BRIDGES o CancerGrid o ConvertGrid o eDiaMonD o EDINA o First Group plc o Fujitsu Labs Europe o GEDDM o GeneGrid o Genomic Technology and Informatics o GOLD o Human Genetics Unit o IBM UK o my Grid o Oracle UK China: o CAS o ChinaGrid o cnGrid o INWA Australia: o Curtin Business School o INWA Tutorials BostonCambridge CERNChicago EdinburghLondon San FranciscoSeattle SeoulSingapore TokyoISSGC 03 to 05 DIALOGUE workshops Columbus, Edinburgh, Indiana, Vienna Chicago, Manchester, San Diego South Korea: o KISTI 1485 registered users downloads

LEAD GeneGrid caBIG BRIDGES OGSA WebDB FirstDIG ConvertGrid eDiaMoND OGSA-DQP Grid Miner Meeting User Requirements

GGF16, Feb 2006 © 19 Summary Experienced team delivering quality software mature software and process Engaging with international community understanding and reacting to user requirements Complementary to other nodes delivering a coordinated roadmap of software De facto standard software for DAI driving refinement of standards used by large and small scale projects

GGF16, Feb 2006 © 20 Comments and Questions Please

GGF16, Feb 2006 © 21 Number of users 1485 registered downloads 3 Users’ Group Meetings Edinburgh Brussels Edinburgh Contributors Austria, China, Finland, Poland, Spain, UK, USA Release Statistics 985 downloads of latest release -Actual user downloads not search engine crawlers -Does not include downloads as part of GT3.2 and GT4 releases R1.0 (Jan 03)109 R1.5 (Feb 03)110 R2.0 (Apr 03)254 R2.5 (Jun 03)294 R3.0 (Jul 03)792 R3.1 (Feb 04)686 R4.0 (May 04)1124 R5.0 (Dec 04)766 R6.0 (May 05)985 Meeting User Requirements

GGF16, Feb 2006 © 22 Core features of OGSA-DAI A framework for building data clients Client toolkit library for application developers Seamless abstraction across WSI and WSRF services Highly-extensible Customise out-of-the-box product A framework for developing functionality Compose existing activities with application specific activities Data service concurrency and sessions Comprehensive documentation and tutorials Shipped to run on OMII_2, GT4.0 and Axis 1.2

GGF16, Feb 2006 © 23 Functionality of OGSA-DAI A framework for data applications Data access, insert and update Relational: MySQL, Oracle, DB2, SQL Server, Postgres, … XML: Xindice, eXist Files – CSV, EMBL, OMIM, SWISSPROT,… Data delivery SOAP over HTTP FTP, GridFTP Inter-service Data transformation XSLT ZIP, GZIP Security X.509 certificates Message Level Transport Level