Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 1 Design Decisions Interoperability.

Slides:



Advertisements
Similar presentations
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
Advertisements

High Performance Computing Course Notes Grid Computing.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Technical Architectures
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Course Instructor: Aisha Azeem
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Accessing Biodiversity Resources in Computational Environments from Workflow Application J. S. Pahwa, R. J. White, A. C. Jones, M. Burgess, W. A. Gray,
Globus 4 Guy Warner NeSC Training.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Designing and Building a Biodiversity Grid: the Biodiversity World Project A talk in the workshop “e-Research - Meeting New Research Challenges” at the.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
THE GITB TESTING FRAMEWORK Jacques Durand, Fujitsu America | December 1, 2011 GITB |
CSE 303 – Software Design and Architecture
1 G52IWS: Distributed Computing Chris Greenhalgh.
©Ian Sommerville 2000 Software Engineering, 6th edition. Slide 1 Component-based development l Building software from reusable components l Objectives.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
SAMANVITHA RAMAYANAM 18 TH FEBRUARY 2010 CPE 691 LAYERED APPLICATION.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 09. Review Introduction to architectural styles Distributed architectures – Client Server Architecture – Multi-tier.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Engr. M. Fahad Khan Lecturer Software Engineering Department University Of Engineering & Technology Taxila.
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Component Technology. Challenges Facing the Software Industry Today’s applications are large & complex – time consuming to develop, difficult and costly.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
Managing and communicating uncertainty in geospatial web service workflows Richard Jones, Dan Cornford, Lucy Bastin, Matthew Williams Computer Science,
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Grid Services I - Concepts
Cracow Grid Workshop ‘06 17 October 2006 Execution Management and SLA Enforcement in Akogrimo Antonios Litke Antonios Litke, Kleopatra Konstanteli, Vassiliki.
CORBA1 Distributed Software Systems Any software system can be physically distributed By distributed coupling we get the following:  Improved performance.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Data Integration in Bioinformatics Using OGSA-DAI The BioDA Project Shirley Crompton, Brian Matthews (CCLRC) Alex Gray, Andrew Jones, Richard White (Cardiff.
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Chapter 1: Computing with Services Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
OGSA-DAI.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
RDA 9th Plenary Breakout 3, 5 April :00-17:30
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
University of Technology
PROCESS - H2020 Project Work Package WP6 JRA3
Service-centric Software Engineering
Service-centric Software Engineering 1
SAMANVITHA RAMAYANAM 18TH FEBRUARY 2010 CPE 691
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Presentation transcript:

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Design Decisions Interoperability in a changing architecture Andrew Jones

Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July BiodiversityWorld requirements (1) Biodiversity Problem Solving Environment – Heterogeneous diverse resources Facilitating integration of both legacy and newly- developed resources Flexible workflows Main challenges centre around metadata, interoperability, resource discovery, etc; High-performance computing secondary (though relevant)

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July BiodiversityWorld requirements (2) Distinctive features: a biodiversity informatics GRID interoperability with heterogeneous data, complex in structure resilience to infrastructure change & interoperation with other GRIDs interactive collaboration a secondary concern Assumptions about resources: A resource worked either: Essentially in ‘batch’ mode, or Supporting a sequence of operations on a single resource, but involving exchange of minimal data Reasonable to treat each resource (including databases) as a service offering its own, defined set of operations

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July BiodiversityWorld architectural overview BiodiversityWorld-GRID Interface (BGI) The GRID Workflow enactment engine Wrapped resources Native Biodiversity- World Resources Metadata repository Presentation BGI API User interface

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July The BGI concept Standardised invocation mechanism Wrappers notionally divided into Grid-facing and resource-facing parts

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Why we protected ourselves from ‘the Grid’(!) Rapidly evolving standards Previous experience in GRAB Globus 2 approach needed ‘canned queries’, temporary files, etc … unnatural for distributed request/response model BiodiversityWorld Globus and other software still evolving Globus 3: Grid Services; Globus 4: WSRF; … Trade-off: abstraction layer (BGI); invocation mechanism Insulates from change Performance penalty Assume computationally intensive applications lie in a single BDW resource Proprietary invocation mechanism hinders interoperation with other Grid/Web services

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Implementations of BGI RMI GT3 Grid Services (incomplete) Web services GT4/WSRF/Grid-Service-as-portal

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Benefits & limitations Too many standards, so we defined a new one!! Interoperability with other projects restricted Could wrap non-BDW resources, or Implement alternative Grid-facing “glue” replacing invocation mechanism with some other standard Restrictions on highly interactive applications BGI OK for coarse-grained interaction; not for dynamic interaction with potentially large data volumes Transmission and storage of intermediate results: method not specified Can pass URI instead of data, but no specifications restricting what this might refer to

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Transmission/storage of data Desirable to have uniform mechanisms for transmission and storage of data for: Efficient operation of workflows Re-use; composition of workflows Supporting more flexible experimentation

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Are workflows sufficient for flexible experimentation? Creating a workflow: Workflows clearly good for capturing complex tasks Good for ‘tweaking’ tasks But is this how users think? If not, we should provide an environment that supports a more exploratory approach too, e.g. User tries out some small subtasks (S)he joins results together Builds larger workflows from fragments This requires recording of interactions, so re-usable workflows can be composed Storage of intermediate data sets Provenance metadata (extending MDR)

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July How to achieve dynamic interaction? Some possibilities for future development Remote direct manipulation (And other remote interactions?) BGI not well suited to fine-grained interaction with resources Some resources may not be accessible except as stand-alone May need (less portable) ‘by-pass’ mechanisms, e.g. New BGI protocol Using existing techniques, such as VNC Local direct manipulation, etc. Achievable via component-based ‘plug-in’ approaches (e.g. using JavaBeans), but component interface must be defined Requires data to be present locally; bandwidth concerns Some bandwidth problems can be addressed by combining local specialised client component & remote server component (e.g. passing vectors, not bitmaps) BGI may or may not be fast enough in this case

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July How to achieve data transmission/intermediate result storage? Low level E.g. orchestrate facilities such as GridFTP, GRAM, … Higher-level E.g. Inferno, SRB

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Additional considerations Again, have problem of committing to other, evolving standards Need at least a thin API layer to protect resources from change And don’t want to break existing BDW system

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July More direct database exploitation with OGSA-DAI BioDA project is investigating relevance & suitability of OGSA-DAI in relation to bioinformatics projects 2 main possibilities within BDW: 1.Augment BGI to support inclusion of queries in workflows and to be sent directly to OGSA-DAI enabled databases. Distributed query processing facilities could assist in planning execution & distribution of data-orientated parts of a workflow. (For the current status of OGSA-DQP see Section 4.) Very major revision to BDW protocols; also, many resources of interest are simply not exposed as databases. 2.Provide facilities within individual wrappers that benefit from OGSA- DAI. Current exemplar (under development) takes approach (2) …

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July BDW OGSA-DAI initial exemplar

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July BDW OGSA-DAI exemplar extension

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Conclusions BDW interoperation layer designed to meet requirements we were given Suitable for high-level interactions Not so good for dynamic interaction with resources (need for this now generally recognised) Doesn’t specify how data is to be moved around Applicable to other domains meeting similar criteria Interesting possibilities for extension But we have achieved a sustainable architecture; this is an important feature to retain in future systems

Andrew Jones Interop. in changing infrastructure BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July Some discussion points (Arising from Jaspreet’s and Andrew’s talks) 1.Balance of requirements for different kinds of GRIDS – (performance, resource discovery, sustainability, …) – how does this affect decisions about architectures, protocols, … ? 2.How can BDW protocols best be enhanced in future projects? 3.How can we best achieve interoperability between grids from different projects (including BDW)? 4.How can we make it easier for 3 rd parties to Introduce their resources to an existing BgiWrapperService? Develop their own additional BgiWrapperServices?