The Grid versus O.G.C. What happens next? Bryan Lawrence Director of Environmental Data Archival and Associated Research, CCLRC Head of the British Atmospheric.

Slides:



Advertisements
Similar presentations
Page 1 CSISS LCenter for Spatial Information Science and Systems 03/19/2008 GeoBrain BPELPower Workflow Engine Liping Di, Genong Yu Center.
Advertisements

A Roadmap of Open Source components for GI Web Services and Clients A Paul R Cooper MAGIC.
Fujitsu Laboratories of Europe © 2004 What is a (Grid) Resource? Dr. David Snelling Fujitsu Laboratories of Europe W3C TAG - Edinburgh September 20, 2005.
Supporting further and higher education Grid Security: Present and Future Alan Robiette, JISC Development Group.
Semantically-Assisted Geospatial Workflow Design Gobe Hobona, David Fairbairn, Philip James ACM GIS – 8 th November Seattle.
Chapter 19 – Service-oriented Architecture
1 UK e-Science All-Hands Meeting Nottingham, 2004 Enterprise specification of the NERC DataGrid Andrew Woolf, Ray Cramer.
Crucial Patterns in Service- Oriented Architecture Jaroslav Král, Michal Žemlička Charles University, Prague.
Information Modelling MOLES Metadata Objects for Linking Environmental Sciences S. Ventouras Rutherford Appleton Laboratory.
This product includes material developed by the Globus Project ( Introduction to Grid Services and GT3.
SCIDIP-ES Components Oct ,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.
OASIS Service Oriented Architecture Reference Model Technical Committee (SOA-RM) BOOT CAMP April DRAFT: Not approved by the OASIS SOA RM TC.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A New Computing Paradigm. Overview of Web Services Over 66 percent of respondents to a 2001 InfoWorld magazine poll agreed that "Web services are likely.
Thee-Framework for Education & Research The e-Framework for Education & Research an Overview TEN Competence, Jan 2007 Bill Olivier,
The MashMyData project Combining and comparing environmental science data on the web Alastair Gemmell 1, Jon Blower 1, Keith Haines 1, Stephen Pascoe 2,
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
FREMA: e-Learning Framework Reference Model for Assessment David Millard Yvonne Howard IAM, DSSE, LTG University of Southampton, UK.
Semantic Mediation & OWS 8 Glenn Guempel
1 TECO-WIS, 6-8 November 2006 TECHNICAL CONFERENCE ON THE WMO INFORMATION SYSTEM Seoul, Republic of Korea, 6-8 November 2006 ISO 191xx series of geographic.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
BADC, BODC, CCLRC, PML and SOC The NERC Metadata Gateway: a product of the NERC DataGrid [ ]= Bryan Lawrence (on behalf of a big team)
OGC Liaison Report WGISS-20 Allan Doyle, EOGEO
Page 1 LAITS Laboratory for Advanced Information Technology and Standards 9/6/04 Briefing on Open Geospatial Consortium (OGC)’s Web Services (OWS) Initiative.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
Possible Architectural Principles for OGSA-UK and other Grids UK e-Science Core Programme Town Meeting London Monday 31st January 2005 “Defining the next.
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
DISTRIBUTED COMPUTING
Bryan Lawrence on behalf of BADC, BODC, CCLRC, PML and SOC An Introduction to NDG concepts [ ]=
1 The NERC DataGrid DataGrid The NERC DataGrid DataGrid AHM 2003 – 2 Sept, 2003 e-Science Centre Metadata of the NERC DataGrid Kevin O’Neill CCLRC e-Science.
Grid-enabling OGC Web Services Andrew Woolf, Arif Shaon STFC e-Science Centre Rutherford Appleton Lab.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
EuroRoadS for JRC Workshop Lars Wikström, Triona Editor of EuroRoadS deliverables D6.3, D6.6, D6.7.
ET-ADRS-1, April ISO 191xx series of geographic information standards.
NOCS, PML, STFC, BODC, BADC The NERC DataGrid = Bryan Lawrence Director of the STFC Centre for Environmental Data Archival (BADC, NEODC, IPCC-DDC.
What is Information Modelling (and why do we need it in NEII…)? Dominic Lowe, Bureau of Meteorology, 29 October 2013.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
OGC/Grid activities in UK Chris Higgins (EDINA), Phil James (Uni of Newcastle), Andrew Woolf (CCLRC)
Managing and communicating uncertainty in geospatial web service workflows Richard Jones, Dan Cornford, Lucy Bastin, Matthew Williams Computer Science,
AUKEGGS Architecturally Significant Issues (that we need to solve)
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY WITHIN THE (SEMANTIC) WEB Matthew Williams
Semantic Web Technologies Research Topics and Projects discussion Brief Readings Discussion Research Presentations.
BADC, BODC, CCLRC, PML and SOC Interacting with NDG [ ]= Bryan Lawrence (on behalf of a big team)
1 ECCF Training 2.0 Implemental Perspective (IP) ECCF Training Working Group January 2011.
RSISIPL1 SERVICE ORIENTED ARCHITECTURE (SOA) By Pavan By Pavan.
Managing Enterprise GIS Geodatabases
Development of Semantically Aware Workflow Engines for GEOspatial Web Service Orchestration Open Grid Forum 20 (OGF20) 7 th May, 2007 Gobe Hobona (University.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
SEcurE access to GEOspatial services OGC-OGF Collaboration workshop Open Grid Forum 21 (OGF21) October, 2007 Chris Higgins (EDINA, University of Edinburgh)
® Using (testing?) the HY_Features model, 95th OGC Technical Committee Boulder, Colorado USA Rob Atkinson 3 June 2015 Copyright © 2015 Open Geospatial.
E-Science Security Roadmap Grid Security Task Force From original presentation by Howard Chivers, University of York Brief content:  Seek feedback on.
1 Service Oriented Architecture SOA. 2 Service Oriented Architecture (SOA) Definition  SOA is an architecture paradigm that is gaining recently a significant.
INFSO-RI Enabling Grids for E-sciencE Web Services Mike Mineter National e-Science Centre, Edinburgh.
Open Grid Services for Earth Observation Pedro Gonçalves.
EGEE is a project funded by the European Union under contract IST Introduction to Web Services 3 – 4 June
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
SDI 4.0 Crowd-sourcing, Gov-sourcing Geographic Data via Open Geosynchronization Raj R. Singh Director, Interoperability Programs Open Geospatial Consortium.
A service Oriented Architecture & Web Service Technology.
ECMWF 24 th November 2008 Deploying secure OGC services in front of a heterogeneous data archive. Bryan Lawrence, Phil Kershaw, Dominic Lowe, and Stephen.
Beyond simple features: Do complex feature types need complex service descriptions?' B.N. Lawrence (1,2), D. Lowe (1,2), S. Pascoe (1,2) and A. Woolf (1).
Service Oriented Architecture (SOA) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
NERC DataGrid: Googling for Secure Data
Service Oriented Architecture (SOA)
Session 2: Metadata and Catalogues
Scott Thorne & Chuck Shubert
Grid Systems: What do we need from web service standards?
Presentation transcript:

The Grid versus O.G.C. What happens next? Bryan Lawrence Director of Environmental Data Archival and Associated Research, CCLRC Head of the British Atmospheric Data Centre, NCAS

Outline Clearly I don’t know what happens next! However, I intend to discuss: 1.Where I’m coming from 2.A brief comparison of Grid and OGC paradigms 3.Some Strengths and Weaknesses of the paradigms 4.Some crystal ball gazing

Where I’m coming from The British Atmospheric Data Centre, The NERC Earth Observation Data Centre The NERC DataGrid

British Atmospheric Data Centre Mission Statement includes Curation and Facilitation Curation: Preserves digital atmospheric science data for posterity (including ALL NERC-funded atmospheric science data and much else). Catalogues and Documents data. Holds ~50 TB terabytes of data! Facilitation: Provides effective access to data and information. Supports field campaigns and model development. Supports knowledge transfer from atmospheric science data to user knowledge in other domains. Knowledge Transfer: Half of the BADC users are from other fields: BADC data has been used to study bird feeding habits, radio communication modelling, A&E influenza cases, wind power research …

NERC Earth Observation Data Centre Will be part of new National Centre for Earth Observation, NCEO NERC commercial satellite data LANDSAT, SPOT, IKONOS NERC Airborne data NERC ARSF NEXTMap Britain Dedicated UK archive ATSR-1/2, ~40 TB AATSR ~80 TB Strong Liaison with ESA Working in ESA grid projects “To deliver effective services to the NERC community in locating, accessing, interpreting and exploiting Earth Observation data and information, and to ensure the long-term integrity of EO datasets produced and acquired by NERC projects and programmes”

Grid Complexity: Size and Heterogeneity British Atmospheric Data Centre British Oceanographic Data Centre NCAR

XML Intermediaries

NDG Architecture Data Providers NDG Core Services Users NDG “Portal” Interface(s) Vocab Services

Standards ISO 19101: Geographic Information Reference model A geospatial dataset… …consists of features and related objects… …in a defined logical structure… …delivered through services… …and described by metadata.

Geographic ‘features’ –“abstraction of real world phenomena” [ISO 19101] –Type or instance –Encapsulate important semantics in universe of discourse –“Something you can name” Application schema –Defines semantic content and logical structure –ISO standards provide toolkit: spatial/temporal referencing geometry (1-, 2-, 3-D) topology dictionaries (phenomena, units, etc.) –GML – canonical encoding [from ISO “Geographic information – Rules for Application Schema”] ISO19101 Features

A brief Comparison of Philosophy Grids The OGC

The Grid The Grid as defined by Foster 1 has three main characteristics: –standard open protocols, –no centralised control, and –non-trivial quality of service. The lack of central control requires secure access to grid resources, typically implemented though public key infrastructures. In practice, nearly all implementations of grids are themselves are Virtual Organisations, with some central control over authorisation and authentication! 1

The OGC web services Existing and planned OGC web services are open protocols designed for the situation where there is no centralised control, and are intended to provide non-trivial quality of service. The OGC has been developing Web Service interfaces over a number of years. I would assert that the OGC has been developing GRID components over a number of years. So what’s the difference?

Directions Grid World “Interesting” access control paradigms coupled with Virtual Organisations “Complex” workflows coupled with workflow management tools “Simple” data objects, or if not simple, homogeneous. Tight-binding in strongly-typed service descriptions OGC World Thus far little access control (but GeoDRM looming) Negligible orchestration of workflow (WPS is “internal” workflow). Registry formalism may change things. Standards support complex data objects, but implementations support simple data objects and relatively-weak binding.

The Collision In recent years, the OGC has commissioned work on producing SOAP-based implementations of the existing OGC web services as well as further developing the suite to include processing services. In practice then, the geospatial community may regard the development of “grid” principles as essentially the re- development and wider deployment of concepts already prevalent in their community. Of course the risk is also that the geospatial community is busy reinventing wheels that the wider community has already refined …

Strengths and Weaknesses Or the essential trade off between the rapid development of functionality versus interoperability … in each world!

Metadata in OGC One of the major identifying characteristics of OGC web services is that they (will) provide identifiers to datasets and services identified using the citation element of standardised metadata constructs (ISO19115). One of the major weaknesses of that approach is that ISO19139 (the xml implementation of ISO19115) is immature. Communities have not grasped the concept of profiles properly, and the tooling is even more immature. Fortunately, one can get a long way without ISO19139, but orchestrating services is going to require service description metadata!

Metadata in the Grid No real concept of metadata for data objects beyond “we can use OGSA/DAI for that”! A plethora of methods of describing services, and a plethora of methods for orchestrating services, but no real way of describing what the characteristics of of the data objects which services might manipulate. The goal of “agent-based” service negotiation is still a pipe dream – particularly if you want the service to do something with real data!

Security in the OGC-space 1.We can use https to secure the transport layer … 2.We can secure access to servers, but access- control granularity behind OGC services is difficult to engineer in an interoperable and scalable way! 3.Without something extra (e.g. NDG security) the BADC could not deploy the OGC services!) It turns out that OWS4 is doing something very similar!

Security in the Grid-world Strong access control based around virtual- organisations, but mostly need common authentication/authorisation governance within the V.O. –Not really good enough for true interoperability, and not yet enough to deal with NDG requirements –We’re bound to deploy Shibboleth and probably OpenID as well, but that’s only authentication. –Role based authorisation requires new infrastructure. PERMIS and other projects lead the way, but are not yet easily deployable. Significantly better than what’s available in the OGC-space, and on the right track …

On Service Description One of the key areas where both OGC and the Grid world have been active is in service description. I’m not that familiar with what the OGC has done 1 for the following reason: The rest of the IT world is doing SOA The rest of the IT world will describe services. It’s unlikely that the OGC will develop best practice which is well supported by software vendor tooling … At the moment WSDL2 is where we are investing our thinking time … 1 ISO19119 is a meta-model for services rather than a SDL

On religious wars (1) SOAP versus REST or POX (plain old XML)! OGC web services are POX. Most of the grid is SOAPacious. Most of the “Enterprise” world thinks they should use SOAP. Most Successful “Web 2.0” use REST. Nearly everyone thinks their way is the “one true way” (although see Ian Foster’s blog on “Fundamentalism”) I suspect that much of the argument is based around different requirements!

On Religious Wars (2) Fielding (who invented REST): “Some architectural styles are often portrayed as “silver bullet” solutions for all forms of software. However, a good designer should select a style that matches the needs of the particular problem being solved. Choosing the right architectural style for a network-based application requires an understanding of the problem domain and thereby the communication needs of the application, an awareness of the variety of architectural styles and the particular concerns they address, and the ability to anticipate the sensitivity of each interaction style to the characteristics of network-based communication.” Minar (implementor of the Google Soap Interface)”: “The deeper problem with SOAP is strong typing … Gregorio (co-inventor of the Atom Protocol): “This backs up my experience; if you don't have control of both ends of the wire then loosely typed documents beat strongly typed data- structure serializations.”

On religious wars (3) And for the grid/OGC collision? The Grid World: Are by and large are building things where the virtual organization’s members are in each other’s pockets. They do have control of both ends of the wire. They do have complex situations, where automatically parased service description languages do help. The tooling does mostly work. The OGC world : Don’t have control of the wire. Loose typing helps. REST and POX make sense. Minar again: “Truly, none of this protocol fiddling matters. Just do something that works” My collision perspective? OGC services have grown SOAP bindings. Grid services can grow REST bindings. WSDL2 (and other service description languages) can describe both! We will use what we need to get the job done, the information model including the interface descriptions is what matters, the bindings will follow. This means domain modellers who “think GML” will have to go back to ISO first principles to think about objects as more than just data types, as they are things with interfaces! It also means the Grid people need to think harder about the implication of the datatypes on their interface descriptions!

Crystal Ball Gazing The JISC-OGC projects Whither NDG Whither everyone else?

JISC-CALL 2006 call for projects to work in the area of OGC-Grid Collision. Two major initiatives: 1.Bringing grid security to the OGC world. 2.Bringing grid workflow concepts to the OGC world. (Implicitly bringing complex data-types aka feature-types into the thinking of the grid-world) Funded 1.SEE-GEO: SEcurE access to GEOspatial services 2.SAW-GEO: Semantically Aware Workflow Engines for GEOspatial Web Service Orchestration

SEE-GEO Aiming for access to geographic information via the National Grid Service Exact project detail not yet clear, but needing to address: 1.The role of Shibboleth (it is becoming the UK standard) 2.How WS-Security can be exploited 3.How to interface with the Globus-2 era NGS. 4.How the OGC web services are involved. Main deliverables will be a report and 3 demonstrators: 1.National datacentre 2.Social Science (NCeSS) 3.Orchestration (Newcastle) Project will involve integrating the WCS into OGSA/DAI SEE-GEO Slide adapted from material from Chris Higgins

SAW-GEO Proposed Architecture 1.Chaining multiple web services together 2.Semantically informed workflow management system and workflow engine 3.Workflow engine deployable onto Apache Tomcat 4.Web portal into the workflow engine 5.Use of OGSA-DAI wrappers and the Globus toolkit Workflow engines 1.Wide range of possible workflow engines will be examined –eg. TAVERNA, Apache Agila, ActiveBPEL 2.Newcastle has experience with SCUFL (TAVERNA), but it is application-specific, so are likely to use BPEL (supported by ActiveBPEL, Agila etc). –Which has a graphical capability and a web-based console running on apache Tomcat which meets the objectives above. SAW-GEO Slides adapted from material from David Fairbairn

SAW-GEO OGC WCS OGC WFS OGC WMS Clients OGSA -DAI WCS OGSA -DAI WFS OGSA -DAI WMS Workflow Management System MapServer or GeoServer Globus Toolkit SAW-GEO EDINA Newcastle SAW-GEO Slides adapted from material from David Fairbairn BNL: Not obviously the right methodology!!

SAW-GEO

SAW-GEO

NDG Futures NDG2 is doing nothing “new” now (project due to end Sep07). Consolidating what we know: –No role for OGSA/DAI Until the feature-type perspective is integral it is, for our purposes, little more than a WS-JDBC! –Will be exploiting OGC feature-type perspective significantly Development of CSML (an application schema of GML, along with some required standards enhancements) Services based on CSML (WCS, WFS, WMS) –Have rolled our own security paradigm Based on PKI, WS-Security, a local attribute- syntax for XML attribute certifcates and similar concepts to Shibboleth. Role-Mapping. Works now!

What would NDG3 consider? Our CSML2 domain modelling does include interface descriptions, so exploiting those in service registries will be important. Definitely want to support orchestration, and late-binding of services! Key technologies: –WSDL2 –ebRIM registries –SAW-GEO outputs in terms of BPEL, not at all convinced about OGSA/DAI! –Migration to shibboleth, XACML.

The next three years? What is everyone else thinking? The questions still remain, 1.Can we exploit the OGC Feature-type perspective, while 2.Keeping simple interfaces, but 3.Managing complex orchestration, of 4.Well described services, in a 5.Secure Manner Requires: 1.Semantic Tools, 2.Confronting implications of architectural paradigms (SOAP/REST) 3.Tooling (e.g. BPEL example from SAW-GEO) 4.Service and Data Metadata 5.Simpler Security a.WS-Security equivalent for POX? Tighter coupling of security concepts into WSDL b.Explicit recognition of access control beyond http authentication in OWS-common.

Summary OGC knows about data typing! –Feature-type concepts are not limited to geospatial! –Grid community need to understand implications of semantic data typing on service descriptions. GRID has more sophisticated service binding, access control and authentication, workflow! –OGC community should not reinvent tooling! Architectural decisions need to be based on pragmatic decisions about necessity for strong/weak typing and governance: –It is RIGHT to use strong-typing and tools (e.g. WSRF, SOAP) when the problem will benefit from doing so. –It is RIGHT to use weak-typing, late-binding, and REST when the problem will benefit from doing so. –Neither are silver bullets!