BADC, BODC, CCLRC, PML and SOC The NERC Metadata Gateway: a product of the NERC DataGrid + ++ + +[ ]= Bryan Lawrence (on behalf of a big team)

Slides:



Advertisements
Similar presentations
Page 1 © Crown copyright 2005 Workshop on Metadata Beijing27-29 September Report on Metadata Gil Ross (Met Office UK) WIS Working Group Geneva.
Advertisements

Information Modelling MOLES Metadata Objects for Linking Environmental Sciences S. Ventouras Rutherford Appleton Laboratory.
Page 1© Crown copyright 2006 Registry technology & case study implementation J. Tandy, D. Thomas - November 2006.
Community semantics and interoperability: the ISO/TC 211 framework and the “Hollow World” Simon Cox CSIRO Exploration and Mining 6 September.
OneGeology-Europe - the first step to the European Geological SDI INSPIRE Conference 2010, Session Thematic Communities: Geology Krakow, June 24 th 2010.
Nov Copyright Galdos Systems Inc. November 2001 Geography Markup Language Enabling the Geo-spatial Web.
Bryan Lawrence on behalf of BADC, BODC, CCLRC, PML and SOC The British Atmospheric Data Centre and the NERC DataGrid (for) [ ]=
NERC Data Grid Helen Snaith and the NDG consortium …
Session 1: Introduction to Geospatial Standards Why do we care about standards? Agreement on syntax, information models and interfaces lets us share information.
1 TECO-WIS, 6-8 November 2006 TECHNICAL CONFERENCE ON THE WMO INFORMATION SYSTEM Seoul, Republic of Korea, 6-8 November 2006 ISO 191xx series of geographic.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
VO Sandpit, November 2009 Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan.
2 nd Training Workshop 4 – 5 June 2007 Common Data Index - CDI By Dick M.A Schaap Technical Coordinator SeaDataNet.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
Interoperability A simple case for standards Kim Finney JCADM – Rome 2007.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
U.S. Department of the Interior U.S. Geological Survey NWIS, STORET, and XML National Water Quality Monitoring Council August 20, 2003.
NERC DataGrid and CSML NDG Team. CSML: Context NERC DataGrid: the integration problem –multiple organisations, formats, storage mechanisms (file, relational)
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Bryan Lawrence on behalf of BADC, BODC, CCLRC, PML and SOC An Introduction to NDG concepts [ ]=
BADC, BODC, CCLRC, PML and SOC Distributed Data, Distributed Governance, Distributed Vocabularies, with a dash of CLADDIER: The NERC DataGrid [
1 The NERC DataGrid DataGrid The NERC DataGrid DataGrid AHM 2003 – 2 Sept, 2003 e-Science Centre Metadata of the NERC DataGrid Kevin O’Neill CCLRC e-Science.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Marine Community and OGC Experiences of the MarineXML initiative EU MarineXML project SEEGrid IOC-UNESCO.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
ET-ADRS-1, April ISO 191xx series of geographic information standards.
NOCS, PML, STFC, BODC, BADC The NERC DataGrid = Bryan Lawrence Director of the STFC Centre for Environmental Data Archival (BADC, NEODC, IPCC-DDC.
What is Information Modelling (and why do we need it in NEII…)? Dominic Lowe, Bureau of Meteorology, 29 October 2013.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
NERC DataGrid NERC DataGrid Vocabulary Server Use Cases Vocabulary Workshop, RAL, February 25, 2009.
VO Sandpit, November 2009 CEDA Metadata Steve Donegan/Sam Pepler.
AUKEGGS Architecturally Significant Issues (that we need to solve)
Are Standards Really Standards Any More? Mélanie F. Meaux NASA / GCMD In response to Wyn Cudlip with regards to an IDN profile of ISO …
BADC, BODC, CCLRC, PML and SOC Interacting with NDG [ ]= Bryan Lawrence (on behalf of a big team)
NDG Discovery Gateway ISO19139 Issues Bryan Lawrence Director of Environmental Data Archival and Associated Research, CCLRC Head of the British Atmospheric.
XIth International Congress for Mathematical Geology - September 3-8, 2006 – Liège, Belgium Contribution of GeoScienceML to the INSPIRE data harmonisation.
Integrating the Climate Science Modelling Language with geospatial software and services Dominic Lowe British Atmospheric Data
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
AUKEGGSWorkshop ANU, Canberra, 29 November 2006 Implementing CSML Feature Types in applications within the NERC DataGrid Dominic Lowe, British Atmospheric.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Documenting UAF Data Ted Habermann NOAA/NESDIS/National Geophysical Data Center.
INFSO-RI Enabling Grids for E-sciencE A service oriented framework to create, manage and update metadata for earth system science.
Some international collaborations in geoscience informatics: IUGS GeoSciML testbed, & AUKEGGS forum Simon Cox CSIRO Exploration and Mining.
WIGOS Data model – standards introduction.
Slide 1 SDTSSDTS FGDC CWG SDTS Revision Project ANSI INCITS L1 Project to Update SDTS FGDC CWG September 2, 2003.
© MarineXML 2005 Slide 1 of 19 Geospatial Standards for Earth System Science Data Edinburgh, Scotland, 6 th -8 th September 2005 MarineXML & MOTIIVE Keiran.
Alison Pamment 1, Steve Donegan 1, Calum Byrom 2, Oliver Clements 3, Bryan Lawrence 1, Roy Lowry 3 1 NCAS/BADC, Science and Technology Facilities Council,
Geography Markup Language (GML). GML What is GML? – Scope  The Geography Markup Language is  a modeling language for geographic information  an encoding.
British Atmospheric Data Centre ( Searching: Whither NDG? Bryan Lawrence.
Interoperability in GSDI: Standards, Solutions, and Futures Douglas Nebert GSDI Secretariat.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
1 Alison Pamment, 2 Calum Byrom, 1 Bryan Lawrence, 3 Roy Lowry 1 NCAS/BADC,Science and Technology Facilities Council, 2 Tessella plc, 3 British Oceanogrphic.
Smart Maps and Dumb Questions: A Geospatial Semantic Web Interoperability Experiment Joshua Lieberman Traverse Technologies, Inc. & Northrop Grumman Information.
Bavarian Agency for Surveying and Geoinformation AAA - The contribution of the AdV in an increasing European Spatial Data Infrastructure - the German Way.
ECMWF 24 th November 2008 Deploying secure OGC services in front of a heterogeneous data archive. Bryan Lawrence, Phil Kershaw, Dominic Lowe, and Stephen.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
U.S. Department of the Interior U.S. Geological Survey WaterML Presentation to FGDC SWG Nate Booth January 30, 2013.
NERC DataGrid: Googling for Secure Data
The NERC Metadata Gateway: a product of the NERC DataGrid
Flanders Marine Institute (VLIZ)
BADC, BODC, CCLRC, PML and SOC
The Re3gistry software and the INSPIRE Registry
Session 2: Metadata and Catalogues
BADC, BODC, CCLRC, PML and SOC
Presentation transcript:

BADC, BODC, CCLRC, PML and SOC The NERC Metadata Gateway: a product of the NERC DataGrid [ ]= Bryan Lawrence (on behalf of a big team)

TECO-WIS, Nov 2006 Outline Introduction to NERC, the NERC Data Centres, and NCAS The NERC DataGrid Project –Key Components: Data Tools, Data Discovery, {Access Control} –NDG Information Environment Key Standards Structures: the ISO Family From CSML, {MOLES}, DIF to ISO19139 (NumSim) Distributed Content Search –Why we did it this way –Our Discovery Architecture NDG Discovery –Now … and –The Future – The “New NERC Metadata Gateway” ISO19139 Best Practice Summary

TECO-WIS, Nov 2006 Some Introductions NERC: The Natural Environment Research Council –The major player in UK environmental research –Is both a funding agency, and a conglomeration of “centres”: internal “research” institutes, The British Oceanographic Data Centre (BODC) is part of one of the internal institutes. And external “collaborative” centres, which include: The Plymouth Marine Laboratory The National Oceanographic Centre, Southampton The National Centre for Atmospheric Science, NCAS, mostly embedded in Universities, but part of which is the British Atmospheric Centre (BADC) which is embedded in the CCLRC: Council for the Central Laboratories of the Research Councils –Is about to be replaced by a new entity, which might be called the “Large Facilities Research Council” NERC has seven discipline based designated data centres (including the BODC and BADC), and requires as much integration of data access as possible. –From discovery to utilisation, from genomics to ecology, from oceanography to atmospheric science, from antarctic science to British geology …

TECO-WIS, Nov British Atmospheric Data Centre British Oceanographic Data Centre Complexity + Volume + Remote Access = Grid Challenge NCAR

TECO-WIS, Nov 2006 If it’s not obvious Lots of organisations –Varying membership, and trust internally and between each other is not consistent. Lots of priorities –Not all organisations are “about” data Different internal storage structures –Data stored in variety of databases and filesystems. –Some things well documented, but not automated –Some things automated, but information content is sparse … Integrating data access non-trivial And none of that includes the important relationships with customers and collaborators!

TECO-WIS, Nov 2006 Key Components Discovery Tools Discovery Portal –Metadata Search –Direct Links to Data and Services Data Tools Slice and Dice Visualisation Manipulation Access Control Systems are resource limited Data may access may be restricted by license Metadata Structures to support all the above

TECO-WIS, Nov 2006 Standards Landscape Or two: ISO TC211 Standards, e.g –ISO 19101: Geographic information – Reference model –ISO 19103: Geographic information – Conceptual schema language –ISO 19107: Geographic information – Spatial schema –ISO 19108: Geographic information – Temporal schema –ISO 19109: Geographic information – Rules for application schema –ISO 19111: Geographic information – Spatial referencing by coordinates –ISO 19115: Geographic information – Metadata Open Geospatial Consortium Specs –Geographic Markup Language, a toolkit for building data descriptions –WMS, WCS, WFS, WPS: the Web (Map, Coverage, Feature, and Processing) services.

TECO-WIS, Nov 2006 Standards ISO 19101: Geographic information – Reference model A geospatial dataset… …consists of features and related objects… …in a defined logical structure… …delivered through services… …and described by metadata.

TECO-WIS, Nov 2006 Data Description Standards Geographic ‘features’ –“abstraction of real world phenomena” [ISO 19101] –Type or instance –Encapsulate important semantics in universe of discourse –“Something you can name” Application schema –Defines semantic content and logical structure –ISO standards provide toolkit: spatial/temporal referencing geometry (1-, 2-, 3-D) topology dictionaries (phenomena, units, etc.) –GML – canonical encoding [from ISO “Geographic information – Rules for Application Schema”]

TECO-WIS, Nov 2006 CSML: Climate Science Modelling Language Fully Featured GML Application Schema, with extensions for –External binary data (Grib, netCDF etc) –Irregular Grids, “Proper” vertical coordinate systems (both activities now on OGC and ISO standards tracks) V1.0 included seven feature types and provided only “data” modelling. V1.0 CSML tooling includes a scanner (creates CSML from netCDF files), and a parser (instantiates python objects which can be manipulated scientifically (based on the XML CSML documents).

TECO-WIS, Nov 2006 XML Parser SeeMyDENC Data Dictionary S52 Portrayal Library SENC Marine GML (NDG) Feature Types XML Biological Species Chl-a from Satellite Modelled Hydrodynamics XSLT For each XSD (for the source data) there is an XSLT to translate the data to the Feature Types (FT) defined by CSML. The FT’s and XSLT are maintained in a ‘MarineXML registry’ The FTs can then be translated to equivalent FTs for display in the ECDIS system XSLT Features in the source XSD must be present in the data dictionary. XSD XML The result of the translation is an encoding that contains the marine data in weakly typed (i.e. generic) Features XSLT Phenomena in the XSD must have an associated portrayal ECDIS acts as an example client for the data. Data from different parts of the marine community conforming to a variety of schema (XSD) Measured Hydrodynamics S-57v3 GML XML XSD XML XSD Feature described using S-57v3.1Application Schema can be imported and are equivalent to the same features in CSML’ Slide adapted from Kieran Millard (AUKEGGS, 2005) MarineXML Testbed

TECO-WIS, Nov 2006 The Concept of re-using Features Here structured XML is converted to plain ascii text in the form required for a numerical model HTML warning service pages are generated ‘on the fly’ XML can also be converted to SVG to display data graphically Here the same XML is converted to the SENC format used in a proprietary tool for viewing electronic navigation charts. All this requires agreement on standards Slide adapted from Kieran Millard (AUKEGGS, 2005)

TECO-WIS, Nov 2006 CSML Round Tripping - 1 Managing semantics UGAS GML app schema XML GML dataset instance conceptual model Conforms to New Dataset Application produces parser V1.0 (Python, Complete)

TECO-WIS, Nov 2006 CSML Round Tripping - 2 Managing data - 1 parser V1.0 V2 in development GML dataset scanner V1.0 V2 in development GML app schema XML instance CF Dataset Application produces CF

TECO-WIS, Nov 2006 CSML2: Structure “Affords” Behaviour ISO coverage class ‘Affordance’ modelled with UML > Moving beyond GML, but staying in the ISO Frame!

TECO-WIS, Nov 2006 CSML2: Related to new OGC Observations and Measurements Spec An Observation is an Event whose result is an estimate of the value of some Property of the Feature-of-interest, obtained using a specified Procedure

TECO-WIS, Nov 2006 Managing Data CF Dataset GML dataset scanner XSLT ISO19115 XML PUBLISH DECISION PROCESSES CF Dataset Define Dataset Add Information

TECO-WIS, Nov 2006 The Most Important Decision What is a dataset? Granularity too coarse: can’t find what you want – not enough information exposed. Granularity too fine: can’t find what you want – buried in unordered results.

TECO-WIS, Nov 2006 Distributed Query Options: Harvest or Crawl Distribute Query to known targets versus harvest from known targets and do local query –Timeliness versus Responsiveness Decision: NDG Discovery based on Open Archives Initiative Protocol for Metadata Harvesting –Additional Partners include NCAR, MPI-WDCC, TPAC, UK-MDIP

TECO-WIS, Nov 2006 Discovery Metadata Usage XML: Metadata store: can support a limited variety of different xml schema provided WS-interface understands them (need unique xquery for each method, schema pair)

TECO-WIS, Nov 2006 Metadata Formats Currently Supporting NASA Global Change Master Directory: Directory Interchange Format (DIF) Experimenting with: Vanilla ISO19139 Dublin Core UK Gemini V1 format Will support following ISO profiles for harvest: (eventually) UK Gemini profile WMO profile IOC profile (whenever) US FGDC profile ALL SIMULTANEOUSLY: XML Database plus appropriate xqueries

TECO-WIS, Nov 2006 Simulation in the context of ISO19139: NumSim NDG Products: NumSim

TECO-WIS, Nov 2006 NumSim Example

TECO-WIS, Nov 2006 Firefox Search Plugin

TECO-WIS, Nov 2006 International Discovery - Climate

TECO-WIS, Nov 2006 NDG “New Interface”

TECO-WIS, Nov 2006 Within Record Scrolling Down

TECO-WIS, Nov 2006 New Interfaces (No CSS as yet) SimpleAdvanced Issues: Times (forecast, paleo etc) BBOX (near poles and dateline) Semantic Vocabulary matching (exploiting a new NDG web-service providing thesaurus content, and ontology mapping)

TECO-WIS, Nov 2006 Metadata extensions and profiles ISO

TECO-WIS, Nov 2006 ISO19139 Background: Designed to exploit as much as possible of the xml-schema machinery Not designed for Humans! Advice: Use in conjunction with a clear concept of why it’s being used: Decide on dataset granularity, and use other metadata schema to describe how to use content (“A” metadata; e.g. an application schema of GML). Devise a profile with utility then: restrict, restrict, restrict. Document. Register.

TECO-WIS, Nov 2006 On Restriction ISO19139 is also about INTEROPERABILITY! Don’t follow the ISO19139 advice and produce a new schema! Ensure that your profile instances are valid vanilla ISO19139 Restrict content out-of-band, e.g. schematron, etc. Agree on how you’re going to deploy ISO19139

TECO-WIS, Nov 2006 On Extension ISO19139 is also about INTEROPERABILITY! Do follow the ISO19139 advice and produce a new schema! Do what you need for your community, but: Design so that code expecting ISO19139 instances can parse yours! Make it easy for third party code to ignore your content!

TECO-WIS, Nov 2006 Summary NDG dealing with heterogeneous environment Successful deployment of OAI with discovery metadata (There are some issues differentiating between model simulations and ordering response sets) Directly linking to and exploiting GML application schema Web Service backends make deployment easier. Communities need to be very careful how they deploy ISO19139