© University of Reading 2008www.reading.ac.uk Reading e-Science Centre 9 September 2008 Harmonization of environmental data using the Climate Science Modelling.

Slides:



Advertisements
Similar presentations
Page 1 © Crown copyright 2005 Workshop on Metadata Beijing27-29 September Report on Metadata Gil Ross (Met Office UK) WIS Working Group Geneva.
Advertisements

Routemap to derive ISO models from BUFR Why do we need both ISO and BUFR models? –The BUFR data model is very large – much larger in principle than most.
A Unified Data Model and Programming Interface for Working with Scientific Data Doug Lindholm Laboratory for Atmospheric and Space Physics University of.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
University of Leeds Department of Chemistry The New MCM Website Stephen Pascoe, Louise Whitehouse and Andrew Rickard.
Dynamic Quick View, interoperability and the future Jon Blower, Keith Haines, Chunlei Liu, Alastair Gemmell Environmental Systems Science Centre University.
An Open Source Linked Data Infrastructure for Publishing Geospatial Data Arif Shaon, Andrew Woolf, Shirley Crompton, Will Rogers, Mike Jackson, Robert.
Ocean data dissemination Jon Blower, University of Reading, UK Steve Hankin, Bob Keeley, Sylvie Pouliquen, Jeff de la Beaujardière, Edward Vanden Berghe,
The MashMyData project Combining and comparing environmental science data on the web Alastair Gemmell 1, Jon Blower 1, Keith Haines 1, Stephen Pascoe 2,
NERC Data Grid Helen Snaith and the NDG consortium …
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
The use of standard OGC web services in integrating distributed model, satellite and in-situ datasets Alastair Gemmell Jon Blower Keith Haines Environmental.
Lesley Bross, August 29, 2010 ArcGIS 10 add-in glossary.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
GADS: A Web Service for accessing large environmental data sets Jon Blower, Keith Haines, Adit Santokhee Reading e-Science Centre University of Reading.
CIM – The Common Information Model in Climate Research
© MarineXML 2006 Slide 1 of 14 AGI Marine and Coastal Zone SIG – Oceans of Change ExCeL, London 22 nd March 2006, Developments in Marine Standards Keiran.
Bryan Lawrence on behalf of BADC, BODC, CCLRC, PML and SOC An Introduction to NDG concepts [ ]=
INSPIRE, EC Water Standards and WaterML Workshop Keiran Millard Keiran Millard
1 The NERC DataGrid DataGrid The NERC DataGrid DataGrid AHM 2003 – 2 Sept, 2003 e-Science Centre Metadata of the NERC DataGrid Kevin O’Neill CCLRC e-Science.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Mapping between SOS standard specifications and INSPIRE legislation. Relationship between SOS and D2.9 Matthes Rieke, Dr. Albert Remke (m.rieke,
The OpenGIS Consortium Geog 516 Presentation #2 Rueben Schulz March 2004.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
NOCS, PML, STFC, BODC, BADC The NERC DataGrid = Bryan Lawrence Director of the STFC Centre for Environmental Data Archival (BADC, NEODC, IPCC-DDC.
What is Information Modelling (and why do we need it in NEII…)? Dominic Lowe, Bureau of Meteorology, 29 October 2013.
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Metadata Maestro Practical (and the new ‘features’ I have found)
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
1 Global Systems Division (GSD) Earth System Research Laboratory (ESRL) NextGen Weather Data Cube Chris MacDermaid October, 2010.
1 NOAA IOOS Program Data Integration Framework (DIF) Project Overview Adapted from a brief to the NOAA Data Management Committee August 6, 2008 by Jeff.
Composing workflows in the environmental sciences using Web Services and Inferno Jon Blower, Adit Santokhee, Keith Haines Reading e-Science Centre Roger.
XIth International Congress for Mathematical Geology - September 3-8, 2006 – Liège, Belgium Contribution of GeoScienceML to the INSPIRE data harmonisation.
Environmental informatics at Reading Keith Haines, Jon Blower, Alastair Gemmell, Kevin Yang, Dan Bretherton, Marc Stringer Reading e-Science Centre.
Integrating the Climate Science Modelling Language with geospatial software and services Dominic Lowe British Atmospheric Data
AUKEGGSWorkshop ANU, Canberra, 29 November 2006 Implementing CSML Feature Types in applications within the NERC DataGrid Dominic Lowe, British Atmospheric.
Geography Markup Language (GML). What is GML? – Scope  The Geography Markup Language is  a modeling language for geographic information  an encoding.
1-2-3 February 2006 –Page 1 Mersea Integrated System How to improve Access/Downloading services ? How far do we go in terms of standardization ?
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
Exporting WaterML from the Earth System Modeling Framework Xinqi Wang Louisiana State University NCAR SIParCS Program August 4, 2009.
WIGOS Data model – standards introduction.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
© MarineXML 2005 Slide 1 of 19 Geospatial Standards for Earth System Science Data Edinburgh, Scotland, 6 th -8 th September 2005 MarineXML & MOTIIVE Keiran.
Alison Pamment 1, Steve Donegan 1, Calum Byrom 2, Oliver Clements 3, Bryan Lawrence 1, Roy Lowry 3 1 NCAS/BADC, Science and Technology Facilities Council,
Geography Markup Language (GML). GML What is GML? – Scope  The Geography Markup Language is  a modeling language for geographic information  an encoding.
Data Interoperability at the IRI: translating between data cultures Benno Blumenthal International Research Institute for Climate Prediction Columbia University.
Using Google Maps and other OpenSource GIS software for displaying geospatial data Jon Blower, Dan Bretherton, Keith Haines, Chunlei Liu, Adit Santokhee.
GO-ESSP LLNL, June 2006 CSML – Stocktake and Forward Look Andrew Woolf Dominic CCLRC Rutherford Appleton Laboratory.
ISO 191** Overview A “Family” of Standards. Resources ISO Standards Web Page – Technical.
An introduction to the MEDIN Discovery Metadata Standard.
Grid Remote Execution of Large Climate Models (NERC Cluster Grid) Dan Bretherton, Jon Blower and Keith Haines Reading e-Science Centre
1 SIMDAT Simdat Project –GTD. Meteo Activity – SIMDAT Meteo Activity OGF June 2008 Barcelona Marta Gutierrez, Baudouin Raoult, Cristina.
WaterML 2.0 Part 1 - Time Series Hydro Informatics Conference Workshop *Standardization of Water Data Exchange: WaterML2.0 and Beyond* Presented by : Open.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
1 Alison Pamment, 2 Calum Byrom, 1 Bryan Lawrence, 3 Roy Lowry 1 NCAS/BADC,Science and Technology Facilities Council, 2 Tessella plc, 3 British Oceanogrphic.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
ECMWF 24 th November 2008 Deploying secure OGC services in front of a heterogeneous data archive. Bryan Lawrence, Phil Kershaw, Dominic Lowe, and Stephen.
Reading e-Science Centre Technical Director Jon Blower ESSC Director Rachel Harrison CS Director Keith Haines ESSC Associated Personnel External Collaborations.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Flanders Marine Institute (VLIZ)
Metadata for digital long-term preservation
Session 3: Information Modelling and Information Communities
School of Information Studies, Syracuse University, Syracuse, NY, USA
Proposal of a Geographic Metadata Profile for WISE
Presentation transcript:

© University of Reading 2008www.reading.ac.uk Reading e-Science Centre 9 September 2008 Harmonization of environmental data using the Climate Science Modelling Language Jon Blower, Alastair Gemmell (Reading e-Science Centre) Andrew Woolf, Dominic Lowe, Arif Shaon (STFC e-Science Centre) Stephen Pascoe (British Atmospheric Data Centre) Keiran Millard, Quillon Harphem (HR Wallingford)

We need to integrate and compare lots of different types of data…

SSM/I HadCM3 HiGEM ERA-40 Satellite Re-analysis product Low res. Climate GCM HadCM3 Hi-res Climate GCM, New physics Putt, Gurney and Haines …for validating numerical models…

… calibrating instruments … +=

…data assimilation… Black line: control run time Green stars: observations Red line: assimilation run

Flood prediction... and making predictions Search and rescue Climate prediction

Where we are now (mostly) Separate websites for each data provider

The need for harmonization Each community has evolved its own means for presenting data: –File formats –Metadata conventions –Coordinate systems These are not usually mutually compatible … and vital metadata can be missing No widely-accepted standards exist for certain types of data Hence scientists spend lots of time dealing with low-level technical issues Need a common view onto all these datasets

Open Geospatial standards Aim to describe all geographic data XML encoding –Geography Markup Language Web Services for data exchange Rooted in international standards Mandated by European INSPIRE directive But fiendishly complex Evolved from map-oriented systems –Vertical and temporal information not handled cleanly

Bridging the gap: CSML Climate Science Modelling Language –Abstract data model defined using ISO/OGC approach –XML encoding based upon GML Adapts open geospatial standards to environmental science data –“Best of both worlds” Wraps existing data –Doesn’t expect providers to convert data Data are seen as geographical “features”, not as a file system

Selected CSML Feature Types PointSeriesFeature (timeseries at a point) ProfileFeature (vertical profile at a point) GridSeriesFeature (series of multidimensional grids) SwathFeature (single satellite sweep) SectionFeature (vertical section) Feature Types are classified by their geometry

Harmonizing 2 databases using CSML Different data providers, different internal representation –Met Office “MIDAS” dataset –“Environmental Change Network” dataset Modelled both databases as collections of CSML PointSeriesFeatures Allowed sharing of plotting and analysis tools –CSML-XML documents converted to maps, plots and KML Intermediate step via XML not necessary in ideal world

Java-CSML Need reusable libraries to apply CSML more widely Aim is to reduce cost of developing data-driven applications Interoperates with other means of modelling data in Java: –GeoAPI, Common Data Model High-level analysis/visualization routines completely decoupled from low-level data access

Java-CSML: Design attempts 1.Transform CSML’s XML schema to Java code using automated tool Led to very deeply-nested code 2.Based upon OGC-sponsored GeoAPI Incomprehensible unless very familiar with ISO standards GeoAPI is a moving target 3.Based on well-known Java concepts Accessible to “typical” Java programmer Compatibility with other data models assured through wrappers Insulated against inevitable changes to standards More code needs to be written by Java-CSML designers Less code needs to be written by users

Java-CSML Application 1: Coastal oceanography decision support system Red line: Smartbuoy data Blue dots: model output

Behind the scenes Smartbuoys (via Web Feature Service) Physical model (via NetCDF files) Biological model (via OPeNDAP server) Java-CSML wrappers Java-CSML Plotting routines

Java-CSML Application 2: Atmospheric ozone Control run Assimilation run

Specializing CSML Features A generic data model can’t encode all possible metadata without becoming extremely complex In CSML generic feature types can be specialized –cf. object-oriented inheritance Hence core data model retains simplicity ProfileFeature ArgoProfileFeature int qualityFlag

Java-CSML Application 3: Ocean data assimilation ArgoProfileFeatureProfileFeature Red lines: Argo data Blue lines: model output

Summary CSML bridges gap between bottom-up (science) and top-down (GIS) approaches to modelling data –Wraps existing data holdings Data modelled as Feature Types distinguished by geometry and “sensible plotting” –Complexity managed through feature inheritance Doesn’t attempt to model everything! –Other technologies deal with discovery, provenance, security… Java-CSML framework allows data intercomparison applications to be built quickly –Automates tedious and error-prone tasks

Wider lessons “Interoperable” data formats not necessarily suitable for storage –Because no single data model can satisfy every application –Abstraction usually leads to data loss! Trade-offs between scope and complexity –Don’t attempt to put everything in one specification Symbiotic relationship between standards, tools and applications –Must be developed in parallel