NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid METADATA DEVELOPMENT for the EARTH SYSTEM GRID Luca Cinquini (SCD/NCAR)

Slides:



Advertisements
Similar presentations
Earth System Curator Spanning the Gap Between Models and Datasets.
Advertisements

Recent Work in Progress
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
Toni Saarinen, Tite4 Tomi Ruuska, Tite4 Earth System Grid - ESG.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
The Earth System Grid Discovery and Semantic Web Technologies Line Pouchard Oak Ridge National Laboratory Luca Cinquini, Gary Strand National Center for.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
1 CF Unleashed: Introduction to Cf/Radial Joe VanAndel National Center for Atmospheric Research 2013/1/8 The National Center for Atmospheric.
January, 23, 2006 Ilkay Altintas
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
The Earth System Curator Metadata Representations Prototype Portal in Collaboration with ESMF and ESG Rocky Dunlap Spencer Rugaber Georgia Tech.
GADS: A Web Service for accessing large environmental data sets Jon Blower, Keith Haines, Adit Santokhee Reading e-Science Centre University of Reading.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
Presented by The Earth System Grid: Turning Climate Datasets into Community Resources David E. Bernholdt, ORNL on behalf of the Earth System Grid team.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
Ian Foster Argonne National Lab University of Chicago Globus Project The Grid and Meteorology Meteorology and HPN Workshop, APAN.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
ESG The Earth System Grid (ESG) Presented by Don Middleton & Luca Cinquini NCAR Scientific Computing Division On Behalf of the ESG Team SCD Executive Committee.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
The Earth System Grid (ESG) Goals, Objectives and Strategies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
The Earth System Grid: A Visualisation Solution Gary Strand.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
Metadata Standards for Gridded Climate Data in the Earth System Grid Robert Drach LLNL/PCMDI UCRL-PRES
Fox 2 AISRP April 4-6, 2005  Earth System Grid  Grid-enabled OPeNDAP  Architecture - Server and Application access  Framework experience.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Access Control for NCAR Data Portals A report on work in progress about the future of the NCAR Community Data Portal Luca Cinquini GO-ESSP Workshop, 6-8.
DSpace - Digital Library Software
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
1 Accomplishments. 2 Overview of Accomplishments  Sustaining the Production Earth System Grid Serving the current needs of the climate modeling community.
1 Overall Architectural Design of the Earth System Grid.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
1 Gateways. 2 The Role of Gateways  Generally associated with primary sites in ESG-CET  Provides a community-facing web presence  Can be branded as.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
SCD User Briefing The Community Data Portal and the Earth System Grid Don Middleton with presentation material developed by Luca Cinquini, Mary Haley,
UC 2006 Tech Session 1 NetCDF in ArcGIS 9.2. UC 2006 Tech Session2 Overview Introduction to Multidimensional DataIntroduction to Multidimensional Data.
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
1 Artemis: Integrating Scientific Data on the Grid Rattapoom Tuchinda Snehal Thakkar Yolanda Gil Ewa Deelman.
Update on Unidata Technologies for Data Access Russ Rew
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
The Earth System Grid: A Visualisation Solution
The Re3gistry software and the INSPIRE Registry
HAO/SCD: VO, metadata, catalogs, ontologies, querying
Presentation transcript:

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid METADATA DEVELOPMENT for the EARTH SYSTEM GRID Luca Cinquini (SCD/NCAR) for the Earth System Grid collaboration

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Metadata-centric view of ESG services METADATA SERVICES METADATA SERVICES USER AUTHENTICATION AND AUTHORIZATION USER AUTHENTICATION AND AUTHORIZATION ACCESS AND AUTHORIZATION METADATA DATA TRANSPORT LOCATION METADATA SYSTEM MONITORING AND CONTROL SYSTEM MONITORING AND CONTROL LOGGING METADATA DATA SEARCH & DISCOVERY CONTENT METADATA ANNOTATION & HISTORY METADATA DATA ANALYSIS & VISUALIZATION DATA ANALYSIS & VISUALIZATION AGGREGATION METADATA DATA BROWSING CATALOGUING METADATA

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid ESG Metadata Services Goal Functionality Services responsible for the creation, management and utilization of metadata associated with geophysical data Functionality:  Metadata extraction (automatically, from files in different format and according to various possible metadata standards)  Metadata conversion (from one standard to another)  Metadata aggregation (associated with data collections)  Metadata annotation (manually by humans)  Metadata validation (basic quality control of metadata)  Registration (population of metadata holdings)  Harvesting (combination of metadata from different repositories)  Metadata browsing and display (for humans)  Search and discovery of data through metadata  Metadata query (by agents or clients for data analysis and visualization)

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid ESG Metadata Services Architecture 3-layers architecture: Metadata Holdings: physical metadata content, stored in a system of relational and/or XML native databases Core Metadata Services: modules and libraries that mediates all access to the Metadata Holdings (insert, update, delete, query) – expose an API that hides the specific implementation of the databases and query languages High Level Metadata Services: system of applications that make use of the Core Metadata Services to fulfill a specific atomic functionality – will be invoked by external clients

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid ORNL: Climate storage & computational resources ORNL: Climate storage & computational resources LANL: Next generation coupled models & computing LANL: Next generation coupled models & computing ANL: Computational grids, & grid-based applications ANL: Computational grids, & grid-based applications USC/ISI: Computational grids, & grid-based applications USC/ISI: Computational grids, & grid-based applications NCAR: Climate change predication and scenarios NCAR: Climate change predication and scenarios LBNL: Climate storage facility LBNL: Climate storage facility LLNL: Model diagnostics & inter-comparison LLNL: Model diagnostics & inter-comparison The Earth System Grid

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid The Earth System Grid “Synergistic collaboration” among several US national labs and research centers (ANL, ISI, LBNL, LLNL, NCAR, ORNL) involved in atmospheric science and scientific computing 3 year project funded by the DOE Scientific Discovery through Advanced Computing (SciDAC) Goal: build the next generation computational and data management environment for the geosciences: a system of geographically distributed data and computational centers which will allow seamless access for earth scientists to data repositories, analysis tools and computational resources Strategy: application and extension of Grid technologies (and other IT innovations) to the geosciences Initial focus on next generation climate modeled data (CCSM)

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid ESG areas of development Authentication and Authorization services : application of Globus technologies for secure data management and access (PKI certificates, proxy delegation, Community Authentication Services, web interfaces) Data Transport Services: based on gridFTP protocol and implementation (high speed, tunable, multi-stream, reliable), extensions for multi-file management and connection to offline storage systems (Hierarchical Storage Management), and for transparent data access and operations (grid-enabled DODS) Metadata services (for data management, access, search & discovery, annotation, analysis, etc.) Other services: Data Analysis and Visualization, Task Management, Monitoring and Control, etc.

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid METADATA EXTRACTION METADATA EXTRACTION METADATA DISPLAY METADATA DISPLAY METADATA BROWSING METADATA BROWSING METADATA SEARCH, QUERY & DISCOVERY METADATA SEARCH, QUERY & DISCOVERY ESG CLIENTS API & USER INTERFACES Replica Location Services Metadata Cataloguing Services XML DB THREDDS catalogs METADATA HOLDINGS METADATA ANNOTATION METADATA ANNOTATION METADATA VALIDATION METADATA VALIDATION METADATA ACCESS (update, insert, delete, query) METADATA ACCESS (update, insert, delete, query) SERVICE TRANSLATION LIBRARY SERVICE TRANSLATION LIBRARY CORE METADATA SERVICES METADATA AGGREGATION METADATA AGGREGATION METADATA CONVERSION METADATA CONVERSION METADATA & DATA REGISTRATION METADATA & DATA REGISTRATION PUBLISHING HIGH LEVEL METADATA SERVICES SEARCH & DISCOVERY ADMINISTRATION BROWSING & DISPLAY ANALYSIS & VISUALIZATION

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid ESG Metadata Services Current Development Currently developing or evaluating the following technologies : Replica Location Services : database to manage and index multiple copies of the same data stored at different centers Metadata Cataloguing Services : relational database to store scientific metadata (developed for high energy physics and geophysical data) XML native databases (Apache Xindice) THREDDS (by Unidata ) : system for hierarchical cataloguing of datasets and associated metadata ( NcML (Netcdf Markup Language) : XML language for encoding of metadata associated with data in netcdf format (and more…)

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid ESG Metadata Policy Premise : geophysical sciences are too broad and complex to impose a single, omnicomprehensive metadata standard to capture the relevant information for all datasets, projects, instruments, scientists ESG will not mandate use of any metadata schema or convention Allow data providers, scientists to use their metadata of choice, provide technologies and tools to store and access metadata through common services (MCS, XML DB, THREDDS catalogs) Encourage development and reuse of a limited set of domain- specific standards (climate data, radar data, airborn instrumentation etc), encoding in XML (according to community developed schemas), interoperability and combination of schemas (XML namespaces, RDF, ontologies)

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Netcdf Markup Language (NcML) Work in progress, collaboration between ESG, Unidata and the University of Florence Definition: XML representation for data following the netcdf model Features:  Express metadata associated with data in netcdf format  Definition of coordinates and coordinate systems (capturing netcdf conventions)  Aggregation/subsetting  Definition of new data, restracturing of existing data (virtual datasets)  Interoperability with openGIS and ISO  Also, possibly extend the model to other data formats (HDF, Grib etc.) Strategy: develop a system of XML schemas each covering a specific domain (advantages: more flexible, mantainable and extensible). Keep it simple!

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML: schemas architecture Netcdf core (generic netcdf data) Netcdf core (generic netcdf data) Netcdf Coordinate Systems (netcdf conventions for coord, coord systems) Netcdf Coordinate Systems (netcdf conventions for coord, coord systems) Netcdf (virtual) dataset (operations on data) Netcdf (virtual) dataset (operations on data) Netcdf Geo Coordinate Systems (geo-referenced coord systems) Netcdf Geo Coordinate Systems (geo-referenced coord systems) openGIS-ISO Reference Coordinate Systems openGIS-ISO Reference Coordinate Systems Other schemas for openGIS- ISO

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML: core schema For XML encoding of metadata (and data) of any generic netcdf file Objects: Netcdf, Dimension, Variable, Attribute Beta version reference implementation as Java library (

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid E xample : two-dimensional latitude, longitude coordinate variables (CDL) dimensions:  xc = 128;  yc = 64;  lev = 18; variables:  float T(lev,yc,xc);  T:long_name = "temperature"; T:units = "K"; T:coordinates = "lon lat";  float xc(xc);  xc:long_name = "x-coordinate in Cartesian system"; xc:units = "m";  float yc(yc);  yc:long_name = "y-coordinate in Cartesian system"; yc:units = "m";  float lev(lev);  lev:long_name = “altitude levels"; lev:units = “km";  float lon(yc,xc);  lon:long_name = "longitude"; lon:units = "degrees_east";  float lat(yc,xc);  lat:long_name = "latitude"; lat:units = "degrees_north";

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML core schema

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML core schema

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML: coordinate systems schema Generalization and unification of netcdf conventions for coordinates and coordinate systems

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Coordinate Systems extension to NcML

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Coordinate Systems extension to NcML

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Coordinate Systems extension to NcML <nc:variable name="T" shape="lev yc xc" type="float” coordinateSystems=“implicit geo pressure”>

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Aggregation in NcML XML naturally suited to represent aggregation of netcdf data Rules for representing an aggregation hierarchy:  Allow netcdf nodes to contain other netcdf nodes  Factor out (i.e. in the parent netcdf node) all common structure between two nodes  Structure defined in a netcdf node overrides that defined in a parent netcdf node

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML aggregation over existing coordinate (time)

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML aggregation over variables

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML double aggregation

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid NcML double aggregation

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Other NcML planned development Subsetting of data Compute derived data Extensions for interoperability with openGIS and ISO standards :  Establish a bond between Atmospheric Research and Geo-spatial communities  Allows import of NcML data into GIS tools, export of GIS data in netcdf format

NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid Conclusions ESG is very active in the research and development of metadata schemas, services and technologies We are very interested in collaborating with other projects and institutions to the definition and adoption of metadata standards for the geosciences and to work at interoperability technologies among standards