Server-side Analysis and a Semantic Framework for Metadata M. Benno Blumenthal International Research Institute for Climate and Society Columbia University.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Please Describe Data ingestion. This includes support for real-time sensor data (object ring buffers) as well as simulation output (grid portals) –We have.
Distributed Data Analysis & Dissemination System (D-DADS) Prepared by Stefan Falke Rudolf Husar Bret Schichtel June 2000.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Developing Health Geographic Information Systems (HGIS) for Khorasan Province in Iran (Technical Report) S.H. Sanaei-Nejad, (MSc, PhD) Ferdowsi University.
Delivery of Forecasted Atmospheric Ozone and Dust for a Public Health Decision-Support System-Architecture and Functionality William B. Hudspeth, Jeff.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
AIRNow Web Services Data to Go! Prepared by Steven A. Ludewig, Timothy S. Dye Sonoma Technology, Inc. Petaluma, CA John E. White U.S. Environmental Protection.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
The IRI Climate Data Library: translating between data cultures Benno Blumenthal International Research Institute for Climate Prediction Columbia University.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
NERC DataGrid NERC DataGrid Vocabulary Server Use Cases Vocabulary Workshop, RAL, February 25, 2009.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
IRI Data Library: enhancing accessibility of climate knowledge M. Benno Blumenthal, Michael Bell, John del Corral, Rémi Cousin, and Igor Khomyakov.
Documentation from NcML to ISO Ted Habermann, NOAA NESDIS NGDC.
M.Benno Blumenthal and John del Corral International Research Institute for Climate and Society OpenDAP 2007
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Accessing and Using Fire-Related Data with the CAPITA DataFed.net* Services Framework Stefan Falke Rudolf Husar Kari Hoijarvi Washington University in.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
M.Benno Blumenthal, Michael Bell, John del Corral, and Emily Grover-Kopec International Research Institute for Climate and Society Columbia University.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Creating Good Documentation NOAA National Geophysical Data Center
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
Interoperability = Leverage + Collaboration  Chris Lynnes  GES DISC.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Data Interoperability at the IRI: translating between data cultures Benno Blumenthal International Research Institute for Climate Prediction Columbia University.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
M.Benno Blumenthal and John del Corral International Research Institute for Climate and Society IRI Data Library.
IRI/LDEO Climate Data Library M.Benno Blumenthal, Michael Bell, John del Corral, Remi Cousin, and Haibo Liu International Research Institute for Climate.
Semantic Web underpinnings of the IRI Data Library Semantic Web as a Framework for Multiple Metadata IRI Data Library: presenting Data in multiple frameworks.
Ewa Deelman, Virtual Metadata Catalogs: Augmenting Existing Metadata Catalogs with Semantic Representations Yolanda Gil, Varun Ratnakar,
M.Benno Blumenthal and John del Corral International Research Institute for Climate and Society Using a Resource.
Semantics in Web Service Composition for Risk Management Michael Lutz European Commission – DG Joint Research Centre Ispra, Italy EcoTerm IV, Vienna,
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
IRI Data Library Faceted Search: an example of RDF-based faceted search for climate data Drawing on multiple ontologies to build an application Using inference.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
M.Benno Blumenthal and John del Corral International Research Institute for Climate and Society Use of RDF/OWL.
M. Benno Blumenthal International Research Institute for Climate and Society Connecting netcdf/CF to a semantic.
Using the Semantic Web M. Benno Blumenthal International Research Institute for Climate and Society Columbia University 31 July 2012 CU Metadata Group.
An Introduction to the Semantic Web M. Benno Blumenthal International Research Institute for Climate and Society Columbia University 2 November 2011.
IRI/LDEO Climate Data Library M.Benno Blumenthal, Michael Bell, and John del Corral International Research Institute for Climate and Society Columbia University.
IRI Data Library Overview
An Overview of Data-PASS Shared Catalog
The IRI was founded approximately 12 years ago with a mission To enhance society’s capability to understand, anticipate and manage the impacts of climate.
Knowledge Management Systems
IRI/LDEO Climate Data Library
IRI/LDEO Climate Data Library
IRI/LDEO Climate Data Library
IRI/LDEO Climate Data Library
IRI/LDEO Climate Data Library
IRI Data Library Overview
Data Standards at the IRI Data Library
RDF Standard Data Model Exchange
IRI Data Library Faceted Search: an example of
M.Benno Blumenthal, Michael Bell,
IN32A-05 The IRI/LDEO Climate Data Library: Helping People use Climate Data M.Benno Blumenthal, Emily Grover-Kopec, Michael Bell, and John del Corral.
IRI/LDEO Climate Data Library
IRI Data Library efforts focus on making climate and other data products more widely accessible through tool development, data organization and transformation,
Presentation transcript:

Server-side Analysis and a Semantic Framework for Metadata M. Benno Blumenthal International Research Institute for Climate and Society Columbia University

Data Analysis as a Service The Data Library's open data model and ability to create networks of virtual web pages and other web resources leads to some powerful applications As datasets become more complicated and difficult to handle, systems that hide that complexity and facilitate analysis become more essential Metadata and its transforms are essential Archived but accessible data are ever more important

Complexity pervades (*)

Overview IRI Data Collection Generalized Data Tools Specialized Data Tools Dataset Variable ivar multidimensional Data ViewerData Language Maproom URL/URI for data, calculations, figs, etc

IRI Data Collection Dataset Variable ivar multidimensional Economics Public Health “geolocated by entity” GIS “geolocation by vector object or projection metadata” Ocean/Atm “geolocated by lat/lon” multidimensional spectral harmonics equal-area grids GRIB grid codes climate divisions IRI Data Collection Data by geolocation type

IRI Data Collection Dataset Variable ivar Servers OpenDAP THREDDS GRIB netCDF images binary Database Tables queries spreadsheetsshapefiles images w/proj IRI Data Collection Data by format

IRI Data Collection Dataset Variable ivar Calculations “virtual variables” images graphics descriptive and navigational pages OpenGIS WMS/WCS KML Data Files netcdf binary images Clients OpenDAP THREDDS Tables Servers OpenDAP THREDDS GRIB netCDF images binary Database Tables queries spreadsheetsshapefiles images w/proj IRI Data Collection Data as services

IRI General Data Tools Data page

IRI General Data Tools Data viewer

Calculations: svd (link: svdview)‏ (link: svd results dataset)‏ (link: svd documentation)‏ IRI General Data Tools

svd program

Calculations: Cluster Analysis (link: cluster view)‏ (link: cluster results dataset)‏(link: k-means fn)‏ IRI General Data Tools

WMS and KML: land cover (link:figure page)‏ IRI General Data Tools

WMS and KML: precipitation (link: figure page)‏ IRI General Data Tools

IRI Map Room Maproom Animation

Malaria Early Warning System Front page illustrates most recent dekadal rainfall estimates (FEWS RFE)‏ Change dates to view different time periods Administrative and epidemiological overlays available Click and drag box across map to zoom IRI Map Room

STEP 1: Select size of domain for analysis STEP 2: Select location for which analysis will be created Administrative District OR Box – 11km, 33km, 55km, 111km MEWS Time Series Analyses IRI Map Room

MEWS Time Series Analyses IRI Map Room

MEWS tool transparently interrelates the three geospatial models Dekadal precipitation (longitude, latitude, time) District outlines Time series for districts (generated on-the-fly) from first two

Data Flow based Analysis with explicit semantics Results data analysis Data analysis Semantic Web

Faceted Search (link)‏

Models, Crosswalks, and Objects in a single RDF/OWL framework s/

Standard metadata schema Tools Users Datasets Standard Metadata Schema RDF Tools Users Datasets Standard Metadata Schema RDF Tools Users Datasets Standard Metadata Schem RDF RDF Data Model Exchange RDF Tools Users Datasets Standard Metadata Schema RDF Tools Users Datasets Standard Metadata Schema RDF

Data Servers Ontologies MMI JPL Standards Organizations Start Point RDF/XML-Schema Crawler XSLT/GRDDL ingest XML Schema to OWL translation Owl Semantics SWRL Rules SeRQL CONSTRUCT Search Queries Location Canonicalizer Time Canonicalizer Sesame Search Interface bibliography IRI RDF Architecture

Semantic Crosswalk for metadata translation

Semantic metadata translation: maproom to GCMD DIF

Sample GCMD DIF-CD Record

OpenDAP CF to WCS Service

Function Documentation (*)

Function Semantics Used to generate function documentation Basis for more extensive function semantics Eventually would like to use to generate workflows Currently working with SSWAP to insure that it can describe these workflow steps: e.g. variable to transformed variable and variable to figure to image file.

SSWAP Simple Semantic Web Architecture and Protocol A way of providing a service that semantically describes its domain and range to advertise it. To invoke it, both domain and range are restricted. Traditionally we specify of chain of processing steps, and provenance documents that effort. SSWAP specifies an object by constraining it – you could specify its provenance to get it “traditionally”, or some other quality.

Multiplicity of Data Representations RDF provides a unifying framework to simultaneous hold and deliver dataset metadata according to multiple standards Models, Crosswalks, and Objects organizes that framework clarifying the semantic distance spanned bidirectional XML Schema to OWL translation enables delivery of inferred metadata to existing XML-based systems Persistence with inference/transform is the underlying technology Semantic Service Framework could extend this framework to semantically-informed workflow generation

21 st Century data analysis Definitive web-accessible data archives Cloud data analysis services based on those archives Semantic descriptions of datasets Semantic descriptions of analysis steps Semantic assembly of workflow pipelines Science is about reproducibility, as are virtual dataset services. This means access to the data, access to the analysis methods, and commitment to archives.

Other Maproom Examples

I. Food Security: Application At the request of the UN FAO, a web-based tool was created to support Desert Locust management and control Eliminates NDVI-based error for identification of locust habitat Adds daily and 10-day CMORPH rainfall estimates for identification of potential breeding areas Michael Bell, Benno Blumenthal

MODIS images: composite and NDVI are now available through IRI Health Maproom Ministry of Health in Eritrea follows NDVI indices on regular basis and provides warnings to the sub-districts I. Human Health: Application Michael Bell, Benno Blumenthal, John del Corral, Emily Grover-Kopec

Fire Management Presentation of the tool to CARE and Ministry of Environment (Indonesia). Improvement and publications are in progress Michael Bell, Benno Blumenthal, Joshua Qian, Andy Robertson, Michael Tippett