Where Should the GALION Data Reside? Centrally or Distributed? Introduction to the Discussion Fiebig, M.; Fahre Vik, A. Norwegian Institute for Air Research.

Slides:



Advertisements
Similar presentations
CNES implementation of the ISO standard An extension of the current CNES implementation of the ISO metadata standard.
Advertisements

EDC and MDIAK model MKDIA Data QC and acceptance Data maintenance and data integration Analysis and synthesis Assimilation in to users framework Stocks.
WORLD METEOROLOGICAL ORGANIZATION Weather - Climate - Water
ECOOP Data Management System (T2.2/WP2) Declan Dunne 13 th February 2008, Athens.
WMO CAS achievements and challenges within WIGOS Sandro Fuzzi National Research Council, Italy WMO: Research Dept.
Visibility Information Exchange Web System. Source Data Import Source Data Validation Database Rules Program Logic Storage RetrievalPresentation AnalysisInterpretation.
CG & GIS Lab: Igor Antolović Vladan Mihajlović Dejan Rančić SEEVCCC: Dragan Mihić Vladimir Đur đ ević Republic Hydrometeorologica l service of Serbia 12th.
New organisational perspectives in 'library business' in the future – case study Finland Kristiina Hormia-Poutanen National Library of Finland.
Data Portal for the “Climate changes Spatial Planning” program Henk Klein Baltink (KNMI) Fred Bosveld (KNMI) Hans de Wolf (Dutch Space)
WISE European System for Water Information WISE – part of Eionet (European Environment Information and Observation Network) http//
16 months…. The Visibility Information Exchange Web System is a database system and set of online tools originally designed to support the Regional Haze.
NERC Data Grid Helen Snaith and the NDG consortium …
ISO/TC211 Geographic Information/Geomatics Implementing ISO Metadata David Danko Work Item 15—Project Leader
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
GAUDI Ground-based Asteroseismology Uniform Database Interface E. Solano Bases de données en spectroscopie stellaire. Paris.
IMPROVING THE UPTAKE OF GLOBAL DATA SETS Dr Wolfgang Grabs Chief, Hydrological Forecasting and Water Management Climate and Water Department
EARLINET-ASOS Symposium 20 September 2010, Geneva, Switzerland EARLINET: Future plans Gelsomina Pappalardo Consiglio Nazionale delle Ricerche-Istituto.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
ICG-WIGOS-3 Status of the WIGOS Operational Information Resource (WIR) Etienne Charpentier (OBS/WIGOS/OSD) WMO; OBS.
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal MINCyT,
The WMO Information System (WIS)
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
VO Sandpit, November 2009 CEDA Metadata Steve Donegan/Sam Pepler.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
WDCGG Outline What is WDCGG How WDCGG works Data information –Data type –Data format download.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
ODP Interoperability Package Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal MINCyT, Buenos Aires, Argentina, 7 – 11 October.
NASA Earth Observing System Visualization Tools ARSET - AQ Applied Remote SEnsing Training – Air Quality A project of NASA Applied Sciences Introduction.
EMIRES Czech. 2 INSPIRE & its requirements Geographic information needed for good governance at all levels should be abundant and widely available under.
The Geographic Information System of the European Commission (GISCO) By Albrecht Wirthmann, GISCO, Eurostat ESPON.
NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.
1-2-3 February 2006 –Page 1 Mersea Integrated System How to improve Access/Downloading services ? How far do we go in terms of standardization ?
EMODnet Chemistry 2 Service Contract MARE/2012/10 S How to make EMODnet Chemistry fit for purpose at system level By Dick M.A. Schaap – Technical.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
The Digimap ShareGeo Facility Login to Digimap and select ShareGeo from the Collections page.
An Introduction to the Argo Data Sytem South Pacific Workshop 11 – 14 October 2005 Mark Ignaszewski FNMOC.
Deutscher Wetterdienst Lindenberg Meteorological Observatory Richard Assmann Observatory Michael Sommer GRUAN Lead Centre, DWD 2 nd GRUAN Implementation.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
AIRS/AMSU-A/HSB Data Subsetting and Visualization Services at GES DAAC Sunmi Cho, Jason Li, Donglian Sun, Jianchun Qin and Carrie Phelps, Code 902, NASA.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss GAWSIS Jörg Klausen Second GALION Workshop, WMO, Geneva.
Sitecues Metrics David Young (617) Discussion Document.
ODP V2 Data Provider overview. 22 Scope Data Provider provides access to data and metadata of the local data systems. Data Provider is a wrapper, installed.
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal.
1 st ECARS Summer school, 26 th May 2016, Bucharest, Romania Download data from:
Near-Real-Time Data Collection at WDCA: Why & What
New Developments in ACTRIS Surface In-Situ Data Flow and Handling
ODP Interoperability Package
ACTRIS on the ESFRI Roadmap
Making Data Providers’ Contribution Count
Global Precipitation Data Access, Value-added Services and Scientific Exploration Tools at NASA GES DISC Zhong Liu1,4, D. Ostrenga1,2, G. Leptoukh4, S.
Data Flows in ACTRIS: Considerations for Planning the Future
INTAROS WP5 Data integration and management
MERRA Data Access and Services
Flanders Marine Institute (VLIZ)
INITIAL WORLD OCEAN DATABASE COLLECTION
GAWSIS is a web-based database system that
eCulture Science Gateway – reloaded
How To Report QA Measure Outcomes With ACTRIS Surface In Situ Data
School of Information Studies, Syracuse University, Syracuse, NY, USA
ACTRIS – EMEP, THE WAY FORWARD
Introduction – workshop on EBAS and Data Quality
Robert Dattore and Steven Worley
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

Where Should the GALION Data Reside? Centrally or Distributed? Introduction to the Discussion Fiebig, M.; Fahre Vik, A. Norwegian Institute for Air Research

User Demands for GALION Data Management Data should be easy to find and accessible via one common location. Data should be searchable by location, time window, parameter, … Plotting and browsing tool for online comparison. Data should be downloadable in homogenous format, option for user selection between a few commonly used formats. Data should be of homogenous high quality, including detailed documentation of processing steps for assessing comparability. Different applications require different proximity to raw measurement. Data should include a measure of uncertainty and variability. Data should be available in near-real-time (crisis management, forecast, …) -> one location, one format! Option for aggregating datasets into climatologies. …

Current Strategy for Data Management in GALION At least one common point of access for common data pool. Responsibility for QA and long-term availability remains with contributing institutions / networks. Features of common access portal: Holds access metadata from all contributing stations, i.e. dates, times, and type of measurements. Allows search with criteria as network, date, location, … Browsing / quicklook of data. Link to download from original location. Tools for format conversion. Control of access rights.

Solution 1: GAWSIS as Data Discovery Portal

GAWSIS Features Data directory encompassing all GAW data centres, holds access metadata. Search data availability by country, network, station name, station ID, station type, and parameter. Map visualisation of availability. Station page with station metadata, available datasets list. Link to original repository, direct link to dataset if available. Functionality similar to a Global Information System Centre (GISC) in WMO Information System (WIS) concept. GAWSIS plans include WIS compliance (once that is defined) and plotting tool.

Solution 2: EARLINET-ASOS Database and Portal

EARLINET-ASOS Database Features Search all EARLINET-ASAS data by date, daytime, season, station, event category, parameter. Select and download data (NetCDF format). Plotting, browsing, comparing function. The EARLINET-ASOS database will be part of the ACTRIS distributed database, which is planned to be WIS compliant (when we know what that means). ACTRIS: EU FP7 project, will network European ground-based in situ & lidar aerosol observations, cloud property observations, and reactive trace gas observations.

Solution 3: GEOmon Distributed Database Data discovery portal holding access metadata. Data may be searched by parameter, station, home database, type (in situ, remote sensing, simulation), platform, matrix, geolocation, altitude, temporal availability. Portal links to individual dataset where possible, to database homepage otherwise. Will be developed into entry portal of ACTRIS distributed database.

Distributed Data Architecture Pros & Cons Pros: Institutions / networks keep control over data access, data quality, long-term availability and maintain visibility. Know-how on measurement principle and data management is combined for tailored solutions. Cons: All institutions / networks have to maintain server infrastructure (file archive, metadata server, webservice, WIS compliance, …) Well defined formats are essential for smooth interoperability. Implementing on-the-fly conversion of dozens of formats would be resource drain and predefined vulnerability. Near-Real-Time dissemination with uniform QA almost impossible to implement. Long-term availability not ensured.

Centralised Data Architecture Pros & Cons Pros: Server infrastructure needs to be maintained only once / few times (economy of scale). Long-term availability ensured. Easy to ensure homogenous data formatting and quality, frequent reformatting not necessary. Almost the only option for implementing NRT service with homogenous automated QA. Cons: Somewhat less visibility of individual institution / network. Institution(s) hosting data centre(s) need to ensure access management. Institution(s) hosting data centre also need experimental expertise.

Well-Defined Common Data Formats are Essential for any Data Architecture Data format is more than just selecting NASA-Ames, NetCDF, … Needs to include: implementation profile for format standard and defined vocabulary, i.e. which parameteres / metadata are included in what unit and how are they named, which processing steps were conducted, all self- explaining, flags to indicate special conditions. Example EUSAAR data formats (all NASA-Ames 1001): Level 0: Annotated, instrument specific raw data, ”native” time resolution. Level 1: processed to final physical variable, original time resolution. Level 1.5: automatically aggregated to (hourly) averages, includes uncertainty for averaging period. Level 2: same as level 1.5, but manually quality assured. Well-defined common processing steps between levels establish traceability. Well defined formats don’t limit usability of data, but make routine work more efficient.

Efficient Use of Project Resources: GAW aerosol NRT Station: auto-creates hourly data files (level 0). initiates auto-upload to NRT server. Data Centre: check for correct data format (level 0). check whether data stays within specified boun- daries (sanity check). automatic feedback FTP transfer to data centre Hourly level 1 data file Processing to level 1 Hourly level 1.5 data file Processing to level 1.5 EBAS database User access (restricted) via web-interface: ebas.nilu.no User access via machine-to- machine web- service Sub-network data centre: auto-creates hourly data files (level 0). initiates auto-upload to NRT server. FTP transfer to data centre automatic feedback Station: collects raw data in custom format transfer

How Do You Access the Data?

NRT-Example: Auto-Processed DMPS data