Data Flow & Data Services for MOSAiC

Slides:



Advertisements
Similar presentations
Ocean Reference Time-Series Moorings: Acoustics By Bruce M. Howe Applied Physics Laboratory, University of Washington Reference Time-Series Science Team.
Advertisements

GEO SB-01 Oceans and Society: Blue Planet An Integrating Oceans Task of GEO GEO-IX Plenary November 2012 Foz do Iguaçu, Brazil on behalf of the Blue.
Plannes security for items, variables and applications NEPS User Rights Management.
Data Portal for the “Climate changes Spatial Planning” program Henk Klein Baltink (KNMI) Fred Bosveld (KNMI) Hans de Wolf (Dutch Space)
Ocean Technology Test Bed Colin Bradley, University of Victoria John Roston, McGill University NEPTUNE Canada VENUS.
Svalbard Integrated Arctic Earth Observing System (SIOS) SIOS Distributed Data Management System & The Brokering Approach by Bente Lilja Bye SIOS Coordination.
National Oceanic and Atmospheric Administration
Project number: Data and Data Requirements Wouter Los University of Amsterdam.
Work Package 2 / 3 TECHNOLOGIAL & PROCEDURAL HARMONISATION FixO3 General Assembly 14 th to the 16 th October 2014, Heraklion-CRETE Observatory 1 - FRAM.
Arctic Palaeoclimate and its EXtremes (APEX). What do we mean by EXtremes? Conditions that represent the end points of magnitude / frequency behaviour.
Planning for Arctic GIS and Geographic Information Infrastructure Sponsored by the Arctic Research Support and Logistics Program 30 October 2003 Seattle,
The role of gliders in sustained observations of the ocean Deliverable 4.1 or WP 4.
MARINERA Marine ERAnet European Research Area Identify research infrastructures Promote exchange between research teams in and outside Europe Organise.
World Data Center for Marine Environmental Sciences.
1 Hans Pfeiffenberger, Ana Macario, Alfred Wegener Institut, Helmholtz Association OAI4 CERN Text, Data and People – How to Represent Earth.
Data discovery and data processing for environmental research infrastructures Roberto Cossu ENVRI WP4 leader ESA.
Future Perspectives of Ocean Observatories in Germany [Name of the infrastructure / site / time series…] Contact person: [name, ] [Institution(s)
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
The Science Requirements for Coastal and Marine Spatial Planning Dr. Robert B. Gagosian President and CEO September 24, 2009.
Strategies For Permanent Access To Scientific Information In Southern Africa Focus On Health And Environmental Information For Sustainable Development.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP) Enabling science through seamless and open access to marine data.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
Sensor Web Enablement (SWE) developments for fixed monitoring platforms and research vessels By Dick M.A. Schaap – SeaDataNet Technical Coordinator with.
1 Internal Integration Strategy - WP8 Gunnar Sand and Ragnhild Rønneberg The University Centre in Svalbard Kick-off meeting GRD
Connected Infrastructure
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
Discovering and accessing data from a distributed network of data centres S. Mazzeo (ESA)
Report from WLCG Workshop 2017: WLCG Network Requirements GDB - CERN 12th of July 2017
University of Limerick Mobile & Marine Robotics Research Centre
INTAROS WP5 Data integration and management
Information Collection and Presentation Enriched by Remote Sensor Data
JCOMM in-situ Observations Programme Support Centre www. jcommops
Flanders Marine Institute (VLIZ)
SIOS (Svalbard Integrated
Connected Infrastructure
Presenter Organisation(s)
The Atmosphere during MOSAiC
Multidisciplinary drifting Observatory for the Study of Arctic Climate
Integrating Data and Information Across Observing System
Physical oceanography observations during the drift experiment
EC FP7 - Cooperation Theme 6: Environment (incl. climate change)
Candyce Clark JCOMM Observations Programme Area Coordinator
Presenter Organisation(s)
DATA SPHINX & EUDAT Collaboration
Juha-Markku Leppänen Marine Research Centre
IT INFRASTRUCTURES Business-Driven Technologies
Staying afloat in the sensor data deluge
JCOMM in-situ Observations Programme Support Centre www. jcommops
FRAM SA number 1.1 FRAM STRAIT, ARCTIC: PHYSICAL OCEAN PROPERTIES
From Observational Data to Information (OD2I IG )
NWT Centre for Geomatics
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Workshop on Gap Analysis and Prioritization
JCOMM in-situ Observations Programme Support Centre www. jcommops
Animal-Borne Instrument Task Team ABI TT Terms of Reference
EMBRC - European Marine Biological Resource Center K. Deneudt, I. Nardello Pilot Blue Cloud Workshop March 28th, 2017 Brussels.
GN2 JRA5 Roaming and Authorisation Jürgen Rauschenbach, DFN-Verein
Bird of Feather Session
21 November Data Science Capabilities
Moving from Data Aggregation to Decision-as-a-Service | US Hydro Conference | Wetherbee Dorshow & Guy Noll.
COLLABORATION AND ENGAGEMENT WITH CROSS-CUTTING WMO PRIORITIES; WIGOS MAIN OBSERVING COMPONENTS GLOBAL CRYOSPHERE WATCH 25 Jan 2019 Árni Snorrason Chair,
EOSC-hub Contribution to the EOSC WGs
Tax Reverted Property and Land Banks
Presentation transcript:

Data Flow & Data Services for MOSAiC Challenges from a large scale program Arndt Steinhage MOSAiC Implementation Workshop 13-16/11/2017 Stephan Frickenhaus, Hans Pfeiffenberger AWI Computing and Data Centre

Vision for MOSAiC Data Management Interdisciplinary Data Collection, Collective Data publications Data Policy Co-ordinated Data Flows Data Protocol, Data Management Plan Underway Data Science support/ Service, on-shore Data Scientist Taking care of the MOSAiC Data Collection

Challenges from the data point of view to harmonize Science and Infrastructure/ Logistics/ Services in interdisciplinary work Need for a common data protocol Need a committment from the groups to help organize this

The Data Protocol MOSAiC [2019/20] is an observatory producing meaningful scientific data Successful data management needs scientists‘ 100% committment The MOSAiC Data Legacy [2021] is the primary and lasting output, and the basis for science Prerequisites for high impact publications: Common meta-data and procedure standards Quality managed and published data Consortium agreements on publication strategy Open by Default; group-specific embargo timing tbd

Chapters of the Data Protocol Objective Definitions raw data, primary data, data products, … Data Policy, incl. fair publication rules Data Standards Data Management roles and responsibilities common data pool Sample Management Data Archival Amendmends and resolution of conflicts

Data Protocol Group AWI Roland Neuber (Atmos) Thomas Krumpen (Ice/Snow) Allison Fong (Eco, Bio-Sampling) Ellen Damm (BGC) Ben Rabe (Ocean) Andreas Herber (airbourne) Amelie Driemel (PANGAEA) Peter Gerchow, Angela Schäfer, Hans Pfeiffenberger, Ingo Schewe (Logistics, Data Flow/ Data Services/ Data Science) Further members tbd during break out! Gunnar Spreen (remote sensing) …

Expected Contributions from Groups Initial work Expected Contributions from Groups One Person per Group for Data Protocol Available for regular Telcos Device lists (until next spring meeting) Responsible person Sensor type, transport through POLARSTERN Satcom | access through home institution, commercial processor, … Data volumes, data frequencies, near real time needs, data processing tools, shore-to-ship needs? Embargo periods … (input from breakouts)

Multivariate Visualization as an App Data science services Managed Data Base on board for core variables as multivariate time-series Multivariate Visualization as an App Show full time series (PCA, MDS, …) Allow for clustering Enable focussing on sub-sets/ events Export selected data Allow for comparing events

Offer to publish MOSAiC data through an ESSD Special Issue Chief Eds offer support structuring issue At least one guest editor from MOSAiC, At least one guest editor from outside

O2A – Observations to Archive Data Flow Framework Arndt Steinhage

Current Use Case: FRAM Ice tethered platforms Frontiers in Arctic Marine Monitoring Seit 2014 25 M Euro Ice tethered: distributed buoys and networks Water columns: moorings, winched profiling, sampler, autonoumous under water vehicle (AUV) Deeper water column: particle camers, light frame on sight key species investigations (LOKI), acoustic recorders Ocean floor: ocean floor observing system (OFOS), benthic lander system, autonomous crawler (Tramper) radiation, snow height, depth, ice thickness, temperature, salinity, oxygen, chlorophyll a … Medieningenieure Bremen / Sabine Lüdeling

Current Use Case: FRAM Water column fluorescence, nutrients, salinity, Ice tethered: distributed buoys and networks Water columns: moorings, winched profiling, sampler, autonoumous under water vehicle (AUV) Deeper water column: particle camers, light frame on sight key species investigations (LOKI), acoustic recorders Ocean floor: ocean floor observing system (OFOS), benthic lander system, autonomous crawler (Tramper) fluorescence, nutrients, salinity, temperature, conductivity, depth, acoustic doppler current profiler, water and phytoplankton samples, … Medieningenieure Bremen / Sabine Lüdeling

Next Use Case: MOSAiC? Multidisciplinary drifting Observatory for the Study of Arctic Climate 2019 - 2020 > 60 M Euro

Data Flow Framework

Data Flow Framework

Objectives Generic infrastructure for data flows Sustainability and up-to-date services Interoperability and standards e.g. Open Geospatial Consortium Seamless integration with existing infrastructure Web GIS Web Portals Data Archive

Challenges Heterogeneity of scientific needs and workflows Number of different instruments, data sources and formats Integration with existing solutions, e.g. for the data flow, but also administrative information limited additional Effort acceptable multitude of Standards

Sensor Description Platform and device descriptions for provenance information and reduced data integration effort Versioning and citability Interoperability and standards ~1200 descriptions available and counting

Dashboard User-customizable, flexible dashboards for data monitoring Automatic data streaming of near-real time and delayed-mode data Based on sensor descriptions and configurations Since 2011 Fast growing number of values and sensors 350 M measurements 460 sensors

Dashboard Since 2011 Fast growing number of values and sensors 350 M measurements 460 sensors

Maps and Portals

Data Flow Framework

Current work for FRAM Developing a science community workspace for data sharing and data analyses State-of-the-art storage, replicated between Bremerhaven and Potsdam User-friendly “one-click” compute solutions with virtual machines and containers Hadoop big data analysis based on Hortonworks data flow and data platform Raster data management and analysis with rasdaman

Use Case: MOSAiC Multidisciplinary drifting Observatory for the Study of Arctic Climate 2019 - 2020 > 60 M Euro

O2A: MOSAiC only 2 x 100MB/day ? Polarstern Satellite Link for Data Monitoring and Remote Service Polarstern Data Storage MOSAiC Raw Data only 2 x 100MB/day ? Ship-to-shore Data Transfer Onboard Data Transfer “direct” satellite links to partner sites

Comprehensive onshore O2A: MOSAiC ftp from 3rd party’s sites raw data Aircraft data ftp from partner’s sites ftp from EO (sat.) providers Polarstern Data MOSAiC Comprehensive onshore Data Collection Primary data

With contributions from: Roland Koppe Peter Gerchow Angela Schäfer Ana Macario