Digital Object Management for ENES: Challenges and Opportunities

Slides:



Advertisements
Similar presentations
Global warming: temperature and precipitation observations and predictions.
Advertisements

Climate Change & Global Warming: State of the Science overview December 2009 Nathan Magee.
Ana4MIPS Update for WDAC3 Michael Bosilovich. Ana4MIPs Project Original Goal tracks Obs4MIPS – Repackage variables to conform to CMIP standard format.
CLIMATE SCIENTISTS’ BIG CHALLENGE: REPRODUCIBILITY USING BIG DATA Kyo Lee, Chris Mattmann, and RCMES team Jet Propulsion Laboratory (JPL), Caltech.
What role does the Ocean play in Global Climate Change?
M. Stockhause et al. Martina Stockhause, Michael Lautenschlager, Frank Toussaint Deutsches Klimarechenzentrum (DKRZ) World Data Centre for Climate (WDCC)
Climate Sciences: Use Case and Vision Summary Philip Kershaw CEDA, RAL Space, STFC.
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
CORDEX South-Asia 2 nd Science and Training Workshop Katmandu, Nepal M. Rixen, WCRP JPS 27 August
Astro / Geo / Eco - Sciences Illustrative examples of success stories: Sloan digital sky survey: data portal for astronomy data, 1M+ users and nearly 1B.
1 The U.S. Climate Change Science Program Peter Schultz, Ph.D. Director Climate Change Science Program Office Peter Schultz, Ph.D. Director Climate Change.
- EGU 2010 ESSI May Building on the CMIP5 effort to prepare next steps : integrate community related effort in the every day workflow to.
ESG Observational Data Integration Presented by Feiyi Wang Technology Integration Group National Center of Computational Sciences.
Global Climate Change: Past and Future Le Moyne College Syracuse, New York February 3, 2006 Department of Meteorology and Earth and Environmental Systems.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
Using a Simple Knowledge Organization System to facilitate Catalogue and Search for the ESA CCI Open Data Portal EGU, 21 April 2016 Antony Wilson, Victoria.
IPCC First Assessment Report 1990 IPCC Second Assessment Report: Climate Change 1995 IPCC Third Assessment Report: Climate Change 2001 IPCC Fourth Assessment.
Weigel, Berger, Kindermann, Lautenschlager EGU Versioning for CMIP6 in the Earth System Grid Federation Data preparation Initial registration.
© Thomas Ludwig Prof. Dr. Thomas Ludwig German Climate Computing Center (DKRZ) University of Hamburg, Department for Computer Science (UHH/FBI) Disks,
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
IIASA. Schlossplatz 1 - A-2361 Laxenburg, Austria
Workshop on Brokering in Data Fabrics - community perspectives -
Accessing the VI-SEEM infrastructure
Towards a European Open Science Cloud for research
RDA 9th Plenary Breakout 3, 5 April :00-17:30
Approaches and Challenges in Managing Persistent Identifiers
AP7/AP8: Long-Term Archival of CMIP6 Data
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
EUDAT’s engagement with the Earth Sciences
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Tools and Services Workshop
A User’s Perspective on Acquisition and Management of CMIP5 Data
Joslynn Lee – Data Science Educator
Intro to CMIP, the WHOI CMIP5 community server, and planning for CMIP6
Schematic framework of anthropogenic climate change drivers, impacts and responses to climate change, and their linkages (IPCC, 2007; 2014).
The RPID Testbed Rob Quick Manager – High Throughput Computing
Data Ingestion in ENES and collaboration with RDA
Joseph JaJa, Mike Smorul, and Sangchul Song
REMOVE THIS SLIDE BEFORE PRESENTING
Wrap-up & discussion EOSC Governance Development Forum workshop:
Research Data Collections WG Plenary 9 Barcelona
Data Fabric Interest Group Plenary 9 Core Session Barcelona
Maggie, Carlo, Peter, Rebecca (GEDE discussions)
EC FP7 - Cooperation Theme 6: Environment (incl. climate change)
Mark van de Sanden Giovanni Morelli
T-TAP for climate data RDA P10 Montréal – September 2017
C2CAMP (A Working Title)
Global Climate Change: Past and Future
Climate Data Analytics in a Big Data world
DATA SPHINX & EUDAT Collaboration
CMIP6 / ENES Data TF Meeting: DKRZ
Global Climate Change: Past and Future
Intergovernmental Panel on Climate Change
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Brief WG/IG reporting Tobias Weigel on behalf of co-chairs
Task 5 : Supporting CCI Contributions to Obs4MIPs
WCRP update DCPP meeting, Barcelona, 12 May 2016 M. Rixen,
Points for discussion Have human activities changed the composition of Earth’s atmosphere? Has Earth’s temperature changed in the past 150 years? In the.
Three Uses for a Technology Roadmap
Malte Dreyer – Matthias Razum
CMIP6 use case and adoption of RDA outputs
24/02/2019 Climate Change Climate Change1 - Observations.
Is Data Infrastructure Revolutionary Infrastructure?
Bird of Feather Session
Donatella Castelli (CNR-ISTI) Project coordinator
RDA uptake activities and plans: ESGF
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

Digital Object Management for ENES: Challenges and Opportunities GEDE workshop, Brussels, 2018-09-26

Workshop on Digital Objects, Brussels Scientific Driver: International Climate Model Intercomparsion Projects Intergovernmental Panel on Climate Change (IPCC): CMIP data history “This evidence for human influence has grown since AR4. It is extremely likely that human influence has been the dominant cause of the observed warming since the mid-20th century.” (3.5 PB of data) “Most of the observed increase in globally averaged temperatures since the mid-20th century is very likely* due to the observed increase in anthropogenic greenhouse gas concentrations” (35 TB of data) Courtesy of Dean Williams CMIP6: (300 PB to 3 EB ?) “There is new and stronger evidence that most of the warming observed over the last 50 years is attributable to human activities” (500 GB of data) “The balance of evidence suggests a discernible human influence on global climate” (1 GB of data) CMIP3: (35 TB of data) Bytes CMIP1: (1 GB of data) CMIP 2: (500 GB of data) CMIP5: (3.5 PB of data) Workshop on Digital Objects, Brussels 2018/09/26

CMIP6 experiment design Eyring, Bony, Meehl, Senior, Stevens et al., Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) excperimental design and organization. Geosci. Model Dev., EGU, 2016. doi:10.5194/gmd-9-1937-2016 Workshop on Digital Objects, Brussels 2018/09/26

The Earth System Grid Federation (ESGF) ESGF is a coordinated multiagency, international collaboration of institutions that continually develop, deploy, and maintain software needed to facilitate and empower the study of climate IS-ENES: European ESGF federation part . . . Courtesy of Dean Williams Workshop on Digital Objects, Brussels 2018/09/26

Challenges and opportunities Automated digital object management Workflow support and provenance aggregation Support for work at higher levels of abstraction Services to new user communities Sustainable funding and business models Workshop on Digital Objects, Brussels 2018/09/26

ESGF publication and versioning raw data (model data, obs data) Pre-processing Iterate on new versions „ESGF publishable“ files ESGF (re-) publication Queueing system Handle System Agreed processes; Kernel Information schema; Governance RDA Fifth Plenary: Large scale data projects 21.06.2019

Workshop on Digital Objects, Brussels ESGF PID services Scalability, reliability, governance Future option: Replication support package – replicate – verify Will require clear interfaces such as the DOIP Workshop on Digital Objects, Brussels 2018/09/26

Automated DO management & Workflow support Example: Replication support Example: HPC workflow support Models should be able to record who they are and what they did Example: Workflow brokering, matching, data transformations We discussed this in the frame of T-TAP in the past Workshop on Digital Objects, Brussels 2018/09/26

Type-Triggered Automated Processing (T-TAP): Status for climate data Data distribution service User ESGF search B2FIND Processing controller Search service Agent / Climate processing controller Structured resource market ECAS birdhouse Schema registry ESGF PID KI PID registry CMIP6 (ESGF) ePIC DTR Type Registry Collection management Collection builder (cross-discipl.) Computing resources obs4MIPs (ESGF) CORDEX (ESGF) Copernicus Broker (various environments) DTR-aware Broker FAIR repositories Generic interfaces red: operational / ready orange: under construction (e.g. via confirmed projects), but likely to become operational yellow: more work to be done Workshop on Digital Objects, Brussels 2018/09/26

Support for work at higher levels of abstraction DOs as primary citizen in ENES But: Abstraction not limited to DO concept Users should concentrate on analysis problems, not data wrangling Example: Data I/O layer for Jupyter environments Example: Machine Learning support VRE Workshop on Digital Objects, Brussels 2018/09/26

Bridging one gap: Processing services (ECAS) Opportunity to put Kernel Information in place Envisaged development for mid 2019 Workshop on Digital Objects, Brussels 2018/09/26

Support for new user communities Knowledge of limitations and assumptions not obvious to non-ENES users social sciences, public administration, policy making DO angle: Abstraction & Research Object approach Workshop on Digital Objects, Brussels 2018/09/26

Thank you for your attention! weigel@dkrz.de Workshop on Digital Objects, Brussels 2018/09/26