Earth System CoG and the Earth System Grid Federation: A Partnership for Improved Data Management and Project Coordination NOAA ESRL Seminar April 8, 2014.

Slides:



Advertisements
Similar presentations
© 2011 Delmar, Cengage Learning Chapter 1 Getting Started with Dreamweaver.
Advertisements

Earth System CoG and the Earth System Grid Federation: A Partnership for Improved Data Management and Project Coordination BESSIG March 18, 2014 Boulder,
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Business Development Suit Presented by Thomas Mathews.
The National Climate Predictions and Projections (NCPP) Platform: Development of Capacity to Support Planning and Management NCPP Core Team Richard B.
® Microsoft Office 2010 Browser and Basics.
Unveiling ProjectWise V8 XM Edition. ProjectWise V8 XM Edition An integrated system of collaboration servers that enable your AEC project teams, your.
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
Project 1 Introduction to HTML.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
1st Project Introduction to HTML.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Welcome to the Minnesota SharePoint User Group. Introductions / Overview Project Tracking / Management / Collaboration via SharePoint Multiple Audiences.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
GEOSS Common Infrastructure: A practical tour Doug Nebert U.S. Geological Survey September 2008.
US NITRD LSN-MAGIC Coordinating Team – Organization and Goals Richard Carlson NGNS Program Manager, Research Division, Office of Advanced Scientific Computing.
Web 2.0: Concepts and Applications 4 Organizing Information.
Trimble Connected Community
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
The Earth System CoG Collaboration Environment Sylvia Murphy and Cecelia DeLuca (NOAA/CIRES), and Luca Cinquini (NASA/JPL) AGU Ocean Sciences February.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
The GeoConnections Discovery Portal Michael Robson MacDonald Dettwiler and Associates Brian McLeod, Michael Adair Natural Resources Canada.
Using the SAS® Information Delivery Portal
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
Portal for ArcGIS An Introduction
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Marcus Barnes, Simon Fraser University, June 2, 2012 Drupal with CONTENTdm Digital Collections.
Leveraging Globus Services to Support Climate Model Data Access Through the Earth System Grid Federation (ESGF) Brian Knosp 1, Luca Cinquini 1, Lukasz.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
1 Earth System Modeling Framework Documenting and comparing models using Earth System Curator Sylvia Murphy: Julien Chastang:
Internet Architecture and Governance
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Afresco Overview Document management and share
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
1 Accomplishments. 2 Overview of Accomplishments  Sustaining the Production Earth System Grid Serving the current needs of the climate modeling community.
1 Overall Architectural Design of the Earth System Grid.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Windows SharePoint Services. Overview Windows SharePoint Services (WSS) Information Worker Infrastructure component delivered in Windows Server 2003 Enables.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
ESMF and the future of end-to-end modeling Sylvia Murphy National Center for Atmospheric Research
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Metadata Support for Model Intercomparison Projects Sylvia Murphy: Cecelia DeLuca: Julien.
Web Page Programming Terms. Chapter 1 Objectives Describe Internet and Understand Key terms Describe World Wide Web and its Key terms Identify types and.
A Quick Tour of the NOAA Environmental Software Infrastructure and Interoperability Group Cecelia DeLuca Dr. Robert Detrick visit March 28, 2012
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Accessing the VI-SEEM infrastructure
Building Distributed Educational Applications using P2P
Project 1 Introduction to HTML.
VI-SEEM Data Repository
WGISS Connected Data Assets April 9, 2018 Yonsook Enloe
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Presentation transcript:

Earth System CoG and the Earth System Grid Federation: A Partnership for Improved Data Management and Project Coordination NOAA ESRL Seminar April 8, 2014 Boulder, CO Sylvia Murphy (NOAA/CIRES) Luca Cinquini (JPL/NOAA), Cecelia DeLuca (NOAA/CIRES), Allyn Treshansky (NOAA/CIRES)

Presentation Outline ESGF-CoG Integration Overview of ESGF ESGF Architecture and Local Data Holdings Overview of CoG CoG Capabilities (Live Demo) ESGF-CoG Integration Development Tasks Upcoming Tutorials

ESGF-CoG Integration ESGF is an international, federated data archive focused on climate projects. CoG is a collaboration environment and hub to connect projects in the Earth sciences. CoG is going to become the new front-end for ESGF. – This will mean a superior interface to ESGF users and data managers in terms of: Overall usability Content management Model Intercomparison Project (MIP) support Multi-project support Online collaboration tools Reference: 3 rd Annual Earth System Grid Federation and Ultrascale Visualization Climate Data Analysis Tools Face-to-Face Meeting Report December (

ESGF Overview The Earth System Grid Federation (ESGF) is a multi-agency, international collaboration of people and institutions working together to build an open source software infrastructure for the management and analysis of Earth Science data on a global scale Collaboration led by PCMDI, includes institutions from several agencies from the U.S.A. (DOE, NASA, NOAA), Canada, Europe (IS-ENES-2), Australia and Asia ESGF manages and serves a global archive of climate data including: CMIP5 model output (basis of IPCC-AR5) Possibly the largest modeling effort in history: 40+ models, 25+ modeling centers, 17 countries, 2 PB of data Obs4MIPs: selected observations from NASA and DOE especially formatted for comparison and evaluation of CMIP5 models Ana4MIPs: reanalysis data also formatted as CMIP5 model output CORDEX: regional climate models, 2 PB of data TAMIP: atmospheric model intercomparison GeoMIP: geo-engineering model intercomparison DCMIP: atmospheric dynamical core model intercomparison WCRP recommended use of ESGF infrastructure for all future MIPs

ESGF System Architecture ESGF is a system of distributed and federated Nodes that interact dynamically through a Peer-To-Peer (P2P) protocol Distributed: data and metadata are published, stored and served from multiple centers (“Nodes”) Federated: Nodes interoperate because of the adoption of common services, protocols and APIs, and the establishment of mutual trust relationships Dynamic: Nodes can join/leave the federation dynamically – global data and services will change accordingly A client (browser or program) can start from any Node in the federation and discover, download and analyze data from multiple locations as if they were stored in a single central archive

ESGF Software Stack Software components can be grouped into 4 areas of functionality – the Node “flavors”: Data Node: secure data publication and access Index Node: metadata indexing and searching Identity Provider: user authentication and group membership Compute Node: analysis and visualization The ESGF software stack is based on the integration of several applications, APIs: – Open source engines (Postgres, Tomcat, Solr) – Geo-spatial servers (Thredds Data Server, Live Access Server) – Industry standards: OpenSSL, X509, OpenID, REST, … – Custom ESGF software Node flavors can be installed in various combinations depending on site needs, or to achieve higher performance and scalability All ESGF software is Open Source (BSD License) and freely available on GitHub –

ESGF ESRL Node NOAA/ESRL is hosting a full-featured ESGF Node: – Node system administrator: Doug Ohlhorst (big thanks!) Available data collections: – Ana4MIPs 20 th Century Reanalysis (Gil Compo, Cathy Smith) – DCMIP-2012 ( Atmospheric Dynamical Core Inter-Comparison workshop at NCAR, led by Christiane Jablonowski), including NOAA FIM model – QED-2013 (Quantitative Evaluation of Downscaling workshop at NCAR, sponsored by National Climate Projection and Prediction –NCPP- project) ESRL Node is part of ESGF federation: – ESRL collections can be accessed and discovered from other ESGF sites – Vice versa, a user can start from ESRL Node and find CMIP5 data throughout ESGF Vertical mesh layout from FIM test 5-1 (idealized tropical cyclone) conducted during DCMIP-2012.

Summary of ESGF Achievements ESGF represents a significant step forward for the management and access of climate data world-wide: Established the first global, distributed database of PB of climate model output and observations Data can be discovered through a federated faceted search or RESTful API Data download can be scripted and executed by programs Users need register only once, authenticate everywhere Architecture is scalable (for increased model and instrument resolution and rates) and extensible (to other formats, repositories and scientific domains) ESGF has established an open source collaboration across agencies and international boundaries Image courtesy of NCAR/CGD

Overview of CoG CoG is a collaboration environment and hub to connect projects in the Earth sciences. It hosts software development projects, model intercomparison projects (MIPS), university short-courses, and workshops. It includes a configurable search to data on ANY ESGF data node. It provides projects with a wiki and customizable navigation to wiki content. Projects, files, or pages can be made private. It contains an ontology for the description and management of projects and provides a consolidated look at this content across a project’s network. It contains a file server for documents and images. It provides services for Earth system model metadata collection and display. Some of the 74 projects hosted on CoG include: NOAA’s High Impact Weather Prediction Project (HIWPP) Atmospheric Dynamical Core Model Intercomparison Project (DCMIP) Reanalysis Data for CMIP5 (Ana4MIPs) Observational Data for CMIP5 (Obs4MIPs) National Unified Operational Prediction Capability (NUOPC) National Climate Predictions and Projections Platform (NCPP) Earth System Documentation (ES- DOC) Earth System Prediction Capability (ESPC) CoG Development Partners

Who’s Using CoG HIWPP (NOAA): NCPP (NOAA): Ana4MIPs (NOAA): NUOPC (Navy, USAF, NOAA): DCMIP-2012:

Wiki and Collaboration Tools The CoG layout is color- coded: The right-hand side (dark yellow) is where services (data, news, project connectivity) are located. The Upper Navigation bar (dark teal) contains links to project-level metadata. On the left (light teal) is an auto-generated navigation system created when projects develop freeform content. The central portion of the site is a wiki that allows projects to create their own content. Screenshot of the CoG project workspace for the 2012 Dynamical Core Model Intercomparison (DCMIP) Workshop.

Customizable Data Services…Interfacing with ESGF Search widget can be turned on/off. Search can be narrowed to any ESGF node and to any project (e.g. CMIP). Search facets can be created, deleted, and grouped. Help text can be added to the top of the search page. Search results can be saved to a Data Cart associated with a user. Items in the Data Cart persist. Search results can be: – Forwarded to the Live Access Server (LAS) for simple visualization. – Downloaded directly via a WGET script. – Associated with model metadata if it exists.

ESGF Search Customization downscaling-2013/

Data Cart Items in the Data Cart can be sent individually or collectively to LAS or WGET. The Data Cart is associated with a user and not a project.

Show Metadata

Project Networks and the Project Browser Projects in CoG are arranged in a hierarchy of Parents, Peers, and Children. The Project Browser displays the network and allows for inter-project navigation. Projects can be tagged with keywords and projects can be searched for using keywords.

CoG Schema The CoG schema contains classes to describe software development projects, short- courses or meetings, and overall project coordination. Projects select which metadata to display via a simple web form. Project-level metadata is linked in standardized locations via the upper navigation bar.

Project-level Metadata Roll-up Management of information is a major problem in projects that involve many sub-projects, partners, multiple leads, and many resources. CoG acts as an index into project information that is necessary for coordination and collaboration and enables people responsible for overall coordination to quickly get consolidated views of information. This example shows the Partners feature that allows projects to list their project partners and include a logo for each. Below the list for ED-DOC is a consolidated view of the partners for ES-DOC’s peer projects.

Resources Resources are pointers to data, files, and URLs. Resources folders can be created, moved, and deleted. Projects can turn on a set of standardized Resources folders (e.g. Presentations, Minutes). Saved data searches can be saved as a Resource. Each Resource can have a private wiki-based notes page to facilitate discussions.

News News is a way to send announcements across a project network. News is visible in the news widget on any targeted project. News will be added to social media (Google+, Facebook, Twitter, RSS) in a future release.

Model Metadata Services The CoG Team is partnering with the international Earth System Documentation (ES- DOC) project to develop and use an Earth System Model metadata entry and view capability. The ES-DOC Viewer is a lightweight JavaScript plugin that will display any Common Information Model (CIM) record. The ES-Questionnaire collects standardized CIM metadata through a high-customizable web form. The output is saved to a community CIM repository.

CoG-ESGF Future Work Requirements are coming from HIWPP, CMIP6, the ESGF integration, and other projects. CMIP6 will include a set of interconnected MIPs. – Work is starting on the CMIP6 sites. CoG is going to replace the ESGF web front end. – Work should be completed by the end of the summer 2014 with a production system in place by the end of the year. – CoG will be federated so that projects hosted on one CoG-ESGF instance will be visible on others. CoG is being modified to conform to Federal and DOC requirements. OpenID access will be added to CoG, which will improve the security of the site. The local CoG URL will be changed to a.gov address.

Webinar/Tutorials Fridays at 10am Mountain Time Contact for more Other group or individual sessions available on demand. Scheduled Sessions: 11 Apr: HIWPP 02 May: ESPC

Questions? CoG: ESRL ESGF data node: PCMDI ESGF data node: JPL ESGF data node: