NOAA Data Management Perspective & Plans NSF RDMI Workshop

Slides:



Advertisements
Similar presentations
Theme 3: Architecture. Q1: Who houses stuff, both records and identifiers All useful services and repositories are centralized (latency, etc.) … but centralizing.
Advertisements

Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements Workforce Demand and Career Opportunities From.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Research Data Service at the IT Pro Forum HEIDI IMKER, DIRECTOR.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Research Data Management Philip Tarrant Global Institute of Sustainability.
NOAA Data Management Activities Deirdre Jones, EDMC Chair Jeff de La Beaujardière, DM Architect Prepared for DAARWG
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Data Interoperability and Access Activities Prepared for the Data Archiving and Access Requirements Working Group (DAARWG) Ken McDonald, TPIO/GEO-IDE Jeff.
Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.
U.S. Department of the Interior U.S. Geological Survey Next Generation Data Integration Challenges National Workshop on Large Landscape Conservation Sean.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Leveraging research and future funding opportunities Hajo Eicken Geophysical Institute & International Arctic Research Center University of Alaska Fairbanks.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.
NOAA Administrative Order : Management of Environmental and Geospatial Data and Information Jeff Arnfield NOAA’s National Climatic Data Center Version.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Data Management at the National Climate Change and Wildlife Science Center.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Climate Archives in NOAA: Challenges and Opportunities March 4, 2014.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
E ARTHCUBE C ONCEPTUAL D ESIGN A Scalable Community Driven Architecture Overview PI:
SciencePAD Open Software for Open Science Alberto Di Meglio – CERN.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
November 2, 2009 NOAA Climate Services Portal Prototype A collaborative, NOAA-wide prototyping effort featuring CPC, CPO, CSC, and NCDC A collaborative,
New data sources (such as Big Data) and Traditional Sources Work Package 2.
WMO WIS strategy – Life cycle data management WIS strategy – Life cycle data management Matteo Dell’Acqua.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
NRF Open Access Statement
Jeff Moon Data Librarian &
CESSDA SaW Training on Trust, Identifying Demand & Networking
Robert R. Downs1and Robert S. Chen2
The NOAA Big Data Project ESIP Cloud Computing Panel
Redesigning the DOE Data Explorer to embed dataset relationships at the point of search and to reflect landing page organization Sara Studwell Department.
Discovering Computers 2010: Living in a Digital World Chapter 14
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Developing Criteria to Establish Trusted Digital Repositories
DataNet Collaboration
2. ISO Certification Discussed already at 2015 PoW and several WLCG OB meetings Proposed approach: An Operational Circular that describes the organisation's.
Conceptual Overview of NOAA Big Data Project
Summit 2017 Breakout Group 2: Data Management (DM)
Joseph JaJa, Mike Smorul, and Sangchul Song
Common Framework for Earth Observation Data
Working with your archive organization Broadening your user community
Agency Requirements: NOAA Administrative Order Management of environmental and geospatial data and information This training module is part of.
Data Stewardship Interest Group WGISS-45 Meeting
OpenML Workshop Eindhoven TU/e,
Prepared by: Jennifer Saleem Arrigo, Program Manager
Data Management Writers Workshop
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Bird of Feather Session
Introduction to SOA Part II: SOA in the enterprise
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

NOAA Data Management Perspective & Plans NSF RDMI Workshop 2017-09-15 NOAA briefing at NSF RDMI Workshop 2016-11-17 NOAA Data Management Perspective & Plans NSF RDMI Workshop 2017-09-15 Jeff de La Beaujardière, PhD National Oceanic and Atmospheric Administration NOAA Data Management Architect jeff.deLaBeaujardiere@noaa.gov jeff.deLaBeaujardiere@noaa.gov

NOAA has "Big Data" (Volume, Variety, Velocity, ...) NOAA briefing at NSF RDMI Workshop 2016-11-17 Satellites Weather radars Ocean bathymetry Buoy networks Tide gauges Ships Aircraft Autonomous vehicles Human observers Numerical models + Extramurally-funded data Jeff.deLaBeaujardiere@noaa.gov 2016-09-15 These data are unique, valuable, irreplaceable, and collected at public expense jeff.deLaBeaujardiere@noaa.gov

Vision for NOAA Data Management NOAA briefing at NSF RDMI Workshop 2016-11-17 Discoverable All NOAA environmental data shall be for all types of users and applications. Accessible Usable Preserved Jeff.deLaBeaujardiere@noaa.gov Vision: All NOAA data will be discoverable, accessible, documented, and preserved for all types of users and applications. 2016-09-15 jeff.deLaBeaujardiere@noaa.gov

NOAA Data Policies https://nosc.noaa.gov/EDMC/ NOAA briefing at NSF RDMI Workshop 2016-11-17 NOAA Administrative Order 212-15: Management of Environmental Data (2010) Jeff.deLaBeaujardiere@noaa.gov NOAA Environmental Data Management Framework (2012-2013) Data Management Planning Directive (2011; rev. 2015) Data Documentation Directive (2011; rev. 2016) Data Access Directive (2015) Archive Appraisal Procedure (2008) 2016-09-15 Data Citation Directive (2015) Data Sharing Directive for NOAA Grantees (2012; rev. 2016) jeff.deLaBeaujardiere@noaa.gov

Implementation Activities Unified Access Framework Standardized formatting & access for gridded data and in-situ observations Big Earth Data Initiative Small NOAA internal grants to improve discovery, access, and usability of datasets Enterprise Metadata Metrics & Assessment Tools to create metadata & evaluate completeness Other projects throughout NOAA Jeff.deLaBeaujardiere@noaa.gov 2016-09-15

NOAA Data Catalog Jeff.deLaBeaujardiere@noaa.gov 2016-09-15

Dataset Identifier Project DOI benefits: Permanent, citable ID. International standard (ISO 26324). Recognition by publishers. Credit from your boss during annual review...? Not yet! DOI (Digital Object Identifier) Jeff.deLaBeaujardiere@noaa.gov landing page Data & Metadata NCEI Archive (National Centers for Environmental Info.) links to 2016-09-15

Challenges managing NOAA Internal Data The good news: NOAA has dedicated, intelligent personnel working assiduously to ensure data are of good quality, publicly accessible, and archived. Less good: Much effort required No existing enterprise-wide approach Lack of resources, tools, training DM often a side-job in addition to regular duties Jeff.deLaBeaujardiere@noaa.gov 2016-09-15

Repository (documents) Conceptual Overview of Grant Data Sharing Directive Data & Publication Sharing Directive for NOAA Grants, Cooperative Agreements, and Contracts (v.3, 2016) https://nosc.noaa.gov/EDMC/PD.DSP.php Federal Funding Opportunity Data Mgmt Guidance Proposal Data Mgmt Plan Researchers Jeff.deLaBeaujardiere@noaa.gov $ cite funding w/FundRef Data Access NOAA Institutional Repository (documents) Link to published version; expose after embargo Data collected by Grantee Research Articles cite data w/DOI 2016-09-15 deposit accepted manuscript 9

Grantee Data Sharing Challenges https://nosc.noaa.gov/EDMC/PD.DSP.php Researchers Data Access Data Jeff.deLaBeaujardiere@noaa.gov Challenges: Compliance monitoring 2yrs after grant end Data hosting NOAA archive may not be able to accept all data size, type, or stewardship issues Need approved repositories permanent or short/medium-term? Limited reusability of multi-source data Data scattered across multiple sites Lack of data standards & interoperability 2016-09-15 10

Data Management is not the goal We don't want to just "manage" data – we want to use and reuse data, and extract maximum value from it Jeff.deLaBeaujardiere@noaa.gov 2016-09-15

Users need answers, not huge datasets (... or 100s of tiny datasets) Jeff.deLaBeaujardiere@noaa.gov Data to Decisions: Distill huge & complex data to ~1 bit: plant crop? evacuate? build wind farm? go skiing? Support non-expert data users 2016-09-15

Challenges Data Volume Data Complexity Jeff.deLaBeaujardiere@noaa.gov 2016-09-15 Data Volume Data Complexity

Traditional Data Services Approach NOAA briefing at NSF RDMI Workshop 2016-11-17 Data.gov and Other Portals Decision Support Tools Scientific Software Numerical Models Value- Adding Reseller User Tools Jeff.deLaBeaujardiere@noaa.gov data services layer shared standards Data Search & Discovery Services Data Access Services Data Documentation (Metadata) Compatible Formats and Vocabularies 2016-09-15 Data Sources Satellite Radar Buoy Ship Sonar Surveys Gliders Models

Traditional Data Services Approach NOAA briefing at NSF RDMI Workshop 2016-11-17 User Hardware User Hardware User Hardware User Hardware User Facilities copy of data Jeff.deLaBeaujardiere@noaa.gov Not scalable as data volumes increase Security risk of every on-premises service Maintenance burden of on-prem infrastructure Data Discovery data access data access data access data access data access data access data access data access 2016-09-15 Data Sources Satellite Radar Buoy Ship Sonar Surveys ROV/UAV Models

Notional Cloud Deployment Scenario Commercial Cloud Information Products Public users Decision-support functions Jeff.deLaBeaujardiere@noaa.gov One-way push NOAA security boundary On-premises Computing Master copy of NOAA Data Operational customers 2016-09-15 Operational Processing Forecast Models Derived from NOAA EDM Framework (2013), figure 8

Wish #1: Fully Leverage the Cloud Operational Customers (e.g., NWS) Jeff.deLaBeaujardiere@noaa.gov Archive Cloud Challenges: Egress costs vs free data Uncertain/unbounded costs Re-architecting for performance vs fork-lifting existing apps IT security policy mis-match 2016-09-15

NOAA Big Data Project (R&D) NOAA briefing at NSF RDMI Workshop 2016-11-17 www.noaa.gov/big-data-project Jeff.deLaBeaujardiere@noaa.gov 2016-09-15 selected datasets Briefing to OSTP PARR meeting

Wish #2: More Tools for Decision-making NOAA briefing at NSF RDMI Workshop Wish #2: More Tools for Decision-making complicated, multi-source data Earth Observations non-scientist users Jeff.deLaBeaujardiere@noaa.gov Policy & Business Decisions Model Outputs Decision Support Functions Ancillary Data Composable functions to create workflows for: Derived information products Multi-source data integration Location-specific analysis Statistics & Trends Novel analyses & discoveries 2016-09-15 jeff.deLaBeaujardiere@noaa.gov

NOAA briefing at NSF RDMI Workshop Questions? NOAA briefing at NSF RDMI Workshop 2016-11-17 Jeff de La Beaujardière, PhD jeff.deLaBeaujardiere@noaa.gov Jeff.deLaBeaujardiere@noaa.gov 2016-09-15 jeff.deLaBeaujardiere@noaa.gov