Improving Information Quality for Earth Science Data and Products – An Overview H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard.

Slides:



Advertisements
Similar presentations
Portfolio Management, according to Office of Management and Budget (OMB) Circular A-16 Supplemental Guidance, is the coordination of Federal geospatial.
Advertisements

Product Quality and Documentation – Recent Developments H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC
NASA Earth Science Data Preservation Content Specification H. K. (Rama) Ramapriyan John Moses 10 th ESDSWG Meeting – November 2, 2011 Newport News, VA.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Metrics Planning Group (MPG) Report to Plenary Clyde Brown ESDSWG Nov 3, 2011.
Chapter 2 The Software Process
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
1 ECV Inventory – Overview and Background  Around 2009 “a certain gentlemen” challenged CEOS to describe how CEOS was actually contributing to the generation.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC.
Slide: 1 27 th CEOS Plenary |Montréal | November 2013 Agenda Item: 15 Chu ISHIDA(JAXA) on behalf of Rick Lawford, GEO Water CoP leader GEO Water.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
1 Beyond California Water Plan Update 2005 California Water and Environmental Modeling Forum Annual Meeting, March 3 rd, 2005.
Developing Climate Data Records (CDRs) from NPOESS Data Jeffrey L. Privette, John Bates, Tom Karl, Ed Kearns National Climatic Data Center (NCDC) NOAA.
References: [1] [2] [3] Acknowledgments:
WGClimate John Bates NOAA SIT Workshop Agenda Item #8 WGClimate Work Plan progress & Issues CEOS SIT Technical Workshop CNES, Montpellier, France 17 th.
CEOS Disaster Risk Management Implementation Phase Status Ivan Petiteville (ESA) on behalf of CEOS DRM Team CEOS SIT-28 Meeting Hampton, Virginia, USA.
Assessing the Maturity of Climate Data Records
ESIP Federation Air Quality Cluster Partner Agencies.
Creating documentation and metadata: Recording provenance and context Jeff Arnfield National Climatic Data Center Version a1.0 Review Date.
Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.
ST-09-01: Catalyzing Research and Development (R&D) Funding for GEOSS Florence Béroud, EC Jérome Bequignon, ESA Kathy Fontaine, US ST Kick-off Meeting.
NASA Earth Science Data and Information System (ESDIS) Project Data Preservation Activities – Update Andrew Mitchell (NASA Goddard Space Flight Center)
The HMO Research Network (HMORN) is a well established alliance of 18 research departments in the United States and Israel. Since 1994, the HMORN has conducted.
NASA Earth Science Data and Information System (ESDIS) Project Preservation Activities – Software & Documentation H. K. “Rama” Ramapriyan Science Systems.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California EDGE: The Multi-Metadata.
1 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI-IOOS Project Updates Mathew Biddle May 28th, 2015 IOOS DMAC Meeting, IOOS.
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
The Global Earth Observation System of Systems (GEOSS) must deliver timely, quality, long- term, global information to meet the needs of its nine societal.
Overview of progress towards a data quality assurance strategy to facilitate interoperability WGISS – May 11 th, 2009.
Evolving a Legacy System Evolution of the Earth Observing Data and Information System M. Esfandiari 1, H. Ramapriyan 1, J. Behnke 1, E. Sofinowski 2 1.
1 NSIDC DAAC Product Workshop Overview Martha Maiden Program Executive for Data Systems NASA Headquarters NSIDC DAAC Product Workshop January 11-12, 2006.
Purpose: The purpose of CMM Integration is to provide guidance for improving your organization’s processes and your ability to manage the development,
September, 2008 TASK DA Data Quality Assurance Strategy GEO Task DA-06-02: “This task is led by CEOS and IEEE” GOAL: “Develop a GEO data quality.
DOE Data Management Plan Requirements
8 January 2016 ESIP Winter Meeting
Data Systems Integration Committee of the Earth Science Data System Working Group (ESDSWG) on Data Quality Robert R. Downs 1 Yaxing Wei 2, and David F.
Software Engineering (CSI 321) Software Process: A Generic View 1.
Information Quality Cluster - Introduction H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard Space Flight Center David Moroni.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California CLARREO GPS RO/AJM-JPL.
ESA UNCLASSIFIED – For Official Use Data Stewardship Interest Group ESA – EO Data Stewardship Maturity Matrix WGISS#41 Meeting, Canberra, (AUS) 14–18 March,
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
QA4EO in 10 Minutes! A presentation to the 10 th GHRSST Science Team Meeting.
Enterprise Architectures Course Code : CPIS-352 King Abdul Aziz University, Jeddah Saudi Arabia.
WGClimate The Joint CEOS/CGMS Working Group on Climate Perspective for Cycle#3 Jörg Schulz WGClimate The Joint CEOS/CGMS Working Group on Climate 6th Meeting.
QA4EO Update on the Quality Assurance Framework For Earth Observation Joint GSICS GDWG-GRWG meeting.
The International Ocean Colour Coordinating Group International Network for Sensor Inter- comparison and Uncertainty assessment for Ocean Color Radiometry.
World Meteorological Organization Working together in weather, climate and water WMO OMM WMO A WMO perspective on GSICS Joint Meeting of the.
Committee on Earth Observation Satellites John Bates, NOAA Plenary Agenda Item 8 29 th CEOS Plenary Kyoto International Conference Center Kyoto, Japan.
Sustained Coordinated Processing of Environmental Satellite Data for Climate Monitoring SCOPE-CM Sustained, Co-Ordinated Processing of Environmental Satellite.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
NASA Earth Science Data Stewardship
Information Quality Cluster - Fostering Collaborations
Ensuring and Improving Information Quality for Earth Science Data and Products – Role of the ESIP Information Quality Cluster H. K. (Rama) Ramapriyan,
Persistent Identifiers Implementation in EOSDIS
NASA Data Quality Working Group (DQWG) Update
Information Quality Cluster - Fostering Collaborations
AGU Paper Number: IN43B-1697 Evolving a NASA Digital Object Identifiers System with Community Engagement Lalit Wanchoo1 and Nathan.
Agency Requirements: NOAA Administrative Order Management of environmental and geospatial data and information This training module is part of.
WGISS-WGCV Joint Session
Data Stewardship Interest Group WGISS-45 Meeting
Data Stewardship Interest Group WGISS-45 Meeting
Measuring Data Quality and Compilation of Metadata
Recent activities of OCR-VC
Presented to the CEOS WGISS October 22, 2018
Introduction to the PRISM Framework
Working Group on Information Systems and Services (WGISS)
A Brief Update on the Activity of the RDA FAIR Data Maturity Model Working Group – An action item from WGISS-46 Ge Peng North Carolina State University,
Presented to the CEOS WGISS October 10, 2019
Working Group on Information Systems and Services (WGISS)
Presentation transcript:

Improving Information Quality for Earth Science Data and Products – An Overview H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard Space Flight Center David Moroni Jet Propulsion Laboratory, California Institute of Technology Ge Peng North Carolina State University December 14, 2015 Paper #IN14A-01 - Presented at AGU Fall Meeting, San Francisco H. K. Ramapriyan’s work was supported by NASA under contract NNG15HQ01C. David Moroni’s work is supported by a NASA contract with the Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA. Ge Peng is supported by NOAA under Cooperative Agreement NA14NES Government sponsorship acknowledged.

ESIP Information Quality Cluster - Objectives Bring together people from various disciplines to assess aspects of quality of Earth science data Establish and publish baseline of standards and best practices for data quality for adoption by inter-agency and international data providers Become an authoritative and responsive resource of information and guidance to data providers on how best to implement certain data quality standards and best practices for their datasets Build framework for consistent capture, harmonization, and presentation of data quality for the purposes of climate change studies, Earth science and applications (Objectives evolve with participant inputs)

Information Quality Scientific quality – Accuracy, precision, uncertainty, validity and suitability for use (fitness for purpose) in various applications Product quality – how well the scientific quality is assessed and documented – Completeness of metadata and documentation, provenance and context, etc. Stewardship quality – how well data are being managed and preserved by an archive or repository – how easy it is for users to find, get, understand, trust, and use data – whether archive has people who understand the data available to help users. Information Quality is a combination of all of the above

Background QA4EO ISO 19157:2013 Standard “Geographic information -- Data quality” NOAA Climate Data Record (CDR) Maturity Matrix NOAA Data Stewardship Maturity Matrix NCAR Community Contribution Pages NASA Making Earth System Data Records (ESDRs) for use in Research Environments (MEaSUREs) Product Quality Checklists NASA Earth Science Data System Working Groups (ESDSWG) Data Quality Working Group Much related work has occurred in recent years

QA4EO Established and endorsed by the Committee on Earth Observation Satellites (CEOS) in response to a Group on Earth Observations (GEO) Task DA (now Task DA-09-01a) Four International Workshops , 2008, 2009, and 2011 Key Principles (from – “In order to achieve the vision of GEOSS, Quality Indicators (QIs) should be ascribed to data and products, at each stage of the data processing chain - from collection and processing to delivery – A QI should provide sufficient information to allow all users to readily evaluate a product’s suitability for their particular application, i.e. its “fitness for purpose”. – To ensure that this process is internationally harmonized and consistent, the QI needs to be based on a documented and quantifiable assessment of evidence demonstrating the level of traceability to internationally agreed (where possible SI) reference standards.” Framework and 10 Key Guidelines established (e.g., establishing Quality Indicator, establishing measurement equivalence, expressing uncertainty) A few cases studies are available that illustrate QA4EO-compliant methodologies [e.g., NOAA Maturity Matrix for CDRs, WELD: Web - Enabled Landsat Data (NASA-funded MEaSUREs Project), ESA Sentinel-2 Radiometric Uncertainty Tool]

ISO 19157: Geographic information -- Data quality* Establishes principles for describing the quality of geographic data – Defines components for describing data quality – Specifies components and content structure of a register for data quality measures – Describes general procedures for evaluating the quality of geographic data – Establishes principles for reporting data quality Defines a set of data quality measures for use in evaluating and reporting data quality Applicable to data producers providing quality information to describe and assess how well a data set conforms to its product specification Applicable to data users attempting to determine whether or not specific geographic data are of sufficient quality for their particular application Examples of DQ Elements: Completeness, Thematic Accuracy, Logical Consistency, Temporal Quality, Positional Accuracy * From:

CDR Maturity Matrix NOAA NCEI Climate Data Record (CDR) Maturity Matrix assesses readiness of a product as a NOAA satellite CDR Bates, J. J. and Privette, J. L., “A Maturity Model for Assessing the Completeness of Climate Data Records”, Eos, Vol. 93, No. 44, 30 October 2012 Assesses maturity in 6 categories (software readiness, metadata, documentation, product validation, public access, utility) at 6 levels Provides consistent guidance to data producers for improved data quality and long-term preservation EUMETSAT’s CORE-CLIMAX Matrix – based on CDR Maturity Matrix; contains guidance on uncertainty measures urity_Matrix_Template.xlsx urity_Matrix_Template.xlsx

Data Stewardship Maturity Matrix NOAA NCEI/CICS-NC Scientific Data Stewardship Maturity Matrix (SMM) provides a unified framework for assessing the maturity of measurable stewardship practices applied to individual digital Earth Science datasets that are publicly available Assesses maturity in 9 categories (e.g., preservability, accessibility, data quality assessment, data integrity) at 5 levels Provides understandable data quality information to users including scientists and actionable information to management Peng, G. et al, “A unified framework for measuring stewardship practices applied to digital environmental datasets”, Data Science Journal, 13. doi: /dsj More details in paper #IN14A-05

NCAR Climate Data Guide* Community contributed datasets, reviews Focuses on “limited selection of data sets that are most useful for large-scale climate research and model evaluation” Contributed reviews answer 10 key questions; Examples of topics addressed – strengths, limitations, and typical applications of datasets – Comparable datasets – Methods of uncertainty characterization – utility for climate research and model evaluation. *From Schneider, D. P., et al (2013), Climate Data Guide Spurs Discovery and Understanding, Eos Trans. AGU, 94(13), 121. [article] - See more at: guide#sthash.zaOUYP3j.dpufarticle

NASA MEaSUREs - Product Quality Checklists Making Earth System Data Records for Use in Research Environments (MEaSUREs) NASA-funded, typically 5-year projects generating long-term consistent time series Product Quality Checklists (PQC) indicate completeness of Quality Assessment, metadata, documentation, etc. PQC templates - developed in 2011 and adopted in 2012 Questions asked address science quality, documentation quality, usage and user satisfaction

NASA Earth Science Data System Working Groups (ESDSWG) – Data Quality Working Group (DQWG) Mission: “Assess existing data quality standards and practices in the inter-agency and international arena to determine a working solution relevant to Earth Science Data and Information System Project (ESDIS), Distributed Active Archive Centers (DAACs), and NASA-funded Data Producers.” Initiated in March : – 16 use cases analyzed, issues identified from users’ points of view and ~100 recommendations made for improvement – Consolidated into 12 high-priority recommendations : – Extracted 4 “Low Hanging Fruit” (LHF) recommendations from previous 12 – Implementation strategies for comprehensive integration across NASA ESDIS have been scoped out for LHF rec’s. Details will be covered in paper #IN14A-08

ESIP Information Quality Cluster Activities  Coordinate use case studies with broad and diverse applications, collaborating with the ESIP Data Stewardship Committee and various national and international programs  Identify additional needs for consistently capturing, describing, and conveying quality information  Establish and provide community-wide guidance on roles and responsibilities of key players and stakeholders including users and management  Prototype innovative ways of conveying quality information to users  Evaluate NASA ESDSWG DQWG recommendations and propose possible implementations.  Establish a baseline of standards and best practices for data quality, collaborating with the ESIP Documentation Cluster and Earth Science agencies.  Engage data providers, data managers, and data user communities as resources to improve our standards and best practices.

Thank you for your attention!

NOAA CDR Maturity Matrix 14 MaturitySoftware ReadinessMetadataDocumentationProduct ValidationPublic AccessUtility 1Conceptual developmentLittle or none Draft Climate Algorithm Theoretical Basis Document (C-ATBD); paper on algorithm submitted Little or NoneRestricted to a select fewLittle or none 2 Significant code changes expected Research grade C-ATBD Version 1+ ; paper on algorithm reviewed Minimal Limited data availability to develop familiarity Limited or ongoing 3Moderate code changes expected Research grade; Meets int'l standards: ISO or FGDC for collection; netCDF for file Public C-ATBD; Peer-reviewed publication on algorithm Uncertainty estimated for select locations/times Data and source code archived and available; caveats required for use. Assessments have demonstrated positive value. 4Some code changes expected Exists at file and collection level. Stable. Allows provenance tracking and reproducibility of dataset. Meets international standards for dataset Public C-ATBD; Draft Operational Algorithm Description (OAD); Peer- reviewed publication on algorithm; paper on product submitted Uncertainty estimated over widely distributed times/location by multiple investigators; Differences understood. Data and source code archived and publicly available; uncertainty estimates provided; Known issues public May be used in applications; assessments demonstrating positive value. 5 Minimal code changes expected; Stable, portable and reproducible Complete at file and collection level. Stable. Allows provenance tracking and reproducibility of dataset. Meets international standards for dataset Public C-ATBD, Review version of OAD, Peer-reviewed publications on algorithm and product Consistent uncertainties estimated over most environmental conditions by multiple investigators Record is archived and publicly available with associated uncertainty estimate; Known issues public. Periodically updated May be used in applications by other investigators; assessments demonstrating positive value 6 No code changes expected; Stable and reproducible; portable and operationally efficient Updated and complete at file and collection level. Stable. Allows provenance tracking and reproducibility of dataset. Meets current international standards for dataset Public C-ATBD and OAD; Multiple peer-reviewed publications on algortihm and product Observation strategy designed to reveal systematic errors through independent cross- checks, open inspection, and continuous interrogation; quantified errors Record is publicly available from Long-Term archive; Regularly updated Used in published applications; may be used by industry; assessments demonstrating positive value

Data Stewardship Maturity Matrix