Rule-Based Approach for Earth Science Metadata Quality Assurance (QA) Tyler Stevens and Ellen Neff NASA’s Global Change Master Directory (GCMD) WYLE Information.

Slides:



Advertisements
Similar presentations
Group 2: Question 2 How do you enter metadata now? DocBuilder? Other? If you use other system(s), how would you compare to DocBuilder? Do you find the.
Advertisements

OpenDOAR The Directory of Open Access Repositories Bill Hubbard SHERPA Manager University of Nottingham.
CNES implementation of the ISO standard An extension of the current CNES implementation of the ISO metadata standard.
Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office.
EOSDIS Common Metadata Repository (CMR) Unified Metadata Model for Collections (UMM-C) Meeting June 12, 2014.
3/5/2009Computer systems1 Analyzing System Using Data Dictionaries Computer System: 1. Data Dictionary 2. Data Dictionary Categories 3. Creating Data Dictionary.
Recruiting Marketing CRM Integration
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Discovering Earth Science Tools, Software, and Models through the CEOS IDN Tyler Stevens IDN GIS/Services Coordinator
1 Agenda Views Pages Web Parts Navigation Office Wrap-Up.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Introduction to Current Contents Connect. What is CCC? A multidisciplinary current awareness resource –Browse and search journals, books and websites.
SATERN for Supervisors May Session Objectives At the end of the session, participants will be able to:  Describe the benefits of SATERN.  Log.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
Data Ingest Automation GHRC Status and Plans Helen Conover GHRC DAAC Operations Manager Presented at ESIP Summer Meeting 2015.
Creating documentation and metadata: Introduction to metadata and metadata standards Lynn Yarmey National Snow and Ice Data Center Version 2.0 Review Date.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
What’s New in VRS? GUGM May 15, 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
Creating Documentation and Metadata: Metadata for Discovery Lola Olsen 1, Tyler Stevens 2, 1 National Aeronautics and Space Administration (NASA) 2 Wyle.
SCADM MEETING, SEPTEMBER 7, 2011 STATUS OF THE ANTARCTIC MASTER DIRECTORY.
Status of the Antarctic Master Directory SCADM Meeting, August 22, 2014.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Creating Documentation and Metadata: Metadata for Discovery Lola Olsen 1, Tyler Stevens 2, 1 National Aeronautics and Space Administration (NASA) 2 Wyle.
Introduction to Geospatial Metadata – ISO 191** Metadata National Coastal Data Development Center A division of the National Oceanographic Data Center.
SATERN for Supervisors Updated: January Session Objectives At the end of the session, participants will be able to:  Describe the benefits of SATERN.
Tools Menu and Other Concepts Alerts Event Log SLA Management Search Address Space Search Syslog Download NetIIS Standalone Application.
IBIS-Admin New Mexico’s Web-based, Public Health Indicator, Content Management System.
Sharing Metadata Recommendations Ted Habermann, John Kozimor Earth Science The HDF Group 1 John Farley Raytheon.
Consolidated Metadata Repository (CMR) Status and Look Ahead CWIC 2015 Annual Meeting, February 18-19, 2015 This work was supported by NASA/GSFC under.
IBIS-Admin New Mexico’s Web-based, Public Health Indicator, Content Management System.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
Creating Documentation and Metadata: Introduction to Metadata and Metadata Standards Lynn Yarmey National Snow and Ice Data Center Version 1.0 February.
WGISS-40: IDN Report Michael Morahan WGISS-40 Fall meeting / Harwell, United Kingdom
Using Portals and Registries: Publishing Metadata to GCMD Lola Olsen 1, Tyler Stevens 2, 1 National Aeronautics and Space Administration (NASA) 2 Wyle.
COMMON METADATA REPOSITORY (CMR) Update and Near Term Plans – ESIP Winter ESIP Winter Meeting | Washington DCJanuary Dan Pilone
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
NASA Perspectives on Data Quality July Overall Goal To answer the common user question, “Which product is better for me?”
Discovering Earth Science Data and Services Using NASA’s Global Change Master Directory: The Value for Earth Science Teachers Tyler Stevens NASA’s Global.
Documenting UAF Data Ted Habermann NOAA/NESDIS/National Geophysical Data Center.
Registering Earth Science Data and Data Related Services Using NASA’s Global Change Master Directory (GCMD) Tyler Stevens (GIS/Services Coordinator) ESIP.
Lesson 4.  After a table has been created, you may need to modify it. You can make many changes to a table—or other database object—using its property.
Dialog Design I Basic Concepts of Dialog Design. Dialog Outline Evaluate User Problem Representations, Operations, Memory Aids Generate Dialog Diagram.
Data Quality A Science Community Perspective 17/13/11K. Lehnert, ESIP Panel on Data Quality Kerstin Lehnert Lamont-Doherty Earth Observatory Columbia University.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Advertising your data Alecia Aleman 1, Ruth Duerr 2 1 National Aeronautics and Space Administration (NASA) 2 National Snow and Ice Data Center, University.
ESO and the CMR Life Cycle Process Winter ESIP, Jan 2015 ESDIS Standards Office (ESO) Yonsook Enloe Allan Doyle Helen Conover.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
1 Raytheon EED Program | ECHO Technical Interchange 2013.
DM_PPT_NP_v01 Configuration Management of UMM Models January 2016.
WEBSITE NEW FEATURES presented to Mt. SAC Website End User’s Group presented by Uyen Mai, Marketing and Communication Eric Turner, Web and Portal Services.
Location Keyword Analysis and Approach to Fixing Records: A Case Study GCMD Science Coordinators CGSync Technical Tag Up Meeting
DAAC Roles with Common Metadata Repository (CMR) ESDIS System Engineering Technical Interchange November 2014 Copyright © 2014 Raytheon Company. All Rights.
WGISS-42: IDN Report Michael Morahan CEOS WGISS-42 Meeting
Michael Morahan CEOS WGISS-43 Meeting
Tyler Stevens GIS/Services Coordinator
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Copyright 2012 Lola Olsen & Tyler Stevens.
1.01- Understand Internet search tools and methods.
GCMD’s New Keyword Search Interface ‘Alpha Version’
WGISS-41: IDN Report Michael Morahan CEOS WGISS-41 Meeting
WGISS-45 International Directory Network (IDN) Report
Computer Literacy BASICS: A Comprehensive Guide to IC3, 3rd Edition
MIKADO: Generation of CDI ISO19139 XML files
Web Content Management
Relation between WIS and WIGOS metadata
BasicSafe Enhancements Update to employee CSV file uploads
Presentation transcript:

Rule-Based Approach for Earth Science Metadata Quality Assurance (QA) Tyler Stevens and Ellen Neff NASA’s Global Change Master Directory (GCMD) WYLE Information Systems ESIP Winter Meeting 2015 Session - Metadata Evaluation: Consistency, Compliance, and Improvement January 7, 2015

Outline 1.Importance of Metadata QA 2.Metadata QA Rules 3.Implementation Concept: QA Viewer 4.Conclusions 2

1. Importance of Metadata QA The Metadata QA Process enhances the efficiency and reliability in which users can discover, find, and make use of data that rely on accurate and complete metadata. Process: 3 Automated Validation Manual Review Make Changes to Metadata Notification of Metadata Changes Publish Metadata

1.2 Principles of High-Quality Metadata 4 AccuracyCompletenessConsistency Conciseness Readable/ Understandable

2. Metadata QA Rules A QA Rule is a check that is applied to content within a field of a metadata record to assess the quality. Checks include Controlled Vocabulary Use URL Validity Field Lengths Uniqueness Required Fields Populated 5

Rules are driven by: Metadata Formats (DIF, ECHO, ISO) o Example: ‘Controlled Vocabulary Check’ can check that valid GCMD keywords are being used in the DIF format. Metadata Models (UMM-C) o Example: ‘Required Field Check’ can check that there is content for a required field, i.e., required by the UMM-C model or by the metadata format. System Requirements (GCMD, CMR) o Example: ‘Max Field Length Check’ can check that a string is within a designated character length. Experience o Example: ‘Non-identical Field Check’ can encourage a metadata author to provide a descriptive title, rather than just repeating the EntryID Rule Development

2.2 Rule Categories Link Rules: Applied to fields that are or may contain links to an external source in the metadata. Character Rules: Applied to the number, type, or pattern of characters that are allowed within a field. Date Rules: Applied to date fields to check that dates are in the proper format. Numeric Rules: Applied to fields where the content should be or include a numeric value. Controlled Vocabulary Rules: Applied to content of the field to check that it matches a valid keyword, either by comparing to an author-provided list or to an external source (i.e. KMS). Miscellaneous Rules: Applies other various rules such as exists checks and suffix checks. 7

2.3 Rule Example 8 Field Name (Xpath)RuleNameConfigurationseverityifFailMessageiffail /DIF/DIF_Creation_DateDateFormat Check Date is yyyy- MM-dd ERROR${name} should be in ${format} date format only. Your ${name} is formatted ${actual} /DIF/Summary/AbstractAllURLsExist Check WARNPotential broken link found in ${xpath} Date Rule and Link Rule

3. Implementation Concept: QA Viewer A Web-Based Viewer/Tool That: Accesses metadata records via local files and GCMD and CMR APIs. Applies QA rules for DIF and ECHO metadata. Generates a table showing each field name and content along with the results per rule applied. 9

3.1 QA Viewer Display Panel View QA Results Displays Field Name, Rules Applied, Rule Results, and Message Sorts Results by Name and Type Filters Result Type 10

4. Conclusions Metadata QA rules can assist in assessing and improving metadata. Tools can support initial metadata screening as part of GCMD science metadata QA efforts. Engaging the community in rule-set authoring will improve the quality of the rules and the metadata. Tools can help automate some the metadata QA process to reduce the workload for manual review. 11

If you have questions or need additional information, please contact: Tyler Stevens Ellen Neff Or our user support office 12 Thank You!