Documenting Data Quality Ted Habermann, NOAA/NESDIS/NGDC

Slides:



Advertisements
Similar presentations
GEOSS AIP Phase 2 Kickoff Workshop September, Boulder Colorado, USA AP ISO 1.0 Jürgen Walther Office of the Interministerial Committee for Geo Information.
Advertisements

Evolution of Metadata Standards: New Features in ISO Ted Habermann NOAA National Data Centers September, 2008
Merging Metadata Standards: FGDC CSDGM and ISO Sharon Shin Federal Geographic Data Committee Metadata Coordinator
NOAA Documentation Improvement Ted Habermann How do we measure and visualize improvements in NOAA Documentation? Record Count Completeness (Rubric Scores)
WMO UNEP INTERGOVERNMENTAL PANEL ON CLIMATE CHANGE NATIONAL GREENHOUSE GAS INVENTORIES PROGRAMME WMO UNEP IPCC Good Practice Guidance Simon Eggleston Technical.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
National Coastal Data Development Center A division of the National Oceanographic Data Center Please a list of participants at each location to
Evolution of Metadata Standards: New Features in ISO Ted Habermann NOAA National Data Centers December, 2007
ISO Standards: Status, Tools, Implementations, and Training Standards/David Danko.
Documentation Developments Ted Habermann, NOAA/NESDIS/NGDC Adoption of ISO Metadata Standards - Crossing the Chasm Facilitating the Transition Documentation.
The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
Interoperable Documentation Ted Habermann, NOAA/NESDIS/NGDC NCAR Earth Observing Laboratory, June 2010 Links: GEO-IDE Wiki:
Documenting Data Quality Ted Habermann, NOAA/NESDIS/NGDC Documentation: It’s not just discovery... 50% change in global average Why? i checked my 2002.
Metadata Implementation Ted Habermann NOAA National Geophysical Data Center Documentation: It’s not just discovery... 50% change in global average Why?
Documentation from NcML to ISO Ted Habermann, NOAA NESDIS NGDC.
Documenting Data Quality Ted Habermann, NOAA National Geophysical Data Center.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database.
® Reading meeting. December 12-14th, 2011 QUAlity aware VIsualisation for the Global Earth Observation system of systems GCI Analysis December 12-14th,
Metadata for Data Understandability Ted Habermann NOAA National Geophysical Data Center AGU, Spring 2008.
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentation Systems Division Slide 1 AGRIS the next steps of the network
The data standards soup … Is the most exciting topic you can dream of.
Meteorological Assimilation Data Ingest System (MADIS) and ISO Data Quality Ted Habermann NOAA National Data Centers MADIS observations on April 29, 2004.
Documenting UAF Data Ted Habermann NOAA/NESDIS/National Geophysical Data Center.
Creating Good Documentation NOAA National Geophysical Data Center
Spatial Databases and Metadata.
Barry Weiss 1/4/ Jet Propulsion Laboratory, California Institute of Technology Quality Elements in ISO Metadata Design for Proposed SMAP Data.
GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
DOE Data Management Plan Requirements
Merging Metadata Standards: FGDC CSDGM and ISO and Sharon Shin Federal Geographic Data Committee Metadata Coordinator
Forecast Model Run Collections and ISO Ted Habermann There has been considerable discussion of describing multiple times in forecast datasets This is not.
Data Quality for Long-Term Datasets
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
Software Quality Control and Quality Assurance: Introduction
DSpace standard Data model and DSpace-CRIS
Working in Groups in Canvas
Topic 2 (ii) Metadata concepts, standards, models and registries
Title of Proposal Objective Research Impact Illustrative Figure
The Information Side of System Engineering
Gap: Poorly Understood Responsibilities for Integration
Improving Braille accessibility and personalization on Internet
Metadata Evaluation & Improvement, Case Studies
Week by Week: Plans for Documenting Children’s Development
Writing a Research Abstract
INSPIRE Geoportal Thematic Views Application
Data Management: Documentation & Metadata
Introduction to Research Data Management
GDI ISO Standards GovData Metadata
2. An overview of SDMX (What is SDMX? Part I)
Attribution…. Self Plagiarism – what to do for the thesis writing
SYS466 Domain Classes – Part 1.
2. An overview of SDMX (What is SDMX? Part I)
Health On-Line Patient Education Web Site
FDA-08 FDA Whitepaper Update
Social Research Methodology and Supplementary Documentation John Kallas University of the Aegean, Department of Sociology.
What is Science? Review This slide show will present a question, followed by a slide with an acceptable answer. For some questions, there is a definite.
Foundations of Technology The Engineering Design Process
Workshop (… 2016) WIGOS Project Office
The Generic Statistical Business Process Model
Proposal of a Geographic Metadata Profile for WISE
Metadata Updates (for S / 4
Data Management Components for a Research Data Archive
Geoscience Australia Service Metadata
Subject Name: SOFTWARE ENGINEERING Subject Code:10IS51
Attribution…. Self Plagiarism – what to do for the thesis writing
Chapter 5.5 Metadata John Cima.
Presentation transcript:

Documenting Data Quality Ted Habermann, NOAA/NESDIS/NGDC Please view these slides as a slide show. High-quality documentation serves many roles. The contemporaneous emergence of metadata standards and the World Wide Web during the mid-1990's focused significant attention on the discovery role. International collaboration, the Open Archival Information System (OAIS) Reference Model, increased data transparency, and public scrutiny of all climate data and interpretations have recently brought focus back to the importance of the role of documentation in enabling independent understanding of data. This Figure shows the global average of a parameter calculated from a NESDIS Satellite Product between 2002 and 2006. There is an obvious 50% increase in this parameter during late 2002. Such an abrupt large change would not be expected over the whole globe. Why did this happen? The text box reflects the current state of the documentation for this dataset. The scientist responsible for the product checked their e-mail archive and came up with a very vague explanation of the change from indirect references in the e-mail. The final statement "hopefully this settles the issue.." may be sufficient in the scientific community of experts that know these data, but it is unlikely to satisfy non-experts that question the integrity of scientists and the scientific process. Also, of course, e-mail archives and personal recollections are impossible to reliably preserve. This example is, unfortunately, not an exception to the rule. Documentation: It’s not just discovery... 1

DQ_Scope <<CodeList>> MD_ScopeCode + attribute + feature + attributeType + featureType + collectionHardware + propertyType + collectionSession + fieldSession + dataset + software + series + service + nonGeographicDataset + model + dimensionGroup + tile <<DataType>> DQ_Scope + level : MD_ScopeCode + extent [0..1] : EX_Extent + levelDescription [0..*] : MD_ScopeDescription <<Union>> MD_ScopeDescription + attributes : Set<GF_AttributeType> + features : Set<GF_FeatureType> + featureInstances : Set<GF_FeatureType> + attributeInstances : Set<GF_AttributeType> + dataset : CharacterString + other : CharacterString

DQ_Result DQ_Result + resultScope: DQ_Scope [0..1] DQ_ConformanceResult + specification : CI_Citation + explanation : CharacterString + pass : Boolean DQ_QuantitativeResult + valueType [0..1] : RecordType + valueUnit : UnitOfMeasure + errorStatistic [0..1] : CharacterString + value [1..*] : Record DQ_DescriptiveResult + statement: CharacterString QE_CoverageResult + resultFile : MX_DataFile + resultFormat: MD_Format + resultContentDescription: MD_CoverageDescription + resultSpatialRepresentation: MD_SpatialRepresentation + spatialRepresentationType : MD_SpatialRepresentationTypeCode

Measure Registry / Database DQ_MeasureReference + measureIdentification: MD_Identifier [0..1] + nameOfMeasure: CharacterString [0..*] + measureDescription: CharacterString [0..1] Quality Measure measure identifier name alias element name basic measure definition description parameter value type value structure source reference example <<DataType>> MD_Identifier + authority [0..1] : CI_Citation + code : CharacterString + codeSpace [0..1] : CharacterString + version [0..1] : CharacterString

Data Quality - Granules

Data Quality - Standards LI_Lineage <<Union>> MD_ScopeDescription + attributes : Set<GF_AttributeType> + features : Set<GF_FeatureType> + featureInstances : Set<GF_FeatureType> + attributeInstances : Set<GF_AttributeType> + dataset : CharacterString + other : CharacterString MI_Metadata DQ_DataQuality + scope : DQ_Scope DQ_StandaloneReportInformation + reportReference : CI_Citation + abstract: CharacterString + standAloneReport 0..1 + report 0..* <<Abstract>> DQ_Element <<DataType>> DQ_Scope + level : MD_ScopeCode + extent [0..1] : EX_Extent + levelDescription [0..*] : <<CodeList>> MD_EvaluationMethodTypeCode + directInternal + directExternal + indirect DQ_MeasureReference DQ_Evaluation DQ_Result + resultScope: DQ_Scope [0..1] DQ_DescriptiveResult DQ_CoverageResult <<CodeList>> MD_ScopeCode + attribute + feature + attributeType + featureType + collectionHardware + propertyType + collectionSession + fieldSession + dataset + software + series + service + nonGeographicDataset + model + dimensionGroup + tile DQ_QuantitativeResult + valueType [0..1] : RecordType + valueUnit : UnitOfMeasure + errorStatistic [0..1] : CharacterString + value [1..*] : Record DQ_ConformanceResult + specification : CI_Citation + explanation : CharacterString + pass : Boolean

Community - the Wiki