Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Data Products and Attributes 1 Karen Stocks Data.

Slides:



Advertisements
Similar presentations
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
Advertisements

R2 IOC, 11/16/2012 Ocean Observatories Initiative Parameter metadata – population and curation Karen Stocks OOICI Data Curator.
ODM2: Developing a Community Information Model and Supporting Software to Extend Interoperability of Sensor and Sample Based Earth Observations Jeffery.
Information Retrieval in Practice
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Overview of Search Engines
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
ViciDocs for BPO Companies Creating Info repositories from documents.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Status of upgrading CDI service (user interface, harvesting via GeoNetwork, CDI interoperability options following SeaDataNet D8.7) By Dick M.A. Schaap.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
INSTRUCTIONS We are looking to develop a detailed understanding of what information about an Platform needs to be displayed somewhere in the user interface,
North American Profile: Partnership across borders. Sharon Shin, Metadata Coordinator, Federal Geographic Data Committee Raphael Sussman; Manager, Lands.
NIWA National Science Centre for Environmental Information Jochen Schmidt, Chief Scientist Federated Information Infrastructure.
Oceanographic Data Provenance Tracking with the Shore Side Data System Mike McCann, Kevin Gomes International Provenance and Annotation Workshop June 18,
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
® GRDC Hydrologic Metadata - core concepts - 5 th, WMO/OGC Hydrology DWG New York, CCNY, August 11 – 15, 2014 Irina Dornblut, GRDC of WMO at BfG Copyright.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Tom O’Reilly Monterey Bay Aquarium Research Institute.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Documentation from NcML to ISO Ted Habermann, NOAA NESDIS NGDC.
VO Sandpit, November 2009 CEDA Metadata Steve Donegan/Sam Pepler.
Statistics New Zealand’s End-to-End Metadata Life-Cycle ”Creating a New Business Model for a National Statistical Office if the 21 st Century” Gary Dunnet.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
Kevin Gomes and John Graybeal, MBARI MBARI’s SSDS OOI Cyberinfrastructure: San Diego June 30, 2008.
RDA Data Foundation and Terminology (DFT) WG: Overview  Prepared for Collab Chairs Meeting, NIST, Nov 13-14, 2014  Gary Berg-Cross, Raphael Ritz, Peter.
Ocean Observatories Initiative R2 User Training (Beta) - Session V June 5, ION R2 User Training on the Beta System Session V: Additional Operator.
ODP Interoperability Package Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal MINCyT, Buenos Aires, Argentina, 7 – 11 October.
IOOS National Glider Data Assembly Center
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Ocean Observatories Initiative R2.0 Beta Session 4 January 24, ION R2.0 Beta Test Session 4 Review of Status and Next Steps Susanne Jul, Carolanne.
Ocean Observatories Initiative OOI Cyberinfrastructure Data Management Michael Meisinger & David Stuebe OOI Cyberinfrastructure Life Cycle Objectives Milestone.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Slide 1 SDTSSDTS FGDC CWG SDTS Revision Project ANSI INCITS L1 Project to Update SDTS FGDC CWG September 2, 2003.
Barry Weiss 1/4/ Jet Propulsion Laboratory, California Institute of Technology Quality Elements in ISO Metadata Design for Proposed SMAP Data.
INSTRUCTIONS We are looking to develop a detailed understanding of what information about an Instrument needs to be displayed somewhere in the user interface,
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
Ocean Observatories Initiative R2.0 Beta Test Field Report January 24, ION R2.0 Beta Test 2013-Jan-24 Field Report Susanne Jul, Carolanne Fisher,
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
OOI Cyberinfrastructure and Semantics OOI CI Architecture & Design Team UCSD/Calit2 Ocean Observing Systems Semantic Interoperability Workshop, November.
Ocean Observatories Initiative R2.0 Beta Session 2 January 9, ION R2.0 Beta Test Session 2: Working with Marine Resources Susanne Jul, Lynn Morgan,
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
DM_PPT_NP_v01 Configuration Management of UMM Models January 2016.
Rolling Deck to Repository (R2R): How to Systematically Document Quality for the New Era of Data Re-Usability? AGU Poster IN21B-1048 AGU Fall Meeting December.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
1 CAA 2009 Cross Cal 9, Jesus College, Cambridge, UK, March 2009 Caveats, Versions, Quality and Documentation Specification Chris Perry.
QA4EO in 10 Minutes! A presentation to the 10 th GHRSST Science Team Meeting.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
NASA Earth Science Data Stewardship
Paul Eglitis [IEEE] and Siri Jodha S. Khalsa [IEEE]
ODP Interoperability Package
Ontology based Collection Discovery
The Re3gistry software and the INSPIRE Registry
Bird of Feather Session
Presentation transcript:

Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Data Products and Attributes 1 Karen Stocks Data Curator

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 “Effective management and storage of data are fundamental requirements” - OOI Data Management Plan. DCN

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 OOI Data Products 3

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Core Data Products Produced by OOI owned and operated instruments OOI is responsible for the long-term provision of these data products e.g salinity, vent fluid temperature, PCO2, seafloor pressure (~90) Undergo quality control procedures Defined by Data Product Specifications… 4

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Data Product Specifications Produced by Project Scientists, implemented by CI. Reviewed, controlled documents 5

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Data Processing Levels Raw: The full “conversation” with instrument. incl. raw data Parsed Canonical: Data streams converted to internal form (e.g. hex to float); still bundles multiple parameters (interleaved). Often separate streams defined for data vs. instrument status, calibration DPS L0: Unprocessed data. Data that are in instrument/sensor units and at instrument/sensor resolution. They are sensor by sensor (unpacked and/or de- interleaved) and are available in OOI supported formats (e.g., NetCDF) L1: Basic Data: L0 data that have been calibrated, are in well-defined scientific units and may have some QC applied L2: Derived Data Products: Data that have been manipulated and/or combined and include new values not present in original data 6

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Quality Control Levels a: no QC b: programmatic/automated QC e.g. spike test, range test, drift test c: human-in-the-loop 7

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Categories of data in OOI Core data products from OOI owned and operated instruments – both raw and derived. Additional data from OOI owned and operated instruments (ancillary data, engineering data, instrument status data) Data from External Observatories (e.g. IOOS, NEPTUNE, WMO) Data from PI-owned instruments deployed on OOI platforms Additional user-provided or user-derived data 8

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 OOI Metadata 9

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Metadata goals for Data Products Assure the capture of appropriate information for the data to be found, and used, including assessing fitness for use, on the decadal timeframe. Enable interoperability Borrow from existing work vs. reinventing a local solution (ISO, OGC, IOOS, MBARI, NEON, QARTOD, R2R, SDN, etc. etc. ) 10

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 OOI Metadata Approach IO operates on “Resources” Resources can be digital items held in the system (data products, documents), or representations of physical items (instruments, observatories) 11

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Associations Resources have associations to other resources. Examples: An instrument device is associated to the platform it is deployed on A Data Product is associated with the process that created it (for processed products) 12

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Resource Model 13

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Attributes Resources have attributes, i.e. “traditional” metadata. Examples: Instrument Models have a manufacturer Deployments have start and end dates Observatories have contact(s) Parameters have units 14

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Tracking Attributes Resources and Attributes managed in a relational database that is kept in alignment with the CIUX database & the YML code. Name, definition, type, controlled vocabulary, reference, source In R3, attributes will become a Resource ~149 Resources; 95 excluding abstract base; 35 primarily user-facing 410 attributes; 95 primarily user-facing 15

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Attachments Resources can have one or many attachments. Examples: Instrument Models can have user manuals Instrument Devices can have scanned factory calibration sheets Core Data Products have Data Processing Specifications 16

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R2.0 17

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R2.0 metadata approach Collect metadata required to support NetCDF/CF + ACDD, ISO 19115, and OGC CSW Plus additional attributes needed to meet the 2.0 Use Cases & Requirements (i.e. manage the system, and operate instruments) 18

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R2.0 Data Product Attributes Identity: ID, various names, description, keywords, processing code, QC level, reference URL(s) geospatial and temporal descriptors available formats provenance events acknowledgement & use license housekeeping: lifecycle state, contact(s), proprietary hold info, external archiving info Parameters… 19

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Parameter Attributes Identity IDs: primary ID, Human-readable identifier (automatically composed from other attributes), Alternate IDs. Names: Display name, CF Standard name, long name, OOI Data product identifier Description Reference URLs Housekeeping Date created Date modified Recent events User notification requests (list) Life cycle state 20 Use Type: e.g. quantity, category Value encoding: e.g. int32, float64 Code Set: for category parameters, it defines the categories and their meanings Unit of Measure Fill value

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R2.1 21

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R2.1 Improvements Overall, largely stable with respect to user-facing resources Additions/refinements stemming from: high priority user-identified gaps (next slide) platform-specific deployment information (e.g. cruise metadata) a small number of calculated attributes deferred out of R2.0 (e.g. last data ingestion time, attachment size) minor refinements for consistency (e.g. using the same set of contact information in different contexts) (Populating via preload pages) 22

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R2.1 UX Proposed Attributes Summary status indicators counts of critical alerts, warning alerts, help tickets aggregated status: Red-Yellow-Green status indicator summarizing all other status assessment Improved Device history representation: location history, power history, data history 23

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 24

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Expansion: metadata “externalizations” In R2, the information needed to support key standards (ISO 19115, Net CDF/CF +ACDD, and OGC CSW) is captured. In R3 the processes will be created to map the internal attributes to external labels and formats (e.g. XML) to create metadata reports meeting different specifications 25

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Expansion: Semantic support Controlled vocabularies/ontologies and mapping between controlled vocabularies/ontologies are required to: improve data discovery (e.g. temperature vs SST vs sea water temperature) facilitate data entry & ensure consistency (e.g. drop-down lists of instrument manufacturer names) verify metadata quality (e.g. IOOS parameter name entries must be from the valid IOOS parameter list) Provide referencing (e.g. URLs to external community definitions) Facilitate Interoperability – linking OOI concepts to externally- defined concepts 26

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Expansions: Semantic support, cont. Vocabularies/Ontologies will become first class resources within ION (versioning, URIs, etc.) 27

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Expansions: New resource types sampling plans/mission plans physical samples Instrument calibration and service history annotations workflows numerical models self-reported instrument metadata (e.g. PUCK) additional provenance representation 28

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 R3 Expansions: other Universal Resource Identifiers (e.g. DOIs) for both closed an aggregating data products. Setting and enforcing minimal metadata standards for different data categories (external observatories, PI-owned instruments, etc.) Evaluating the importance of supporting additional standards (ongoing) 29

OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 References Data Management Plan (Alfresco) Data Management Plan Metadata Implementation Plan (Alfresco) Metadata Implementation Plan Resource Model ay/syseng/CIAD+APP+Resource+Model Data Product Specifications: ay/science/Data+Product+Specifications ay/science/Data+Product+Specifications 30