1 - A View from the Field - The Next Generation Data Standards For the PDS - PDS4 - ESIP Federation Meeting July 8, 2009 J. Steven Hughes JPL Copyright.

Slides:



Advertisements
Similar presentations
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Advertisements

Chapter 10: Designing Databases
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Managing Data Resources
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
Lecture Nine Database Planning, Design, and Administration
IRS XML Standards & Tax Return Data Strategy For External Discussion June 30, 2010.
PV2013 Summary Results Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
March 2010 PDS Imaging Node 1 NASA PDS Imaging Node: NASA PDS Imaging Node: Digital Data Archives and Distribution Archiving and distributing data and.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Developing Enterprise Architecture
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
IPDA Update Dan Crichton Tom Stein August 2014 International Planetary Data Alliance.
Database Design - Lecture 1
Demystifying the Business Analysis Body of Knowledge Central Iowa IIBA Chapter December 7, 2005.
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
PDS4 Tool Development Strategy PDS Management Council Meeting November 18, 2014 Dan Crichton.
Enterprise Architecture, Enterprise Data Management, and Data Standardization Efforts at the U.S. Department of Education May 2006 Joe Rose, Chief Architect.
PDS Geosciences Node Page 1 Archiving Mars Mission Data Sets with the Planetary Data System Report to MEPAG Edward A. Guinness Dept. of Earth and Planetary.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
Label Design Tool Management Council F2F Washington, D.C. November 29-30, 2006
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Data Standards and Build 3b Plans Steve Hughes MC Face-to-Face UCLA, Los Angeles, CA November 28-29, 2012.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
Object-Oriented Software Engineering using Java, Patterns &UML. Presented by: E.S. Mbokane Department of System Development Faculty of ICT Tshwane University.
Data resource management
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
1 PDS 4 Data Design Report PDS Management Council April 2, 2009 PDS 2010 Data Design WG.
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Data Standards Development August 29, Topics 1.Current Status 2.What was delivered for Build 2c 3.How was IPDA supported 4.What mission support.
PDS Geosciences Node Page 1 Archiving LCROSS Ground Observation Data in the Planetary Data System Edward Guinness and Susan Slavney PDS Geosciences Node.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Djc -1 Daniel J. Crichton NASA/JPL 9 May 2006 CCSDS Information Architecture Working Group.
SOFTWARE ARCHIVE WORKING GROUP (SAWG) REPORT TODD KING PDS MANAGEMENT COUNCIL MEETING FEB. 4-5, 2016.
1 Steve Hughes Daniel J. Crichton NASA/JPL January 16, 2007 CCSDS Information Architecture Working.
A Perspective on the Electronic Geophysical Year Raymond J. Walker UCLA Presented at eGY General Meeting Boulder, Colorado March 13, 2007.
Planetary Data System (PDS) Tom Morgan November 24, 2014.
PDS4 Project Report PDS MC F2F UCLA Dan Crichton November 28,
PDS4 Project Report PDS MC F2F University of Maryland Dan Crichton March 27,
ESA UNCLASSIFIED – For Official Use INSPIRE Orthoimagery TWG Status Report Antonio Romeo ESRIN 15/02/2012.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Managing Data Resources File Organization and databases for business information systems.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
The Next Generation PDS Archive Data Standards
IPDA PDS4 Report PDS Team July 2015.
PDAP Query Language International Planetary Data Alliance
Outline Pursue Interoperability: Digital Libraries
The Re3gistry software and the INSPIRE Registry
MANAGING DATA RESOURCES
Data Model.
MSDI training courses feedback MSDIWG10 March 2019 Busan
Reportnet 3.0 Database Feasibility Study – Approach
Presentation transcript:

1 - A View from the Field - The Next Generation Data Standards For the PDS - PDS4 - ESIP Federation Meeting July 8, 2009 J. Steven Hughes JPL Copyright 2009 California Institute of Technology Government sponsorship acknowledged

PDS Mission and Vision Statement Mission: The mission of the Planetary Data System is to facilitate achievement of NASA’s planetary science goals by efficiently collecting, archiving, and making accessible digital data produced by or relevant to NASA’s planetary missions, research programs, and data analysis programs Vision: To gather and preserve the data obtained from exploration of the Solar System by the U.S. and other nations To facilitate new and exciting discoveries by providing access to and ensuring usability of those data to the worldwide community To inspire the public through availability and distribution of the body of knowledge reflected in the PDS data collection PDS is a federation of heterogeneous nodes including science and support nodes

About the Planetary Data System NASA’s official archive for Planetary Science Data Missions are required to archive data with PDS Data is peer reviewed as part of the archiving of data PDS works to support planetary science R&A Federation of nodes Science discipline nodes provide scientific and data management expertise Central engineering (JPL) addresses PDS-wide software and standards Management node (GSFC) provides management support to PDS Governed by the PDS Management Council which is formed from the node managers Standard archiving processes and tools PDS supports diversity of data types but promotes a homogeneous architecture for archives

PDS Engineering Challenges All data online and distributed across the PDS enterprise Single point of access to all data through an integrated infrastructure Substantial increases in data volumes International coordination and interoperability Shared standards Sharing of science products Satisfying US data sharing regulations Supporting diverse user needs Replacing aging technology, tools and processes

Why is PDS4 Necessary? The PDS data standards were developed in the late 1980’s to define the concepts and terms needed for archiving science data in the planetary science domain. Even though the data standards were innovative for their time, ambiguity and many assumptions have crept in over almost two decades of use. This situation has caused significant problems for PDS operations, data providers, and end-users. In 2006 the International Planetary Data Alliance (IPDA) was formed. Charter stated that the PDS data standards, as the de-facto planetary standards, should reviewed and the core elements be adopted. The PDS4 task was initiated to formalize the PDS data standards and address the issues.

PDS4 Key Goals Enable a stable and usable long-term archive. Enable more efficient archive preparation for data providers. Enable services for the data consumer to find the specific data they need and provide the formats they require.

PDS4 Design Principles The data model: is defined in a formal language is independent of implementation defines a few fundamental data structures that do not evolve over time is extensible enabling it to handle more complex data formats The archive data formats is being designed independent of data provider and data consumer formats. The data architecture shall include a standard data dictionary model.

PDS 2010 Architecture 8

PDS4 Data Product Model Components Registry Object & Web Resource Product Description Combinations Data Object Description Structure Data Classification

Data Product Concept Map 10 Web & Registry View Basic I/O ViewProgrammer View User/Designers View

The Model Design Process Master model constrains data design and defines validation. Master model and data dictionary are tightly coupled. Document writer translates modeling information into target languages based on grammar and munging rules. Updates to master model are reflected quickly in the documents.

PDS4 Data Standards Deliverables PDS4 Information Model The Information Model defines PDS object classes. This includes data structures, formats, and products as well as data sets, documents, missions. PDS4 Data Dictionary Model The Data Dictionary Model provides the schema for the PDS data dictionary. The data dictionary documents the data elements used in the PDS4 Information Model. PDS Standards Reference V4.0 The PDS Standards Reference V4.0 will be written in the format and tone of a standards reference document. Grammar Options The Grammar is used to capture PDS archive metadata for product labels. 12

Benefits of the PDS4 Data Model The data model is managed in a data modeling tool. The model is formally defined. The model can be validated and tested. Defines a few simple fundamental data structures. Fundamental data structures may be extended and combined to form more complex data formats The overall architecture is model driven. Disentangles the model from its implementation. Model can evolve over time as research domain changes. Drives the generation of documentation, label schema, and other model dependent artifacts. The data dictionary uses a standard data dictionary model.

Backup 14

PDS4 Product Label Creation Label 15 Model Generic Label Schema Design Populate Extract Specific Label Schema Mission Specific Data Dictionary Ingest/ Update Ingest Label Product Labels Modeling Tool A Design Data Engineer Modeling Tool B Load Edit Validate Generate Current using editor Current using Oxygen/Future Design Tool Proposed Ingest DDWG

Topics The View from the Field What are other disciplines and agencies doing for preservation/stewardship? How do they deal with databases, collections of files, physical objects, ad-hoc services such as work flows? How do they deal with provenance? Any lessons to be learned and incorporated into earth science practice Copyright 2009 California Institute of Technology Government sponsorship acknowledged

PDS4 Data Design Accomplishments - General Project and Project Members Defined Principles and Drivers updated General Data Model (Draft) Product Data Model (Draft) Data Dictionary Model (Draft) Grammar Options PDS Standards Reference (Outline) Data Dictionary Model (Final) Grammar Decision PDS Standards Reference (Draft) PDS and community wide review General Data Model (Final) Product Data Model (Final) PDS Standards Reference (Final) 17 Done Substantial Progress Next 3 Months Due 9/31/10

Proposed IPDA Data Standards Project 18 Identify the core elements of the PDS4 data standards Develop a process for maintaining alignment between the IPDA and the PDS Unique Namespace

Charter for the IPDA Data Architecture Standards 19 “The data standards within the IPDA, including the data models and derived dictionaries, are based on the NASA Planetary Data System (PDS) standard that is the de-facto standard for all planetary data at the time of the IPDA founding”. Charter of the International Planetary Data Alliance, 3rd Draft, May 24, 2007

PDS4 Data Design Accomplishments - Details 20 Four basic data structures Homogeneous N-dimensional array of scalars – Array_Base Heterogeneous repeating record structure of scalars – Table_Base Unencoded byte stream Encoded byte stream Identifiable Digital Product Data Product Product_Image_Grayscale …Image_3D …Movie …Table_Character …Table_Binary …Table_Binary_Grouped Document Set Software_Set Non-Digital Product Mission Instrument Resource Data Type Binary Data Type Decimal Integer … Character Literals Character Integer … Array_Base Array_2D Image_Grayscale Spectrum_2D Array_3D Image_3D Movie Table_Base Table_Character Table_Binary Table_Binary_Grouped Draft PDS Data Dictionary Model Grammar – ODL+, PVL, and XML Labels

Example Results Image _Grayscale Concept Map UML Class Diagram XML Schema PVL Label Template Class Definition Table <xs:element name="axes_order" type="axes_order_Type" /* ******* Label Template - Product_Image_Grayscale Object = Product_Image_Grayscale; Object = Image_Grayscale; local_identifier = ${local_identifier};

PDS4 Product Label Creation Label 22 Model Generic Label Schema Design Populate Extract Specific Label Schema Mission Specific Data Dictionary Ingest/ Update Ingest Label Product Labels Modeling Tool A Design Data Engineer Modeling Tool B Load Edit Validate Generate Current using editor Current using Oxygen/Future Design Tool Proposed Ingest DDWG

Query Model A formal model consisting of classes, attributes, and relations that are appropriate for use as search constraints Types of query models include Data Set Product Data Product Document Product Software Product Any PDS4 Identifiable The query models are subsets of the archive model Can be augmented with external metadata (e.g. LDD) Example query constraints general parameters - time, target, mission, instrument host, instrument any metadata defined in the archive model any association between two classes e.g. documents associated with data sets class type and hierarchy geometry - (lat, lon), (az, el), (ra, dec), (x,y,z).