Todd King. The first requirement for the Information Model [IM2008] is: 1. The Information Model shall be developed and maintained independent from any.

Slides:



Advertisements
Similar presentations
UCD++: XML-based UCDs Motivation –following on discussion of atomizing UCDs: PCD, PAD, … –2 approaches to tagging metadata 1.XML-based: … good for registries.
Advertisements

Chapter 10: Designing Databases
XML Data Validation An Open QA Framework February 28, 2005 The Exchange Network Node Mentoring Workshop.
Embrace the Elephant A few provocative questions….
EAD Revision: Technical Considerations Terry Catapano EAD Roundtable Meeting
Copyright 2008 Tieto Corporation Database merge. Copyright 2008 Tieto Corporation Table of contents Please, do not remove this slide if you want to use.
ITHAKA Preservation Metadata 2.0: Revising the Event Model A last-minute presentation on work currently in progress Evan Owens VP, Content Management ITHAKA.
Query Evaluation. An SQL query and its RA equiv. Employees (sin INT, ename VARCHAR(20), rating INT, age REAL) Maintenances (sin INT, planeId INT, day.
Query Evaluation. SQL to ERA SQL queries are translated into extended relational algebra. Query evaluation plans are represented as trees of relational.
Improving code generation. Better code generation requires greater context Over expressions: optimal ordering of subtrees Over basic blocks: Common subexpression.
Lecture 14 XML Validation. a simple element containing text attribute; attributes provide additional information about an element and consist of a name.
Major Exam II Reschedule 5:30 – 7:30 pm in Tue Dec 5 th.
Overview of Software Requirements
IRS XML Standards & Tax Return Data Strategy For External Discussion June 30, 2010.
File and Database Design; Logic Modeling Class 24.
2005 W3C XML Schema Workshop Path Forward Soumitra Sengupta Microsoft.
1 Data Strategy Overview Keith Wilson Session 15.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Software Evolution Planning CIS 376 Bruce R. Maxim UM-Dearborn.
Lecture 15 XML Validation. a simple element containing text attribute; attributes provide additional information about an element and consist of a name.
Ertan Deniz Instructor.  XML Schema  Document Navigation (Xpath)  Document Transformation (XSLT)
PDS4 Build 3b Status and Readiness Steve Hughes MC Face-to-Face Columbia, Maryland April 2-3, 2013.
February 1 & 31 Csci 2111: Data and File Structures Week4, Lectures 1 & 2 Fundamental File Structure Concepts & Managing Files of Records.
Fundamental File Structure Concepts & Managing Files of Records
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
Concepts and Terminology Introduction to Database.
Introduction to MDA (Model Driven Architecture) CYT.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
Exchange Design Best Practices Tools for Successful Flow Design and Implementation 1.
Global Justice XML Data Model Naming and Design Rules 17 August 2005.
ISetup – A Guide/Benefit for the Functional User! Mohan Iyer January 17 th, 2008.
US Army Corps of Engineers BUILDING STRONG ® Local Data Requirements and Definitions USACE SDSFIE Training Prerequisites: Creating a Data Dictionary for.
BizTalk Flat File Parsing Annotations. Flat File Parsing = LL(k) Parser The flat file parser is entirely grammar driven and is implemented as an LL(k)
KFS Data Mapping Leveraging a new KFS 3.0 feature.
ETL Extract. Design Logical before Physical Have a plan Identify Data source candidates Analyze source systems with data- profiling tools Receive walk-through.
1 - A View from the Field - The Next Generation Data Standards For the PDS - PDS4 - ESIP Federation Meeting July 8, 2009 J. Steven Hughes JPL Copyright.
New ITS Investigation NHS CfH Research Report Grahame Grieve, Laura Sato, Charlie McCay.
FILES AND DATABASES. A FILE is a collection of records with similar characteristics, e.g: A Sales Ledger Stock Records A Price List Customer Records Files.
1 Overview of XSL. 2 Outline We will use Roger Costello’s tutorial The purpose of this presentation is  To give a quick overview of XSL  To describe.
Recall The Team Skills 1. Analyzing the Problem (with 5 steps) 2. Understanding User and Stakeholder Needs 3. Defining the System 4. Managing Scope 5.
Software Testing and Quality Assurance Practical Considerations (4) 1.
PDS M/C August 2012 PPI Node Report PDS Management Council August 28-29, 2012 R. J. Walker S. P. Joy T.A. King J. Mafi.
1 Quality Attributes of Requirements Documents Lecture # 25.
PDS Atmospheres Node Plans for PDS4 User Roll-out 8/28/12PDS4 Roll-outStatus 1 Reta Beebe Lyle Huber Lynn Neakrase Jim Murphy Nancy Chanover Joni Johnson.
XML eXtensible Markup Language. XML A method of defining a format for exchanging documents and data. –Allows one to define a dialect of XML –A library.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Consultative process for finalizing the Guidance Document to facilitate the implementation of the clearing-house mechanism regional and national nodes.
Data Standards Development August 29, Topics 1.Current Status 2.What was delivered for Build 2c 3.How was IPDA supported 4.What mission support.
Case Study on Redlining at the ISO Chandi Perera
(1) Test Driven Development Philip Johnson Collaborative Software Development Laboratory Information and Computer Sciences University of Hawaii Honolulu.
1 Exchange Network Shared Schema Components. 2 Shared Schema Components Topics: Introduction to Shared Schema Components Purpose/value of using Shared.
March 1, 2004CS WPI1 CS 509 Design of Software Systems Lecture #6 Monday, March 1, 2004.
Software Requirements Specification Document (SRS)
Technical Steering Committee La Jolla, January 2003 Paul Kiel, HR-XML.
XML Validation. a simple element containing text attribute; attributes provide additional information about an element and consist of a name value pair;
VOTable agenda Current VOTable status Current VOTable status News from Applications News from Applications Questions about VOTable schema Questions about.
T Project Review RoadMappers I2 Iteration
PDS4 Project Report PDS MC F2F University of Maryland Dan Crichton March 27,
Requirement Elicitation Review – Class 8 Functional Requirements Nonfunctional Requirements Software Requirements document Requirements Validation and.
Interface Concepts Modeling Core Team
Essential Needs of Software Test Automation
Writing Product Requirements
Software Specification Tools
The Features of a Product or System
Project Charter I want to design a project
Embedded Development Application Note: Endian-Independence
Analysis models and design models
HR Data Dictionary Overview Dallas, October 2003
Writing Product Requirements
Lecture 20: Representing Data Elements
Presentation transcript:

Todd King

The first requirement for the Information Model [IM2008] is: 1. The Information Model shall be developed and maintained independent from any specific technology choices, implementations, or expressions. That is: The IM is implementation neutral.

 Early on it was decided that the target implementation was to be XML.  XML Schema was used to express the IM  It was realized that constraining content (by nodes and missions) would lead to a proliferation of XML schema.  In 2012 Schematron was adopted so that content constraints could be expressed as rules.

 As with past builds we performed detailed reviews of the IM.  Test Coverage: Approx. 80% of the model (excludes 8 Array classes): Product_Observation, Identification_Area, File_Area_Observational, Table_Binary (Record_Binary, Field_Binary), Table_Character (Record_Character, Field_Character), Table_Delimited (Record_Delimited, Field_Delimited), Group_Field_Binary, Group_Field_Character, Group_Field_Delimited, Time_Coordinates, Investigation_Area, Observing_System, Target_Identification, Mission_Area, Discipline_Area, Internal_Reference, External_Reference, Product_Collection, File_Area_Inventory, Product_Bundle, Bundle_Member_Entry, Context_Area, Reference_List, Product_Browse, File_Area_Browse, Product_Document, Product_Context, Agency, Facility, Instrument_Host, Instrument,Mission, Target, Node, Investigation, Target

Be consistent  Apply naming conventions and formation rules across the board. Refactor to Simplest Form  Reducing to common denominators (see Field_*)  Removing elements with single values. (see record_delimiter in Table_Character) Eliminate redundant information by:  Removing counts of things that can be counted (see fields in Record_*)  Bytes in data_type where length in bytes is an attribute. Don't allow the undesirable  Don't accommodate bad practices to make "migration" a literal transform. (see data_type and allowing the changing of byte order field by field) Use technology effectively  What is possible with XML schema + Schematron cannot be done with XML Schema alone. Some principals adopted in the Model where based on XML schema alone.

During our build 3b review it was realized that the trifurcation of Table could be addressed with Schematron rules. And that other constraints in the model can be expressed as rules. Simplification and no loss of rigor.

From Table_Binary Record_Binary Field_Binary Table_Character Record_Character Field_Character Table_Delimited Record_Delimited Field_Delimited To Table Record Field The attribute pds:encoding_type must be equal to on of the following values 'Binary', 'Character', 'Delimited'. The attribute pds:record_delimiter is not necessary when pds:encoding_type is 'Binary'. The attribute pds:field_delimiter is not necessary when pds:encoding_type is ' '. The attribute pds:record_delimiter is required and must be equal one of the following values 'carriage_return_line_feed'. The attribute pds:field_delimiter is required and must be equal to one of the following values 'comma', 'horizontal_tab', 'semicolon', 'vertical_bar'.

 Concise It is possible to reduce Table to a much simpler form and eliminate 14 classes in the XML schema.  Consistent Fewer variants can make documentation, training and applications more concise.  Effective It makes the most effective use of the chosen implementation technologies (XML Schema and Schematron).

 If applied to the "Array" class  9 classes can be consolidated to 1.  If applied to "File_Area“  13 classes can be consolidated to 1  If applied to the "Product“  31 classes can be consolidated to 1.  And XPath to elements become more uniform. /Product/Identification_Area/logical_identifier

 Documents  Makes them shorter and more concise.  Software  External - None.  Internal - harvest, registry.  Schedule  Minimal.  To meet immediate needs it can be done with hand edits on the XML Schema+Schematron, later it can be automated.

With so much time, effort and resources invested the IM should be self-consistent, concise, and optimized. We want the IM model to be:  Impressive  The wow factor.  Community impression is important.  An improvement  XML is good, but not enough.  Model should be a refinement.  Innovative  Do something to shift the paradigm.  XML Schema + Schematron is new!

 The simplification is an editorial pass  Like a page limit in a proposal.  Can you say the same thing with fewer words?  Are you using a minimal set of terms?  We’re about 80% there.  Some areas of the IM are ideal. i.e., Product_Context is very well formed.  Others still need work.  Focus on the deliverables  XML Schema and Schematron is what will be used.  The IM spec is an internal document.