September 2003ERPANET - Marburg 2003View # 1 Data preservation Using CCSDS and ISO Standards Denis MINGUILLON

Slides:



Advertisements
Similar presentations
XML - Beyond the Hype Everything you must know to succeed in implementing XML Lisa Shreve - Syscom Strategies Mike Rawlins - Rawlins EDI Consulting.
Advertisements

Efficient XML Interchange What is it? Why is it? How does it fit in?
Introduction to the BinX Library eDIKT project team Ted Wen Robert Carroll
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
CESG, Fall 2011, 5 th November 2011 Stuart Fowell, SciSys Device Virtualisation and Electronic Data Sheets.
1 CEOS/WGISS20 – Kyiv – September 13, 2005 Paul Kopp SIPAD New Generation: Dominique Heulet CNES 18, Avenue E.Belin Toulouse Cedex 9 France
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
® IBM Software Group © 2008, 2009 IBM Corporation OSLC Reporting POC implementation Dragos Cojocari March 2010.
ILDG File Format Chip Watson, for Middleware & MetaData Working Groups.
Toulouse School of Graduate Studies Theses and Dissertations ETDs - Why We Do them –We at UNT believe that electronic theses and dissertations enhance.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
CS 425/625 Software Engineering Software Requirements
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
ACG 4401 Introduction to XBRL. What is a Supply Chain?
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Chapter 12 Creating and Using XML Documents HTML5 AND CSS Seventh Edition.
Linux Operations and Administration
Software Engineering Muhammad Fahad Khan
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
A Domain-Specific RM&IG Solution Designed to Support the Implementation of ISAD(G) Arian Rajh, PhD, Assist. Prof., FFZG Krešimir Meze, Omega software d.o.o.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
ITEC224 Database Programming
CS 360 Lecture 3.  The software process is a structured set of activities required to develop a software system.  Fundamental Assumption:  Good software.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
C. Huc/CNES, D. Boucon/CNES-SILOGIC Specification for the Formal Definition and Transfer Phase of a Producer-Archive Interface.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
Current Applications of the OAIS Model David Giaretta.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
The european ITM Task Force data structure F. Imbeaux.
1 Introduction  Extensible Markup Language (XML) –Uses tags to describe the structure of a document –Simplifies the process of sharing information –Extensible.
C. Huc/CNES, D. Boucon/CNES-SILOGIC, D.M. Sawyer/NASA/GSFC, J.G. Garrett/NASA-Raytheon Producer-Archive Interface Methodology Abstract Standard PAIMAS.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.
WGISS /09/2015 DATA PRESERVATION – CNES APPROACH B. Chausserie-Laprée.
CCSDS Meeting CNES Report, 05/12/2004 CNES 1 CCSDS - MOIMS Area Information Packaging and Registry WG CNES Report on activity November April 2005.
CCSDS Meeting CNES Report, 2/11/2004 CNES 1 CCSDS - MOIMS Area Data Archive ingestion WG Information Packaging and Registry WG CNES Report on activity.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Overview of Previous Lesson(s) Over View  A program must be translated into a form in which it can be executed by a computer.  The software systems.
Chapter 1 Introduction Major Data Structures in Compiler
XML A Language Presentation. Outline 1. Introduction 2. XML 2.1 Background 2.2 Structure 2.3 Advantages 3. Related Technologies 3.1 DTD 3.2 Schemas and.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
CCSDS Meeting data Archive Ingest - June 2006 CNES 1 CCSDS - MOIMS Area Data Archive Ingest WG CNES Report Rome meeting – June 2006 Claude Huc.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Topic 4 - Database Design Unit 1 – Database Analysis and Design Advanced Higher Information Systems St Kentigern’s Academy.
System Requirements Specification
CCSDS Meeting data Archive Ingest - January 2007 CNES 1 CCSDS - MOIMS Area Data Archive Ingest WG CNES Report Colorado Springs meeting – January 2007 Claude.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
Chapter – 8 Software Tools.
1 Integration of the LCP Reporting Into the E-PRTR Scope and Technical proposal November 3rd.
Supporting the design of interactive systems a perspective on supporting people’s work Hans de Graaff 27 april 2000.
Eurostat Sharing data validation services Item 5.1 of the agenda.
CASE Tools and their Effect on Software Quality
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
CNES: CCSDS - MOIMS Area Data Archive ingestion WG Information Packaging and Registry WG CNES Report on activity May-October 2004 Toulouse meeting.
Distribution and components
Tools of Software Development
Part of the Multilingual Web-LT Program
Tutorial 7 – Integrating Access With the Web and With Other Programs
GT Portal v. 2.0 Data Delivery
Presentation transcript:

September 2003ERPANET - Marburg 2003View # 1 Data preservation Using CCSDS and ISO Standards Denis MINGUILLON

September 2003ERPANET - Marburg 2003View # 2 The problem to solve Data is often expensive to produce –Bits must not be lost (preserved medium) –Bits must remain accessible (known syntax) –Bits must remain understandable (known semantics) This presentation focuses on point 2 and 3

September 2003ERPANET - Marburg 2003View # 3 The syntactic aspect How to master the bit organisation –2 possible solutions Define exchange formats –Better done by user communities (ex : FITS for astronomy) –Not generic enough Define a standard way for describing any data –Was the CNES proposal to CCSDS –Was entrusted to CNES (definition and tools) by CCSDS

September 2003ERPANET - Marburg 2003View # 4 History CNES proposed a description language –EAST for data syntax –CCSDS recommendation since May 1997 CCSDS B-2 Issue 2 November 2000 –ISO standard since 2000 ISO CCSDS recommendations

September 2003ERPANET - Marburg 2003View # 5 EAST principles EAST is a standard language –Enhanced Ada SubseT inherits from a huge standardisation work enhanced with a few features missing for data description EAST has not to be mastered to be used thanks to the tools Data syntax description

September 2003ERPANET - Marburg 2003View # 6 EAST principles (2) Assumption : any kind* of digital data (available as an ordered set of bits) can be described as a tree. The minimal tree is a rake * note : optimal for binary data, interesting for raw ASCII format, of no use for XML data To describe such a tree, EAST proposes : composite types for the branches elementary types for the leaves Data syntax description

September 2003ERPANET - Marburg 2003View # 7 EAST data types Composite types are : records arrays lists Elementary types are : integers and reals enumerated characters Data syntax description

September 2003ERPANET - Marburg 2003View # 8 Optionality Some branches or leaves of the data tree may be optional. –EAST offers a way to indicate which data entities are optional and under which condition they exist –Example : Housekeeping data exists if telemetry mode is set to « housekeeping » Data syntax description

September 2003ERPANET - Marburg 2003View # 9 Variable size Some repetitions of data entities may have a variable number of elements –EAST offers a means to indicate how many of those elements are present in the current data Data syntax description

September 2003ERPANET - Marburg 2003View # 10 EAST use conclusion All the features described in the previous slides have been sufficient to describe almost every data encountered since 1997 –Exceptions : variable size not carried by the data variable size provided after the variable size field Data syntax description

September 2003ERPANET - Marburg 2003View # 11 History (2) CNES has also been in charge of writing the recommendation for data semantics description : –DEDSL (D ata E ntity D ictionary S pecification L anguage ) –XML/DTD syntax –CCSDS recommendation since January 2002 CCSDS B-1 Issue 1 January 2002 –ISO standard since 2003 ISO : 2003 CCSDS recommendations

September 2003ERPANET - Marburg 2003View # 12 DEDSL principles The data semantics description can be kept as a set of attribute values. The recommendation defines standard attributes : name, definition, units…. The recommendation gives a standard way to define user attributes to be added to the standard ones if needed Data semantics description

September 2003ERPANET - Marburg 2003View # 13 DEDSL principles (2) A data description consists in the tree organisation (as seen on the sysntactic aspect) completed by all the fields (nodes or leaves) caracteristics given as attributes values. Example of the descripon of a leaf of such a tree description.Example

September 2003ERPANET - Marburg 2003View # 14 OASIS OASIS is the tool to describe data (syntax and semantics) in a friendly way (without any requested knowledge of the EAST or DEDSL syntax) A dedicated Man-Machine interface offers the requested features (those listed previously) to describe data A tool to describe data

September 2003ERPANET - Marburg 2003View # 15 OASIS (2) Example :Example a data tree as built and shown by OASIS A tool to describe data

September 2003ERPANET - Marburg 2003View # 16 OASIS (3) Once the tree and each of its nodes and leaves described, the descriptions can be produced just by a click ( EAST and DEDSL-XML files ) Once saved, the EAST description can be used by the EAST tools to read and write the described data Once saved, the XML dictionary can be published as an html or word document A tool to describe data

September 2003ERPANET - Marburg 2003View # 17 EAST tools Application Programming Interface (API) GENERATOR : for data writing INTERPRETER : for data reading –Both available for C, C++, FORTRAN and Ada Standalone tools (based on the API) data_checker : data vs description compliance ascii_dump : display any data in ascii values DUW : interactive data generator other utilities… Tools for EAST described data

September 2003ERPANET - Marburg 2003View # 18 EAST tools (2) EAST tools are available on : –PC (DOS, NT, Linux), –HP-UX, –SUN-Solaris, –Dec true64 (OSF1), –VAX-VMS –other on request…. Tools for EAST described data

September 2003ERPANET - Marburg 2003View # 19 Interface Control Documents The DEDS-XML OASIS output can be transformed in an html or Word document A tool (named SCRIBE) is being developed for that purpose. Documenting the interfaces

September 2003ERPANET - Marburg 2003View # 20 Synthesis Data dictionary DEDSL/XML XML Data EAST Apis Html or Word document DCI generation File descriptions XML schema Read Write Generate Check ‘RAW’ Data OASIS tool File descriptions EAST Read Write Generate Check XML applications

September 2003ERPANET - Marburg 2003View # 21 Main advantages Only one consistent reference for syntactic description used for input/output and for semantic description (documentation for understanding) Reusable and reliable generic input/output software available User application easier to maintain due to less impact from changes of data formats Help in describing data and producing documentation The project conclusions

September 2003ERPANET - Marburg 2003View # 22 EAST/DEDSL-XML in action Many projects at CNES –SPOT, SSALTO, SIPAD, HELIOS, PHARAO… ESA projects –AMS, DEBAT NASA –Science data rehabilitation Use in other projects

September 2003ERPANET - Marburg 2003View # 23 EAST/DEDSL-XML feed back PROs : Easier sub-system integration –description accuracy (even after long time!!!) –early data simulation –assisted data checking Easier ICD evolutions –full top-down process Helpful in heterogeneous platforms context –automatic conversions Helpful in multi-missions context Projects feedback

September 2003ERPANET - Marburg 2003View # 24 EAST/DEDSL-XML feed back (2) CONs : Performances to be again improved –reading in time constrained context –writing in any case User type sharing to be added –to facilitate evolutions Documentation production to be improved –project customisation –add as much filters as possible to allow adaptation of the document to its target Projects feedback

September 2003ERPANET - Marburg 2003View # 25 Current actions Documentation tool in progress generic tool to transform XML-DED in Word or html documents 2nd prototype delivered Main lacks taken into account by the DEBAT project (ESA/ESRIN) Support to NASA tests on data migration - good first results Support to XML schema production On going activity

September 2003ERPANET - Marburg 2003View # 26 CNES policy for the EAST tools OASIS and EAST tools are freely available They can be downloaded from a CNES server Tools are maintained by a CNES team and a sub-contractor team (CS-SI) Policy : if CNES stops founding the maintenance the source code would be made available Information

September 2003ERPANET - Marburg 2003View # 27 Links For any support request : ( address) For information and documentation east.cnes.fr (web server) To download the software : logiciels.cnes.fr (web server) To get the standards (web server) –then under publications, recommendations and reports, information access and interchange systems. Information