DFDL WG Session 1 Summary of Status Mike Beckerle Ascential Software.

Slides:



Advertisements
Similar presentations
© 2006 Open Grid Forum Data Format Description Language (DFDL) Progress Update Mike Beckerle, Oco, Inc.
Advertisements

XMLSchema to TTCN-3 Mapping Importing XML schema based data types into TTCN-3.
Introduction to the BinX Library eDIKT project team Ted Wen Robert Carroll
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh Alan Chappell PNNL
Datatypes for OGSA Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:
EC-211 DATA STRUCTURES LECTURE 2. EXISTING DATA STRUCTURES IN C/C++ ARRAYS – A 1-D array is a finite, ordered set of homogeneous elements – E.g. int a[100]
Managing XML and Semistructured Data Lecture 12: XML Schema Prof. Dan Suciu Spring 2001.
Problem Solving by Searching Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 3 Spring 2007.
XML Language Family Detailed Examples Most information contained in these slide comes from: These slides are intended.
4 XML Schema.
1 Web Data Management XML Schema. 2 In this lecture XML Schemas Elements v. Types Regular expressions Expressive power Resources W3C Draft:
IT253: Computer Organization Lecture 6: Assembly Language and MIPS: Programming Tonga Institute of Higher Education.
Chapter 4 Enterprise Modeling.
SYSTEM ANALYSIS & DESIGN (DCT 2013)
Systems Analysis and Design 9th Edition
CSE 636 Data Integration XML Schema. 2 XML Schemas W3C Recommendation: Generalizes DTDs Uses XML syntax Two documents: structure.
XML Simple Types CSPP51038 shortcourse. Simple Types Recall that simple types are composed of text-only values. All attributes are of simple type Elements.
Hash Tables1 Part E Hash Tables  
XML Schemas and Namespaces Lecture 11, 07/10/02. BookStore.dtd.
Software Engineering Recitation 2 Suhit Gupta. Today we will be covering… XML II Sockets, Server – Client relationships, Servers capable of handling multiple.
State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.
About the Data-Flow Complexity of Web Processes Jorge Cardoso Department of Computer Science, University of Madeira Funchal Portugal.
Introducing Hashing Chapter 21 Copyright ©2012 by Pearson Education, Inc. All rights reserved.
Chapter 4.
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
Data Formats CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook.
Avro Apache Course: Distributed class Student ID: AM Name: Azzaya Galbazar
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
CE 525. SPATIAL ANALYSIS Using Spatial Analyst Point Density (Data Management) Clipping Cell size - smoothness Raster Calculator (Map Algebra) Combined.
IS432 Semi-Structured Data Lecture 3: XSchema Dr. Gamal Al-Shorbagy.
Dr. Azeddine Chikh IS446: Internet Software Development.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Constraint Satisfaction Problems (CSPs) CPSC 322 – CSP 1 Poole & Mackworth textbook: Sections § Lecturer: Alan Mackworth September 28, 2012.
Prepared By Ms.R.K.Dharme Head Computer Department.
SDPL 2005Notes 2.5: XML Schemas1 2.5 XML Schemas n Short introduction to XML Schema –W3C Recommendation, 1 st Ed. May, 2001; 2 nd Ed. Oct, 2004: »XML Schema.
DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
Overview and Applications
Can’t provide fast insertion/removal and fast lookup at the same time Vectors, Linked Lists, Stack, Queues, Deques 4 Data Structures - CSCI 102 Copyright.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 5 XML Schema (Based on Møller and Schwartzbach,
Sheet 1XML Technology in E-Commerce 2001Lecture 2 XML Technology in E-Commerce Lecture 2 Logical and Physical Structure, Validity, DTD, XML Schema.
1 CCSDS Information Architecture Working Group Daniel J. Crichton, Chair NASA/JPL 14 September 2005.
Advanced Computer Science Lesson 4: Reviewing Loops and Arrays Reading User Input.
1. 2 Preface In the time since the 1986 edition of this book, the world of compiler design has changed significantly 3.
Chapter 4 enterprise modeling
XML Evidence Record Syntax
ESA UNCLASSIFIED – For Official Use SOIS EDS & Toolchain ESA YGT Study F. Torelli & P. Skrzypek CCSDS Fall Meeting /10/2013.
Primer on XML Schema CSE 544 April, XML Schemas Generalizes DTDs Uses XML syntax Two parts: structure and datatypes Very complex –criticized –alternative.
© 2004Kizoom 1 Structured Travel Incidents & Situations SIRI - Service Interface for Real Time Information (CEN-OO ) CENTC278.
Systems Analysis and Design 8th Edition
© 2006 Open Grid Forum Data Format Description Language (DFDL) DFDL 1.0 Public Comment Suman Kalia, IBM
Lecture 0 W3C XML Schema. Topics Status Motivation Simple type vs. complex type.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
 Data Type is a basic classification which identifies different types of data.  Data Types helps in: › Determining the possible values of a variable.
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh
© 2010 Open Grid Forum Data Format Description Language (DFDL) DFDL 1.0 – Proposed Recommendation Steve Hanson, IBM
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh
Apache Avro CMSC 491 Hadoop-Based Distributed Computing Spring 2016 Adam Shook.
Binary Data Format (BDF) ARCH/VCDE Small Working Group 03/28/2008
DFDL WG Session 3 Mike Beckerle Ascential Software Two note-takers please?
Washington D.C. 1 DFDL Data Format Description Language Overview & Summary of Status WG Co-Chairs: Mike Beckerle,
Introduction toData structures and Algorithms
DFDL WG Session 2 Mike Beckerle Ascential Software
A DFDL Proposal based on Commercial Data Processing Requirements
Data Modeling II XML Schema & JAXB Marc Dumontier May 4, 2004
Dictionaries 9/14/ :35 AM Hash Tables   4
Software Connectors – A Taxonomy Approach
Chapter 9 Web Services: JAX-RPC, WSDL, XML Schema, and SOAP
THE DATATYPES OF XML SCHEMA A Practical Introduction
CS2013 Lecture 7 John Hurley Cal State LA.
Presentation transcript:

DFDL WG Session 1 Summary of Status Mike Beckerle Ascential Software

DFDL-WG Session 1 Summary of Status Agenda Presentation: primer material (45 mins) Review of purposes/goals XML / XSD impact: Data Model Examples from primer General discussion (40 mins) Overview of other sessions (5 mins)

Data Interchange Formats Prescriptive: Put your data in this format! XML – textual Binary – ASN.1, XDR, NetCDF, HDF... Descriptive: What format is your data in? Various commercial products. Nothing standard.  DFDL

Why Descriptive? Allows us to achieve two goals simultaneously: 1. Interoperability Modern and Legacy data formats 2. Performance! Density Fewest bytes to represent data without resorting to compression Optimized I/O Seekable random access Memory mapped, aligned Without sacrificing general access

Why the GGF for DFDL? Grids are about big-data and big- computation problems Simplistic solutions like “use XML” won’t cut it! Grids are about universal data interchange

Related Standards Efforts Prescriptive systems: W3C binary XML ( Formed, but discussion group has no items. Descriptive systems: None known

XML Synergy Use XSD to describe the logical data Use annotations within the XSD to describe the representation of it.

XSD Types Elements A.k.a. fields Sequence groups, All groups All = unordered group Choice A.k.a. union, redefine, Vectors Use element with minOccurs, maxOccurs. Nillability A.k.a. Nullable values

XML/XSD – basic types anySimpleType stringQNameNOTATIONfloatdoubledecimalbooleanbase64BinaryhexBinaryanyURI normalizedString token languageNameNMTOKEN NMTOKENSNCName ID IDREFENTITY IDREFSENTITIES integer long nonPositiveIntegernonNegativeInteger negativeInteger positiveIntegerunsignedLong unsignedInt unsignedShort unsignedByte int short byte datetimedateTimegYeargYearMonthgMonthgMonthDaygDayduration

Example 1: XML E E8

Example 1: XSD

Example 1 DFDL - binary e8c 169a 54dd 0a1b 4a3f ce29 46f6

Example 1 DFDL - binary

Example 1 DFDL - textual “5, , 8.6E-200, -7.1E8”

Example 1 DFDL - textual

The pieces of the puzzle Primitive types: XSD Data structure: XSD Mappings – representations to/from primitives: DFDL mappings Composition – modular composition of basic mappings: DFDL mapping composition References – associate primitives in a structure with correct mapping: DFDL mapped types N.B. Problem parts have some level independence

Mappings Named black boxes Implementations know how to call them Semantics not described Name Range type Domain type Directionality <mapping name="dfdl:data-bytes" rangeType="dfdl:data" domainType="dfdl:bytes.unbounded" direction="bidirectional"/> <mapping name="dfdl:bytes-int" rangeType="dfdl:bytes.4" domainType="xs:int" direction="bidirectional" argumentType="dfdl:binaryProperties"/>

Composing mappings Simple linear composition (from primer) May be too restricted more complex composition (mapping trees) possible.

Mapped types Associate a new type with a composed mapping Use a trick, null restriction with annotation New type is a valid XML type Simple composition could be included in annotation

About USE For convenience instead of textInt or binaryInt, you create a “use” association Means: All uses of type “xs:int” in the scope of this declaration mean “dfdl:binaryInt”

Example 1 – binary with ‘use’

Next Steps Tuesday session Open Issues Richer data formats: stored length, choice/unions Layered translation, modularity Wednesday session Continuation on Open Issues XML/XSD concerns