The Role of XML in Cloud Data Integration Presenter: David RR Webber, Oracle Corporation October 15th, 2010.

Slides:



Advertisements
Similar presentations
Visual Scripting of XML
Advertisements

IJIS Industry Summer Briefing – July 2011
Dictionaries Tutorial Overview – Public Sector NIEM Team, November 2012 CAM Test Model Data Deploy Requirements Build Exchange Generate Dictionary XML.
Future of NIEM Tools Delivery Public Sector NIEM Team, July 2012 Futureof NIEM Tools.
Multi-Mode Survey Management An Approach to Addressing its Challenges
IEEE P1622 Meeting, Oct 2011 IEEE P1622 Meeting October 24-25, 2011 Guide to using OASIS EML v7.0 for UOCAVA Implementations David RR Webber Information.
Quick Guide to CAM Blueprints
1 1 Accelerated NIEM IEPD Development Process, Procedures and Techniques.
1 Manufacturing Solutions Division AutoCAD Electrical 2007 What’s New? AMS | CAD Consulting Group
1 June, 2010 – CAM Draft Specification Development Related Materials Quick Guide to CAM Dictionaries Building and using canonical XML components dictionaries.
NIEM, CAM and the 7 “D’s” David Webber - Public Sector NIEM Team, November 2011 NIEM Test Model Data Deploy Requirements Build Exchange Generate Dictionary.
Semantics and Information Exchanges Overview – Public Sector NIEM Team, June 2011 CAM Test Model Data Deploy Requirements Build Exchange Generate Dictionary.
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
Overview of OASIS SOA Reference Architecture Foundation (SOA-RAF)
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Software Factory Assembling Applications with Models, Patterns, Frameworks and Tools Anna Liu Senior Architect Advisor Microsoft Australia.
IRS XML Standards & Tax Return Data Strategy For External Discussion June 30, 2010.
XML Exchange Development CAM Technology Tutorial – Public Sector NIEM Team, June 2011 CAM Test Model Data Deploy Requirements Build Exchange Generate Dictionary.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
1 1 Roadmap to an IEPD What do developers need to do?
Open Data API delivery “Open-XDX” David Webber, Information Architect, Oracle Public Sector Open Data Exchange October, 2012.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
1 Dictionary Driven Exchange Content Assembly Blueprints Concepts, Procedures and Techniques (CAM – Content Assembly Mechanism Specification) Author: David.
Overview of the Database Development Process
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 1 Using Oracle BPM with Open Data Web Services David Webber.
James Cabral, David Webber, Farrukh Najmi, July 2012.
Copyright OASIS, 2005 / 2007 CAM Technology Introduction David Webber Chair OASIS CAM TC Presentation February 9 th, 2007 Bethesda MD.
PROJECT NAME: DHS Watch List Integration (WLI) Information Sharing Environment (ISE) MANAGER: Michael Borden PHONE: (703) extension 105.
What is QTP ► QTP stands QuickTest Professional ► It is an automated testing tool provided by HP/Mercury Interactive ► QTP integrates with other Mercury.
1 1 Roadmap to an IEPD What do developers need to do?
NIEM, CAM and the 8 “D’s” David Webber - Public Sector NIEM Team, December 2011 NIEM Test Model Data Deploy Requirements Build Exchange Generate Dictionary.
Methodology - Conceptual Database Design Transparencies
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
Introduction to MDA (Model Driven Architecture) CYT.
NIEM Blue Team Presentation April 20, 2010 Phil Letowt, Mini Kanwal, Ken Sall, David Webber ICE OCIO / Task ASAS ICE Information Exchange Reuse with NIEM.
1 Quick Guide to CAM Dictionaries Building and using Canonical XML dictionaries for CAM (CAM – Content Assembly Mechanism Specification) Author: David.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 2 Open Data Web Services for Oracle BPM August, 2013 Forms.
David Webber, NIEM Team, Oracle Public Sector Rapid NIEM XML Exchange Design, Semantics and UML Models NIEM Test Model Data Deploy Requirements Build Exchange.
1 Twitter #NIEMNTE3 Lightning-Fast IEPD Development Techniques Presenter David RR Webber Information Architect Oracle Public Sector.
© 2008 IBM Corporation ® IBM Cognos Business Viewpoint Miguel Garcia - Solutions Architect.
Interfacing Registry Systems December 2000.
Copyright OASIS, 2005 / 2007 Slaying the Complexity Monster David Webber Chair OASIS CAM TC Presentation January 24 th, 2007 Reston VA.
ISURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Prof. Dr. Asuman Dogac METU-SRDC Turkey METU.
Development Process and Testing Tools for Content Standards OASIS Symposium: The Meaning of Interoperability May 9, 2006 Simon Frechette, NIST.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Using XML for Business and Life May 2006 David RR Webber Technology and Standards in Higher Education.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 Quick Guide to CAM Blueprints Using blueprints to develop XML exchange templates and schema (CAM – Content Assembly Mechanism Specification) Author:
Methodology - Conceptual Database Design
1 1 CAM Toolkit for NIEM IEPD Development Phil Letowt – DHS/ICE David Webber – ICE Data Architect.
NIEM WHAT IS IT REALLY?. XML ORB: VIEWS & INSIGHTS David Webber XML savant blog: XMLOrb CAMeditor.org.
Leveraging SET, OWL, CAM and Dictionary based tools to enabled automated cross-dictionary domain translations David Webber OASIS SET TC / CAM TC (with.
Leveraging SET, OWL, CAM and Dictionary based tools to enabled automated cross-dictionary domain translations David Webber OASIS SET TC / CAM TC (with.
U NITED N ATIONS C ENTRE F OR T RADE F ACILITATION A ND E LECTRONIC B USINESS Under the auspices of United Nations Economic Commission for Europe UN/CEFACT.
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
Leveraging UBL for Developing Justice XML (GJXDM) Reference Documents John Ruegg County of Los Angeles Information Systems Advisory Body GJXDM User Conference.
IBM Software Group ® Managing Reusable Assets Using Rational Suite Shimon Nir.
Manufacturing Systems Integration Division Development Process and Testing Tools for Content Standards Simon Frechette National Institute of Standards.
EbXML Semantic Content Management Mark Crawford Logistics Management Institute
Silverstein Group Presenter Moshe Silverstein A Content Assembly Mechanism Technology Overview Context & Integration A Content Assembly Mechanism Technology.
Basic Concepts Key Learning Points : The objectives of this chapter are as follows:  To provide an introduction to the basic Concepts of enterprise architectures,
Building Enterprise Applications Using Visual Studio®
Methodology Conceptual Databases Design
CIM Modeling for E&U - (Short Version)
Methodology Conceptual Database Design
Core Components and More
XML Based Interoperability Components
Methodology Conceptual Databases Design
Automated Analysis and Code Generation for Domain-Specific Models
Presentation transcript:

The Role of XML in Cloud Data Integration Presenter: David RR Webber, Oracle Corporation October 15th, 2010

Introduction  Cloud services introduce new challenges for information sharing. While certain aspects are familiar and yet still unresolved, new techniques can be utilized. A canonical approach underpinning Cloud data exchanges is vital to ensure consistent understanding and enhanced interoperability.  Developers face many challenges and complexities in using today’s industry standards for information exchanges. How can we simplify this and rapidly develop consistent and conforming information exchanges in the Cloud?  Open source tools will be discussed and government and industry example exchanges presented.

Agenda  Challenges, New and Old  Why a canonical approach?  Adaptive, agile and context aware infrastructure  Avoiding the n(n-1) 2 dilemma  Ensuring Simplicity at the Foundation  Never underestimate the ability of engineers to add complexity  Open source and open standard solutions  Examples from emergency management domain with NIEM*  Summary and Q&A * National Information Exchange Model (NIEM) approach

Challenges, New and Old  Why a Canonical Approach?  Adaptive, agile and context aware infrastructure  Avoiding the n(n-1) 2 dilemma

Why a Canonical Approach?  Traditional XML information exchange has been schema driven; many issues for Cloud integration:  W3C XML Schema is inflexible, static, brittle, localized, expensive  Canonical dictionaries exploit Cloud approach with distributed availability, flexible collaboration and dynamic updating and referencing  Amazon Web Services AWS catalog an example of dynamic approach  Canonical dictionaries provide the components that underpin the exchanges while leaving the precise exchange formatting open for implementers; the “what” not the “how”; content can change hourly on AWS!  Neutral XML-based syntax is future proof

Canonical XML dictionary  A collection of distinct components that represent discreet business information for an application domain  Includes singleton components and combinations of related components together as sub-assemblies  Information is represented in a simple neutral conceptual data format that captures the critical concepts about the data e.g. name, description, content type, contextual usage pattern, hierarchy  Wikipedia definition:

Baking in Interoperability  Using consistent component definitions dramatically improves interoperability and reuse  Having formal design methods makes development faster, easier, predictable and repeatable  Aligning local practice to industry domain dictionary can reduce complexity and reinforce best practices  Dictionary definitions can be automatically evaluated for common mistakes and this reduces the opportunity for errors during design phase  Generating software artifacts from neutral dictionary definitions ensures reliable information exchange results across user communities and their particular systems, platforms and tools

Neutral Content Model Representation  Neutral representations allow business stakeholders to participate in dictionary development without technology barriers  Concise neutral formats can be viewed as simple spreadsheets as they have no special syntax dependencies  Based on open public standard specifications, semantic concepts and leading knowledge domain techniques  Neutral representation prevents lock-in by vendor, syntax, tooling or platforms  Maximizes flexibility and future proofing of dictionary definitions

Linguistic and Semantic Alignment  Formal community domain naming and design rules provide consistency of definitions  Consistency of definitions minimizes duplication and overlapping of dictionary components  Dictionaries allow collaboration on component development to improve the overall results  Formal component content detail drives alignment  Design best practices ensure logical self-contained components that can be selected contextually  Avoids explosion of complexity and excessive over definition (e.g. “kitchen-sink” schema)

What is a Canonical Approach?  There are several flavors of canonical approaches; some more complex than others – e.g. UBL vis OAGi vis CCTS  Avoid dependence on W3C XML Schema mechanisms  Core Components Technical Specification (CCTS) simple components with basic hierarchy  Parent components with child entities, and/or components  Associated attributes that denote context and related factors  In CCTS parlance these are ABIE, BBIE and ASBIE Parent = Aggregate Business Information Entity Child = Basic Business Information Entity Attribute = Association Business Information Entity

Conceptual Information Model Child (BBIE) Item Child (BBIE) Item Parent (ABIE) Item Parent (ABIE) Item Follows Naming and Design Rule (NDR) principles and guidelines Canonical Components Dictionary XML Canonical Components Dictionary XML ebXML CCTS terms (ABIE, BBIE, ASBIE) Parent = Aggregate Business Information Entity Child = Basic Business Information Entity Attribute = Association Business Information Entity Parent (ABIE) Item Parent (ABIE) Item Parent (ABIE) Item Parent (ABIE) Item Parent (ABIE) Item Parent (ABIE) Item..... Child (BBIE) Item Child (BBIE) Item Child (BBIE) Item Child (BBIE) Item Child (BBIE) Item Child (BBIE) Item Attribute (ASBIE) Attribute (ASBIE) Attribute (ASBIE) Attribute (ASBIE) * CCTS – Core Components Technical Specification Attribute (ASBIE) Attribute (ASBIE) Attribute (ASBIE) Attribute (ASBIE) Each compound component Each atomic component Optional attributes of component

Example – Person Name  Person Name (ABIE)  Language Code (ASBIE)  Verified Details? (ASBIE)  Has Alias? (ASBIE)  First Name (BBIE)  Middle Name (BBIE)  Last Name (BBIE)  Previous Name? (ASBIE) Language Code may exist independently of Person Name Verified Details and Previous Name are flags that denote additional information about the entity they are associated with There are three component items aspects: structure relationships; content rules; definitions Naming and Design Rules (NDR) also important in ensuring shorter non-specific context names e.g. compare PersonName to IncidentPersonName

Methods for creating Canonical Dictionary  Harvest from collection of domain exchange schema  Export from SQL database to schema; harvest; rename  Export from modelling tool to schema; harvest; rename  Create manually in XML or spreadsheet

Sample Dictionary Building Processes EDM Ele Def DDL 1 1 Export Components in XSD syntax Collection of objects from model Option 1 – From Enterprise Data Model Import XSD and refactor for use with OASIS CAM 2 2 Option 2 – Derive from existing exchange XSD schema Import each XSD and merge into CAM dictionary Exchange XSD schema CAM template OASIS CAM template Model Components XSD schema CAM template OASIS CAM template NDR Evaluation, Refactor, Renaming Tool NDR Evaluation, Refactor, Renaming Tool Apply Naming and Design Rule (NDR) checks and edits 4 4 NDR Evaluation, Refactor, Renaming Tool NDR Evaluation, Refactor, Renaming Tool 4 4 Generate Standard Components Dictionary XML Generate Standard Components Dictionary XML Merge & Generate Dictionary XML Merge & Generate Dictionary XML Dictionary of exchange components XML Import Dictionary of exchange components Automated Manual LEGEND Analyst Review ebXML CCTS compatible (ABIE, BBIE, ASBIE)

Ensuring Simplicity at the Foundation  Never underestimate the ability of engineers to add complexity

Adaptive, agile and context aware infrastructure  XML validation framework that is configurable dynamically through the use of XML templates and rules.  “In today's complex information exchanges with XML and associated large XSD schema, coupled with an array of trading partners, it becomes a significant challenge to support and maintain accurate handling of all incoming transactions”.  “With a more adaptive and fault tolerant process, the application is able to handle a wider variation in content and, hence, more easily support a broad set of interaction partners with reduced support and maintenance costs”. 

Avoiding the n(n-1) 2 dilemma  New XML validation framework  Automotive parts repair with STARBOD example  Utilizing validation framework with singleton validation templates that are context rule driven Source:

19 Agile Solution Components Def Ele Domain applications Industry dictionary formatted as XML Interchanges XML exchange realistic test examples XML Schema Unit Test Harness Test Blueprint toolkit Automated Manual LEGEND Definitions Repository (XML) Exchange Structure Schema Domain dictionary formatted as XML Templates Build CAMV engine Content Hints Exchange Designer Tool User Interface Review Structure Assembly Pick Components Wantlist WSDL actions (optional) Business Context Rules Agile Validation Engine Canonical Dictionaries

Leveraging Cloud Deployment strengths  Collaboration tools for sharing canonical component dictionaries  Repositories of templates and code lists  Fault tolerant deployment architectures with redundancy  Machine accessible APIs to allow real time updates and propagation of changes  Standards based implementations that provide open access  Open source resources for shared implementation support

Open Source and Open Standard solutions  Examples from Emergency Management domain with NIEM, OASIS EDXL, LEXS

Example Emergency Management Scenario  Emergency Response Services Workflow using OASIS EDXL exchanges Haiti demonstrated need for agile exchanges to rapidly cope with unfamiliar scenario and environment changes Cloud-based sharing of open adaptive common infrastructure components

Top Down Solution Approach Def Ele Industry dictionaries formatted as XML Exchange generator tools (CAM) Automated Manual LEGEND Components Definition (XML) Local domain dictionary formatted as XML Build Exchange Blueprint Designer User Interface Expand Structure Exchange Structure Pick Components Structure Outline Blueprint Target applications EDM Ele Def DDL Exchange Package Exchange Package Exchange Components Exchange Components Enterprise Data Model Import and refactor for use with CAM Dictionary Repository 4 4

Assembling Components from dictionaries  Determine your business information exchange components at conceptual level  Search and locate candidate components from appropriate domain dictionary collections  Catalogue the parts to be used  Dictionary components can be referenced individually or as collections by an assembly blueprint that puts them all together to create a complete information exchange  Components can be selected from multiple dictionaries  Note any new extension pieces as needed  Select components from multiple physical dictionary files  Blueprints themselves also have high re-use value  Can be sub-assemblies and patterns not just exchange models

Example Assembly Blueprint Outlines LEXS messaging blueprint Reusable messaging envelope constructs OASIS EDXL HAVE message Business functional components Message handling, delivery and control Payload goes here Top level sets of business information components Individual component these examples available from CAM editor install package ~ CAMeditor\eclipse\workspace\CAMEditor\dictionary\blueprints\ LEXS – Law Enforcement eXchange System –

Exchange Development Process Tools Expander Tool Expander Tool Industry dictionary Domain dictionary Component Definitions 4 4 Web tool Excel Searc h Tools Searc h Tools 2 2 Blueprint Designer Blueprint Designer 1 1 Insert Dictionary Parent Components Insert Dictionary Parent Components 3 3 Completed Exchange Template Completed Exchange Template 5 5

Summary and Q & A  Review  Resource links

Summary  Canonical XML component dictionaries  Neutral representation of components  Deployment to target environments and architectures  Collaborative development and open source  Uses open public standards and government guidelines (NIEM)  Available resources and tools  Illustrative use cases  Leverage strengths of cloud-based collaboration resources

Resources  Resource links  Supporting supplemental slides

Links and Resources  DOWNLOADS -  CAM Toolkit download   SUPPORTING MATERIALS -  NIEM Naming and Design Rules (NDR) 1.3   RESOURCES –  UN/CEFACT Core Components Technical Specification   Tutorials - wiki.oasis-open.org/cam/CAM_Tutorials  Specifications   docs.oasis-open.org/cam   NIEM site -  LEXS site –

Available XML Dictionaries  LEXS dictionary  OASIS EDXL dictionary  OASIS EML dictionary  NIEM 2.1 dictionaries  CBRN dictionary  Emergency dictionary  Family dictionary  Immigration dictionary  Infrastructure dictionary  Intelligence dictionary  Justice dictionary  Maritime dictionary  Screening dictionary  Trade dictionary  NIEM core dictionary  Immigration blueprint Available from download site direct link: XML + includes spreadsheets and sample blueprint Packaged with CAM editor see dictionary folder of install + spreadsheet + blueprint samples XML Note: Those marked in bold are model style dictionaries with recursive components.

Conceptual Information View CAM toolkit processing Apply tools in desktop CAM toolkit editor CAM Template DOMAIN DATA COMPONENTS Structure Rules Definitions Items Item (ABIE, BBIE, ASBIE) Properties Name Unique ID Component Type Cardinality Content Type Content Mask Children Group Structure Context Where from Definition Rules Language, Label, Notes * Required items in Blue DICTIONARY COMPONENTS

XML View of Dictionary Content Items Name Unique ID Component Type Cardinality Content Type Content Mask * See slide notes for explanation Parent / Child linkage where referenced

Excel Spreadsheet View An item per row properties as columns Type (ABIE, BBIE) children

Mapping to Dictionaries  You can compare a template of components to a dictionary  check within a domain for alignment to dictionary  check between domains for interoperability  merge new/existing components with dictionary  Matches on physical names  Reports matching items and details  Reports statistics and percentages of matching  Generates crosswalk xml file  Compatible with Microsoft Excel  Report can be used to do spell checking

Example cross-reference spreadsheet Formatted view in Microsoft Excel of import of cross- reference report details (from generated XML file) Matched details; item and alignment, definition