WG2 Tutorial ISO/JTC1/SC32 Larry Fitzwater (202) SDC JE-4029
2 WG 2 Areas of Focus F Metadata Registries. F IT Enablement. F Metamodel Framework. F Conceptual Graphs and Knowledge Interchange Format.
SDC JE Topics to be Covered F Registries: A common interest of SC 32 WGs. u Registry types. n Functionality and content issues. u Registry content examples. n Metadata management issues. u A proposed solution–Registry interoperation. u Registry standards efforts. u Conference 2003–Demonstrate interoperability. n Standards, software, practice.
SDC JE Registries F Using this term loosely, there are several types of registries in the area of data management and interchange. F The registries have some common, overlapping content, which is extended and utilized in different ways. F The registries vary according to the intended purpose, granularity of contents, the level of semantics management.
SDC JE Types of Registries Common Content OASIS/ebXML Registries Common Content ISO Registries Common Content Ontological Registries Common Content Database Catalogs Common Content CASE Tool Repositories Common Content UDDI Registries Software Component Registries Common Content
SDC JE Types of Registries F OASIS/ebXML (Organization for the Advancement of Structured Information Standards/electronic business XML) XML Registries – XML Artifacts. u Register XML Artifacts - Includes schemas and DTDs. XML schema relate to a paper form, EDI document, or reporting requirement. u Emphasis on syntactic (structure) information. F ISO Metadata Registries – Data semantics. u Register Data Elements, components of data elements and groups of data elements. For example, country codes for customer place of residence. Includes: data element concepts, data elements (including representation), value domains, and (multiple) taxonomies. u Emphasis on semantic information such as definitions of data elements and value meanings, and stewardship responsibilities. F UDDI (Universal Description, Discovery, and Integration) Registries – Web-based business services. u A White page directory of the business activities of firms. u Emphasis on interface specifications.
SDC JE Types of Registries (continued) F Database System Registries (System Catalogs/Data Dictionaries/ Repositories) – Schema, integrity & operational information. u Includes all elements of database schemas: data elements, relations, integrity constraints. u Emphasis on information required to make database systems work for queries, etc., not on semantic management. F Case Tool Registries (Encyclopedias/Repositories) – Data model and application program logic. u Includes information needed to create a database and potentially the program code for a system. Contains database schemas. u Emphasis not on semantic management. F Ontological Registries – Concept structures. u Includes relations among concepts (subsumption, inheritance,…) and axioms for inference among concepts, e.g., temporal/spatial reasoning, etc. u Emphasis is on semantics.
SDC JE Types of Registries (continued) F Software Component Registries – Software components. u Reuse software components built on specific vendor platforms such as EJB (Enterprise Java Beans), COM, and CORBA IDL(Common Object Request Broker Architecture Interface Definition Language), etc. u Include basic common elementary objects and object patterns. u Emphasis on reusability of software component to build flexible and standardized business systems.
SDC JE Registries Example of Common Content Common Content OASIS/ebXML Registries Common Content ISO Registries Common Content Ontological Registries Common Content CASE Tool Repositories Common Content UDDI Registries Country Name Software Component Registries Common Content Database Catalogs
SDC JE Registries Management of Common Content F Overlapping content presents the usual multiple entry, synchronization, and maintenance problems for users. F Data management must also accommodate differences in data representation.
SDC JE Data Elements DZ BE CN DK EG FR... ZW ISO 3166 English Name ISO Numeric Code ISO Alpha Code Algeria Belgium China Denmark Egypt France... Zimbabwe Name: Context: Definition: Unique ID: 4572 Value Domain: Maintenance Org. Steward: Classification: Registration Authority: Others ISO 3166 French Name L`Algérie Belgique Chine Danemark Egypte La France... Zimbabwe DZA BEL CHN DNK EGY FRA... ZWE ISO Alpha Code Example of Common Content Country Identifier Algeria Belgium China Denmark Egypt France... Zimbabwe Name: Country Identifiers Context: Definition: Unique ID: 5769 Conceptual Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Data Element Concept
SDC JE Data Element List – Address Group Alice Wilson 161 North Street Happy Valley MO USA ISO Registry 33c Name Street Address City, State Postal Code Country
SDC JE OASIS/ebXML XML Registry JPN VI304 List Price Currency Dollars XML Instance Unit Price XML Instance JPN Collaboration Partner #1 X12 EDIFACT DISA DISA Schema or Template DISA DISA Schema or Template Collaboration Partner #2 Alice Wilson 161 North Street Happy Valley MO USA REPOSITORY
SDC JE UDDI Registry Alice Wilson 161 North Street Happy Valley MO USA Business-to-Business
SDC JE Case Tool Repository
SDC JE Database Catalog
SDC JE Ontological Registry Concept Geographic Area Geographic Sub-Area Country Country Identifier Country NameCountry Code Short Name ISO Character Code ISO Character Code Long Name Distributor Country Name Mailing Address Country Name ISO Numeric Code FIPS Code
SDC JE An Example of Software Component Registry
SDC JE Needs for Registry Interoperation System Development PDES/STEP Registry Manufacturing UDDI Component Registry GCI Registry EAN Registry CPFR Registry RosettaNet Registry Business Negotiation Vendor Catalog Shipper Catalog Business Category Shipping Customer Support CALS Production Planning Consumer Profile ebXML Registry
SDC JE Consolidation or Interoperation? F Does one registry acquire all of the key functionalities? F Do distinct capabilities require multiple registries with interoperability?
SDC JE UDDI OASIS/ebXML XML Registry ISO Metadata Registry Ontological Registry Database Catalog (e.g., SQL) Consolidation or Interoperation? (continued) CASE Tool Encyclopedia
SDC JE OASIS/ebXML XML Registry ISO Metadata Registry Ontological Registry Database Catalog (e.g., SQL) CASE Tool Encyclopedia UDDI Consolidation or Interoperation? (continued)
SDC JE UDDI OASIS/ebXML XML Registry ISO Metadata Registry Ontological Registry Database Catalog (e.g., SQL) CASE Tool Encyclopedia Consolidation or Interoperation? (continued)
SDC JE UDDI OASIS/ebXML XML Registry ISO Metadata Registry Ontological Registry Database Catalog (e.g., SQL) CASE Tool Encyclopedia Consolidation or Interoperation? (continued)
SDC JE Interoperation of Registries Common Content OASIS/ebXML Registries Common Content ISO Registries Common Content Ontological Registries Common Content Database Catalogs Common Content CASE Tool Repositories Common Content UDDI Registries Interoperation Software Component Registries Common Content
SDC JE A Look at the Development Efforts How can we work toward interoperability? F Standards developers. F Software developers. F Practitioners.
SDC JE XML Registry Efforts F OASIS Registry/Repository Specification. u OASIS and ebXML have merged into a single OASIS Technical Committee chaired by NIST. u OASIS specifications strength is in its classification capabilities. Associations among objects make it easy to compare versions of similar objects. u OASIS allows organizations to register, classify, capture associations and dependencies, and record additional metadata for their objects of choice. u XML interface provides easy access. Members can exchange information with one another. u Major advantage is that the interface can remain stable for each interchange partner.
SDC JE XML Registry Efforts (continued) F OASIS Registry/Repository Specification (continued). u Registered objects associated with two pieces of information: n An electronic file consisting of a specific digital instance. The file containing the registered object is maintained by a repository. n Metadata for naming, describing, and locating the registered object and identifying its associations and relationships with other objects. The metadata is maintained by the registry. u Registry and repository are tied together in that the metadata includes a globally unique locator for a file. u Benefits of storing metadata in a registry include: reuse of data and metadata, efficient version control, unified understanding of registered objects, and promotion of consistency across organizational areas. u May be a driving force in commercial implementations that are interoperable between trading partners and meet government needs.
SDC JE Speaking the Same Language Company X publishes an XML Schema in a public repository. Company Z retrieves XML Schema from repository and generates XML document from it. Company Z sends XML document to Company X over shared communications link. Company X verifies correctness of received XML document by checking it against well-known Schema Company X Company Z Public XML Schema Repository XML Schema provides cross-organizational XML document interoperability.
SDC JE Metadata Registry Efforts F ISO/IEC – Metadata Registries Background. u Specifies basic aspects of the kind and quality of metadata necessary to describe data and the management and administration of that metadata in a metadata registry. u Applies to the formulation of data representations, meanings, and relationships between them to be shared, independent of the organization that produces the data. u Focuses on semantics (the meaning of data) and management of semantic change. u Does not apply to the physical representation of data as bits and bytes; metadata refers to descriptions of other data.
SDC JE Metadata Registry Efforts (continued) F Major aspects of ISO/IEC family of international standards apply to activities including: u The definition, specification and contents of data element dictionaries or data registries, including interchanging or referencing among various collections of data elements. u The design and specification of application-oriented data models, databases, and message types for data interchange. u The actual use of data in communications and information processing systems. u Interchanging or referencing among various collections of data elements. F Major implementations for ISO/IEC exist for environment, healthcare, intelligent transportation system, aviation, demographic and other programmatic areas. F The desire is to make these metadata registries interoperate with XML registries and with the registries handling deeper levels of semantic management.
country SDC JE-4029
34
SDC JE Business Registry Efforts F UDDI effort defines a set of specifications making business accelerate use of B2B and commerce over the Internet easier. F This direct application-to-application integration is a core building block of the digital economy and will help reduce many of the configuration and compatibility problems. F UDDI is based on existing Internet standards and is platform and implementation neutral. F A formal standard is expected. Specification may have relevance to XML and metadata registries that could be registered in UDDI registries.
SDC JE Database Catalog Efforts F SQL has the concept of a catalog. u Has info that database needs to operate. n Not necessarily sharable. F SQL has concept of Schema. u Metadata accessible by SQL. F Relational systems may have a “dictionary,” but not specified by a standard. F Area of possible coordination between SC 32 WGs.
SDC JE CASE Tool Repository Efforts F There are several proprietary encyclopedia/repository products. F Difficult to specify common model. F SC 7/WG 11 and Crystal Data Identification File (CDIF), a CASE Tool Interchange Format committee have established a common core of content and an interchange specification. F Object Management Group (OMG): Unified Modeling Language (UML) meta-model describes UML Objects, the Meta Object Facility (MOF) provides a container and XMI provides for metadata transfer. F OMG: UML Profile (set of stereotypes for particular domain).
SDC JE Software Component Registry Efforts F A lot of actual commercial base software component registries are available in the market. Flashline.com, ComponentAA, ComponentSquar.com F Sometimes those components are called Business Objects. F Software components are developed and reused among specific vendor platforms, such as EJB, COM, and CORBA with their own manner. F No common framework is available so far, to share components among registries.
SDC JE Interoperability F Interoperability approaches. u Standards developers. n Liaison between standards committees. u Software developers. n Interoperability conformance testing. u Practitioners. n Single points of data entry, interchange between software.
SDC JE Four General Registry Areas A Work in Progress F Discovery mechanisms. F Repositories of XML artifacts. F Semantic registries–Business content mapping. F Application Program Interfaces (APIs). Work with NIST and others in the XML Working Group to prepare report covering registries including descriptions of:
SDC JE OASIS/ebXML Integration Approach ISO Registry Extractor XML Interface Service ebXML Registry ebXML Transport Layer Data Semantic Content Classifications Associations Marked content domain for extraction Classification synchronization XML Syntactic Content Classifications Associations ebXML Interface Service
SDC JE UDDI and Metadata Registry Interoperability (White Pages) (Yellow Pages) (Green Pages) Publisher Subscriber (Brown Pages)(Green Pages) Run-Time Build-Time Information Metadata i.e., Ontologies, Taxonomies, Models, etc. Access Metadata i.e., Security, Required Technologies, APIs, etc. (Brown Pages) ISO/IEC other Joint Common Catalog
SDC JE What is it we need to interoperate? F A business object framework for the common content and how the registries might interoperate: u ebXML (electronic business XML). u UN/CEFACT (United Nations Centre for Trade Facilitation and Electronic Business). u OMG (Object Management Group). u CBOP (Consortium for Business Object Promotion). u SC 32.
SDC JE For the Interoperability of Registries Common Content OASIS/ebXML Registries Common Content ISO Registries Common Content Ontological Registries Common Content Database Catalogs Common Content CASE Tool Repositories Common Content UDDI Registries Meta-model FW Business Object Registry Software Component Registry Common Content
SDC JE Close look at Meta-model Framework for the Interoperability Meta-model for Mapping Meta-model for Registering constructs Common Contents Terminologies Codes Ontologies Object Patterns Common Content Ontological Registries Common Content CASE Tool Repositories Common Content UDDI Registries Common Content OASIS/ebXML Registries Common Content ISO Registries Common Content Database Catalogs Registry Software Component Registries Common Content Meta-meta model
SDC JE Meta-model Framework Registry A User Registry BRegistry C Local meta-model Registry Federation Standard Value Domain Standard Modeling Facility Standard Identification & Classification Scheme Meta-models Standard Value Domain Meta-meta-model Common Meta-models Meta-model Framework Terminology & Ontology Local meta-model Standard Modeling Constructs
SDC JE Conference 2003 Open Forum F Major topic: Interoperability of registries. F Participants: u Standards developers. u Software developers. u Practitioners. F Related topics: u Tutorials on the standards. u Tracks for application areas: Environment, Transportation, Healthcare, Energy, Aeronautics & Space, Demographics, … The Conference intends to encourage progress toward interoperability and showcase progress made.
Discussion Larry Fitzwater SDC JE-4029