Impact of OASIS UIMA Standard on Apache UIMA OASIS Unstructured Information Management Architecture (UIMA) TC

Slides:



Advertisements
Similar presentations
The REPOX system Nuno Freire -
Advertisements

Pierre-Johan CHARTRE Java EE - JAX-RS - Pierre-Johan CHARTRE
Service Oriented Architecture Reference Model
2005 Adobe Systems Incorporated. All Rights Reserved.Adobe Confidential Duane Nickull Adobe ® Service Oriented Architecture Reference Model (SOA RM)
IATI Technical Advisory Group Technical Proposals Simon Parrish IATI Technical Advisory Group, DIPR March 2010.
1 An Update on XML.org Registry and Repository Una Kearns Documentum, Inc.
Requirements. UC&R: Phase Compliance model –RIF must define a compliance model that will identify required/optional features Default.
XPointer and HTTP Range A possible design for a scalable and extensible RDF Data Access protocol. Bryan Thompson draft Presented to the RDF.
© 1998, Progress Software Corporation 1 Migration of a 4GL and Relational Database to Unicode Tex Texin International Product Manager.
OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
CTS2 DEVELOPMENT FRAMEWORK CTS2 Overview. Schedule What is it? Why a framework? What does this do for me? Plugins Implementations available now CTS2 Compliance.
Service Description: WSDL COMP6017 Topics on Web Services Dr Nicholas Gibbins –
Reliable Interoperation between Open Office & MS office by UOML Alex Wang Chair/OASIS UOML TC Chairman / Sursen Co.
31242/32549 Advanced Internet Programming Advanced Java Programming
Using UML, Patterns, and Java Object-Oriented Software Engineering Chapter 2, Modeling with UML, Part 4 UML 2 Metamodel.
INTER-OPERABILITY IN THE NEW ZEALAND EDUCATION SECTOR USING A SECTOR DATA MODEL DRIVEN METHODOLOGY Presented on April at the New Zealand State.
Common Terminology Services 2 (CTS2)
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
What iS RMI? Remote Method Invocation. It is an approach where a method on a remote machine invokes another method on another machine to perform some computation.
Information Retrieval in Practice
Annotation Types for UIMA Edward Loper. UIMA Unified Information Management Architecture Analytics framework –Consists of components that perform specific.
Apache Axis: A Set of Java Tools for SOAP Web Services.
CMU’s TTO3 Tasks Analyze annotation manuals and annotated samples from all TTO3 teams –Status: Formal UML models have been constructed and merged together.
Basic Definitions Data Structures: Data Structures: A data structure is a systematic way of organizing and accessing data. Or, It’s the logical relationship.
Common Mechanisms in UML
Principle of Functional Verification Chapter 1~3 Presenter : Fu-Ching Yang.
The Role of Modeling in Systems Integration and Business Process Analysis © Sparx Systems Pty Ltd 2011 Ben Constable Sparx Systems.
UIMA Introduction SHARPn Summit June 11, 2012
Overview of Search Engines
Roles and Responsibilities Jahangheer Shaik. Service Specification Specification requires development of three inter-related documents CIM, PIM and PSM.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
WSDL Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Web Services Experience Language Web Services eXperience Language Technical Overview Ravi Konuru e-Business Tools and Frameworks,
MDA and QVT  Tom Gullion, Director of Product Management, Together Products.
Astronomical Data Query Language Simple Query Protocol for the Virtual Observatory Naoki Yasuda 1, William O'Mullane 2, Tamas Budavari 2, Vivek Haridas.
95-843: Service Oriented Architecture 1 Master of Information System Management Service Oriented Architecture Lecture 10: Service Component Architecture.
XML in Development of Distributed Systems Tooling Programming Runtime.
Copyright © Orbeon, Inc. All rights reserved. Erik Bruchez Applications of XML Pipelines XML Prague, June 16 th, 2007.
3 April SOA: Services Oriented Architecture MDA: Model Driven Architecture.
Introduction to MDA (Model Driven Architecture) CYT.
UIMA SHARP 4 - NLP May 25, Outline UIMA Terminology (not just TLAs) Parts of a UIMA pipeline Running a pipeline Viewing annotations Creating a new.
A language to describe software texture in abstract design models and implementation.
The LDAP Schema Registry and its requirements on Slapd development OpenLDAP Developers' Day San Francisco 21 March 2003 Peter Gietz, DAASI International.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
A Component Platform for Experimenting with Autonomic Composition A component framework for supporting composition of autonomic services and bio-inspired.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE User Forum, Manchester, 10 May ‘07 Nicola Venuti
Combining GATE and UIMA Ian Roberts. University of Sheffield NLP 2 Overview Introduction to UIMA Comparison with GATE Mapping annotations between GATE.
Service Component Architecture (SCA) Policy TC … Face to Face Agenda – Jan 24,
Reviews Crawler (Detection, Extraction & Analysis) FOSS Practicum By: Syed Ahmed & Rakhi Gupta April 28, 2010.
SHIWA Desktop Cardiff University, Budapest, 3 rd July 2012.
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
Combining GATE and UIMA Ian Roberts. 2 Overview Introduction to UIMA Comparison with GATE Mapping annotations between GATE and UIMA.
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
Model Driven Architecture MDA SE-548 Lale Doğan
Information Retrieval in Practice
Extending Model-Driven Engineering in Tango
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
SysML v2 Formalism: Requirements & Benefits
CORBA Alegria Baquero.
OASIS UIMA Technical Committee Specification Overview (DRAFT v0.2)
CORBA Alegria Baquero.
Instance Model Structure
Constructing MDA-based Application Using Rational XDE for .NET
Combining GATE and UIMA
QoS Metadata Status 106th OGC Technical Committee Orléans, France
Task Manager & Profile Interface
Software Architecture & Design
Presentation transcript:

Impact of OASIS UIMA Standard on Apache UIMA OASIS Unstructured Information Management Architecture (UIMA) TC February 19, 2008

Impact on Apache UIMA Well Aligned –CAS Data Representation: Apache UIMA already uses XMI –Abstract Interfaces: Very similar to the Apache UIMA interfaces Minor Differences –Type System Language Apache UIMA supports Ecore import/export, but still uses its own native type system language, and there are a few minor mismatches in semantics. Many constraints expressible in Ecore are not enforced. –Processing Element Metadata: Apache UIMA has some things not in the proposed standard: Indexes Configuration Parameter Groups –SOAP Interfaces: Would need to be implemented as new Apache UIMA service adapters. Apache UIMA is designed to make this relatively easy to do. Major Differences –Type System Base Model: significant differences Apache UIMA Annotation Base Model is different from proposed standard Apache UIMA Views are 1-1 with Sofas –Behavioral Metadata: Apache UIMA has limited behavioral metadata and it lacks precise semantics (and therefore cant be automatically converted to the proposed standard format).

Type System Base Model Differences Naming differences –Apache: uima.tcas.Annotation –OASIS: org.oasis-open.uima.TextAnnotation Differences in begin, end offsets? –Apache: UTF-16 code units (works well for Java) –OASIS: Unicode characters? (better for interoperability across platforms) Differences in what can be a Sofa –Apache: Annotation points to an object of type Sofa, which contains the data –OASIS: Annotation points to an object of type SofaReference, which then points to another object in the CAS that holds the actual Sofa data OASIS allows separate Regional Reference object. Differences in Views –Apache: Every Sofa has exactly one View, and every View must have a Sofa. –OASIS: Views are general collections of objects. A View may be linked to a Sofa, but this is not required. More than one View may be linked to the same Sofa.

Behavioral Metadata Differences Apache UIMA semantics are not well-defined –Allows specifying inputs and output but… –Not clear whether an input is required or optional –Not clear how input and output types relate to input & output Sofas Apache UIMA allows specifying multiple sets of capabilities –Allows specifying that different outputs may be produced depending on what inputs are received –Rarely Used

Service-Level Compliance Apache UIMA service adapters –Apache UIMA Analysis Engines can be deployed as UIMA- Standard-Compliant services. Implementing getMetadata operation –Type System converted to Ecore using existing converters (also need to convert base type system!) –PE Metadata can be serialized to standard-compliant format Some things such as configuration groups would not be supported –Issue How Can Apache UIMA Capabilities be published as OASIS UIMA Behavioral Metadata?

Service-Level Compliance Implementing process operation – need to map OASIS base type system. Not trivial but possible. –Convert Type Names –Convert character offsets –Create a Sofa object for everything thats pointed to by a SofaReference –Separate RegionalReference objects might not be supported (I think the spec may not require them to be) –For non-anchored Views could create a dummy Sofa –Not Clear: What to do with an incoming CAS where more than one View points to the same Sofa??

Deeper Compliance Over time Apache UIMA could internalize more of the UIMA standard representations. –Type System could natively use Ecore, and provide additional enforcement of Ecore constraints. –Descriptor formats could support UIMA-standard metadata XML. –Behavioral Metadata could move to the OASIS standard and get away from the underspecified representation currently supported. –The standard Type System Base Model could be supported natively in the CAS.