LIRICS Mid-term Review 1 LIRICS WP2 – NLP Lexica Monica Monachini CNR-ILC - Pisa 23rd May 2006.

Slides:



Advertisements
Similar presentations
OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
Advertisements

Using OLIF, The Open Lexicon Interchange Format Susan McCormick OLIF2 Consortium October 1, 2004.
Can I Use It, and If so, How? Christian Lieske SAP AG – MultiLingual Technology Discussion of Consortium Proposal for OLIF2 File Header.
Status on the Mapping of Metadata Standards
DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
Profiles Construction Eclipse ECESIS Project Construction of Complex UML Profiles UPM ETSI Telecomunicación Ciudad Universitaria s/n Madrid 28040,
Alan Edwards European Commission 5 th GEO Project Workshop London, UK 8-9 February 2011 * The views expressed in these slides may not in any circumstances.
0 DOD/DT/CEDCV – 20 th & 21 st January Paris meeting SAGEM RTD Activities C2-Sense project Paris – 20 & 21 January 2015.
Status Report of the Study Group on MDR/MFI Implemenations ISO/IEC JTC 1/SC 32/WG2 Interim Meeting Santa Fe, NM, USA, November 11~15, 2013 Dongwon Jeong,
Restricted © Siemens AG All rights reserved Siemens Corporate Technology | Month 20XX Proposed topics for TDL phase 3.
ICT Monica Monachini – 1° KYOTO Workshop – Amsterdam 2/ KYOTO (ICT ) Yielding Ontologies for Transition-Based Organization Intelligent.
The Wichita lexicon in LEXUS Armik Mirzayan University of Colorado at Boulder Jacquelijn Ringersma Max Planck Institute for Psycholinguistics RELISH Workshop.
Defining and Managing Semantics and Datatypes – Is there a role for ISO Topic Maps? Martin Bryan Technical Manager The Diffuse Project
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
 2003 CSLI Publications Ling 566 Oct 16, 2007 How the Grammar Works.
1 Chapter 11 Developing Custom Help. 11 Chapter Objectives Use HTML to create customized Help topics for an application Use the HTML Help Workshop to.
MTEI Methods & Tools for Enterprise Integration
A Use Case for SAML Extensibility Ashish Patel, France Telecom Paul Madsen, NTT.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
ISO Standards: Status, Tools, Implementations, and Training Standards/David Danko.
Barcelona Meeting 21/06/05 MM 1 LIRICS WP2 LIRICS WP2 NLP LEXICA Task Leader: ILC-CNR (Pisa) presented by: Monica Monachini.
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
1 CIM User Group Conference Call december 8th 2005 Using UN/CEFACT Core Component methodology for EIC/TC 57 works and CIM Jean-Luc SANSON Electrical Network.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Provo, 16 Aug 2007 LMF meeting 1 Lexical Markup Framework: ISO Provo meeting Gil Francopoulo.
CLARIN web services and workflow Marc Kemps-Snijders.
Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Second RMSIG Workshop Methodology and Process Wolfram Höpken.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
►Thierry Declerck (DFKI GmbH, LT Lab. Saarbrücken, Germany) Standards and Infrastructures for Language Resources.
LIRICS mid-term review 1 LIRICS WP3: Morpho-syntactic and syntactic annotations Thierry Declerck DFKI-LT - Saarbrücken 23rd May 2006.
Working group on multimodal meaning representation Dagstuhl workshop, Oct
Alignment of ATL and QVT © 2006 ATLAS Nantes Alignment of ATL and QVT Ivan Kurtev ATLAS group, INRIA & University of Nantes, France
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
ISO/IEC CD and WD : Core Model and Model Mapping ISO/IEC JTC1/SC32/WG September 2005, Toronto SC32/WG2 Japan (Kanrikogaku Ltd) Masaharu.
Towards multimodal meaning representation Harry Bunt & Laurent Romary LREC Workshop on standards for language resources Las Palmas, May 2002.
ISOcat introduction 20 March 20121CLARIN-NL ISOcat workshop.
CLARIN work packages. Conference Place yyyy-mm-dd
Nicoletta Calzolari Berlin, October PWI ISO SC 4/WG 4 Lexicon-Ontology relations PWI Nicoletta Calzolari Exploratory meeting.
Modeling Component-based Software Systems with UML 2.0 George T. Edwards Jaiganesh Balasubramanian Arvind S. Krishna Vanderbilt University Nashville, TN.
LEXUS a flexible web based lexicon tool LEXUS a flexible web based lexicon tool, august 21 th, 2005 Marc Kemps-Snijders Peter Wittenburg
ISO-PWI Lexical ontology some loose remarks Thierry Declerck, DFKI GmbH.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.
Working with Ontologies Introduction to DOGMA and related research.
Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.
® A Proposed UML Profile For EXPRESS David Price Seattle ISO STEP Meeting October 2004.
STEP Tutorial: “ Fundamentals of STEP” David Briggs, Boeing January 16, 2001 ® PDES, Inc NASA STEP Workshop step.nasa.gov.
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA — Vassar College.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
1 STO A Lexical Database of Danish for Language Technology Applications Anna Braasch Center for Sprogteknologi Copenhagen SPINN Seminar, October 27, 2001.
Developing OLIF, Version 2 Susan M. McCormick Christian Lieske OLIF2 Consortium SAP/Walldorf, Germany.
SemAF – Basics: Semantic annotation framework Harry Bunt Tilburg University isa -6 Joint ISO - ACL/SIGSEM workshop Oxford, January 2011 TC 37/SC.
Formats, interoperability and standards Marc Kemps-Snijders.
Session 1 What Is the UML? Written by Thomas A. Pender Published by Wiley Publishing, Inc. October 5, 2011 Presented by Kang-Pyo Lee.
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
ISOcat introduction 10 May /20111CLARIN-NL ISOcat workshop.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Object Management Group Information Management Metamodel
SysML v2 Formalism: Requirements & Benefits
Web Service Modeling Ontology (WSMO)
Proposed SysML v2 Submission Plan
Lirics mid-term review
Interactions.
Ling 566 Oct 14, 2008 How the Grammar Works.
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

LIRICS Mid-term Review 1 LIRICS WP2 – NLP Lexica Monica Monachini CNR-ILC - Pisa 23rd May 2006

LIRICS Mid-term Review2 Summary of the presentation Overview of WP2 1° year objectives Main results in T2.1 and T2.2 Work done Synergies with other LIRICS WPs, ISO activities, meetings Priorities for future activities

LIRICS Mid-term Review3 WP2 overall objective Define a “family” of standards for NLP lexicons Two-level standards: the high level specifications provide structural elements, i.e. lexical classes and relations between them, the meta-model; the low level specifications provide standardized constants, i.e. data categories used to “adorn” the lexical classes  ISO 12620

LIRICS Mid-term Review4 WP2 T2.1 overview and objectives  From past and on-going standardization activities, gathering linguistic information considered relevant for lexical description and to be combined with the layers of the lexical model  Coherent input to ISO Data Category Registry revision

LIRICS Mid-term Review5 WP2 T2.1 results  Proposal for a unified set of lexical information and unified descriptors as draft set of Data Categories  Maximum set of candidate lexical data categories subdivided along the layers of linguistic description: morphosyntax, syntax and semantics.  Data Categories shared between WP2 and WP3 relevant to Morphosyntactic description have been incorporated in the Syntax Tool: the Morphosyntactic Profile.

LIRICS Mid-term Review6 WP2 T2.1 Deliverables 1st year2nd year3rd year M1M1 M2M2 M3M3 M4M4 M5M5 M6M6 M7M7 M8M8 M9M9 M10M10 M11M11 M12M12 M13M13 M14M14 M15M15 M16M16 M17M17 M18M18 M19M19 M20M20 M21M21 m22m22 M23M23 M24M24 M25M25 M26M26 M27M27 M28M28 M29M29 M30M30 WP2 T2.1 T2.2 I T2.3 I D.2.1 Survey and evaluation of existing standard for Lexica D.2.1 Survey and evaluation of existing standard for Lexica (revision) (version foreseen in conjunction with Data Cats to be issued together with the data model in T2.2) D.2.1 Survey and evaluation of existing standard for Lexica

LIRICS Mid-term Review7 WP2 T2.2 overview and objectives  Define a lexical framework, a general and abstract meta-model as a set of structural nodes relevant for lexical description, enabling specific implementations on the basis of common Data Categories  Definition of the common set of related Data Categories

LIRICS Mid-term Review8 WP2 T2.2 results  Formulation of a high-level lexical meta-model, the Lexical Markup Framework, a flexible environment for user-defined mark-up languages  Proof-of-concepts: mapping exercises of well known NLP lexicon practices against the model

LIRICS Mid-term Review9 WP2 T2.2 Deliverables 1st year2nd year3rd year M1M1 M2M2 MM3MM3 M4M4 M5M5 M6M6 M7M7 M8M8 M9M9 M10M10 M11M11 M12M12 M13M13 M14M14 M15M15 M16M16 M17M17 M18M18 M19M19 M20M20 M21M21 m22m22 M23M23 M24M24 M25M25 M26M26 M27M27 M28M28 M29M29 M30M30 WP2 T2.1 T2.2 I T2.3 I NLP Lexica standard for CD ballot (submitted beginning year 06) NLP Lexica standard for ISO DIS ballot Internal milestone for internal quality control

LIRICS Mid-term Review10 WP2 Activities, Meetings, Synergies... LIRICS WPs BI- TRI-LATERAL Working Meetings: CNR-ILC – MPI, : PAROLE-SIMPLE lexical architecture and LEXUS tool WP2 internal meeting, : basic structure of the meta-model for lexicons (core model + extensions) CNR-ILC – DFKI, : convergences between morpho-syntactic and syntactic data; issues for the submission of the N W I on Syntax (SynAF) to ISO Pisa, WP2 internal meeting: basic structure of the meta-model for representation of Multiword expressions LIRICS Meetings Paris, Progress of work within WP2. Presentation of the standard core model for lexicons and the extensions for NLP lexicons Barcelona, LIRICS Industrial Advisory Board Meeting Barcelona, Presentation of first bulk of information relevant for lexical description Nancy, WP4 TDG3 Workshop: connections between lexico-semantic representation and semantic roles in lexicon ISO Meetings Berlin ISO TC37/SC4 WG4 Meetings Warsaw Plenary meeting of ISO TC37/SC4. Task force for the purpose of designating generic data category sets for alignment with with the level of the metamodel; task force related to the representation of MWEs. Rome UNI-DIAM Commission: candidature of Italy as P-member in ISO TC37/SC4 (CNR-ILC reference expert)

LIRICS Mid-term Review11 provide a common model for the creation and use of lexical resources manage the exchange of data between and among these resources enable the merging of electronic resources to form extensive global resources. Range of topics: monolingual, bilingual multilingual lexical resources Scalability the same specifications are to be used for both small and large lexicons Coverage linguistic description range from morphology, syntax, semantic to multilingual representation languages are not restricted to European languages the range of targeted NLP applications is not restricted. What is LMF for?

LIRICS Mid-term Review12 Future activities/Priorities/Plans Data Categories deliver rev 2 of D2.1: candidate data categories will receive the necessary adjustments after discussion extend the ISO Registry to cover further layers of linguistic description: do we need an ISO Syntactic Profile (Bejin)? LMF model refine the NLP multilingual and MWE extensions XML representation of LMF linguistic objects in order to allow unified access to LMF conformant lexicons through APIs Provide implementation of test suite lexical entries: PAROLE-SIMPLE lexicons ready to be described according to LMF (LEXUS), to be put in the LMF server and made accessible via the web.

LIRICS Mid-term Review13 Structure of LMF Structural skeleton, with the basic hierarchy of information in a lexical entry extend a subset of core-model classes; are conformant to the core model; cannot be used regardless to the core model LMF specifications comply with modeling UML principles

LIRICS Mid-term Review14 Core package Container for managing the top level language components. The number of words or MWe of the lexicon is equal to the number of lexical entries in a given lexicon. Form consists of a text string that represents a single word or a multi-word expression Sense specifies or disambiguates the meaning and context of a form One to many Representation Frames can be associated with Form, each of which contains a form and data categories that specify the orthographic types and name of the word It is a cross- reference pivot that can link to many Lexical Entries within or across Lexicons.

LIRICS Mid-term Review15 Package for extensional morphology 1st strategy:describe the morphology representing explicitly all inflections

LIRICS Mid-term Review16 Package for inflectional paradigm 2nd strategy: declare an inflectional paradigm; use the inflectional paradigm extension for defining it

LIRICS Mid-term Review17 Package for NLP syntax Syntactic behavior represents one of the behaviors of one (or more) senses Construction describes one syntactic construction and can be shared by all words with the same syntactic behavior Self refers to the head lexical entry and describes syntactic properties Syntactic Argument describes a syntactic actant ConstructionSet regroups together various Syntactic Constructions and factorizes syntactic descriptions to have a minimum of syntactic behavior elements in the lexicon.

LIRICS Mid-term Review18 XML representation

LIRICS Mid-term Review19 Package for NLP semantics Predicative Representation describes the link between Sense and Semantic Predicate Semantic Predicate describes an abstract meaning Semantic Argument describes a semantic actant and is linked with its syntactic counterpart

LIRICS Mid-term Review20 Package for NLP semantics (cont.)

LIRICS Mid-term Review21 XML representation

LIRICS Mid-term Review22 Package for NLP semantics (cont.)

LIRICS Mid-term Review23 Package for Multilingual representation Sense Axis Relation describes the linking between two different Sense Axis Source and TargetTest permit to express conditions about the translation on the source/target language side

LIRICS Mid-term Review24 Package for Multiword expressions