Lirics mid-term review

Slides:



Advertisements
Similar presentations
Using OLIF, The Open Lexicon Interchange Format Susan McCormick OLIF2 Consortium October 1, 2004.
Advertisements

DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
Alan Edwards European Commission 5 th GEO Project Workshop London, UK 8-9 February 2011 * The views expressed in these slides may not in any circumstances.
IUFRO International Union of Forest Research Organizations Eero Mikkola Description of WP2 – NEFIS Metadata and Controlled Vocabularies Standards - work.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. ISO TC 37 Tutorial Gerhard.
ACTS Programme M obile I ntelligent A gents for M anaging the Information I nfrastructure ACTS Programme AC338.
1 European Standardisation and the Identification of ICT Technical Specifications 13th XBRL Europe Day Rome, 6 May 2014 Antonio Conte, Project Manager.
ISOcat introduction 19 June 20121CLARIN-NL ISOcat workshop.
The Language Archive – Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Metadata Component Framework Possible Standardization Work.
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
Linked Data as an enabler of cross-media and multilingual content analytics for enterprises across Europe A.Gómez-Pérez (UPM) Project Coordinator.
LIRICS International Standards in Lexicography Gerhard Budin University of Vienna August 2005.
“The preparation of annual financial reports in a single electronic reporting format will be mandatory as from January 1, 2020” XBRL Europe Working Group:
Barcelona Meeting 21/06/05 MM 1 LIRICS WP2 LIRICS WP2 NLP LEXICA Task Leader: ILC-CNR (Pisa) presented by: Monica Monachini.
1 Proposed PLCS TC Organization and Functional Responsibilities Revision
/21LIRICS IAG Meeting Barcelona LIRICS IAG Meeting /21 Universitat Pompeu Fabra Barcelona Introduction Gerhard Budin.
CLARIN web services and workflow Marc Kemps-Snijders.
APEC-TPT Intermodal & ITS Group Action Plan (Proposed Format) May 2006 (updated – Ha Noi) CONGESTION AHEAD.
Copyright OASIS, 2002 OASIS Topic Maps Technical Committees Standards Update Presentation Knowledge Technologies Conference Seattle , March 11 Bernard.
►Thierry Declerck (DFKI GmbH, LT Lab. Saarbrücken, Germany) Standards and Infrastructures for Language Resources.
LIRICS mid-term review 1 LIRICS WP3: Morpho-syntactic and syntactic annotations Thierry Declerck DFKI-LT - Saarbrücken 23rd May 2006.
LIRICS Mid-term Review 1 LIRICS WP2 – NLP Lexica Monica Monachini CNR-ILC - Pisa 23rd May 2006.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
ISLE: International Standards for Language Engineering A European/US joint project Martha Palmer University of Pennsylvania Tides Kickoff March 22, 2000.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
Reference Information Specifications for Europe Exploitation Guidelines Jörgen Hartnor
“ BIRD Project“ 1 Broadband Access, Innovation & Regional Development” Broadband Access, Innovation & Regional Development” Project Description Ulrich.
The European Localisation Exchange Centre Karl Kelly Event Coordinator LRC electonline.org.
ISOcat introduction 20 March 20121CLARIN-NL ISOcat workshop.
CLARIN work packages. Conference Place yyyy-mm-dd
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
IGNITE: The Interoperability Standard Verification Initiative For Localisation International Localisation Standards Convention Dublin, November
EuroRoadS A pan-European Road Data Solution Project within the eContent programme.
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA — Vassar College.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
SEMIC.EU Semantic Interoperability Centre Europe Open Days Workshop eGovernment for Regions Aldo Laudi 7th October 2008.
1 Future Circular Collider Study Preparatory Collaboration Board Meeting September 2014 R-D Heuer Global Future Circular Collider (FCC) Study Goals and.
Co-funded by the European Union Ref. number: LLP FI-ERASMUS-ENW WP2: Identification of Industrial Needs for Open innovation Education in.
ISOcat introduction 10 May /20111CLARIN-NL ISOcat workshop.
Co-ordinated by aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT Business Development activities after project completion Ruben.
Kick Off Meeting Largs, Scotland
eContentplus 2008 Work Programme
WP4 Models and Contents Quality Assessment
Components People Technology Policies Standards Spatial Data.
Usage scenarios, User Interface & tools
FUTURE EVOLUTION OF SHORT-TERM ECONOMIC STATISTICS
CRE8TIVE KO Meeting, Rome Italy Quality Assurance
Trilateral Research EUROPEAN COMMISSION
Online platforms Brussels, September 2016.
Standing Committee Knowledge & Technology
Implementing the ESS Vision 2020
CEPMC Executive Board and General Assembly EC standardisation package
Krister Lindén and Ville Oksanen FINCLARIN / University of Helsinki
ESS Vision 2020: ESS.VIP Validation
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
How will the future European standard (EN) on Electronic Invoicing benefit public entities and service & solution providers? December 1, 2016 Andrea Caccia,
O2’s 3rd Party Developer Programme
LOSD Publication Deirdre Lee
Open Archival Information System
Infrastructrural Language Resources and International Cooperation
Jørgen Friis, ETSI VP SES
SDMX : General introduction H. Linden, Eurostat, Unit B5
Project overseer Song Yanqin May 11, 2004 Honolulu
REFIT Platform 20/02/2019 Diversity Europe Group.
A Global Consensus Process
Reinhard Scholl, GTSC-7 Chairman
Workshop on Structured Information and Implementation Frameworks (SIIF) in Slovenia concerning Urban Waste Water Treatment Directive (91/271/EEC) 3. Ongoing.
APENet and EUROPEANA: Digitization Issues in the European Context
SDI from a technological perspective: Standards
- Kick-off meeting - ERANET Cofund BlueBio WP4 (Leader: AEI)
Presentation transcript:

Lirics mid-term review LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►mid-term review presentation: aim and work carried out ►Proposal N° 22.236 ►Presented by Laurent Romary (INRIA, France, chair of ISO-TC37/SC4) 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Scope Europe being a mosaic of languages, the processing of multilingual linguistic data concerns a lot of people in Europe And the recent expansion to 10 new EU members intensifies this task (and 2 new EU members Bulgary+Romania next year) Of course, various linguistic data already exist all over Europe But today there exists no established standard to enable interoperability and re-use of multilingual data 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Scope (cont.) And these data need to be improved, extended, processed, merged, used and re-used Of course, translation is directly concerned And to address the whole European population, localised tools regarding to various markets and languages are also concerned But at present, these tasks form a timely and costly part of daily work of Europe’s industry 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Objectives To lower this cost, LIRICS will: Provide Europe with a set of industry validated standards for language resource management ratified within the project lifetime Facilitate the acceptance of these standards by providing an open-source reference implementation platform, related web services and test suites Gain full industry support and input to the standards development via the Industry Advisory group and demonstration workshops Provide a pay-per-use business model for use by industry validated during the project 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Consortium The LIRICS consortium bring together leading experts in the field of Natural Language Processing via participation in ISO committees INRIA (F) specialist in standardisation DFKI (D) sp. in morpho-syntax & syntax processing USFD (UK) provider of the GATE open source platform CNR-ILC (I) sp. in language resources & standardisation UW (A) sp. in terminology management & language codes Util (NL) sp. in computational semantics MPI (D) sp. in meta-data Unis (UK) sp. in language resources IULA-UPF (E) sp. in lexicons & grammars 23rd May 2006 Genoa Lirics mid-term review

Industry advisory group For the standards to have impacts, LIRICS ensures their usability by consulting with a group of industrial users The Industry advisory group is consulted to identify priorities and requirements 21 members: ► NLP solution providers like Systran, Sinequa, Temis or Morphologic ► Lexicon publishers like Longman-Pearson ► End users like EADS-CCR, British Telecom, Telefonica Invest-Des. or HP Membership will be expanded 23rd May 2006 Genoa Lirics mid-term review

Description of the work The deliverables are direct inputs to the ISO ballots WP1: Infrastructure for standard development & quality assurance - to guarantee that the documents produced within the project are designed in accordance with ISO - to guarantee that they reach maturity, soundness and adequacy with the market - attendance at ISO meeting & submission of LIRICS deliverables to ISO 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Desc. of the work (cont.) WP2: Lexicons (connected to ISO TC37/SC4/WG4) - efforts to address standardization have been already undertaken in the past: GENELEX, EAGLES, PAROLE-SIMPLE & ISLE constitute a valuable point for LIRICS - LIRICS relies on the experience accumulated at each centre and capitalises on results of the above mentioned projects, together with European and non European national projects - high compatibility ensured by the formulation of data categories (ISO 12620) - following the ISO milestones, a Lexical Markup Framework ISO-TC37 committee has been submitted to ISO ballot in March 2006. And DIS ballot is foreseen at M27 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Desc. of the work (cont.) WP3: morpho-syntactic & syntactic annotations (connected to ISO TC37/SC4/WG2) - valuable recommendations, best practices and guidelines have been proposed, on which WP3 bases its work (e.g. Eagles, Multext-East) - LIRICS will benefit from ongoing work at the ISO level - check the consistency with legacy data from existing Treebanks (e.g. Penn treebank) and with existing grammars (e.g. Matrix framework from EU project Deep-thought) - morpho-syntactic annotation framework (=MAF) => call for CD ballot in August 2005, on the way for DIS - syntactic annotation framework (=SynAF) => NWIP accepted production of WD-rev-2 in January 2006 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Desc. of the work (cont.) WP4: semantic content (connected to ISO 12620 DCR) - developing standards for all aspects of the semantic content is beyond the scope of LIRICS - but, analysis of recent and emerging systems for the representation and annotation of semantic content - a useful step is the identification of a range of data categories such as temporal-spatial information, verb subcat, reference annotation, word sense information and quantification - data category compilation to be endorsed by the ISO TC37/SC4 Thematic Domain Committee (semantic group) - let’s note that a NWIP will be proposed during the next plenary ISO meeting in Beijing with a LIRICS member involved 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Desc. of the work (cont.) WP5: reference implementation platform - all LIRICS defined ISO standards will be defined on the basis of web services in order to support distributed NLP resources - support « try before you buy » paradigm which enables NLP companies to give temporary access and charge on per-usage basis - provide open-source reference implementation of wrappers for lexicons, morphological analysers, syntactic parsers and semantic annotators 23rd May 2006 Genoa Lirics mid-term review

Lirics mid-term review Desc. of the work (cont.) WP6: dissemination & exploitation - a requirement workshop (M6) to identify priorities and essential characteristics from the Ind. Adv. Gr. has been held in Barcelon - eContent workshop with the existing eContent projects to make language standards known in all relevant areas of industry and economy - a web site and a mailing list is set up and managed 23rd May 2006 Genoa Lirics mid-term review