ISLE: International Standards for Language Engineering A European/US joint project Martha Palmer University of Pennsylvania Tides Kickoff March 22, 2000
Outline Background on EAGLES Recent discussions on EU/US cooperation ISLE project under NSF funding Current ISLE structure Next meeting
History of EUROPEAN EAGLES B egan 1992 with agreement among –NREC, ET7, ACQUILEX, MULTILEX, GENELEX, SAM,TEI EAGLES launched in 1993 –European Advisory Group on Language Engineering Standards
Underpinnings Recognition of the strategic role of Linguistic Resources in HLT Also the need for a shared platform of large-coverage LR as a common infrastructure Must serve the community –all EU languages
Standards are needed for Interoperability of systems Reuse and re-integration of components Training based on Gold Standards Evaluation based on accepted criteria Transition from prototypes to LE products
Gave rise to coordinated development of Linguistic Resources Definition of technical standards –LE EAGLES Creation of LRs for all EU languages –LE PAROLE, LE SIMPLE – EuroWordNet, SPEECHDAT 1&2 Distribution of LRs –LE ELRA
EAGLES Structure: 1 st Phase ( )
EAGLES Structure: 2 nd Phase ( )
Methodology for each WG State-of-the-art survey and investigation Elaboration of Proposals –Discuss, draft, feedback, redraft –Early involvement of industry Validation of Proposals –Prototype implementations –Possible redrafting Dissemination, Promotion, Maintenance
EAGLES Guidelines Lexicons –Morpho-syntactic phenomenaMorpho-syntactic phenomena –SubcategorizationSubcategorization –Semantic encodingSemantic encoding
Discussion of EU/US cooperation LREC Conference in Granada (May, 98) Workshop on Multilingual Information Management (post LREC, Granada) National Academy of Science Workshop on US/EU Cooperation (Washington, June, 98) Post-Coling/ACL Workshop (Montreal, August, 98) SPARKLE/EAGLES Workshops (Pisa,Jan, 99)
NSF funding for EU/US joint projects Discussed at NSF HCI meeting, (Florida, Feb, 1999) html/jointannounce?OpenDocument html/jointannounce?OpenDocument ISLE proposal submitted by Pisa/Penn –EU side: with SDU (DK), ISSCO (CH) –US side: with NYU and ISI
ISLE Structure: Phase I (2000 – 2002)
Computational Multilingual Lexicons WG US feedback on EAGLES Extend to linking Multilingual Lexicons Lexical Entry Tool Implementation of Samples Evaluation
Natural Interaction and Multimodality WG Data Resources Annotation Schemata and Systems Meta-data description
Evaluation WG Quality Models for MT systems Quality Models for Spoken Dialogue Systems
Planned Workshops 2 Meetings each year for WGs First meetings – –NAACL-00, (Seattle, May 4) Planning meeting –LREC 2000, (Athens, June 3) WGs will meet