Smart Computer-Aided Translation Environment Project Progress and State of Affairs Vincent Vandeghinste.

Slides:



Advertisements
Similar presentations
8/16/2014 E U R O P E A N F O R E S T I N S T I T U T E  To ensure that the objectives of NEFIS are reached and activities completed in accordance with.
Advertisements

© NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks.
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
Languages & The Media, 5 Nov 2004, Berlin 1 New Markets, New Trends The technology side Stelios Piperidis
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
REACTION REACTION Workshop Overview Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.
Smart Space & Oxygen CIS 640 Project By Usa Sammpun
New work-package WP5: Multimodal Processing and Interaction MUSCLE JPA3 Leaders: Petros Maragos, ICCS-NTUA Alexandros Potamianos, TSI-TUC.
Methodology Conceptual Database Design
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Bootstrapping pronunciation models: a South African case study Presented at the CSIR Research and Innovation Conference Marelie Davel & Etienne Barnard.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved Digital Enterprise Research Institute Ontologies & Natural Language.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Information Technology – Dialogue Systems Ulm University (Germany) Alexander Schmitt, Gregor Bertrandt, Tobias Heinroth,
revised CmpE 583 Fall 2006Discussion: OWL- 1 CmpE 583- Web Semantics: Theory and Practice DISCUSSION: OWL Atilla ELÇİ Computer Engineering.
DFKI GmbH, , R. Karger Indo-German Workshop on Language Technologies Reinhard Karger, M.A. Deutsches Forschungszentrum für Künstliche Intelligenz.
PrepTalk a Preprocessor for Talking book production Ted van der Togt, Dedicon, Amsterdam.
Language Services Industry 华豫江 John Hua.
Area Report Machine Translation Hervé Blanchon CLIPS-IMAG A Roadmap for Computational Linguistics COLING 2002 Post-Conference Workshop.
17.0 Distributed Speech Recognition and Wireless Environment References: 1. “Quantization of Cepstral Parameters for Speech Recognition over the World.
1 ICDM 2004 Business Meeting 11/4/2004 Data Mining on ICDM Submission Data Shusaku Tsumoto Ning Zhong and Xindong Wu.
Subtask 1.8 WWW Networked Knowledge Bases August 19, 2003 AcademicsAir force Arvind BansalScott Pollock Cheng Chang Lu (away)Hyatt Rick ParentMark (SAIC)
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ICDM 2003 Review Data Analysis - with comparison between 02 and 03 - Xindong Wu and Alex Tuzhilin Analyzed by Shusaku Tsumoto.
Learning Automata based Approach to Model Dialogue Strategy in Spoken Dialogue System: A Performance Evaluation G.Kumaravelan Pondicherry University, Karaikal.
A. Mørch, , ) Empirical-Based, Evolutionary Design of FLE/Agents Background Knowledge building environments Method Empirical study and.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
CAST Project funded by the European Commission, Directorate-General Energy & Transport, under the 6th RTD Framework Programme.
Project overview objectives, work plan, milestones and deliverables Mark Stevenson, University of Sheffield Kate Fernie, MDR Partners PATHS Final Project.
Progress Meeting WP4 status review Teleconference with GMV, GSA Konstandinos Diamandouros 16/12/2010 GNSS for INnovative road Applications.
EASAIER Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Ying Ding.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Annual Review Meeting - Introduction.
Exploring ‘Workspaces’ Tom Visser, SARA compute and networking services, Amsterdam Garching Workshop 21 st September 2010.
IPSOM Indexing, Integration and Sound Retrieval in Multimedia Documents.
Catia Cucchiarini, Walter Daelemans and Helmer Strik Strengthening the Dutch Language and Speech Technology Infrastructure Catia Cucchiarini, Walter Daelemans.
Contact : Bernadette Bouchon-Meunier, Patrick Gallinari, Jean-Gabriel Ganascia LIP6, UPMC, 8 rue du Capitaine Scott, Paris, France
NCSR “Demokritos” Institute of Informatics & Telecommunications CROSSMARC CROSS-lingual Multi Agent Retail Comparison Costas Spyropoulos & Vangelis Karkaletsis.
31 March Learning design: models for computers, for engineers or for teachers? Jean-Philippe PERNIN (*,**) Anne LEJEUNE (**) (*) Institut national.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Search and Annotation Tool for Oral History INTER-VIEWS Henk van den Heuvel, Centre for Language and Speech Technology (CLST) Radboud University Nijmegen,
WP1: Application Ontology Management Maria Teresa Pazienza Dept. Of Computer Science University of Rome “Tor Vergata”
ELRC Training Workshop in Belgium, April 13, 2016 Walter Daelemans Universiteit Antwerpen, CLiPS Language Technology in Belgium 1.
Smart Computer-Aided Translation Environment 23 september 2014 Leuven.
WP1: Plan for the remainder (1) Ontology –Finalise ontology and lexicons for the 2 nd domain (RTV) Changes agreed in Heraklion –Improvement to existing.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
WP 6: Analysis and evaluation Hans H. K. Andersen Cogain Kick-Off Tampere, Finland.
CLARIN - Flanders Activities and Achievements Frank Van Eynde Center for Computational Linguistics (KU Leuven) Digital Humanities Spring Event, April.
Smart Computer-Aided Translation Environment D5.1: Results of the analysis of translation systems and practices of translators Iulianna van der Lek-Ciudin.
ARIES WP2 Task 2.2 kick-off Coordination, support and enhancement of communication/outreach activities for accelerators in Europe Jennifer Toes (CERN),
BRAIN Alliance Research Team Annual Progress Report (Jul – Feb
Data-Driven Educational Data Mining ---- the Progress of Project
Statistical Machine Translation
Tools for Natural Language Processing Applications
WP3: Supporting RTD in Language Technologies
Eick: Introduction Machine Learning
Overview of Year 1 Progress Angelo Cangelosi & ITALK team
Sponsored by the University of Southampton
Translating and the Computer 39
--Mengxue Zhang, Qingyang Li
ece 627 intelligent web: ontology and beyond
ISCA Distinguished Lecture
Jan Odijk LREC Miyazaki
1Micheal T. Adenibuyan, 2Oluwatoyin A. Enikuomehin and 2Benjamin S
Christoph F. Eick: A Gentle Introduction to Machine Learning
Workshop on Structured Information and Implementation Frameworks (SIIF) in Slovenia concerning Urban Waste Water Treatment Directive (91/271/EEC) 3. Ongoing.
FCC study – Fire collaboration WP4 Status & Update
ESS.VIP ADMIN – Status report Item 4.1 of the draft agenda
Presentation transcript:

Smart Computer-Aided Translation Environment Project Progress and State of Affairs Vincent Vandeghinste

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting2

WP0. Project management Project progress of the last six months: described in D Activity Report T24 Technical description of the second project year: D0.3.2 Technical report T24 (Available upon request for IAC members) SCATE T24 Meeting3

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting4

WP1. Translation Technology Improvements Task 1.1 Improve Fuzzy Matching human evaluation of previous work Task 1.2 Improve Machine Translation Task Improve Transduction D Report on improved transduction preliminary version was delivered will be updated later Task 1.3 Integration of TM into MT work has started Presentations Collaborative Translation by Tom Vanallemeersch Syntactic Concordancing with Poly-GrETEL by Liesbeth Augustinus SCATE T24 Meeting5

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting6

WP2 Evaluation of Computer-Aided Translation Task 2.1 Taxonomy and annotated data set of typical translation errors Rule-based MT errors have been added to the corpus Task 2.2 Logging and analysis of human-machine interaction Presentation of Arda Tezcan Task 2.3 Confidence estimation of MT currently ongoing SCATE T24 Meeting7

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting8

WP3 Terminology Extraction from Comparable Corpora Task 3.1 Study of translator’s methods to acquire domain knowledge and terminology Presentation of Iulianna van der Lek Task 3.3 Terminology extraction from comparable text Use of compound splitter as preprocessing for cross-lingual term extraction Is recent progress in neural word embedding applicable to terminology extraction Presentation by Sien Moens SCATE T24 Meeting9

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting10

WP4 Speech Recognition Task 4.2 Automatic Domain Adaptation Presentation of Joris Pelemans and Geert Heyman SCATE T24 Meeting11

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting12

WP5 Work Flows and Personalised User Interfaces Task 5.2 Modelling Personalised Workflows for Translation Practices Task 5.3 Visualisation of Translation Features Task 5.4 Interfaces for Translation Work Presentation by Jan Van den Bergh SCATE T24 Meeting13

Overview per work package WP0 – Project management WP1 – Translation Technology Improvements WP2 – Evaluation of Computer-Aided Translation WP3 – Terminology Extraction WP4 – Speech Recognition WP5 – Workflows and Personalized User-Interfaces WP6 – Integration, Evaluation, Valorisation, and Dissemination SCATE T24 Meeting14

WP6 Integration, Evaluation, Valorisation, and Dissemination Task 6.1 Integration: first demonstrator Task 6.4 Dissemination scientific publications SCATE project presentation at EAMT 2015 Antalya, Turkey at EAMT 2016 Riga, Latvia at LT-Innovate 2016 Brussels SCATE T24 Meeting15

Publications of the last six months Geert Heyman, Ivan Vulic & Marie-Francine Moens (2015) C-BiLDA Extracting Cross-lingualTopics from Non-Parallel Texts by Distinguishing Shared from Unshared Content. Data Mining and Knowledge Discovery. pp Douwe Kiela, Ivan Vulic & Stephen Clark (2015) Transferring Features from a Convolutional Neural Network to Perform Bilingual Lexicon Induction. In Proceedings of EMNLP 2015: Conference on Empirical Methods in Natural Language Processing. Lisboa, Portugal September Lieve Macken & Arda Tezcan (in press). Dutch Compound Splitting for Bilingual Terminology Extraction. In R. Mitkov, J. Monti, G.C. Pastor & V. Seretan (editors), Multi-word Units in Machine Translation and Translation Technology. John Benjamins. Ayla Rigouts Terryn, Lieve Macken & Els Lefever (submitted). Dutch Hypernym Detection: Does Decompounding Help? In: Proceedings of the Joint Second Workshop on Language and Ontology & Terminology and Knowledge Structures (LangOnto2 + TermiKS). LREC Portoroz, Slovenia. Joris Pelemans, Tom Vanallemeersch, Kris Demuynck, Lyan Verwimp, Hugo Van hamme & Patrick Wambacq (accepted). Language Model Adaptation for ASR of Spoken Translations using Phrase-based Translation Models and Named Entity Models. In: Proceedings of the 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016). Shanghai, China, March Joris Pelemans, Tom Vanallemeersch, Kris Demuynck, Hugo Van hamme, Patrick Wambacq (2015). Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations. In: Proceedings of Interspeech 2015 (annual conference of the International Speech Communication Association). Dresden, Germany, September pp Arda Tezcan, Véronique Hoste, Lieve Macken (accepted). SCATE Taxonomy and Corpus of Machine Translation Errors. Translation and Interpreting Technologies. Brill Academic Publishers. Leiden, the Netherlands. Liesbeth Augustinus, Vincent Vandeghinste & Tom Vanallemeersch (accepted). Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions. In: Proceedings of the 10th International Conference on Language Resouces and Evaluation (LREC). Portoroz, Slovenia. Jan Van den Bergh, Eva Geurts, Donald Degraen, Mieke Haesen, Iulianna van der Lek-Ciudin and Karin Coninx (2015). Recommendations for Translation Environments to Improve Translators‘ Workflows. Translating and the Computer 37. Asling. London, UK, November 2015 (pp ). Iulianna van der Lek-Ciudin, Tom Vanallemeersch & Ken De Wachter (2015). Contextual Inquiries at Translators’ Workplace. In: Benjamin Phister, Carmelo Cancio, Laurence Cuzzolin (eds) Proceedings of TAO-CAT 2015, a conference on Computer Aided Translation tools. Angers, France, June pp Ivan Vulic & Marie-Francine Moens (2016). Bilingual Distributed Word Representations from Document-Aligned Comparable Data. Journal of Artificial Intelligence Research. SCATE T24 Meeting16

Questions SCATE T24 Meeting17