Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.

Slides:



Advertisements
Similar presentations
Translation in the 21 st Century Impacts of MT and social media on language services.
Advertisements

1 2 nd Shanghai, 19/02/06 Architecture for Next Generation Grids Kostas Tserpes, NTUA Shanghai, 20th of February 2006.
1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
Research Infrastructures WP 2012 Call 10 e-Infrastructures part Topics: Construction of new infrastructures (or major upgrades) – implementation.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Audrey Desjarlais, Signetwork Coordinator Survey Findings SPDG Initiative Goals SPDG Initiative Outcomes.
14-1 © Prentice Hall, 2004 Chapter 14: OOSAD Implementation and Operation (Adapted) Object-Oriented Systems Analysis and Design Joey F. George, Dinesh.
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Educate to Innovate A SusChem programme for building skills capacity for a sustainable European chemical sector Susan Fleet - Britest Limited Professor.
Can public libraries & their users benefit & profit from Europeana ?
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
BELMONT FORUM E-INFRASTRUCTURES AND DATA MANAGEMENT PROJECT Updates and Next Steps to Deliver the final Community Strategy and Implementation Plan Maria.
Linked Data as an enabler of cross-media and multilingual content analytics for enterprises across Europe A.Gómez-Pérez (UPM) Project Coordinator.
Provisional draft 1 ICT Work Programme Challenge 2 Cognition, Interaction, Robotics NCP meeting 19 October 2006, Brussels Colette Maloney, PhD.
Eye on Earth (EoE), Citizen Science and the Invasive Alien Species project Malene Bruun NRC’s for EIS June 17, 2011.
Robust and affordable process control technologies for improving standards and optimising industrial operations Pau Puigdollers Project Coordinator.
Headquaters Kick-Off Meeting Brussels 17 January 2013 Aware and Resilient Project overview – Austrian Red Cross.
An innovative platform to allow translation and indexing of internet sites Localization World
Funded under the EU ICT Policy Support Programme Automated Solutions for Patent Translation John Tinsley Project PLuTO WIPO Symposium of.
Linda Mitchell Evaluating Community Post-Editing - Bridging the Gap between Translation Studies and Social Informatics Linda Mitchell PhD student.
Esri UC2013. Technical Workshop. Technical Workshop 2013 Esri International User Conference July 8–12, 2013 | San Diego, California Configuring ArcGIS.
ICT work programme ICT 17 Cracking the language barrier Aleksandra Wesolowska Unit G.3 - Data Value Chain.
Big speech data analytics for contact centers - BISON European Horizon 2020 project No Rationale: The Contact Center (CC) industry involves more.
IMA CIM Overview. IMA Mission “Provide a knowledge-sharing platform for business professionals where proven Internet.
Education and Culture Main initiatives and events 2013 Multilingualism.
GEO Work Plan Symposium 2012 ID-05 Resource Mobilization for Capacity Building (individual, institutional & infrastructure)
European Life Sciences Infrastructure for Biological Information ELIXIR
FLAVIUS Technical presentation (Overblog, Qype, TVTrip) - WP2 Platform architecture.
Google Apps for Education WCPS Summer Institute 2011.
Local services and Community Memory: knowledge sharing and policy support Let’s map the future 28 February 2003 Rob Davies MDR Partners
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
14-1 © Prentice Hall, 2004 Chapter 14: OOSAD Implementation and Operation Object-Oriented Systems Analysis and Design Joey F. George, Dinesh Batra, Joseph.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Upgrading to IBM Cognos 10
14-1 © Prentice Hall, 2004 Chapter 14: OOSAD Implementation and Operation Object-Oriented Systems Analysis and Design Joey F. George, Dinesh Batra, Joseph.
1 WERT: WP 5 RG EVANS ASSOCIATES November 2010 Aim To pilot and evaluate the content and context of the course material with target groups To help women.
Collaborative Learning Environment Preliminary Report - DRAFT December 2008.
1 Women Entrepreneurs in Rural Tourism Evaluation Indicators Bristol, November 2010 RG EVANS ASSOCIATES November 2010.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
Case Study Summary Challenges
Case Study Summary Link Translation entered a partner agreement with Autodesk to provide translation solutions integrating human and machine translation.
1 SMEs – a priority for FP6 Barend Verachtert DG Research Unit B3 - Research and SMEs.
Integrated Projects & Networks of Excellence Examples in IST.
CLARIN work packages. Conference Place yyyy-mm-dd
TraMOOC Project Overview Presentation Overview Presentation.
The worldwide language services market 2.33% Revenue from post-edited machine translation 53.5% Consumers using free MT on the web Copyright © 2011 by.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
1 CHORUS + Coordinated approacH to the EurOpean effoRt on aUdio-visual Search engines ” Grant Agreement No CHORUS + 01/01/2010 – 31/12/ Jean-Charles.
EU Projects – FP7 Workshop 6: EU Funding –What’s Next? Carolina Fernandes Innovation & Funding Manager GLE Group.
Emerging Frontiers of Science of Information Management Objectives, Structure, and Operation.
The RDI Governance System Vasileios Pitsinigkos Head of Managing Authority of Eastern Macedonia - Thrace Region.
TOWARDS “CLEAN” MINING TECHNOLOGY THROUGH TECHNICAL SCIENTIFIC INNOVATION Nicolae Ilias, Romania.
1 Lotus Connections Customer Use Cases The following are stories from early customer engagements. They were created to help people understand the many.
The Multilingual Web – Where Are We? Next Generation Localisation Josef van Genabith, CNGL & NCLT, DCU.
Technology-enhanced Learning: EU research and its role in current and future ICT based learning environments Pat Manson Head of Unit Technology Enhanced.
1/22 Turning the challenges of INSPIRE implementation into business opportunities for European geo-ICT SMEs Glenn Vancauwenberghe Danny Vandenbroucke KU.
H2020 ICT WP ICT 17 Cracking the language barrier Aleksandra Wesolowska Unit G.3 - Data Value Chain H2020-LEIT-ICT WP pending Commission.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Work Package 2 Eva Heckl 1/10/2014 Feasibility study on an internet-based e-platform for women entrepreneurs.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
Centre for Translation Studies FACULTY OF ARTS
European Topic Centre on Sustainable Consumption and Production (ETC/SCP) Lars Fogh Mortensen, Head of Group Sustainable Consumption and Production.
Language Technologies Institute Carnegie Mellon University
The ACCEPT Project Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities.
HOW TO MAKE MACHINE TRANSLATION WORK FOR YOU
Fire Service Maps and Apps for Local Government
Presentation transcript:

Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal

Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities in both commercial and non-profit environments. Grant agreement : No

ACCEPT at M3 3 (coordinator)

The use of machine translation (MT) is becoming much more pervasive. Web 2.0 paradigms are democratising content creation - stressing the value of communities of users creating content for each other. But: MT engines, even statistical engines, cannot produce acceptable results for community content due to the extreme variability within the content. The ACCEPT project will address this issue by developing new technologies designed specifically to help MT work better in this environment. ACCEPT’s mission is to help communities share information more effectively across the language barrier. 4

Main Avenues: 1.Development of new paradigms for “minimally intrusive” content pre- editing. 2.Development of strategies for content post-editing, leveraging the skills of monolingual domain experts volunteers. 3.Improvement of learning and development of feedback loops to improve Statistical Machine Translation (SMT) for community data. 5

Development of user-friendly (minimally intrusive) strategies for pre- editing the content for statistical machine translation. –The project will identify the most important types of corrections that need to be applied to the source content in order to attain a higher translation quality. 6

Development of strategies for post-editing. –Ideally, post-editing of the translation results is done by bilingual skilled experts, but the lack of such experts is a major bottleneck. –To overcome this bottleneck, the project will develop post-editing strategies which do not require proficiency of the source language, but only of the target language, thus enlarging the pool of (volunteer) skilled experts. 7

Improvement of learning and development of feedback loops to improve Statistical Machine Translation (SMT) for community data. –Parallel data are sparse and heterogeneous in the user-generated content domain (UGC). –The project will develop innovative domain adaptation methods and will use linguistic information to cope with sparseness and heterogeneity. –It will take into account feedback from the post-editing process to automate corrections whenever possible. –It will seek to improve the reliability of UGC translation using Text Analytics (text classification, sentiment analysis). 8

9 Work packages and deliverables

10 COMMUNITY DEVELOPMENT PORTAL INTEGRATION PRE-EDITING POST- EDITING SMT TEXT ANALYTICS EVALUATION DISSEMINATION

For the first time, pre-editing, MT and post-editing will be linked together not just in a process, or workflow, but by connecting the software components together and by developing new linguistic software components specifically optimised for community content translation. 11

The ACCEPT project will be addressing the challenge of removing the language barrier in two slightly different scenarios: 1.content in a typical commercial product forum, Symantec’s security focussed Norton forum: 2.content in the community of volunteer translators Traducteurs sans Frontières: More generally, the project’s effects will be useful for all those who need information instantly and reliably translated into their own language, despite linguistic imperfections. 12

The main expected result is the improvement of technologies for translating community content. The ACCEPT project will create sophisticated components for machine translation. Demonstrators: –online source content checking –online MT editing –online content evaluation 13

The technology developed throughout the ACCEPT project will contribute to the take-up of MT in the burgeoning area of user generated content. ACCEPT will allow citizens across the EU better access to communities in both commercial and non-profit environments. ACCEPT will make companies in Europe better able to engage with their customers across the language barrier, making them more competitive and allowing them to expand more quickly across Europe. ACCEPT’s scientific and technical innovation will help drive take-up of MT in this new Web 2.0 paradigm. The social impact is highlighted in the participation of Traducteurs sans Frontières, whose focus is helping European NGOs provide better services in multilingual theatres of operations. 14

ACCEPT at M3 Progress

ACCEPT at M3 16 Website and logo established, dissemination and exploitation plans delivered.Baseline SMT engines generatedPre-editing investigations and community development for post-editing in progress.Infrastructure stage 1 in alpha test.

17

18