Basque language: is IT right on?

Slides:



Advertisements
Similar presentations
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Advertisements

Machine Translation II How MT works Modes of use.
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
CALTS, UNIV. OF HYDERABAD. SAP, LANGUAGE TECHNOLOGY CALTS has been in NLP for over a decade. It has participated in the following major projects: 1. NLP-TTP,
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
Multilingual Information Access in a Digital Library Vamshi Ambati, Rohini U, Pramod, N Balakrishnan and Raj Reddy International Institute of Information.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
ITCS 6010 Natural Language Understanding. Natural Language Processing What is it? Studies the problems inherent in the processing and manipulation of.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
The ECHA-term project Multilingual REACH and CLP Terminology Dieter Rummel, Translation Centre for the Bodies of the EU Luxembourg EAFT - Oslo, 11 October.
Language Technology 2005/06 Hans Uszkoreit Universität des Saarlandes
9/8/20151 Natural Language Processing Lecture Notes 1.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
1 Computational Linguistics Ling 200 Spring 2006.
26 June 2008 DG REGIO Evaluation Network Meeting Ex-post Evaluation of Cohesion Policy Programmes co-financed by the European Fund for Regional.
Language Technology I © 2005 Hans Uszkoreit Language Technology I 2005/06 Hans Uszkoreit Universität des Saarlandes and German Research Center for Artificial.
Introduction to CL & NLP CMSC April 1, 2003.
Gerrit Schutte OHIM 9th of December, 2011 Trademark terminology control.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
1 Branches of Linguistics. 2 Branches of linguistics Linguists are engaged in a multiplicity of studies, some of which bear little direct relationship.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
Natural Language Processing (NLP)
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
INTRODUCTION TO APPLIED LINGUISTICS
Chapter 11 Linguistics and Foreign Language Teaching Lecturer: Rui Liu.
Information Retrieval in Practice
Dr Anie Attan 26 April 2017 Language Academy UTMJB
Interpreting as Process
An Introduction to Linguistics
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Proposal for Term Project
The ACCEPT Project Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities.
Addressing the grammar gap in task work
CHAPTER 5 This chapter introduces students to the study of linguistics. It discusses the basic categories and definitions used to study language, and the.
FATMA ISMED K1-09 Websites in ELT.
3.0 Map of Subject Areas.
LACONEC A Large-scale Multilingual Semantics-based Dictionary
* Conversation lessons 2017 * Teacher: Laura Fernandez.
CS416 Compiler Design lec00-outline September 19, 2018
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
SPEAKING ASSESSMENT Joko Nurkamto UNS Solo 11/8/2018.
Multilingual Information Access in a Digital Library
SPEAKING ASSESSMENT Joko Nurkamto UNS Solo 12/3/2018.
2. An overview of SDMX (What is SDMX? Part I)
Stanford University March 24-26
SECOND LANGUAGE LISTENING Comprehension: Process and Pedagogy
Communicative Competence (Canale and Swain, 1980)
Evaluation – embed within an equity framework
Communicating and Adapting Language task
Chapter 2 What speakers know.
Natural Language Processing
CS416 Compiler Design lec00-outline February 23, 2019
FCE (FIRST CERTIFICATE IN ENGLISH) General information.
To show my understanding, I can comment, with evidence, on the content and form of short and extended texts, and respond to literal, inferential and.
Developing Listening strategies
The ultimate in data organization
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
„Translation is the expression in another language (or target language of what has been expressed in another, source language, preserving semantic and.
Artificial Intelligence 2004 Speech & Natural Language Processing
FCE IES Parque de Lisboa.
Presentation transcript:

Basque language: is IT right on? BASQUE OMBUDSMAN Basque language: is IT right on? Promoting language rights through IT in public administration and courts Rafael Sainz de Rozas

Linguistic competence in the three Basque territories BASQUE OMBUDSMAN Linguistic competence in the three Basque territories

Percentage of students under 17 attending school in Basque BASQUE OMBUDSMAN Percentage of students under 17 attending school in Basque

Three key features BASQUE OMBUDSMAN Dialectization Bilingualism Diglossia Three key features

LANGUAGE TECHNOLOGY SUPPORT FOR BASQUE BASQUE OMBUDSMAN LANGUAGE TECHNOLOGY SUPPORT FOR BASQUE On line translation and interpretation schemes Multilingual text processing, speech recognition systems and language checking Machine translation or cross-lingual information retrieval Global visibility

BASQUE OMBUDSMAN Evolution of the access to househould IT equipment for the 15 and over 15-year-old population. In percentage, 1999-2016. (Eustat)

Lenguage technologies BASQUE OMBUDSMAN Multimedia & multimodality technologies Text technologies Speech technologies Lenguage technologies Knowledge technologies Lenguage Technologies

A Typical Text Processing Application Architecture: BASQUE OMBUDSMAN A Typical Text Processing Application Architecture: Gramma-tical Analysis Task-Specific Modules OUTPUT Pre-processing Semantic Analysis INPUT TEXT

Challenging particularities in Basque morphology BASQUE OMBUDSMAN Challenging particularities in Basque morphology Basque is an agglutinative and high-inflective Basque is postpositional; phrases are formed by attaching a suffix or concatenating more than one to the end of a word, according to the following scheme: root + (article) + (number) + (case(s). The auxiliary verb not only agrees with the subject, but also with the direct object and complements the speaker refers to. Basque phrase order is topic-focus: the focus directly precedes the verb phrase.

Eye have a spelling chequer, It came with my Pea Sea. BASQUE OMBUDSMAN Eye have a spelling chequer, It came with my Pea Sea. It plane lee marks four my revue Miss Steaks I can knot sea.

Statistical language model BASQUE OMBUDSMAN CORRECTION PROPOSALS Statistical language model INPUT TEXT Spelling check Grammar check Lenguage Checking (bottom: rule-based; top: statistical)

Matching and relevance BASQUE OMBUDSMAN Web Search Architecture Search Results Pre-processing Semantic processing Web pages Indexing Matching and relevance Pre-processing User query Query analysis

Simple Speech-based Dialogue Architecture BASQUE OMBUDSMAN Simple Speech-based Dialogue Architecture Signal processing Speech input Recognition Central language understanding & dialogue Speech output Speech Synthesis Phonetic lookup & Intonation planning

Machine translation (top: statistical; bottom: rule-based) BASQUE OMBUDSMAN Machine translation (top: statistical; bottom: rule-based) Text analysis (formatting, morphology, syntax, etc.) Source text Translation rules Statistical machine translation Post-editing (formatting, context, etc.) Target text

Assessment criteria for existant language related IT ressources BASQUE OMBUDSMAN Quantity Reliability Quality Coverage Maturity Sustainability Adaptability Assessment criteria for existant language related IT ressources

JUSTICE POLICE HEALTH BASQUE OMBUDSMAN Importance of the POWER over SERVICE paradigm The insistance on using the basque language by bilingual citizens tends to be perceived as a non-collaborative attitude in power-ridden administrations. Priority to EFFICIENCY over IDENTITY Putting at risk the efficiency of communication in difficult or even life-threatening situations dissuades the bilingual citizens from using basque in health, judiciary or police settings.

little snail climb Mt Fuji inch by inch BASQUE OMBUDSMAN little snail climb Mt Fuji inch by inch Yoshi Mikami Issa (Kobayashi Nobuyuki 1762-1826)