SYRCoSE-2011 APPLICATION OF THE FUNCTIONAL PROGRAMMING TOOLS IN THE TASKS OF LANGUAGE AND INTERLANGUAGE STRUCTURES REPRESENTATION Ermakov Peter, Kozhunova.

Slides:



Advertisements
Similar presentations
LOGO 1. Task: Know something about life online 2. Study Points: some phrases about online; enable Ss to talk about the.
Advertisements

Compilers Course 379K, TTH 9:30-11:00 Instructor: Dr. Doron A. Peled Office Hours: Mon 11:00-12:00.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Formal Languages: main findings so far A problem can be formalised as a formal language A formal language can be defined in various ways, e.g.: the language.
Development of a German- English Translator Felix Zhang.
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
Bilingual Dictionaries
1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.
INTRODUCTION TO TRANSLATION STUDIES
CSE (c) S. Tanimoto, 2008 Natural Language Understanding 1 Natural Language Understanding Outline: Motivation Structural vs Statistical Approaches.
C SC 520 Principles of Programming Languages 1 C SC 520: Principles of Programming Languages Peter J. Downey Department of Computer Science Spring 2006.
Applying Software Model Checking to Automatic Text Summarization SSSEV2011 Irina Shoshmina, Nasrin Mostafazadeh, Omid Bakhshandeh, Alexey Belyaev, and.
LANGUAGE AND GRAMMARS © University of LiverpoolCOMP 319slide 1.
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Introduction Syntax: form of a sentence (is it valid) Semantics: meaning of a sentence Valid: the frog writes neatly Invalid: swims quickly mathematics.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
1 October 2, October 2, 2015October 2, 2015October 2, 2015 Azusa, CA Sheldon X. Liang Ph. D. Computer Science at Azusa Pacific University Azusa.
Syntactically annotated corpora of Estonian Heli Uibo Institute of Computer Science University of Tartu
Interpreting Dictionary Definitions Dan Tecuci May 2002.
Syntax: 10/18/2015IT 3271 Semantics: Describe the structures of programs Describe the meaning of programs Programming Languages (formal languages) -- How.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Introduction to CL & NLP CMSC April 1, 2003.
Learning English: Language-nutrition method Olga Kozhunova Institute for the Informatics Problems, The Russian Academy of Sciences (Moscow, Russia) CML-2011,
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
Parsing Introduction Syntactic Analysis I. Parsing Introduction 2 The Role of the Parser The Syntactic Analyzer, or Parser, is the heart of the front.
Head-driven Phrase Structure Grammar (HPSG)
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
Center for PersonKommunikation P.1 Background for NLP Questions brought up by N. Chomsky in the 1950’ies: –Can a natural language like English be described.
Daisy Arias Math 382/Lab November 16, 2010 Fall 2010.
1 / 5 Zdeněk Žabokrtský: Automatic Functor Assignment in the PDT Automatic Functor Assignment (AFA) in the Prague Dependency Treebank PDT : –a long term.
1 Compiler Design (40-414)  Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007  Evaluation:  Midterm.
Introduction CPSC 388 Ellen Walker Hiram College.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
Compiler Introduction 1 Kavita Patel. Outlines 2  1.1 What Do Compilers Do?  1.2 The Structure of a Compiler  1.3 Compilation Process  1.4 Phases.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Grammars A grammar is a 4-tuple G = (V, T, P, S) where 1)V is a set of nonterminal symbols (also called variables or syntactic categories) 2)T is a finite.
CSC312 Automata Theory Lecture # 26 Chapter # 12 by Cohen Context Free Grammars.
Institute for Information Problems of the Russian academy of Sciences and its linguistic research Olga Kozhunova CML-2008, Becici, 6-13 September.
C Sc 132 Computing Theory Professor Meiliu Lu Computer Science Department.
Formal grammars A formal grammar is a system for defining the syntax of a language by specifying sequences of symbols or sentences that are considered.
Compiler Design (40-414) Main Text Book:
English-Korean Machine Translation System
Approaches to Machine Translation
Chapter 1 Introduction.
Introduction to Formal Languages
Lecture – VIII Monojit Choudhury RS, CSE, IIT Kharagpur
Formal Language & Automata Theory
Natural Language Processing - Formal Language -
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Chapter 1 Introduction.
FORMAL LANGUAGES AND AUTOMATA THEORY
Formal Language Theory
Compiler Lecture 1 CS510.
CSE322 Chomsky classification
Formal Language.
Classification of Matter Review!
A method for WSD on Unrestricted Text
Approaches to Machine Translation
Teori Bahasa dan Automata Lecture 9: Contex-Free Grammars
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Discrete Maths 13. Grammars Objectives
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
COMPILER CONSTRUCTION
Statistical NLP Winter 2009
Presentation transcript:

SYRCoSE-2011 APPLICATION OF THE FUNCTIONAL PROGRAMMING TOOLS IN THE TASKS OF LANGUAGE AND INTERLANGUAGE STRUCTURES REPRESENTATION Ermakov Peter, Kozhunova Olga The Institute of the Informatics Problems, The Russian Academy of Sciences

Natural text representation and formalisms  Machine text analysis – what to choose?  Linguistic resources: electronic dictionaries, syntactic parsers, language ontologies, generators of syntactic trees (WordNet, EuroWordNet, Ontolingua, etc.)  Growing demands to the language models: a search of new approaches to language structures representation and hybridization of the well- functioning old methods

Usage of the natural text analysis  Automatic translators  Data mining  Natural language interface  Parallel text analysis and comparison

Formal grammars for language analysis and representation  Noam Chomsky – the pioneer  Formal grammar definition  Grammar classes ( Chomsky classification ):  - unrestricted  - context-sensitive  - context-free  - regular

Finite state automata disadvantages  More rules – more states  Weak natural language grammar handling  No problem-oriented analytical engine is available

Functional approach to the grammar rules representation  Grammar rules – functions in mathematical sense  Mapping from set A to set B  Difference:

Functional approach advantages  Functional programming tools usage  n-tuples usage  Higher-order functions usage

Computational and Functional Grammar (FCG)

Set of parametric n-tuples The symbol «_» is suggested to denote an n- tuple element which value one may ignore when defining the transformation function.

Atoms  Attributive characteristics of natural language words  Specific instrument for simplifying of natural language structures analysis  So, all possible attributive characteristics of language structures are defined in the Grammar

FCG Example #1 Atoms = {noun, singular, plural} func({X, noun, singular}) → X func({X, noun, plural}) → X ++ «s»

FCG Example #2 Atoms = {noun, verb, plural, singular, ok, not ok} f({noun, _}) → {ok} f({verb, _}) → {not ok}

Parallel texts analysis and comparison  Task: interlanguage structures transfer from one language into the other  Example: patent claims (in chemical technologies) in German and English

Parallel texts example  Claim in German: Verfahren zur Epoxidierung einer organischen Verbindung mit wenigstens einer C C-Doppelbindung mit Wasserstoffperoxid in Gegenwart wenigstens einer katalytisch aktiven Verbindung und wenigstens eines Lösungsmittels, dadurch gekennzeichnet, dass ein Produktgemisch umfassend a-Hydroperoxyalkohole unter Einsatz wenigstens eines Reduktionsmittels reduziert wird.  Claim in English: A process for the epoxidation of an organic compound having at least one C-C double bond by means of hydrogen peroxide in the presence of at least one catalytically active compound and at least one solvent, wherein a product mixture comprising [alpha]-hydroperoxyalcohols is reduced using at least one reducing agent.

Parallel texts transformations  (a) Verfahren zur Epoxidierung → A process for the epoxidation N [verb, nom, neutr, sg] + Prep [zu+der, dat, comp, fem, sg] + N [dat, fem, sg] → Art [indef, sg] + N [com, sg] + Prep + Art [def, 0] + N [com,sg]  (b) ein Produktgemisch → a product mixture Art [indef, masc, nom, sg] + N [comp, nom, neutr, sg] → Art [indef, sg] + N [com, sg] + N [com,sg]  (c) dadurch gekennzeichnet → wherein Pron + Part [II f, masc, sg] → Adv

(b) ein Produktgemisch → a product mixture v({«ein»,art,indef,masc,nom,sg}) → «a», v({«Productgemisch»,noun,comp,nom,neutr,sg}) → «product mixture»; fgerman- english({X1,art,indef,masc,nom,sg},{X2,noun,comp,nom,neutr, sg}) → v({X1,art,indef,masc,nom,sg}) ++ v({X2,noun,comp,nom,neutr,sg});

Conclusions  Approach to natural language grammar representation as functions in mathematical sense  Opportunities of applying functional programming tools to building systems of transfer  Practical application of the approach: parallel texts analysis and comparison

Next steps  Customizing of the existing representations of the natural language grammars to functional form  Creation of problem-oriented system of functional programming  Enhancement of functional programming tools taking into account needs and tasks of computer linguistics

THANK YOU FOR YOUR ATTENTION!