Encoding UNL Expressions: Some Problems and Proposals Igor Boguslavsky UNL Russia

Slides:



Advertisements
Similar presentations
Unit 9 Saving the earth Grammar--Inversion.
Advertisements

Report Writing.
CODE/ CODE SWITCHING.
Linguistic Theory Lecture 11 Explanation.
Unit A4 Translation shifts
Relational Database Design UNIT II 1. 2 Advantages of Using Database Systems Centralized control of a firm’s data Redundancy can be reduced (avoid keeping.
Semantics (Representing Meaning)
The Meaning of Language
Modality Lecture 10. Language is not merely used for conveying factual information A speaker may wish to indicate a degree of certainty to try to influence.
Knowledge Representation
Lecture 17: Modification. 1. What is modification?  Modification is an important grammatical device for description and sentence expansion. We have already.
Introduction to phrases & clauses
Pronouns.
Introduction to Linguistics and Basic Terms
CS 330 Programming Languages 09 / 13 / 2007 Instructor: Michael Eckmann.
Syntax Lecture 4.
Uses of Language Tests.
Universal Networking Language
Its Grammatical Categories
MORAL THEORY: INTRODUCTION PHILOSOPHY 224. THE ROLE OF REASONS A fundamental feature of philosophy's contribution to our understanding of the contested.
PARTS OF SPEECH General Survey. The problem of parts of speech causes great controversies both in general linguistic theory and in the analysis of separate.
Copyright © Cengage Learning. All rights reserved.
Style, Grammar and Punctuation
Tuesday, December 2 GUM 5.7 Composition 3.7 Literary Analysis and Composition
323 Morphology The Structure of Words 1.1 What is Morphology? Morphology is the internal structure of words. V: walk, walk+s, walk+ed, walk+ing N: dog,
CAS LX 502 Semantics 3a. A formalism for meaning (cont ’ d) 3.2, 3.6.
Computer System Analysis Chapter 10 Structuring System Requirements: Conceptual Data Modeling Dr. Sana’a Wafa Al-Sayegh 1 st quadmaster University of Palestine.
©Silberschatz, Korth and Sudarshan6.1Database System Concepts Chapter 6: Integrity and Security Domain Constraints Referential Integrity Assertions Triggers.
MECHANICS OF WRITING C.RAGHAVA RAO.
Copyright © Cengage Learning. All rights reserved.
Syntax Lecture 8: Verb Types 1. Introduction We have seen: – The subject starts off close to the verb, but moves to specifier of IP – The verb starts.
Relative clauses Chapter 11.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Proposal for a new UNECE regulation on recyclability of motor vehicles Informal Document GRPE Reply to the Comments of the Russian Federation Informal.
Ferenc Havas Tallinn, Introduction to the project: Uralic Typology Database Project website:
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
Modifier (grammar) Definition: A word, phrase, or clause that functions as an adjective oradverb to provide additional information about another word or.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 1 Proposals for solving some problems in UNL encoding International Conference on.
Deeper Sentiment Analysis Using Machine Translation Technology Kanauama Hiroshi, Nasukawa Tetsuya Tokyo Research Laboratory, IBM Japan Coling 2004.
Semantic Construction lecture 2. Semantic Construction Is there a systematic way of constructing semantic representation from a sentence of English? This.
Rules, Movement, Ambiguity
ICUKL-2002, Nov, Goa, India Universal Word and UNL Knowledge Base Meiying Zhu Hiroshi Uchida UNL Center UNDL Foundation.
Honors English 9 INTEGRATING QUOTATIONS. The Situation: KWPQ You want to use the following line from Chapter 6:  “And yet, against their will, they had.
Chapter 6 - Basic Similarity Topics
UML Part 1: Class Diagrams. Introduction UML stands for Unified Modeling Language. It represents a unification of the concepts and notations presented.
SYNTAX.
Machine Translation Divergences: A Formal Description and Proposed Solution Bonnie J. Dorr University of Maryland Presented by: Soobia Afroz.
11/23/00UNU/IAS/UNL Centre1 The Universal Networking Language United Nations University Institute of Advanced Studies United Networking Language ® UNU/IAS.
UNL Document Summarization Virach Sornlertlamvanich, Tanapong Potipiti and Thatsanee Charoenporn Information Research and Development Division National.
Sentences In Arabic NOMINAL AND VERBAL SENTENCES.
Plagiarism Miss H. 2008/2009. The entire content of this presentation comes from TurnItIn.com Turnitin allows free distribution and non-profit use of.
How do I begin to prepare ????. Why Praxis ?? 35 states in US currently use Praxis for assessment purposes in CT - used for assessing skills of future.
Parts of speech English Grade 9 Kaleena Ortiz PARTS OF SPEECH Noun Pronoun Adjective AdverbVerbPreposition Conjunction Interjection Click here for this.
Lec. 10.  In this section we explain which constituents of a sentence are minimally required, and why. We first provide an informal discussion and then.
Writing 2 ENG 221 Norah AlFayez. Lecture Contents Revision of Writing 1. Introduction to basic grammar. Parts of speech. Parts of sentences. Subordinate.
Lecture IV. Basic Translation Theories Plan 1. The Transformational Approach 2. The Denotative Approach 3. The Communicational Approach.
Voice Lecture 9. Forms and Meanings Voice is a grammatical category of the verb, which reflects the semantic role of the verbal subject. This category.
The theory of word classes in modern grammar studies
4TH Grade ELA Standards.
Non-finite forms of the verb
Syntax Lecture 9: Verb Types 1.
Semantics (Representing Meaning)
GREEK ADJECTIVES
Chapter Eight Syntax.
Revising and editing Week 3.
BBI 3212 ENGLISH SYNTAX AND MORPHOLOGY
How to use a dictionary effectively
The Classical Approach to Categorization
Representations & Reasoning Systems (RRS) (2.2)
Presentation transcript:

Encoding UNL Expressions: Some Problems and Proposals Igor Boguslavsky UNL Russia

Generals UNL: an interlingua or not? No ambiguity UNL vs. natural language –“at least as powerful as any NL”? –semantics vs. KB Correct UNL vs. adequate UNL

Adequacy conditions An adequate UNL expression should: preserve the meaning of the source text; be convenient for prospective applications, including deconversion.

Possibility of inverse generation? Necessary but insufficient: invitation of the president mod(invitation, president)? –the president invites somebody –somebody invites the president Russian: He received the shower (= took the shower)

Tentative procedure How to develop a definite and common view on what UNL expressions are adequate? 1. UNL from LCs to UNLC 2. Comments from LCs to UNLC 3. Feedback from UNLC to LCs 4. Update of UNL by LCs

Universal Words Headwords Restrictions Attributes

Headwords Multi-word UWs Support verbs

Multi-word UWs They should be avoided, if the their meaning is representable as a combination of meanings of words they are composed of: UW to be avoided: «Ministery of foreign affairs» UW to be preferred:

Why so? If any free word combination can be made a UW, one can never hope that other partners will have matching UWs in their dictionaries.

Appropriate multi-word UWs Non-compositional phrases: –«look for(agt>thing,obj>thing)» –«look like(aoj>thing,obj>thing)»

A convenient compromise To account for the fact that a phrase is considered as denoting a single concept, the UNL expression can be enclosed in a scope:

Another possibility (Ch. Boitet) Postulate one UW having the internal structure: foreign)»

Restrictions in UW/KB (1) Semantic function (2) Knowledge Base function (3) Argument frame function

(1) Semantic function Restricting the meaning - needed, in particular, to ensure –disambiguation of the head word –selection of the translation equivalent

(2) KB function Locating the UW in the KB - needed, in particular, to ensure –choice of a nearest UW in the case the direct equivalent is absent in the UW dictionary (replacement ability) –semantic inference

(3) Agrument frame function Presenting the argument frame.

Correlation between the semantic and the KB functions Semantically- and KB-oriented restrictions do not necessarily coincide: –semantic restriction: book(icl>thing) –KB restriction: titmouse{(icl>bird)}

How to select semantic restrictions September{(icl>month>date)} answer(icl>do) (for cases like answer questions) – answer(icl>be) (for cases like answer expectations) – answer(icl>thing) (for cases like know the answer) Ru: zhenit’sja – marry(agt>male), vyxodit’ zamuzh – marry(agt>female).

VERY IMPORTANT! Semantic restrictions should effectively distinguish the meaning we restrict from all other relevant meanings of the same English headword. They should NOT be equally applicable to more than one meaning. They should be easily understandable.

Example operator - all the meanings denote a thing WRONG (in the sense ‘inadequate’): –operator(icl>thing) CORRECT –operator(icl>human) –operator(icl>abstract thing)

Relations useful for disambiguation icl iof equ ant (disappeared from the specs?) –poor(icl>bad): poor quality –??? poor(icl>having little money) “having little money” is a bad UW –poor(ant>rich): poor people

Needs to be emphasized again UNL News1: build global knowledge –build(agt>thing,obj>thing) Does not differentiate between different meanings of the headword: –build a railway (a house): build(agt>thing,obj>concrete thing) –build plans (knowledge): build(agt>thing,obj>abstract thing)

KB function UW: September MD: September{(icl>month>date)}

What remains unclear-1: KB  semantics. Links between related concepts of different semantic categories are missing. There is no way to express the relationship between “dance” (as a verbal concept) and “dance” (as a nominal concept)), “government” and “governmental”, etc. –dance({icl>do(}agt>person{)}) –dance(icl>action{>event})

What remains unclear-2: Status of UWs within the restrictions: propose(agt>thing,gol>thing,obj>thing) –They proposed to the president that a special committee should be set up «set up» does not fall under «thing». But where then?

What is an argument-1? A is an argument of L --> A is integral to the meaning of L.

What is an argument-2? A is semantically obligatory: L cannot be semantically defined without A being mentioned. A is not always syntactically obligatory: it can remain unmentioned in a sentence.

Example: buy buy has 4 arguments: a buyer, an object, a seller, the money paid. All of them are semantically obligatory: “buying” cannot exist without any of them. None of them is syntactically obligatory: –I bought a book (the seller and the money are not mentioned). –To buy is more pleasant than to sell (no arguments are mentioned).

Semantic roles vs. predicate- argument relations UNL does not mark predicate-argument relations in a systematic way. Assumption: arguments can be reliably identified based on their semantic role.

It does not work. Why? Too many «difficult» cases. Only a part of semantic relations between the words can be reliably interpreted in terms of semantic roles. Too many mismatches. Assignment of semantic roles cannot be done in a consistent way (especially in the UNL multi-lingual and multi-cultural environment).

The reason: Numerous mismatches in the representation of the same or similar phenomena are rooted in the fundamental impossibility of a consistent interpretation of ALL argument relations in terms of a fixed SMALL number of semantic roles.

Examples Nothing (obj) prevents the members (ben) from discussing (gol) this problem –why beneficiary (ben)? –why finite state (gol)? protect nature from pollution (?) familiarize students with India (?)

Difference between arguments and non-arguments-1 Any nominal concept can have a purpose, e.g. –a stone for driving nails Therefore {pur>uw} is assigned to UW «thing» and is inherited by all UWs lying below.

Difference between arguments and non-arguments-2 Purpose is NOT an argument of “stone”: a stone has no obligatory conceptual link with the purpose. Purpose IS an argument of “method”: a method cannot exist without a purpose. –a method for calculating taxes

Another example: borrow –X borrows Y from Z for W = Z owns Y, X makes Z to give him Y, X promises Z to give Y back after W expires –borrow cannot exist without 4 participants: agent, object, source, duration

Difference between arguments and non-arguments-3 Each action has a certain duration. Therefore {dur>time} is assigned to UW «do» and is inherited by all UWs lying below. Besides this, borrow has a semantic argument with the role ‘duration’

Argument vs. non-argument (1) John borrowed money for 3 years –Argument W: term of the loan. John promised to return money after 3 years (2) John has been borrowing money for 3 years – Non-argument: the situation ‘John is borrowing money’ lasted for three years (the term of each loan is not specified)

Why important? - For semantic processing (1) can answer the question on the terms of the loan - (2) cannot do so. mod(invitation, president) –the president invites somebody (arg. 1) –somebody invites the president (arg. 2) –the invitation has an unspecified connection with the president (non-arg.) NB: the specs do not allow to draw this distinction!

Why important? - For deconversion The arguments and non-arguments are very often encoded differently: –dur: Ru «borrow on 3 years» vs. during 3 years –rsn: afraid of darkness, tremble with fear not: *afraid because of darkness –scn: In [scn] this country the relations between the nations are based on [scn- arg] mutual respect

How to differentiate? The distinction between the arguments and the non-arguments should be drawn both in the UWs and in UNL expressions.

Proposal for UWs Restrictions corresponding to arguments should be systematically and exhaustingly represented in KB. They can either be included into the UW, or be inherited from upper concepts.

Proposal for UWs They should be formally opposed to non-argument restrictions. One of the ways: capitalization. –thing{(and>thing,…,pur>uw,…)} –method(icl>abstract thing,Pur>uw) –do{and>do,…,dur>period,…)} –borrow(icl>do,…Dur>period) Another possibility: –dur vs.

Proposal for UNL expressions Mark argument relations relation for (1): year) relation for (2): –dur(borrow, year)

Sample UW dictionary entry Current UW: responsible(aoj>thing,obj>thing) It is proposed to introduce a comment: responsible (Aoj>thing,Obj>thing,Gol>*) ;he (aoj) is responsible to me (gol) for his actions (obj) (example) ;IB_Ru, 29/11/02 (author and date)

Comments in MD Not only for illustrating argument frames, but also for clarifying concepts. Specs modification is needed.

Attributes Dictionary of attributes (explanation of the attribute, examples) Procedure for introduction new attributes should be set up.

Issues concerning KB Adjectival concepts

Adjectival concepts: mod vs. aoj Two major classes of adjectives: –(aoj>thing) vs. (mod<thing) Specs: “For an adjectival concept, (aoj>thing) or (mod thing) is for expressing a predicative concept, whereas (mod<thing) is for expressing restrictive concept”.

We should distinguish between: (a) a syntactic property: whether the adjective is used predicatively (Greeks are wise) or attributively (the wise Greeks); (b) a semantic property: what does the adjective mean when used attributively: –restriction; –qualification. Only (b) should interest us.

Restrictive vs. non-restrictive Wise Greeks diluted wine with water –restrictive: Those Greeks who were wise diluted wine with water. Silly ones didn’t. –non-restrictive (qualificative): Greeks were wise. They diluted wine with water. Non-attributive (predicative) adjective does not restrict the noun: –Greeks are wise.

Adjectives Some adjectives can only be restrictive: –Many dogs have curly hair Some adjectives can only be non- restrictive: –Get those damned dogs out of the room! –Dear colleagues! Most of the adjectives can be both restrictive and non-restrictive.

Non-adjectives The old people in the street were very tired. –Those who were in the street were tired; others were not tired. –The old people were very tired. They were in the street. There is no UW to which a restriction can be assigned!

What is more… In some languages restricive vs. non- restrictive interpretations of relative clauses are marked by punctuation (English, Spanish).

Restrictive No commas are allowed: –The old people who came a long way were tired. –Los viejos que habían venido de muy lejos estaban cansados.

Non-restrictive Commas are needed: –The old people, who came a long way, were tired. –Los viejos, que habían venido de muy lejos, estaban cansados.

Proposal Renounce from the division of adjectives into (aoj>*) and (mod<*). In order to account for this difference, introduce two restr) which can be added to any modifier (an adjective, a prepositional phrase, a clause), if the author wishes to mark the restrictive or non-restrictive interpretation.

Why attributes are better than restrictions? Attributes reflect the point of view of the speaker in the current situation and not the permanent property of the word. Attributes are optional and may be not assigned, if the author does not wish to specify his point of view.

The strongest argument The restrictive vs. non-restrictive opposition is relevant not only for adjectives but also for other types of noun modifiers. These modifiers cannot be assigned restrictions but can easily take an attribute.