1 Natural Language Processing (2b) Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University 2010-2011

Slides:



Advertisements
Similar presentations
Artificial Intelligence
Advertisements

1 Knowledge Representation Introduction KR and Logic.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
Introducing Formal Methods, Module 1, Version 1.1, Oct., Formal Specification and Analytical Verification L 5.
Semantics (Representing Meaning)
Syntax-Semantics Mapping Rajat Kumar Mohanty CFILT.
CPSC 322 Introduction to Artificial Intelligence November 5, 2004.
Introduction: The Chomskian Perspective on Language Study.
Albert Gatt LIN1180/LIN5082 Semantics Lecture 2. Goals of this lecture Semantics -- LIN 1180 To introduce some of the central concepts that semanticists.
PSY 369: Psycholinguistics Some basic linguistic theory part2.
Toward Linguistically Grounded Ontologies by Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek (Ireland, Netherlands, Germany) presented.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio LECTURE 6 The Cognitive Perspective: Generative Lexicon.
Linguistic Theory Lecture 8 Meaning and Grammar. A brief history In classical and traditional grammar not much distinction was made between grammar and.
Chapter 17. Lexical Semantics From: Chapter 17 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by.
Introduction to Linguistics and Basic Terms
Sag et al., Chapter 4 Complex Feature Values 10/7/04 Michael Mulyar.
Artificial Intelligence 2005/06 From Syntax to Semantics.
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
Type shifting and coercion Henriëtte de Swart November 2010.
Conceptual modelling. Overview - what is the aim of the article? ”We build conceptual models in our heads to solve problems in our everyday life”… ”By.
Meaning and Language Part 1.
Today  What is syntax?  Grammaticality  Ambiguity  Phrase structure Readings: 6.1 – 6.2.
Syntax and Semantics Dr. Walid Amer, Associate Professor of linguistics The Islamic university of Gaza February, 2009.
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
Generative Grammar(Part ii)
NLU: Frames Frame KR is a good way to represent common sense –can define stereotypical aspects of some domain we are interested in analyzing –sentences.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
The Current State of FrameNet CLFNG June 26, 2006 Fillmore.
Postgraduate Diploma in Translation Lecture 1 Computers and Language.
Lexical Semantics Chapter 16
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Lecture 3 - Race against Time 1 Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
Culture , Language and Communication
Semantic Construction lecture 2. Semantic Construction Is there a systematic way of constructing semantic representation from a sentence of English? This.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
CTM 2. EXAM 2 Exam 1 Exam 2 Letter Grades Statistics Mean: 60 Median: 56 Modes: 51, 76.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Rules, Movement, Ambiguity
Artificial Intelligence: Natural Language
Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
SYNTAX.
Lexical Semantics Fall Lexicon Collection of Words Collection of Words Mental store of information about words and morphemes Mental store of information.
November 9, Lexicon (An Interacting Subsystem in UG) Part-I Rajat Kumar Mohanty IIT Bombay.
Yule: “Words themselves do not refer to anything, people refer” Reference and inference Pragmatics: Reference and inference.
 2003 CSLI Publications Ling 566 Oct 17, 2011 How the Grammar Works.
Syntax.
An Introduction to Semantic Parts of Speech Rajat Kumar Mohanty rkm[AT]cse[DOT]iitb[DOT]ac[DOT]in Centre for Indian Language Technology Department of Computer.
1 LIN 1310B Introduction to Linguistics Prof: Nikolay Slavkov TA: Qinghua Tang CLASS 11, Feb 9, 2007.
Meaning and Language Part 1. Plan We will talk about two different types of meaning, corresponding to two different types of objects: –Lexical Semantics:
MENTAL GRAMMAR Language and mind. First half of 20 th cent. – What the main goal of linguistics should be? Behaviorism – Bloomfield: goal of linguistics.
Chapter 3 Language Acquisition: A Linguistic Treatment Jang, HaYoung Biointelligence Laborotary Seoul National University.
Lec. 10.  In this section we explain which constituents of a sentence are minimally required, and why. We first provide an informal discussion and then.
SYNTAX.
Articulating Your Practice C3 - Session #3
Semantics (Representing Meaning)
Language, Logic, and Meaning
International Research and Development Institute Uyo
Principles and Parameters (I)
Structure of a Lexicon Debasri Chakrabarti 13-May-19.
Testing Schedule.
Presentation transcript:

1 Natural Language Processing (2b) Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University

2  Lexicons and Lexical Analysis  Generative Lexicon (1) Outline

3 Lexicons and Lexical Analysis (31) Generative Lexicon (1) Overview  This lexicon deals with a novel and exciting theory of lexical semantics that addresses the problem of the “multiplicity of word meaning”, that is, how we are able to give an infinite number of senses to words with finite means.  The inventor, James Pustejovsk, proposed that the lexicon becomes an active and central component in the linguistic description.

4 Lexicons and Lexical Analysis (32) Generative Lexicon (2) Functions of the Lexicon The essence of his theory is that the lexicon functions generatively,  by providing a rich and expressive vocabulary for characterizing lexical information;  by developing a framework for manipulating fine-grained distinctions in word descriptions;  by formalizing a set of mechanisms for specialized composition of aspects of such descriptions of words, as they occur in context, extended and novel senses are generated.

5 Lexicons and Lexical Analysis (33) Generative Lexicon (3) Problems on Lexical Semantics Formal theories of natural language semantics have done little to explore two important issues:  The creative use of words in novel contexts;  An evaluation of lexical semantic models on the basis of compositionality.

6 Lexicons and Lexical Analysis (34) Generative Lexicon (4) Computational Lexical Semantics  a clear notion of semantic well-formedness will be necessary in order to characterize a theory of possible word meaning;  lexical semantics must look for representations that are richer than thematic role descriptions;  lexical semantics must study all syntactic categories in order to characterize the semantics of natural language.

7 Lexicons and Lexical Analysis (35) Generative Lexicon (5) Sense Enumeration Lexicon (SEL) A lexicon L is a SEL if and only if for every word w in L, having multiple senses s 1, s 2, …, s n associated with that word, then the lexical entries expressing these senses are stored as {w s 1, w s 2, …, w s n }. That means it is to allow the lexicon to have multiple listings of words, each annotated with a separate meaning or lexical sense.

8 Lexicons and Lexical Analysis (36) Generative Lexicon (6) Lexical Descriptions (Nouns) bank 1 CAT = count_noun GENUS = financial_institution bank 2 CAT = count_noun GENUS = shore

9 Lexicons and Lexical Analysis (37) Generative Lexicon (7) Lexical Descriptions (Verbs) The bank will lend the money to the customer. lend 1 CAT = verb SEM = R 0 (  1,  2,  3 ) ARG1 = np [+financial_institution] ARGSTR = ARG2 = np [+money] ARG3 = np [+human]

10 Lexicons and Lexical Analysis (38) Generative Lexicon (8) Contrastive Ambiguity [Weinreich, 1964] (1) A lexical item accidently carries two distinct and unrelated meanings. Ex. 1: a) Drop me a line when you are in Boston. (a short personal letter) b) We built a fence along the property line. (a boundary)

11 Lexicons and Lexical Analysis (39) Generative Lexicon (9) Contrastive Ambiguity [Weinreich, 1964] (2) Ex. 2: a) The discussion turned on the feasibility of the scheme. (depend upon) b) The bull turned on the matador. (become hostile towards)

12 Lexicons and Lexical Analysis (40) Generative Lexicon (10) Complementary Polysemies [Weinreich, 1964] (1) The lexical senses clearly show the same basic meaning of a word as it occurs in different contexts. Ex. 1: a) The bank raised its interest rates yesterday. (an institution) b) The store is next to the newly constructed bank. (a building)

13 Lexicons and Lexical Analysis (41) Generative Lexicon (11) Complementary Polysemies [Weinreich, 1964] (2) Ex. 2: a) The farm will fail unless we receive the subsidy promised. (workplace) b) To farm this land would be both foolish and without reward. (working mode)

14 Lexicons and Lexical Analysis (42) Generative Lexicon (12) Complementary Polysemies [Weinreich, 1964] (3) Note: The logical polysemy denotes complementary ambiguity where there is no change in lexical category, and the multiple senses of the word have overlapping, dependent, or shared meanings.

15 Lexicons and Lexical Analysis (43) Generative Lexicon (13) New Definition for SEL A lexicon L is a SEL if and only if for every word w in L, having multiple senses s 1, s 2, …, s n associated with that word, then: a)if s 1, s 2, …, s n are contrastive senses, the lexical entries expressing these senses are stored as w s1, w s2, …, w sn. b)if s 1, s 2, …, s n are complementary senses, the lexical entry expressing these senses is stored as w {s1, s2, …, sn}.

16 Lexicons and Lexical Analysis (44) Generative Lexicon (14) Complementary Senses Ex. a) John ate lamb for breakfast. b) The lamb is running in the field. lamb SENSE 1 = CAT = mass_noun GENUS = meat SENSE 2 = CAT = count_noun GENUS = animal

17 Lexicons and Lexical Analysis (45) Generative Lexicon (15) Limitations of SEL  The creative use of words: Words assume new senses in novel contexts;  The permeability of word senses: Word senses are not atomic definitions but overlap and make reference to other senses of the word;  The expression of multiple syntactic forms: A single word sense can have multiple syntactic realization.

18 Lexicons and Lexical Analysis (46) Generative Lexicon (16) Creative Use of Words (1) Ex. 1.The island authorities sent out a fast little government boat, the Culpeper, to welcome us: a boat driven quickly or a boat that is inherently fast 2.a fast typist: a person who performs the act of typing quickly 3.Rackets is a fast game: the motions involved in the game are rapid and swift 4.a fast book: one that can be read in a short time

19 Lexicons and Lexical Analysis (47) Generative Lexicon (17) Creative Use of Words (2) fast 1 to move quickly Ex. 1 fast 2 to perform some acts quickly Ex. 2, 3 fast 3 to do something that takes little time Ex. 4

20 Lexicons and Lexical Analysis (48) Generative Lexicon (18) Creative Use of Words (3) Ex. 5.The Autobahn is the fastest motorway in Germany. 6.I need a fast garage for my car, since we leave on Saturday. Ex. 5: fast 4 : the ability of vehicles on the motorway to sustain high speed (new sense for novel context) Ex. 6: fast 2 + fast 3 : the length of time needed for a repair by the garage

21 Lexicons and Lexical Analysis (49) Generative Lexicon (19) Creative Use of Words (4) Conclusion: Enumeration is unable to exhaustively list the senses that adjectives or verbs assume in new context!

22 Lexicons and Lexical Analysis (50) Generative Lexicon (20) Permeability of Word Senses (1) Even if we were to assume that sense enumeration were adequate as a descriptive mechanism, it is not always obvious how to select the correct word sense in any given context. Ex. 1.John baked the potatoes. (change-of-state) 2.Mary baked a cake. (creation)

23 Lexicons and Lexical Analysis (51) Generative Lexicon (21) Permeability of Word Senses (2) Ex. 1.Mary cooked a meal. 2.Mary cooked the carrots. 3.John fried an omelet. 4.John fried an egg.

24 Lexicons and Lexical Analysis (52) Generative Lexicon (22) Permeability of Word Senses (3) Problems: 1.There is too much overlap in the core semantic components of the different readings; 2.It is lacks any appropriate or natural level of abstraction.

25 Lexicons and Lexical Analysis (53) Generative Lexicon (23) Expression of Multiple Syntactic Forms (1) The following sentences show that the syntactic realization of the verb’s complement determines how the proposition is interpreted semantically. 1.Madison Avenue is apt to forget that most folks aren’t members of the leisure class. (factive)

26 Lexicons and Lexical Analysis (54) Generative Lexicon (24) Expression of Multiple Syntactic Forms (2) 2.But like many others who have made the same choice, he forgot to factor one thing into his plans: Caliphobia. (non- factive) 3.What about friends who forget the password or never got it? (concealed question)

27 Lexicons and Lexical Analysis (55) Generative Lexicon (25) Expression of Multiple Syntactic Forms (3) forget 1 forget 2 CAT = verb SEM = R 2 (  1,  2 [-FACTIVE]) SEM = R 2 (  1,  2 [+FACTIVE]) ARGSTR = AGR1 = np AGR2 = vp[+inf] AGR2 = s[+tns]

28 Lexicons and Lexical Analysis (56) Generative Lexicon (26) Expression of Multiple Syntactic Forms (4) forget 3 CAT = verb SEM = R 3 (  1,  2 ) ARGSTR = AGR1 = np AGR2 = np The proper expression is to have one definition for “forget” which could generate all the allowable readings by suitable composition with the different complement types.

29 Lexicons and Lexical Analysis (57) Generative Lexicon (27) Expression of Multiple Syntactic Forms (5) The range of subjects possible with causative and experiencer verbs: Ex. 1.Driving a car in Shanghai frightens me. 2.Driving frightens me. 3.John’s driving frightens me. 4.Cars frightens me.

30 Lexicons and Lexical Analysis (58) Generative Lexicon (28) Expression of Multiple Syntactic Forms (6) The syntactic argument to a verb is not always the same logical argument in the semantic relation.

31 Lexicons and Lexical Analysis (59) Generative Lexicon (29) Description Languages for Lexical Semantics (1) 1.Monomorphic Languages (MLs)  The lexical items and complex phrases are provided a single type and denotation.  Lexical ambiguity is handled by having a multiple listing of the word. Such as the description language used in SEL.

32 Lexicons and Lexical Analysis (60) Generative Lexicon (30) Description Languages for Lexical Semantics (2) 2.Unrestricted Polymorphic Languages (UPLs)  No restriction on the type that a lexical item may assume.  No operational distinction between subclasses of polymorphic transformations. For example, the theory of Searle (1979).

33 Lexicons and Lexical Analysis (61) Generative Lexicon (31) Description Languages for Lexical Semantics (3) 3.Weakly Polymorphic Languages (WPLs)  All lexical items are semantically active, and have a richer typed semantic representation than conventionally assumed;  Semantic operations of lexically-determined type changing operate under well-defined constraints;

34 Lexicons and Lexical Analysis (62) Generative Lexicon (32) Description Languages for Lexical Semantics (4)  Different subclasses of polymorphic operations are defined, each with independent properties and conditions on their applications.

35 Lexicons and Lexical Analysis (63) Generative Lexicon (33) Description Languages for Lexical Semantics (5) What natural language data seem to require is a semantic system falling outside of ML, but well below the language of UPL: ML  WPL  UPL It is important to look more at the generative or compositional aspects of lexical semantics, rather than decomposition into a specified number of primitives (senses).

36 Lexicons and Lexical Analysis (64) Generative Lexicon (34) References (1) J. Pustejovsky Issues in Computational Lexical Semantics. In the Proc. of the Fourth European ACL Conference, Manchester, England. J. Pustejovsky The Generative Lexicon. Computational Linguistics, Vol. 17, pages J. Pustejovsky & B. Boguraev Lexical Knowledge Representation and Natural Language Processing. Artificial Intelligence, Vol. 63, pages

37 Lexicons and Lexical Analysis (65) Generative Lexicon (35) References (2) J. Pustejovsky & B. Boguraev Lexical Semantics in Context. Journal of Semantics, Vol. 12, pages 1-14.

38 Lexicons and Lexical Analysis (66) Assignments (3) 1.Please define some lexical items (nouns or verbs) for the following words in a SEL: begin, plane, pilot 2.Based on the above SEL, give a sentence example for each lexical item.