1 Syntagmatic Preferences Patrick Hanks Masaryk University In honour of Yorick Wilks BCS, London, June 22, 2007.

Slides:



Advertisements
Similar presentations
Innovation in Assessment? Why? Poor student feedback regarding feedback timeliness and usefulness Staff workloads Student lack of awareness as to what.
Advertisements

1 Lexical Semantics for the Semantic Web Patrick Hanks Masaryk University, Brno Czech Republic UFAL, Mathematics Faculty, Charles University.
Grammar: Meaning and Contexts * From Presentation at NCTE annual conference in Pittsburgh, 2005.
Task 3 - Style 1.Comment on aspects of the essay style you liked or disliked and give reasons why. 2. In what ways is her style similar or different from.
The nature of Sign and sign/symbol distinction
1 The Generative Lexicon (GL) meets Corpus Pattern Analysis (CPA) Patrick Hanks Institute of Formal and Applied Linguistics, Charles University in Prague,
Mapping meaning onto use: a Pattern Dictionary of English Verbs Patrick Hanks Faculty of Informatics, Masaryk University, Brno, Czech Republic
1 Why do CPA? Patrick Hanks Research Institute for Information and Language Processing, University of Wolverhampton; Bristol Centre for Linguistics, University.
CL Research ACL Pattern Dictionary of English Prepositions (PDEP) Ken Litkowski CL Research 9208 Gue Road Damascus,
Introduction to corpus session General corpora Rosamund Moon: lexicography, polysemy data Alice Deignan Specialised corpora Elena Semino Andreas Musolff.
1 Elliptical Arguments Patrick Hanks Institute of Formal and Applied Linguistics, Charles University in Prague, Czech Republic ***
Statistical Methods and Linguistics - Steven Abney Thur. POSTECH Computer Science NLP Lab Shim Jun-Hyuk.
Albert Gatt LIN3021 Formal Semantics Lecture 5. In this lecture Modification: How adjectives modify nouns The problem of vagueness Different types of.
1 Computing Real Language Meaning for the Semantic Web Patrick Hanks Masaryk University, Brno Czech Republic UFAL, Mathematics Faculty,
Linguistic Theory Lecture 8 Meaning and Grammar. A brief history In classical and traditional grammar not much distinction was made between grammar and.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
August 23, 2010 Grammars and Lexicons How do linguists study grammar?
The Language of Theories Linking science directly to ‘meanings’
Amirkabir University of Technology Computer Engineering Faculty AILAB Efficient Parsing Ahmad Abdollahzadeh Barfouroush Aban 1381 Natural Language Processing.
Talking about your homework News story? –What made you choose…? One of your words? –What made you choose…? (Give your vocabulary books to another student.
EE 399 Lecture 2 (a) Guidelines To Good Writing. Contents Basic Steps Toward Good Writing. Developing an Outline: Outline Benefits. Initial Development.
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
What is a good length of string? –Depends on its use How do you design a good length of string? –Can be determined by a process What is a good user interface?
Corpora and Language Teaching
Meaning and Language Part 1.
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Communicative Language Teaching Vocabulary
The DVC project: Disambiguation of Verbs by Collocation ____ an introduction to the linguistic theory of norms and exploitations Patrick Hanks Research.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
MOOD CHOICES. INTERPERSONAL METAFUCTION OFFER US: Resources for interacting with language. Resources for giving and demanding information or good and.
1 Statistical NLP: Lecture 10 Lexical Acquisition.
1 How to Compute the Meaning of Natural Language Utterances Patrick Hanks, Research Institute of Information and Language Processing, University of Wolverhampton.
Word senses Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds, Sussex.
Issues in Paraphrasing Postgraduate In-sessional Writing: 4 John Morgan.
© 2003, Carla Ellis Experimentation in Computer Systems Research Why: “It doesn’t matter how beautiful your theory is, it doesn’t matter how smart you.
Lecture 3 THE KEY SKILLS TESTED IN A DISSERTATION.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
D.Hafordpsychlotron.org.uk How to deal with a whole lot of words.
AP STATS: PROJECT Take 5-10 minutes to brainstorm ideas for your final project. Proposals will be due on Friday (rough drafts will be peer edited on Thursday).
Dualism: epiphenomenalism
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
Supporting Researchers and Institutions in Exploiting Administrative Databases for Statistical Purposes: Istat’s Strategy G. D’Angiolini, P. De Salvo,
User Interface Design & Usability for the Web Card Sorting You should now have a basic idea as to content requirements, functional requirements and user.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Cognitive Processes PSY 334 Chapter 11 – Language Structure June 2, 2003.
QUALITATIVE RESEARCH What is the distinction between Inductive and Deductive research? Qualitative research methods – produces observations that are not.
Presentation about pragmatic concepts Implicatures Presuppositions
Introduction Chapter 1 Foundations of statistical natural language processing.
WORDS The term word is much more difficult to define in a technical sense, and like many other linguistic terms, there are often arguments about what exactly.
Practicing Problems and Thinking About Linguistics Billy Clark, Middlesex University, UK Linguistics Olympiad Summer Course Corpus Christi.
Adjusting to University Life and the Challenges of KAC Don’t panic. We did this too!
Revision Lecture Cognitive Science. Past papers What is the answer to the question? The answer will nearly always involve: “How amazing it is that people.
In this lecture, we will learn about: Translation.
On Probabilities in Personalised Medicine: ‘The Problem of Untestable Treatments’ Prof. Darrell P. Rowbottom Philosophy (Head), Lingnan University
Extended Definition Essay. What is a definition essay? It is an extended explanation of an abstract idea, a complicated idea or a controversial word or.
Lec. 10.  In this section we explain which constituents of a sentence are minimally required, and why. We first provide an informal discussion and then.
Figure and Ground Part 2 APLNG 597C LEJIAO WANG 03/16/2015.
Learning and Development HR1007 Session 2 Learning.
SEMANTICS DEFINITION: Semantics is the study of MEANING in LANGUAGE Try to get yourself into the habit of careful thinking about your language and the.
Semantic Roles and Ontologies Ontologies Growing interest in the data structures known as ontologies Language expressions covering the.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
PSYC 206 Lifespan Development Bilge Yagmurlu.
Statistical NLP: Lecture 9
RESEARCH BASICS What is research?.
Introduction to Semantics
Presentation transcript:

1 Syntagmatic Preferences Patrick Hanks Masaryk University In honour of Yorick Wilks BCS, London, June 22, 2007

2 What's so important about “My car drinks gasoline”? Violation of “selection restrictions” is normal. So selectional restrictions aren't restrictions at all –They are, in fact selectional preferences –Different combinations of selectional preferences activate different senses Yorick's insights of the 1970s deserve to be followed up more vigorously and systematically than they have been.

3 A language is a double helix Start from the bottom up: –Let’s look at what the words do. –How do people use words to make meanings? A natural language is a system of norms and exploitations: –Norms: Animals drink water, people drink beverages –Exploitations: My car drinks gasoline Syntagmatic rules governing normal linguistic behaviour systematically interact with exploitation rules governing how those norms are exploited

Patterns of linguistic behaviour Normal linguistic behaviour is highly patterned. Words in isolation have meaning potential, not meaning –A meaning potential is a more or less vague cluster of possibilities – e.g. what does fire mean? –A burning process? (and if so is it a good thing – in a house, under control – or a bad thing, raging out of control in a forest?) An electric heater? A sense of enthusiasm? Dismiss from employment? Operate a gun? Shoot an arrow? Cause to enthuse? Bake? –All of these and more. –Much overlap. –Sense enumeration doesn’t get it (cf Pustejovsky’s lexical conceptual paradigms) In context, the range of possible interpretations of a word is severely limited: –People firing guns, ideas that fire people with enthusiasm, employers firing their staff, firing pottery in a kiln

Word Use, Meaning, and Linguistic Theory The normal uses of a word can be grouped into patterns, and meanings can be associated with the patterns (rather than the word in isolation) So far they haven’t been. Why not? –Lack of evidence Lexical analysis can only be done effectively with large corpora –Tradition and intuition direttissimo assaults on word meaning No one thought to go the long way round, via patterns –The tyranny of “all and only” Lexicographers aimed to cover all possible uses, not just all normal uses NLP and linguistic theory focused on boundary cases –Syntactocentrism in linguistic theory misses the point about syntagmatics –Lack of a suitable theory Aha! Preference Semantics provides the basis for such a theory We should take PS seriously and ally it with other relevant theoretical work (Wittgenstein, Putnam, Rosch, Sinclair, Hoey, Pustejovsky, …)

6 Why is a Pattern Dictionary Necessary? Standard dictionaries do not provide the contexts that distinguish one sense of a word from another. –very poor syntagmatic information –give equal prominence to normal and merely possible senses –definitions (and senses) are not mutually exclusive WordNet: synsets ≠ word senses! FrameNet: frames ≠ word senses!

7 Identifying norms is hard... and boring –The painful rediscovery of the obvious, –which is only obvious when pointed out Only by painstaking corpus analysis is identifying norms possible. What counts as a normal use of any verb? – e.g. drink

8 Norms for 'drink', v. 1.55% [[Human]] drink [[{Liquid = Water} | Beverage]] 2. 4% [[Animal]] drink [[Liquid = Water]] 3.39% [[Human]] drink [NO OBJ] 4. 1% [[Human]] drink [[Experience]] {in} 5. 1% [[Human]] drink ([[Liquid = Beverage]]) {up}

9 Some Exploitations of 'drink' A metaphor (or literary allusion): The child of a nonconformist father learnt to drink deep of the Catholic tradition. –Owen Chadwick, Michael Ramsey: a life. A coercion: ` He knows them all, ' she says adoringly, ` and they all drink shampoo -- nearly every night. –The Guardian, 1989.

10 How pervasive is ambiguity? Not as pervasive as you might think. –If we attach meanings to patterns, not to words, most “ambiguities” don't get a chance to rear their ugly heads. But here's one: He drank. Could be a null-object alternation of “he drank [[Beverage]]” or it could mean that he had a problem with alcohol (pattern 2)

11 Getting the right level of generalization is hard “John fired at a line of stags” Corpus evidence shows that fire at does not prefer ANIM in the prepositional object slot. Any PHYSOBJ will do. Building a pattern dictionary is a constant struggle to get “the right level” (or at least an acceptable level ) of generalization Art is required to choose a level. There are no right answers (no absolutes). –But plenty of wrong ones!

12 Semantic Types and Semantic Roles fire at assigns the semantic role “Target” to words of semantic type [[Physical Object]] Semantic types are the intrinsic prototypical values of nouns – their essences Semantic roles are assigned by context

13 Word Meaning: a complex linguistic Gestalt In the mind of an English speaker, the verb land is primed for any or all of the following: –passengers land from a plane – the pilot lands the plane – the plane lands – we landed at Heathrow – passengers land from a boat (but more probably they are soldiers) – a commander lands his troops (but not from a plane) – a boat lands its cargo – a trawler lands its catch – an angler lands a fish – Yorick landed the role of Caliban – He landed a job in Sheffield – someone else may land in trouble – or be landed with a problem – and someone may even land a blow on your nose

14 Imposing order on chaos In the Pattern Dictionary: Verbs are sorted into patterns Exploitations are flagged for later analysis Nouns (“lexical sets”) are clustered into an ontology The ontology is “distorted” by usage Lexical sets “shimmer”

15 Lexical Sets “shimmer” [[Human]] attend [[Event]] –Lexical set [[Event]] = { meeting, conference, funeral, ceremony, course, school, seminar, lecture, session, class, rally, dinner, hearing, briefing, reception, workshop, wedding, inquest, summit, concert, event, premiere, …} [[Human]] participate {in [[Event]]} –Lexical set [[Event]] = {debate, election, exercise, coup, demonstration, activity, process, conference, consultation, selection, meeting, …} [[Human]] hail [[Event]] –Lexical set [[Event]] = {victory, success, agreement, vote, opening, development, result, start, resurgence, …}

16 Patterns are contrastive 2% [[Human]] launch [[Boat]] 7% [[Human]] launch [[Projectile]] 58% [[Human | Institution]] launch [[Activity | Plan]] 24% [[Institution]] launch [[{Artifact = Product} | {Activity = Service}]]

17 What is a Pattern Dictionary? a inventory of all normal patterns of verb use –not all possible uses. an ontology of “shimmering” lexical sets (clusters of nouns according to semantic type and argument roles) an inventory of semantically motivated syntagmatic distinctions

18 Tools needed to build a Pattern Dictionary A balanced corpus of the language (i.e. general language) A theory –An initial lexical architecture that guides clustering Wilks, Pustejovsky, Sinclair, … –A lexical model that distinguishes norms from exploitations A methodology: Corpus Pattern Analysis –Hanks 2004, Hanks and Pustejovsky 2005 –Including statistical corpus analysis Church and Hanks 1989, Kilgarriff et al. 2004, 2005 A shallow ontology –A hierarchical organization of semantic types, reflecting word groupings, not scientific conceptualization of the universe A suite of corpus tools: Manatee, Bonito, Word Sketch Engine Kilgarriff, Rychlý

19 CPA procedure Create a sample concordance (KWIC index) for a word: –250 examples of actual uses of the word Identify the typical syntagmatic patterns. Assign each line of the sample to one of the patterns. Take further samples if necessary. –Introspection is used to interpret data, but not to create data. Store the pattern in the entry manager.

20 In CPA, every line in the sample must be classified The choices are: Norms Exploitations Alternations Names (Midnight Storm: name of a horse, not a storm) Mentions (to mention a word or phrase is not to use it) Errors (e.g. learned mistyped as leaned) Unassignables –See Proceedings of the Eleventh EURALEX International Congress, pages 105–116, Lorient, France, 2004.

21 How normal are norms? How frequent are exploitations? Roughly 75% of all clauses activate “primary norms” About 20% activate secondary norms –including conventional metaphors –and some expressions that may once have been exploitations themselves About 4% of all clauses involve exploitations of various sorts –dynamic metaphors, other tropes, coercions, ellipsis, etc. About 1% of all clauses are unclassifiable

22 Browsing and Feedback The English Pattern Dictionary Browse the first 50 verbs at –Login and password are both “guest” –Click on the pattern number to see the whole pattern –Click on “lines” to see supporting corpus evidence 50 verb entries have been completed and released –Feedback, please! 400 additional entries have been analysed, awaiting release –A shallow ontology has been drafted and is being edited –But not populated with nouns yet –6500 verbs remain to be analysed EPD will not include rare words like saltate or saccharify