FrameNet What, how and why C. J. Fillmore C.F.Baker.

Slides:



Advertisements
Similar presentations
Grammar and Sentences “It is impossible ..to teach English grammar in the schools for the simple reason that no one knows exactly what it is” Government.
Advertisements

CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
CODE/ CODE SWITCHING.
Linguistic Theory Lecture 11 Explanation.
Why study grammar? Knowledge of grammar facilitates language learning
Grade 2 Common Core I Can Statements… 1. Second Grade Common Core… The Next Generation Strand: Reading: Literature RL.2.1 –
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
Introduction: A discourse perspective on grammar
The SALSA experience: semantic role annotation Katrin Erk University of Texas at Austin.
Introduction to phrases & clauses
Albert Gatt LIN3021 Formal Semantics Lecture 5. In this lecture Modification: How adjectives modify nouns The problem of vagueness Different types of.
Steven Schoonover.  What is VerbNet?  Levin Classification  In-depth look at VerbNet  Evolution of VerbNet  What is FrameNet?  Applications.
Linguistic Theory Lecture 8 Meaning and Grammar. A brief history In classical and traditional grammar not much distinction was made between grammar and.
Running Records.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Semantic Frames: FrameNet. What is FrameNet? FrameNet is an ongoing project at the International Computer Science Institute located in Berkeley California.
Do you suffer from judgement creep? A group moderation session will soon put you right!
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Test Taking Tips How to help yourself with multiple choice and short answer questions for reading selections A. Caldwell.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Introduction to English Syntax Level 1 Course Ron Kuzar Department of English Language and Literature University of Haifa Chapter 2 Sentences: From Lexicon.
Linguistics, Pragmatics & Natural Grammar
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
1 How to Compute the Meaning of Natural Language Utterances Patrick Hanks, Research Institute of Information and Language Processing, University of Wolverhampton.
Probabilistic Parsing Reading: Chap 14, Jurafsky & Martin This slide set was adapted from J. Martin, U. Colorado Instructor: Paul Tarau, based on Rada.
How conversation works Conversational English Compiled by Victor Nickolz Grand Lyceum 2004 For classes 7-11.
Unit #1: Chapters 1 & 2 The Secret Life of a High School Grammar Snob.
FrameNet The work. The product. The applications. CJFillmore - ICSI/Berkeley.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Introduction to Philosophy Lecture 3 Formalizing an argument By David Kelsey.
The Current State of FrameNet CLFNG June 26, 2006 Fillmore.
UNIT 7 DEIXIS AND DEFINITENESS
Unit 5 : PREDICATES.
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
11 Chapter 19 Lexical Semantics. 2 Lexical Ambiguity Most words in natural languages have multiple possible meanings. –“pen” (noun) The dog is in the.
Rhetorical Framework Purpose Audience Situation Persona/Ethos Message.
Artificial Intelligence: Natural Language
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Wordnet - A lexical database for the English Language.
Rules, Movement, Ambiguity
F.I.N.A.L R.E.V.I.S.I.O.N. Today’s Agenda- Warm- up : Tea Party game- review grammar Review vocabulary/writing – PPT- (Turn into passive voice) Call my.
Introduction Chapter 1 Foundations of statistical natural language processing.
UNIT 2 - IMPLICATURE.
WORDS The term word is much more difficult to define in a technical sense, and like many other linguistic terms, there are often arguments about what exactly.
Communicative and Academic English for the EFL Professional.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
Thursday, September 16, Announcements Movie night, Friday September 25, at 6:00 pm. Bring your family and friends. There will be vocabulary review.
© 2015 albert-learning.com How to talk to your boss How to talk to your boss!!
This I Believe Writing Workshop Notes. Personal Writing Personal writing: –Communicates a central idea that has a deep personal meaning to the writer.
Lec. 10.  In this section we explain which constituents of a sentence are minimally required, and why. We first provide an informal discussion and then.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
Lecture IV. Basic Translation Theories Plan 1. The Transformational Approach 2. The Denotative Approach 3. The Communicational Approach.
Natural Language Processing Vasile Rus
LANGUAGE DESCRIPTIONS
Vocabulary Module 2 Activity 5.
Traditional Grammar VS. Generative Grammar
Structure of a Lexicon Debasri Chakrabarti 13-May-19.
Information Retrieval
Presentation transcript:

FrameNet What, how and why C. J. Fillmore C.F.Baker

Who pays us?  International Computer Science Institute  National Science Foundation  Defense Advanced Research Projects Agency

Related Projects  WordNet (Princeton) – words grouped into “synsets” – relations between synsets  Proposition Bank (Pennsylvania) – Penn TreeBank – semantic annotation  FrameNet – words related to “frames” – valence descriptions uniting frame info and syntax

Background assumption Hypothesis: People understand things by performing mental operations on what they already know. Such knowledge is describable in terms of information packets called frames.

Lexicon Building  FrameNet is a lexicon-building project for English, relating words to their meanings (via the “frames” that underlie them), recording the ways in which phrases and sentences are built around them, using the evidence found in large samples of modern English usage.

The core work of FrameNet 1.characterize frames 2.find words that fit the frames 3.develop descriptive terminology 4.extract sample sentences 5.annotate selected examples 6.derive "valence" descriptions

FrameNet “Frames”  Intuitively, the frames we mainly work with stand for more-or-less fine-grained situation types, and the concepts we use in describing them are defined relative to the individual frame.  Here are some examples of frames and their constituent frame elements.

Finding words that belong in a given frame  We look for words in the language that bring to mind the individual frames.  We say that the words evoke the frames.

“Words”?  But first there’s an enemy we have to deal with: polysemy, lexical ambiguity, multiple meanings of a single “word”.  Instead of words, we work with lexical units (LUs), each of these being a pairing of a word with a sense.  In WordNet different LUs belong to different synsets; in FrameNet different LUs (typically) belong to different frames.

Polysemy  FrameNet is at the “splitting” end of the “splitting” versus “lumping” continuum when it comes to the monosemy/polysemy.  It’s generally assumed that for IR purposes both WN and FN make too many distinctions.  Here are some examples of how we think when trying to determine the separateness of lexical units with the same form?

Discernible meaning differences. If a word communicates different meanings in different contexts – and the difference isn’t explained by the contexts – maybe the word has more than one meaning. 1.She earns a lot less than she deserves. 2.I made a lot of money, but I earned it. The second sentence conveys the idea that the amount of money earned was appropriate.

How many meanings for replace?  ‘put (sth) back where it belongs’  ‘occupy a position formerly occupied by (sth,sbd)’  ‘put something in a position formerly occupied by (sth,sbd)’

John replaced me.

John replaced the telephone.

Just having different argument types in grammatical positions isn’t enough.  Subject as Speaker: Mom explained …, you complained …, she said …, I insist …, the dean informed us …  Subject as Medium: chapter 2 explains …, your letter complains …, the Bible says …, the law insists …, the editorial informed …

Those don’t require separate senses.  The “Medium-as-Subject” examples can be thought of as Metonymy. Thus:  Chapter 2 explains … = The author explains in Chapter 2 that …  Your letter complains that … = You complain in your letter that …

Here’s a different situation: Some - but not all - “verbs of speaking” have a “cognitive” use, identifying sources of beliefs or belief-attitudes, with no actual communicating implied.  The heavy winds explain the number of windmills around here. (*explicate)  These facts argue in favor of your hypothesis. (*reason)(*quarrel)  His repeated absence at meetings suggests that he’s not happy with the job. (*hints)

That is, we take the fact that some but not all words in a particular semantic class have special meaning elaborations argues for a polysemy interpretation in those cases.

Different Complementation Complementation patterns should go with particular meanings of a word.  Medical sense of complain: the patient complained [of back pains]  Official act sense of complain: we complained [to the manager] [about X] she complained [that her checks were late]

Argument omissibility  We would argue that the ordinary sense of give and the ‘contribute’ sense of give should be separated, since they differ in argument omissibility: – Do you want to meet the Red Cross representative? - I already gave. – Did you remember a present for your daughter’s birthday? - *I already gave.

Nominalization differences  If a verb has two different event noun derivatives, and they have different meanings that are also found in the verb, the verb itself should also be described as polysemous.

Nominalization Differences  adhere to a belief: adherence adhere to your skin:adhesion  observe a rule: observance observe the process: observation  commit to a cause: commitment commit sb to an asylum:commitment commit a crime: commission  deliver a package: delivery deliver sb. from danger: deliverance

Support verb differences with nominalizations  argue : quarrel sense associated with have an argument; reasoning sense with make an argument  commit : dedication sense associated with make a commitment; crime/sin sense & incarceration sense, no support verb  complain : symptom report: present a complaint; kvetch: no support verb; official: file a complaint, register a complaint, lodge a complaint

FN work: characterizing frames Let’s work through the Revenge frame.

The Revenge frame The Revenge concept involves a situation in which a)A has done something to harm B and b)B takes action to harm A in turn c)B's action is carried out independently of any legal or other institutional setting

Vocabulary for Revenge  Nouns: revenge, vengeance, reprisal, retaliation, retribution  Verbs: avenge, revenge, retaliate (against), get back (at), get even (with), pay back  Adjectives: vengeful, vindictive  V+N Phrases: take revenge, exact retribution, wreak vengeance

FN work: choosing FE names  We develop a descriptive vocabulary for the components of each frame, called frame elements (FEs).  We use FE names in labeling the constituents of sentences exhibiting the frame.

FEs for Revenge  Frame Definition: Because of some injury to something-or-someone important to an avenger (maybe himself), the avenger inflicts a punishment on the offender. The offender is the person responsible for the injury.  FE List: – avenger, – offender, – injury, – injured_party, – punishment.

Semantic Roles  Notice that we use such situation-specific notions as injury, offender, etc., rather than limiting ourselves to some standard list of thematic roles, like agent, patient, goal, etc.  First, there aren’t enough of those to go around, and if we had to squeeze all the distinctions we find into such a list, – we would waste too much time finding criteria to do the mapping, – and we would have to remember what decisions we’d made.

Collecting examples  We extract from our corpus examples of sentences showing the uses of each word in the frame. We depend on corpus data rather than existing dictionaries or our “intuitions” about the language.  Our main corpus is the British National Corpus; we have recently added lots of newswire text from the Linguistic Data Consortium. The total is about 200M running words.

FN work: annotating examples  We select sentences showing all major syntactic contexts, giving preference to those with common collocations.  Using the names assigned to FEs in the frame, we label the constituents of sentences that express these FEs.  The next slides show what our annotation software looks like.

The list of frames - including “Revenge” CLICK ON “Revenge

List of FEs: CORE: Av, InjP, Inj, Off, Pun List of LUs CLICK ON “avenge” Avenger Offender Injury Injured Party Punishment

List of salient contexts for “avenge” CLICK ON “rcoll-death”

Sentences with “avenge”... “death” CLICKED ON sentence 1

Annotater’s workspace with sentence 1

List of FEs for Revenge CLICK ON “GF”

Streamlined list of grammatical functions CLICK ON “PT”

Two Kinds of “Targets”  Predicates – words that evoke frames, create contexts for fillers of information about frame instances  Fillers – words that (serve as the heads of constituents which) satisfy semantic roles of frames evoked by predicates – many of these evoke frames of their own

Separate kinds of annotation  When targets are predicates: – find the arguments  When targets are fillers: – find the governor – find the enclosing phrase – identify the frame and the FE of that phrase

Valence Variation  We typically find that different words in the same frame show variation in how the frame elements are grammatically realized.

Communication Speaker Addressee Topic Message

Communication Speaker Addressee Topic Message We spoke to Harrison about the crisis.

Communication Speaker Addressee Topic Message We informed Harrison of the crisis.

Communication Speaker Addressee Topic Message We told Harrison there was a crisis.

Communication Speaker Addressee Topic Message What did you talk about ?

Encoding Speaker Message_type Manner

Encoding Speaker Message_type Manner She expressed her request rudely.

Encoding Speaker Message_type Manner She phrased her answer in this way.

Encoding Speaker Message_type Manner How should we word our complaint ?

Back to Revenge

I avenged my brother.

I avenged my brother’s death.

Querying the data: ask about the form given the meaning Through various viewers built on the FN database we can, for example, ask how particular FEs get expressed in sentences evoking a given frame.

By what syntactic means is offender realized?  Sometimes as direct object: – we'll pay you back for that  Sometimes with the preposition on – they'll take vengeance on you  Sometimes with against – we'll retaliate against them  Sometimes with with – she got even with me  Sometimes with at – they got back at you

By what syntactic means is offender realized?  Sometimes as direct object: – we'll pay you back for that  Sometimes with the preposition on – they'll take vengeance on you  Sometimes with against – we'll retaliate against them  Sometimes with with – she got even with me  Sometimes with at – they got back at you It's these word-by- word specializations in FE-marking that make automatic FE recognition difficult.

Querying the data: ask about the meaning given the form Or, going from the grammar to the meaning, we can choose particular grammatical contexts and ask which FEs get expressed in them.

What FE is expressed by the object of avenge?  Sometimes it's the injured_party – I've got to avenge my brother .Sometimes it's the injury – My life goal is to avenge my brother's murder.

Coverage  Lexical coverage. We want to get all of the important words associated with each frame.  Combinatorics. We want to get all of the syntactic patterns in which each word functions to express the frame.

Frequency data  We do not ourselves collect frequency data. That will wait until methods of automatic tagging get perfected.  In any case, the results will differ according to the type of corpus - financial news, children's literature, technical manuals, etc.

What do we end up with?  Frame descriptions  Lexical entries  Annotations

What do we end up with?  Frame descriptions – (which some use for situation ontologies)  Lexical entries – (which some use for lexicon building)  Annotations – (which some use as training corpora for machine learning)

Outreach  Other activities – Success in automating frame analysis of raw text would be valuable for IE, MT, NLU; various groups are experimenting with FN for such purposes.  Other languages – There are FrameNets or FrameNet-like projects for Spanish, German, Japanese, Swedish, Chinese, and apparently Hindi, Romanian, and a few others.