FrameNet, PropBank, VerbNet Rich Pell. FrameNet, PropBank, VerbNet  When syntactic information is not enough  Lexical databases  Annotate a natural.

Slides:



Advertisements
Similar presentations
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing Semantics (Chapter 17) Muhammed Al-Mulhem March 1, 2009.
Advertisements

The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio Relation Extraction.
Layering Semantics (Putting meaning into trees) Treebank Workshop Martha Palmer April 26, 2007.
Multilinugual PennTools that capture parses and predicate-argument structures, and their use in Applications Martha Palmer, Aravind Joshi, Mitch Marcus,
CL Research ACL Pattern Dictionary of English Prepositions (PDEP) Ken Litkowski CL Research 9208 Gue Road Damascus,
E XTRACTING SEMANTIC ROLE INFORMATION FROM UNSTRUCTURED TEXTS Diana Trandab ă 1 and Alexandru Trandab ă 2 1 Faculty of Computer Science, University “Al.
Natural Language Processing Semantic Roles. Semantics Road Map 1.Lexical semantics 2.Disambiguating words Word sense disambiguation Coreference resolution.
Overview of the Hindi-Urdu Treebank Fei Xia University of Washington 7/23/2011.
Language Data Resources Treebanks. A treebank is a … database of syntactic trees corpus annotated with morphological and syntactic information segmented,
Automatic indexing and retrieval of crime-scene photographs Katerina Pastra, Horacio Saggion, Yorick Wilks NLP group, University of Sheffield Scene of.
Semantic Role Labeling Abdul-Lateef Yussiff
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
Towards Parsing Unrestricted Text into PropBank Predicate- Argument Structures ACL4 Project NCLT Seminar Presentation, 7th June 2006 Conor Cafferkey.
Steven Schoonover.  What is VerbNet?  Levin Classification  In-depth look at VerbNet  Evolution of VerbNet  What is FrameNet?  Applications.
Semantic Frames: FrameNet. What is FrameNet? FrameNet is an ongoing project at the International Computer Science Institute located in Berkeley California.
Computational Semantics Ling 571 Deep Processing Techniques for NLP February 2, 2011.
The Hindi-Urdu Treebank Lecture 7: 7/29/ Multi-representational, Multi-layered treebank Traditional approach: – Syntactic treebank: PS or DS, but.
Introduction to treebanks Session 1: 7/08/
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved Digital Enterprise Research Institute Ontologies & Natural Language.
Named Entity Recognition and the Stanford NER Software Jenny Rose Finkel Stanford University March 9, 2007.
Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.
ELN – Natural Language Processing Giuseppe Attardi
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Learning Information Extraction Patterns Using WordNet Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield,
Comments on “Building a resource for studying translation shifts” (by Lea Cyrus) Hans C. Boas University of Texas at Austin
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
PropBank, VerbNet & SemLink Edward Loper. PropBank 1M words of WSJ annotated with predicate- argument structures for verbs. –The location & type of each.
September 17, : Grammars and Lexicons Lori Levin.
Assessing the Impact of Frame Semantics on Textual Entailment Authors: Aljoscha Burchardt, Marco Pennacchiotti, Stefan Thater, Manfred Pinkal Saarland.
A Web Application for Customized Corpus Delivery Nancy Ide, Keith Suderman, Brian Simms Department of Computer Science Vassar College USA.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
The Current State of FrameNet CLFNG June 26, 2006 Fillmore.
AQUAINT Workshop – June 2003 Improved Semantic Role Parsing Kadri Hacioglu, Sameer Pradhan, Valerie Krugler, Steven Bethard, Ashley Thornton, Wayne Ward,
MASC The Manually Annotated Sub- Corpus of American English Nancy Ide, Collin Baker, Christiane Fellbaum, Charles Fillmore, Rebecca Passonneau.
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
11 Chapter 19 Lexical Semantics. 2 Lexical Ambiguity Most words in natural languages have multiple possible meanings. –“pen” (noun) The dog is in the.
LREC 2008 Marrakech1 Clustering Related Terms with Definitions Scott Piao, John McNaught and Sophia Ananiadou
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
A Semantic Approach to IE Pattern Induction Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield, UK.
Annotation for Hindi PropBank. Outline Introduction to the project Basic linguistic concepts – Verb & Argument – Making information explicit – Null arguments.
Combining Lexical Resources: Mapping Between PropBank and VerbNet Edward Loper,Szu-ting Yi, Martha Palmer September 2006.
LING 6520: Comparative Topics in Linguistics (from a computational perspective) Martha Palmer Jan 15,
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
ARDA Visit 1 Penn Lexical Semantics at Penn: Proposition Bank and VerbNet Martha Palmer, Dan Gildea, Paul Kingsbury, Olga Babko-Malaya, Bert Xue, Karin.
NLP. Introduction to NLP Last week, Min broke the window with a hammer. The window was broken with a hammer by Min last week With a hammer, Min broke.
SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.
Multilinugual PennTools that capture parses and predicate-argument structures, for use in Applications Martha Palmer, Aravind Joshi, Mitch Marcus, Mark.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Chapter 7 Part II Structuring System Process Requirements MIS 215 System Analysis and Design.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.
Leonardo Zilio Supervisors: Prof. Dr. Maria José Bocorny Finatto
English Proposition Bank: Status Report
Coarse-grained Word Sense Disambiguation
Answer Extraction: Semantics
Natural Language Processing (NLP)
Representation of Actions as an Interlingua
Natural Language Processing (NLP)
CS224N Section 3: Corpora, etc.
CS224N Section 3: Project,Corpora
Progress report on Semantic Role Labeling
Owen Rambow 6 Minutes.
Natural Language Processing (NLP)
Presentation transcript:

FrameNet, PropBank, VerbNet Rich Pell

FrameNet, PropBank, VerbNet  When syntactic information is not enough  Lexical databases  Annotate a natural language corpus with semantic information  Largely manual classification efforts

Outline  FrameNet  Frame Semantics  Overview and Demo  Applications  PropBank  Overview and Demo  Applications  VerbNet  Levin Classes  Frame Demo  Conclusion

FrameNet

Frame Semantics  Sell (v) – to exchange an item for money or its equivalent  Vce (n) – estimator of reactivity delta due to voids in moderator  Definition of a word is useless without knowledge relating to that word:  Entities involved - buyer, seller, item, money  Relationships between those entities:  Buyer gives money to seller  Seller gives item to buyer  Buyer believes value of item >= monetary amount  Seller believes value of item <= monetary amount

Semantic Frame  Description of an event, relation, or entity and its participants  Captures the ‘essential knowledge’ of a given word sense  Developed by Charles Fillmore

FrameNet Overview  Attempt to represent frame semantics in a human and machine-readable database  Developed by Charles Fillmore at Berkeley’s International Computer Science Institute  Founded in 1997  Funded by National Science Foundation and DARPA  Freely available via web interface or download 

FrameNet Overview  Set of semantic frames  Composed of frame elements (FEs) – roles within the frame  Words that evoke this frame are called lexical units(LUs) – represent a sense of a given word  Frame: Commerce_sell  FEs: buyer, seller, item, money, place, reason…  LUs: auction.v, retail.v, vend.v…

Frames  Definition  Core/non-core frame elements  Definition and examples  Frame-frame relations  Lexical Units

Frame-Frame Relations  Inheritance – IS-A relation  Child frame is subtype of parent frame  Each frame element in parent has corresponding frame element in child  Revenge inherits from Rewards_and_punishments  Using – child frame presupposes parent frame as background  Speed presupposes Motion  No one-to-one correspondence between FEs  Subframe – child frame is subevent of complex event represented by parent  Criminal_process -> Arrest, Arraignment, Trial, Sentencing  Perspective-on – one frame provides some perspective on (perspectivizes) another frame  Commerce_goods_transfer provides perspective on Commerce_sell

Text Annotation  [Seller Bob] auctioned [Goods the clock] [Buyer to John]  [Item Colgate’s stock] rose [Difference $3.64][Final_value to $49.94]  reduction [Item of debt levels][Value_2 to $665 million][Value_1 from $2.6 billion]  [Sleeper They][Copula were]asleep[Duration for hours]  He took a packet of Woodbines out of the breast pocket of [Wearer his][Material cotton][Garment shirt] and lit one.

Development  Characterize frames  Find words that fit the frames (lexical units)  Extract sample sentences  British National Corpus (editorials, sermons, textbooks, advertisements, novels, sermons)  Linguistic Data Consortium (US newswire texts)  American National Corpus  ~200 million words  Annotate selected examples

Progress  1000 linked semantic frames comprising:  10,000 lexical units  170,000 manually annotated sentences  Ports to other languages  Spanish, German, Chinese, Japanese

Uses  Semantic role assignment  Natural language understanding  Machine translation  Part of speech tagging  Textual entailment  Information extraction  NLP applications where a syntactic parse will not suffice

PropBank

 Adds a semantic layer to Penn Treebank  Attempts to capture accurate predicate-argument structure by annotating predicates and the semantic roles of their arguments  Annotates predicates (verbs) and their arguments:  John broke the window -> broke(arg0 = John, arg1 = the window)  The window broke -> broke(arg1 = the window)  Developed in 2001 at the University of Pennsylvania  Martha Palmer, Paul Kingsbury  Free, open-source, downloadable 

PropBank Structure  PropBank is a set of frame files  Each frame file contains one or more PropBank verb senses (aka frameset or roleset ID)  Each verb sense is annotated with:  Semantic roles for each argument of a predicate  Examples  Links to other lexical tools (FrameNet, VerbNet)

PropBank Arguments  Standardized as much as possible  Arg0 = agent  Arg1 = patient  Arg2 = instrument/attribute  Arg3 = starting point/attribute  Arg4 = ending point  ArgM = modifier  Obama met him privately in the White House, on Thursday.  Rel: met  Arg0: Obama  Arg1: him  ArgM-MNR: privately  ArgM-LOC: in the White House  ArgM-TMP: on Thursday

PropBank Example …the campaign is drawing fire from anti-smoking advocates… Arg0:the campaign Rel:drawing Arg1:fire Arg2-from:anti-smoking advocates

PropBank Example They-1 have *trace*-1 to sell when things look like they're falling. *trace* sell when things look like they're falling A painting by August Strindberg sold at auction in Stockholm. A painting by August Strindberg sold at auction in Stockholm

Differences From FrameNet  Verb-specific  Each verb is its own predicate  Closer to syntactic parse  More thorough but simpler annotation of corpus

PropBank Progress  3500 verbs annotated  Work on translating to Dutch, Arabic  Semantic role labeling  Knowledge discovery  Semantic parsing

VerbNet

 Lexicon of English verbs  Groups verbs based upon shared syntactic behavior  5800 verbs in 270 verb classes  Based on Levin classes and their extensions  Developed by Karin Kipper-Schuler at University of Pennsylvania via NSF and DARPA grants  Free, open source, downloadable  l

Levin Classes  English Verb Classes and their Annotations, Beth Levin, 1993  Syntactic behavior of a verb is based upon its meaning  Possible to syntactically group verbs into classes based upon how they interact with specific objects/prepositions/subjects and expect them to have some semantic similarity  e.g. Locative alternation – involves moving something into or onto a location  Verbs of placement and covering  Scatter, pump, hang, drizzle, cram, load

VerbNet Roles  Groups verbs based upon Levin classes  Add semantic role labels to Levin classes, e.g.  Agent – actor in an event who carries out the event  Theme – undergoer that is central to event or state that does not have control over the way the event occurs  Destination – goal that is a concrete, physical location  …  23 total  Illustrate the “who what how when where” information contained in a sentence  Analogous to FrameNet’s frame elements or PropBank’s numbered arguments

VerbNet Classes  Set of member verbs  Thematic roles used in predicate-argument structure of verbs in the class  Selectional restrictions on the roles  “Sam drank a coffee.”  “Sam drank a car.”  Set of frames:  Brief description  Example  Syntactic description  Set of semantic predicates, includes temporal function indicating when a predicate is true

VerbNet applications  Verbs typically convey the main idea of a sentence  Maps the syntactic nature of PropBank predicate/argument parses into a richer semantic context  Machine translation  Document classification  Word sense disambiguation  Semantic role labeling  3D animation (parameterized action representations)  Planning  Automatic verb acquisition

Automatically Extending VerbNet  Semantic information for several verbs at a time captured in VerbNet classes  Can automatically add new candidate verbs to a class by testing against pre-defined class specifications  Removes need for exhaustive manual encodings

Automatically Extending VerbNet  Apply k-means clustering to some other resource:  PropBank  WordNet  FrameNet  Observe the clusters to see if they correspond to any VerbNet class  If so, do they contain any verbs not in the existing VerbNet class?  Able to add 47 verbs

Summary  FrameNet, PropBank, VerbNet all annotate an NL corpus with semantic information:  FrameNet – defines a set of semantic frames annotating additional semantic information needed to capture meaning of a word  PropBank – annotates propositions and their arguments in a structured fashion  VerbNet – groups verbs into syntactically and semantically similar classes  All are used when a syntactic parse is not enough  Highly linked:  Unified Verb Index -  SemLink -

Questions

References VerbNet Guidelines, Palmer, M Semlink: Linking PropBank, VerbNet and FrameNet. Proceedings of the Generative Lexicon Conference. Sept. 2009, Pisa, Italy: GenLex Data Format Specifications for English PropBank, format.txt. format.txt M. Palmer et al, “English PropBank Annotation Guidelines,” 2012, annotation-guidelines.pdfhttp://verbs.colorado.edu/~mpalmer/projects/ace/EPB- annotation-guidelines.pdf Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. Extending VerbNet with Novel Verb Classes. Fifth International Conference on Language Resources and Evaluation (LREC 2006). Genoa, Italy. June, Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. Extensive Classifications of English verbs. Proceedings of the 12th EURALEX International Congress. Turin, Italy. September, Paul Kingsbury and Karin Kipper. Deriving Verb-Meaning Clusters from Syntactic Structure.. Workshop on Text Meaning, held in conjunction with HLT/NAACL Edmonton, Canada, May Karin Kipper-Schuler, VerbNet: a Broad-Coverage, Comprehensive Verb Lexicon,” Dissertation, University of Pennsylvania, Michael Ellsworth et al, “FrameNet II: Extended Theory and Practice,” 2010,