The Current State of FrameNet CLFNG June 26, 2006 Fillmore.

Slides:



Advertisements
Similar presentations
Silent Launch Expectations This activity should be… Silent Independent Work until I say stop Be ready to share your answersExpectations This activity should.
Advertisements

REPORTING STUDENT LEARNING. GCO or General Curriculum Outcomes GCO’s are outcomes that all students are expected to meet. The General Curriculum Outcomes.
CODE/ CODE SWITCHING.
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
CL Research ACL Pattern Dictionary of English Prepositions (PDEP) Ken Litkowski CL Research 9208 Gue Road Damascus,
The SALSA experience: semantic role annotation Katrin Erk University of Texas at Austin.
Introduction to phrases & clauses
Linking Dictionary and Corpus Adam Kilgarriff Lexicography MasterClass Ltd Lexical Computing Ltd University of Sussex UK.
Words Words Words! Helping ELL Students Develop Vocabulary.
Steven Schoonover.  What is VerbNet?  Levin Classification  In-depth look at VerbNet  Evolution of VerbNet  What is FrameNet?  Applications.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Semantic Frames: FrameNet. What is FrameNet? FrameNet is an ongoing project at the International Computer Science Institute located in Berkeley California.
Application Activity: Content Analysis The purpose of this PowerPoint presentation is to offer a more detailed assignment description than I offer in class.
Artificial Intelligence 2005/06 From Syntax to Semantics.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Meaning and Language Part 1.
FrameNet What, how and why C. J. Fillmore C.F.Baker.
Listening Task Purpose of the test:
Building the Valency Lexicon of Arabic Verbs Viktor Bielický Otakar Smrž LREC 2008, Marrakech, Morocco.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Peer Review Expectations Practice With Sentence Types.
Lexical patterning in academic talk
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
FrameNet The work. The product. The applications. CJFillmore - ICSI/Berkeley.
The Impact of Grammar Enhancement on Semantic Resources Induction Luca Dini Giampaolo Mazzini
SALSA The Saarbrücken Lexical Semantics Annotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado,
Interpreting Dictionary Definitions Dan Tecuci May 2002.
1 Read All About It! Helping Your Child Become an Independent Reader.
Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.
ELD-Animals Descriptions 3 Objective: Students will be able to describe a different animal’s physical attributes and actions using the pronoun “it” and.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
WELCOME! 1. PLEASE GET A DOT – AND PLACE IT ON THE KNOWLEDGE OF ELP STANDARDS CONTINUUM 2. HAVE A SEAT AT THE APPROPRIATE TABLE GROUP 1 GROUP 2 3. TURN.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
MASC The Manually Annotated Sub- Corpus of American English Nancy Ide, Collin Baker, Christiane Fellbaum, Charles Fillmore, Rebecca Passonneau.
1 Word senses: a computational response Adam Kilgarriff Auckland 2012Kilgarriff: Word senses: a computational response.
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
II. LANGUAGE AND COMMUNICATION DOMAIN I can answer questions and talk with my teacher and friends. I can follow directions. Listening Comprehension Skill.
Spanish FrameNet Project Autonomous University of Barcelona Marc Ortega.
Artificial Intelligence: Natural Language
Linguistic Essentials
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
1 Word senses: a computational response Adam Kilgarriff.
Introduction Chapter 1 Foundations of statistical natural language processing.
Topic and the Representation of Discourse Content
Unit 8 Syntax. Syntax Syntax deals with rules for combining words into sentences, as well as with relationship between elements in one sentence Basic.
1 STO A Lexical Database of Danish for Language Technology Applications Anna Braasch Center for Sprogteknologi Copenhagen SPINN Seminar, October 27, 2001.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
SYNTAX.
Building Sub-Corpora Suitable for Extraction of Lexico-Syntactic Information Ondřej Bojar, Institute of Formal and Applied Linguistics, ÚFAL.
Definition Essay WIT Comp 2. Definition A definition essay is an essay that defines a word, term, or concept. In this essay you should not define a term.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
Benjamin Rifkin The College of New Jersey.  Background  Development  ACTFL and ILR  Modalities  Levels and sublevels.
Essay Questions. Two Main Purposes for essay questions 1. to assess students' understanding of and ability to think with subject matter content. 2. to.
Language and Cognition Colombo, June 2011 Day 2 Introduction to Linguistic Theory, Part 3.
SOAPSTONE & STRATEGIES Annotation Notes. SOAPS Speaker Occasion Audience Purpose Subject.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Welcome Opening Prayer. Content Objectives: 1.I will review the definition of texts and the teacher’s responsibility in choosing classroom materials.
#1 Make sense of problems and persevere in solving them How would you describe the problem in your own words? How would you describe what you are trying.
Running Records Feedback… What is a running record?
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
Automatic Writing Evaluation
Vocabulary Module 2 Activity 5.
An Introduction to the Government and Binding Theory
Mapping it Out! Practical Tools to Use Assessment Well
A CASUAL CONTRIBUTOR’S LEARNING AID FOR DITA STRUCTURING
The Meaning of Texts Inherent to the interpreting process is the goal of determining the meaning of a source language message and the equivalency of.
Presentation transcript:

The Current State of FrameNet CLFNG June 26, 2006 Fillmore

The Starting Challenge A standard dictionary often does not give us –the background to the meanings of words for specialist vocabulary nobody doubts the need of such background frame semantics proposes that ALL (content) words, including the ones that everybody knows, are best understood in terms of their background frames –a description of a word’s combinatory affordances (valences) this requires a description of the elements of each frame – the things that can be relevantly talked about within phrases structurally related to the word being described – and a classification of relevant syntactic forms and functions –examples illustrating its common uses and these should be determined from examples from a large corpus

The Starting Solution Describe a situation type (frame) that holds for one or more words in the language. –[Revenge] Find the frame elements for this frame. –[Avenger, Offender, Injury,Injured_Party, Punishment] Choose words that belong to the frame. –[avenge, retaliate] Create valence descriptions as sets of triples {frame elements, grammatical function, phrase type} for each word. –one possibility for avenge: {{Av,Subj,NP}, {Inj,Obj,NP}, {Pun, Dep, PPing[by]}}

Do this by: Having trained humans annotate “good” example sentences by assigning FE labels to the syntactic dependents of words being described. Having automatic processes assign syntactic form and function properties to the labeled phrases. Having automatic processes derive the valences from the annotations. Noticing whether the number of syntactic phrases found with the head word matches the number of frame elements known to belong to the frame, and figure out what to make of discrepancies when you find them.

The Product Frame Descriptions Lexical Entries specifying –frame membership –valences –access to examples Annotated Sentences Map of Frame-to-Frame Relations

Immediately Emerging Issues For our lexicographic purposes we wanted to select only simple clear examples. –First, that’s not possible. –Second, we need to be able to annotate full texts. There are cases where the major frame in a construction was evoked by a syntactic dependent, not a syntactic head –support verbs or prepositions with dependent frame-bearing nouns say a prayer, swear an oath, make a promise –transparent nouns “selected by” their complements (unitizers, etc.) stick of gum, kernel of rice, There are cases of mismatches between the number of syntactic constituents we find in a sentence and the number of frame elements that we might expect to see represented. For example: –null representation of some FEs (for which we have stories) –multiple or discontinuous representations of some FEs

More Recently Emerging Issues Not all FE are of equal status. We proposed distinguishing –Core, –Peripheral and –Extrathematic FEs, if only because we wanted to say SOMETHING about all phrases that were in syntactically appropriate positions relative to each target LU. –We bombed their village yesterday in retaliation. The set of frames is not a flat list. There are frame-to-frame relations, of various kinds, importantly –inheritance –composition –perspective

Issues that don’t go away Public Relations –Some of our lexicography and NLP friends object that since we work one frame at a time,rather than one word at a time, it’ll take forever before we can provide the research community with any serious kind of systematic polysemy analysis – hence we’re useless for WSD. Speed –We’ll always be slow, but a current grant providing collaboration with Adam Killgarriff and the use of his WordSketch tool for FrameNet “vanguarding” shows promise. (Vanguarding is what you do before you turn material over to the annotators: selecting correct senses of polysemous words, sorting and selecting samples that can together show the variety of syntactic patterns found with a given word, choosing the most relevant collocations, etc.. Collin will talk about this.)

Continuing Ambitions 1.A speeding up of the whole process (the WordSketch connection) 2.Collaborative alignment with WordNet and SUMO 3.Full sample FN analysis of some portions of the ANC 4.Some open-source way of enhancing the growth of the FrameNet lexicon 5.Success in funding 1-4.

Current FN Visitors Carlos Subirats Jan Scheffsczyk Thomas Schmidt Kyoko Hirose Fran Valverde