Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.

Slides:



Advertisements
Similar presentations
Re-reading Annotate the text for the following: Draw a line where the Introduction ends Draw a line where the Conclusion begins Underline the line that.
Advertisements

CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
MEANT: semi-automatic metric for evaluating for MT evaluation via semantic frames an asembling of ACL11,IJCAI11,SSST11 Chi-kiu Lo & Dekai Wu Presented.
Automatic Identification of Cognates, False Friends, and Partial Cognates University of Ottawa, Canada University of Ottawa, Canada.
Max-Margin Matching for Semantic Role Labeling David Vickrey James Connor Daphne Koller Stanford University.
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
Word sense disambiguation and information retrieval Chapter 17 Jurafsky, D. & Martin J. H. SPEECH and LANGUAGE PROCESSING Jarmo Ritola -
Sentiment Analysis An Overview of Concepts and Selected Techniques.
Steven Schoonover.  What is VerbNet?  Levin Classification  In-depth look at VerbNet  Evolution of VerbNet  What is FrameNet?  Applications.
Semantic Frames: FrameNet. What is FrameNet? FrameNet is an ongoing project at the International Computer Science Institute located in Berkeley California.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
Shallow semantic parsing: Making most of limited training data Katrin Erk Sebastian Pado Saarland University.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
The syntax of language How do we form sentences? Processing syntax. Language and the brain.
Information Extraction with Unlabeled Data Rayid Ghani Joint work with: Rosie Jones (CMU) Tom Mitchell (CMU & WhizBang! Labs) Ellen Riloff (University.
Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.
Language and Thought.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Ronan Collobert Jason Weston Leon Bottou Michael Karlen Koray Kavukcouglu Pavel Kuksa.
Comments on “Building a resource for studying translation shifts” (by Lea Cyrus) Hans C. Boas University of Texas at Austin
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
PropBank, VerbNet & SemLink Edward Loper. PropBank 1M words of WSJ annotated with predicate- argument structures for verbs. –The location & type of each.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
The Current State of FrameNet CLFNG June 26, 2006 Fillmore.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Constructing Knowledge Graph from Unstructured Text Image Source: Kundan Kumar Siddhant Manocha.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
Annotation for Hindi PropBank. Outline Introduction to the project Basic linguistic concepts – Verb & Argument – Making information explicit – Null arguments.
Semantic Frames to Predict Stock Price Movement. Example.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Event-Centric Summary Generation Lucy Vanderwende, Michele Banko and Arul Menezes One Microsoft Way, WA, USA DUC 2004.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Ameeta Agrawal Nikolay Yakovets 01 Dec …Prime Minister Vladimir V. Putin, the country's paramount leader, cut short a trip to Siberia, returning.
Lecture 1 Lec. Maha Alwasidi. Branches of Linguistics There are two main branches: Theoretical linguistics and applied linguistics Theoretical linguistics.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Knowledge Representation
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
FILTERED RANKING FOR BOOTSTRAPPING IN EVENT EXTRACTION Shasha Liao Ralph York University.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
Knowledge Structure Vijay Meena ( ) Gaurav Meena ( )
SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Final Review  Syntax  Semantics/Pragmatics  Sociolinguistics  FINAL will be part open book, and part closed book  Will use similar tasks as Problem.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Machine Learning Lecture 1: Intro + Decision Trees Moshe Koppel Slides adapted from Tom Mitchell and from Dan Roth.
Text Summarization using Lexical Chains. Summarization using Lexical Chains Summarization? What is Summarization? Advantages… Challenges…
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Automatically Labeled Data Generation for Large Scale Event Extraction
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
Load Balancing: List Scheduling
Identifying functions and using function notation
Nov. 29, 2001 Ontology Based Recognition of Complex Objects --- Problems to be Solved Develop Base Object Recognition algorithms that identify non-decomposable.
Load Balancing: List Scheduling
Presentation transcript:

Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun By Efrat Hazani

Frame Semantics Frame Semantics is a theory that relates linguistic semantics to knowledge and experience The meaning of words depend on contexted experiences

For example: “I always eat cereal for breakfast” – breakfast : first meal of the day “Breakfast is served at any time” – breakfast: a particular combination of foods typically eaten at breakfast

What is a frame? Structured representation of concept Identifies the experience as a type, and gives structure and meaning to the relationships, objects and events within the experience A word represents a category of experience, and thus evokes a frame of semantic knowledge A word can evoke different frames

Frame Elements Frame evoking element (FEE) – the word (or lexical unit) which evokes the frame Frame elements (FEs) – words which have semantic roles in the frame Semantic roles describe the relations between a predicate and its arguments Semantic roles are independent from syntactic relations

For example: “Lee punched John in the eye” Agent CAUSE_HARM VictimBody_part “She was frying eggs on a camp stove” Agent Heating_instrument APPLY_HEAT Food

FrameNet A project building a lexical database of English that is both human- and machine-readable, based on annotating examples of how words are used in actual texts.

FrameNet – what is it good for? Provide a unique training dataset for semantic role labeling, used in applications such as: information extraction machine translation event recognition sentiment analysis

The Goal of the Project Create a larger collection of annotated sentences

The General Idea Input: ◦ A set L of sentences labeled with frames and roles (seed corpus) ◦ A set U of unlabeled sentences (expansion corpus) For every unlabeled sentence u : ◦ Find the most similar labeled sentence l ◦ Annotate u according to l ’s annotation. For every labeled sentence l: ◦ Return the k best newly annotated sentences according to l.

Measuring Similarity Sentences are represented by dependency graphs An alignment between two graphs: ◦ A partial injective function σ : M → N ∪ {} ◦ Domain M - a labeled graph ◦ Range N - an unlabeled graph ◦ x ∈ M is aligned to x’ ∈ N by σ, iff σ (x) = x’ Find a graph M with the best alignment to the unlabeled graph N

Measuring Similarity An example: “His back thudded against the wall” “The rest of his body thumped against the front of the cage” Impactor IMPACT Impactee

Calculating score for alignment σ : - the lexical similarity between x and σ (x) (a value between 0 and 1) - is the grammatical relation between x1 and x2 equal to the grammatical relation between σ (x1) and σ (x2) (0 or 1) Measuring Similarity

Calculating score for alignment σ : α - the relative weight of syntactic similarity compared to lexical similarity (optimal value ≈ 0.55) C - normalizing factor Measuring Similarity

Summary We want more sentences labeled with semantic roles Expand the set of annotated sentences by: ◦ For every unlabeled sentence, finding an optimal alignment with some labeled sentence ◦ Projecting annotation from the labeled sentence to the unlabeled sentence