Shallow semantic parsing: Making most of limited training data Katrin Erk Sebastian Pado Saarland University.

Slides:



Advertisements
Similar presentations
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Advertisements

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
The SALSA experience: semantic role annotation Katrin Erk University of Texas at Austin.
Semantic Role Labeling Abdul-Lateef Yussiff
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
CS 4705 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised –Dictionary-based.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Regular Expressions and Automata Chapter 2. Regular Expressions Standard notation for characterizing text sequences Used in all kinds of text processing.
1 Statistical NLP: Lecture 13 Statistical Alignment and Machine Translation.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
1 CS546: Machine Learning and Natural Language Preparation to the Term Project: - Dependency Parsing - Dependency Representation for Semantic Role Labeling.
Automatic Extraction of Opinion Propositions and their Holders Steven Bethard, Hong Yu, Ashley Thornton, Vasileios Hatzivassiloglou and Dan Jurafsky Department.
Evaluation in NLP Zdeněk Žabokrtský. Intro The goal of NLP evaluation is to measure one or more qualities of an algorithm or a system Definition of proper.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
The Impact of Grammar Enhancement on Semantic Resources Induction Luca Dini Giampaolo Mazzini
SALSA The Saarbrücken Lexical Semantics Annotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado,
Interpreting Dictionary Definitions Dan Tecuci May 2002.
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.
A Language Independent Method for Question Classification COLING 2004.
2014 EMNLP Xinxiong Chen, Zhiyuan Liu, Maosong Sun State Key Laboratory of Intelligent Technology and Systems Tsinghua National Laboratory for Information.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
AQUAINT Workshop – June 2003 Improved Semantic Role Parsing Kadri Hacioglu, Sameer Pradhan, Valerie Krugler, Steven Bethard, Ashley Thornton, Wayne Ward,
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
A Cross-Lingual ILP Solution to Zero Anaphora Resolution Ryu Iida & Massimo Poesio (ACL-HLT 2011)
Transformation-Based Learning Advanced Statistical Methods in NLP Ling 572 March 1, 2012.
The interface between model-theoretic and corpus-based semantics
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
GermaNet-WS II A WordNet “Detour” to FrameNet Aljoscha Burchardt Katrin Erk Anette Frank* Saarland University, DFKI* Saarbrücken
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Hendrik J Groenewald Centre for Text Technology (CTexT™) Research Unit: Languages and Literature in the South African Context North-West University, Potchefstroom.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Supertagging CMSC Natural Language Processing January 31, 2006.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Towards the Use of Linguistic Information in Automatic MT Evaluation Metrics Projecte de Tesi Elisabet Comelles Directores Irene Castellon i Victoria Arranz.
Toward an Open Source Textual Entailment Platform (Excitement Project) Bernardo Magnini (on behalf of the Excitement consortium) 1 STS workshop, NYC March.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
8 December 1997Industry Day Applications of SuperTagging Raman Chandrasekar.
SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
LECTURE 5 Scanning. SYNTAX ANALYSIS We know from our previous lectures that the process of verifying the syntax of the program is performed in two stages:
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Semantic Role Labelling Using Chunk Sequences Ulrike Baldewein Katrin Erk Sebastian Padó Saarland University Saarbrücken Detlef Prescher Amsterdam University.
Approaches to Machine Translation
Two Discourse Driven Language Models for Semantics
Statistical NLP: Lecture 13
Statistical NLP: Lecture 9
Approaches to Machine Translation
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

Shallow semantic parsing: Making most of limited training data Katrin Erk Sebastian Pado Saarland University

Introduction Frame semantics: –“ Who does what to whom ” analysis: senses and roles –Cross-lingual appeal (Boas 2005) Prerequisite for use in NLP: Automatic, robust, accurate methods for analysis of free text Predominant machine learning paradigm: Supervised classification –Learn relation between features and classes from training corpus; guess classes in test corpus –Gildea and Jurafsky (2002) and many since

Frame-semantic analysis Step 1: Frame disambiguation –WSD-style classification of predicate in terms of frames Step 2: Role assignment –Classification of nodes in terms of role labels

Frame-semantic analysis Creeping in its shadow I reached a point whence I could look straight through the uncurtained window. (A. Conan Doyle, The Hound of the Baskervilles)

Problems of supervised learning setting Coverage: –lemmas may be missing –frames may be missing Languages other than English: –Training data may not be available –Can we take advantage of existing resources for English?

Today ’ s talk Shalmaneser: a system for automatic frame-semantic analysis Unknown sense detection: dealing with missing frames Annotation projection for cross-lingual data creation Summary

Shalmaneser: Automatic frame-semantic analysis Assignment of –senses (frames) to predicates –semantic roles Aim: easy use, for exploring applications of frame-semantic analysis –Input: plain text –Syntactic preprocessing integrated –Visualization with SALTO tool

Shalmaneser: Automatic frame-semantic analysis Semantic analysis as supervised learning tasks –Pre-trained classifiers available for English (FrameNet) and German (SALSA) Performance of English models: –Frame assignment: accuracy 0.93, baseline 0.89 High baseline because some senses are missing –Role assignment: Role recognition F-score 0.75 Role labeling Accuracy 0.78 –Not top-scoring, but okay. Focus on ease of use and on flexibility.

Shalmaneser: Flexibiliby Processing steps linked only by interface format: Salsa/Tiger XML (Erk & Pado 04) –Adding a module: just needs to speak Salsa/Tiger XML Model features specified in experiment file, can be changed easily Adding new parser by instantiating an interface class New language: only syntactic preprocessing changes

Today ’ s talk Shalmaneser: a system for automatic frame-semantic analysis Unknown sense detection: dealing with missing frames Annotation projection for cross-lingual data creation Summary

Detecting unknown word senses (frames) Conan Doyle, The Hound of the Baskervilles. Syntax: Collins parser Semantics: Shalmaneser Unseen senses  normal WSD approach will assign wrong sense Automatically detect senses we haven ’ t seen before?

Unknown sense detection as outlier detection Outlier detection: detect occurrences of previously unseen events (overview articles: Markou & Singh 2003a,b) –training data: positive cases only. Derive model of “ normal ” cases –test data: positive and negative cases training items test items

A Nearest Neighbor-based outlier detection method Tax and Duin (2000): simple method, easy to implement Given test point and its nearest training neighbor : Is closer to than ‘ s nearest neighbor? –Test point x, nearest training neighbor t, nearest neighbor t ’ of t, (Euclidean) distances d: Accept x if p NN (x) is below a given threshold yes no

Unknown sense detection: Results Evaluation (Erk NAACL 2006): –Use FrameNet data –Treat one sense of a lemma as pseudo-unknown (iterate over all senses) Results (assignment of label “ unknown ” ): –Tax&Duin ’ s method, one lemma at a time: Prec 0.70, Rec 0.35 –More data: all data for a frame, not just that of one lemma Prec 0.77, Rec 0.82

Results What features are important? 1. Best: just context words 2. Almost as good: features of 1, 3, 4 together 3. Just the subcategorization frame: high precision, low recall 4. Subcat frame, plus headwords of arguments: inbetween 3 and 2, but obviously too sparse

Unknown sense detection as outlier detection: The bigger picture Why assume missing word senses in the sense inventory and in the training data? –Growing, unfinished resources, like FrameNet –Domain-specific senses may be missing from general-purpose sense inventories Outlier detection method presented here: applicable to any resource that groups words into senses, e.g. WordNet Using outlier detection to detect occurrences of nonliteral use?

Today ’ s talk Shalmaneser: a system for automatic frame-semantic analysis Unknown sense detection: dealing with missing frames Annotation projection for cross-lingual data creation Summary

Motivation Definitions, Role set: Language-independent Predicate classes: Language-specific Annotated Sentences: Specific, too

Agenda For new language, induce: 1.Frame-semantic predicate classification 2.Corpus with frame-semantic annotation Method: Annotation projection in parallel corpus –Word alignments approximate semantic equivalence Corresponding word pairs (predicates) Corresponding constituents Evaluation: Study on EUROPARL corpus (De/En/Fr)

An idealised example Peter comes homePierre revient à la maison Arriving

Frame-semantic classes Idea: For each frame, construct list of predicates in new language occurring aligned to predicates of this frame => FEEs for new languages Main obstacle: Translational divergence –Corresponding predicates don ’ t evoke same frame Address by shallow, language-independent filtering (Pado and Lapata AAAI 2005) –Important: Distributional patterns Evaluation: Can obtain predicate classes for German and French with precision of 65-70% –Main remaining problem: English polysemy not covered by FrameNet

Role annotations (I) Idea: For each sentence, transfer semantic role annotation onto translated sentence Obstacle 1: Frame divergence –Role projection only sensible if frames match –Good news: In En-De test corpus (Pado and Lapata HLT/EMNLP 2005), 70% of frames match Obstacle 2: Role divergence –Even if frames are parallel, do roles match? –Good news: In En-De test corpus, matching frames show 90% role matches Remaining cases mostly elisions (e.g. passive)

Role annotations (II) Obstacle 3: Errors/omissions in automatically induced word alignments –Can be overcome by using bracketing information (chunks / constituents) –Induction of cross-lingual correspondences as graph optimisation problem (Pado and Lapata ACL 2006) Evaluation (all exact match F-score): –Word-based projection: 0.50 –Constituent-based: 0.75 –Upper limit: 0.85 Remaining errors mostly parsing-related

Summary Frame-semantic analysis potentially interesting for many NLP applications –Goal of Shalmaneser: flexible and easy-to-use system Address incompleteness in resources –Unknown sense detection as outlier detection Porting Frame Semantics to new languages –Parallel corpora for automatic annotation projection