A transformation-based approach to argument labeling Derrick Higgins Educational Testing Service

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

FMZ Elaborazione del linguaggio naturale Fabio Massimo Zanzotto.
SEMANTIC ROLE LABELING BY TAGGING SYNTACTIC CHUNKS
Progress update Lin Ziheng. System overview 2 Components – Connective classifier Features from Pitler and Nenkova (2009): – Connective: because – Self.
Search-Based Structured Prediction
CONSTRAINED CONDITIONAL MODELS TUTORIAL Jingyu Chen, Xiao Cheng.
Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.
Sequence Classification: Chunking Shallow Processing Techniques for NLP Ling570 November 28, 2011.
Part-Of-Speech Tagging and Chunking using CRF & TBL
Semantic Role Labeling Abdul-Lateef Yussiff
Probabilistic Detection of Context-Sensitive Spelling Errors Johnny Bigert Royal Institute of Technology, Sweden
A Joint Model For Semantic Role Labeling Aria Haghighi, Kristina Toutanova, Christopher D. Manning Computer Science Department Stanford University.
Chapter 4: Linear Models for Classification
Shallow Parsing CS 4705 Julia Hirschberg 1. Shallow or Partial Parsing Sometimes we don’t need a complete parse tree –Information extraction –Question.
Rate-Distortion Optimal Skeleton-Based Shape Coding Haohong Wang, Aggelos K. Katsaggelos, and Thrasyvoulos N. Pappas Image Processing, Proceedings.
Jun Zhu Dept. of Comp. Sci. & Tech., Tsinghua University This work was done when I was a visiting researcher at CMU. Joint.
Stemming, tagging and chunking Text analysis short of parsing.
SRL using complete syntactic analysis Mihai Surdeanu and Jordi Turmo TALP Research Center Universitat Politècnica de Catalunya.
DS-to-PS conversion Fei Xia University of Washington July 29,
Ch 10 Part-of-Speech Tagging Edited from: L. Venkata Subramaniam February 28, 2002.
Two-Phase Semantic Role Labeling based on Support Vector Machines Kyung-Mi Park Young-Sook Hwang Hae-Chang Rim NLP Lab. Korea Univ.
Page 1 Generalized Inference with Multiple Semantic Role Labeling Systems Peter Koomen, Vasin Punyakanok, Dan Roth, (Scott) Wen-tau Yih Department of Computer.
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
LING 388 Language and Computers Take-Home Final Examination 12/9/03 Sandiway FONG.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
1 I256: Applied Natural Language Processing Marti Hearst Sept 25, 2006.
A Memory-Based Approach to Semantic Role Labeling Beata Kouchnir Tübingen University 05/07/04.
CS 4705 Lecture 11 Feature Structures and Unification Parsing.
Semantic Role Labeling using Maximum Entropy Model Joon-Ho Lim NLP Lab. Korea Univ.
1 Sequence Labeling Raymond J. Mooney University of Texas at Austin.
Feature Selection for Automatic Taxonomy Induction The Features Input: Two terms Output: A numeric score, or. Lexical-Syntactic Patterns Co-occurrence.
English Syntax. Arguments against traditional prescriptive grammar teaching L1 grammar is acquired without explicit instruction or corrective feedback.
Robert Hass CIS 630 April 14, 2010 NP NP↓ Super NP tagging JJ ↓
Overview of Machine Learning for NLP Tasks: part II Named Entity Tagging: A Phrase-Level NLP Task.
A Survey of NLP Toolkits Jing Jiang Mar 8, /08/20072 Outline WordNet Statistics-based phrases POS taggers Parsers Chunkers (syntax-based phrases)
Spring /22/071 Beyond PCFGs Chris Brew Ohio State University.
Partial Parsing CSCI-GA.2590 – Lecture 5A Ralph Grishman NYU.
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
Ling 570 Day 17: Named Entity Recognition Chunking.
CSA2050: Introduction to Computational Linguistics Part of Speech (POS) Tagging II Transformation Based Tagging Brill (1995)
1 CS546: Machine Learning and Natural Language Latent-Variable Models for Structured Prediction Problems: Syntactic Parsing Slides / Figures from Slav.
AQUAINT Workshop – June 2003 Improved Semantic Role Parsing Kadri Hacioglu, Sameer Pradhan, Valerie Krugler, Steven Bethard, Ashley Thornton, Wayne Ward,
CS774. Markov Random Field : Theory and Application Lecture 19 Kyomin Jung KAIST Nov
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
Transformation-Based Learning Advanced Statistical Methods in NLP Ling 572 March 1, 2012.
Prototype-Driven Learning for Sequence Models Aria Haghighi and Dan Klein University of California Berkeley Slides prepared by Andrew Carlson for the Semi-
Deep Learning for Efficient Discriminative Parsing Niranjan Balasubramanian September 2 nd, 2015 Slides based on Ronan Collobert’s Paper and video from.
Fact Extraction Ontology Ontological- Semantic Analysis Text Meaning Representation (TMR) Fact Repository (FR) Text Sources Lexicons Grammars Static Knowledge.
Supertagging CMSC Natural Language Processing January 31, 2006.
February 2007CSA3050: Tagging III and Chunking 1 CSA2050: Natural Language Processing Tagging 3 and Chunking Transformation Based Tagging Chunking.
Intra-Chunk Dependency Annotation : Expanding Hindi Inter-Chunk Annotated Treebank Prudhvi Kosaraju, Bharat Ram Ambati, Samar Husain Dipti Misra Sharma,
Shallow Parsing for South Asian Languages -Himanshu Agrawal.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
LING/C SC/PSYC 438/538 Lecture 20 Sandiway Fong 1.
Logistic Regression William Cohen.
1 Principles & Parameters Approach in Linguistics II Bibhuti Bhusan Mahapatra.
Modified from Diane Litman's version of Steve Bird's notes 1 Rule-Based Tagger The Linguistic Complaint –Where is the linguistic knowledge of a tagger?
CSA2050: Introduction to Computational Linguistics Part of Speech (POS) Tagging II Transformation Based Tagging Brill (1995)
A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning Ronan Collobert Jason Weston Presented by Jie Peng.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 17 th.
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
Semantic Role Labelling Using Chunk Sequences Ulrike Baldewein Katrin Erk Sebastian Padó Saarland University Saarbrücken Detlef Prescher Amsterdam University.
Embedded Clauses in TAG
Tracking parameter optimization
Optimal UAV Flight Path Selection
CSCI 5832 Natural Language Processing
Word embeddings (continued)
Artificial Intelligence 2004 Speech & Natural Language Processing
“Traditional” image segmentation
Presentation transcript:

A transformation-based approach to argument labeling Derrick Higgins Educational Testing Service

General approach Word-by-word SRL Modified IOB scheme for indicating role boundaries Start from simplistic baseline labeling TBL rules re-label words based on contextual features

Data representation Modified IOB Words Argument labeling Input formatIOB2Modified IOB They(A0*A0)B-A0 left(V*V)B-V their(A1*B-A1 jobs*A1)I-A1I on(AM-TMP*B-AM-TMP Friday*AM-TMP)I-AM-TMPI.*OO

Features Fairly standard set; role label of word depends on: –Target verb –Target verb POS –Target verb passive –Word –POS –Chunk tag –NE tag –L/R of target word –Clause embedding –PP feature –PP head –NP head –Path Values for current word and surrounding words No use made of PB frames

Transformational rules 130 total Minimum number of applications = 3 (Mostly) local rules –Local syntactic features + [path, target V, NP head, etc] Rules using context –Smoothing rules –Long-distance rules

Results Overtraining is an issue Core arguments easier than modifiers

Results

Error analysis Pros/cons of TBL –Pro: easy conditioning on many factors –Con: Little control over trade-off between rule frequency and rule type in selecting rules –Con: Predictive features which are correlated with one another may not be used jointly –Con: No real probabilistic framework Problems with low-freq. roles

Error analysis Dependency on length

Potential improvements Phrase-by-phrase labeling Using ‘official’ baseline Rules in ordered sets? Global optimization Additional features