1 A Pipeline Model for Bottom-Up Dependency Parsing Ming-Wei Chang, Quang Do, Dan Roth Computer Science Department University of Illinois, Urbana-Champaign.

Slides:



Advertisements
Similar presentations
Differential (De)Serialization for Optimized SOAP Performance Michael J. Lewis Grid Computing Research Laboratory Department of Computer Science Binghamton.
Advertisements

Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao Wei Fan Jing JiangJiawei Han University of Illinois at Urbana-Champaign IBM T. J.
Joint Parsing and Alignment with Weakly Synchronized Grammars David Burkett, John Blitzer, & Dan Klein TexPoint fonts used in EMF. Read the TexPoint manual.
OUTPUT – INPUT STABILITY Daniel Liberzon Coordinated Science Laboratory and Dept. of Electrical & Computer Eng., Univ. of Illinois at Urbana-Champaign.
Key Stage 3 National Strategy Handling data: session 2.
A question from last class: construct the predictive parsing table for this grammar: S->i E t S e S | i E t S | a E -> B.
Bottom up Parsing Bottom up parsing trys to transform the input string into the start symbol. Moves through a sequence of sentential forms (sequence of.
Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.
Sorry, I didn’t catch that! – an investigation of non-understandings and recovery strategies Dan Bohuswww.cs.cmu.edu/~dbohus Alexander I. Rudnickywww.cs.cmu.edu/~air.
Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.
1 University of York Department of Health Sciences Computer Skills Review By Ian Cole Lecturer in C&IT.
Page 1 Generalized Inference with Multiple Semantic Role Labeling Systems Peter Koomen, Vasin Punyakanok, Dan Roth, (Scott) Wen-tau Yih Department of Computer.
Link Grammar ( by Davy Temperley, Daniel Sleator & John Lafferty ) Syed Toufeeq Ahmed ASU.
The classification problem (Recap from LING570) LING 572 Fei Xia, Dan Jinguji Week 1: 1/10/08 1.
CSC3315 (Spring 2009)1 CSC 3315 Lexical and Syntax Analysis Hamid Harroud School of Science and Engineering, Akhawayn University
Inquiry and IBL pedagogies What do inquiry tasks look like in science? Tool IH-3: Comparing approaches.
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Advanced Data Mining May 4, 2010 Growing Parallel Paths for Entity-Page.
Aspect Guided Text Categorization with Unobserved Labels Dan Roth, Yuancheng Tu University of Illinois at Urbana-Champaign.
WIEN Stereo-based Image and Video Analysis for Multimedia Applications Application: “Computer-generated Stereoscopic Paintings“ M. Gelautz, E. Stavrakis,
Computer Science Department UoC. Outline Project Teams Key Points description Suggested Task Delegation Files Needed & previous work.
Computer Science: A Structured Programming Approach Using C1 5-3 Multiway Selection In addition to two-way selection, most programming languages provide.
Dependency Parser for Swedish Project for EDA171 by Jonas Pålsson Marcus Stamborg.
Parallel XML Parsing Using Meta-DFAs Yinfei Pan 1, Ying Zhang 1, Kenneth Chiu 1, Wei Lu 2 1 State University of New York (SUNY) Binghamton 2 Indiana University.
2007 Science of Design (SoD) PI Meeting – Project Nuggets NSF SoD Award No: NSF SoD-HCER Project Title: Learning Based Programming Investigator.
A Tool for Constructing Syntax-Directed Editors Yung-Shen Chang and Nai-Wei Lin Department of Computer Science and Information Engineering National Chung-Cheng.
Multilingual prototype GCMD Portal JAXA/EORC Kengo Aizawa KEIO UNIVERSITY Hiromichi Fukui Kazuyoshi Kunisawa March 8, 2005.
Direct Instruction Stages 1.REVIEW ( Motivation, Hook, Activating Prior Learning) -Review homework -Review relevant previous learning -Review prerequisite.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
Forwarding Programming in Protocol- Oblivious Instruction Set Author : Jingzhou Yu, Xiaozhong Wang, Jian Song, Yuanming Zheng, Haoyu Song Conference: 2014.
Supervisor Creates Performance Plan for Employee via EmpowHR.
Dependency Parsing Niranjan Balasubramanian March 24 th 2016 Credits: Many slides from: Michael Collins, Mausam, Chris Manning, COLNG 2014 Dependency Parsing.
CS 614: Theory and Construction of Compilers Lecture 4 Fall 2002 Department of Computer Science University of Alabama Joel Jones.
The University of Illinois System in the CoNLL-2013 Shared Task Alla RozovskayaKai-Wei ChangMark SammonsDan Roth Cognitive Computation Group University.
Lecture 7 Syntax Analysis (5) Operator-Precedence Parsing
Predictors of Memory in Healthy Aging: Polyunsaturated Fatty Acid Balance and Fornix White Matter Integrity K. Zamroziewicz Marta 1, 2, 3 ;J. Paul Erick.
Predictors of Memory in Healthy Aging: Polyunsaturated Fatty Acid Balance and Fornix White Matter Integrity K. Zamroziewicz Marta 1, 2, 3 ;J. Paul Erick.
PRESENTED BY: PEAR A BHUIYAN
What’s next for search engine
Programming Languages Translator
Predictors of Memory in Healthy Aging: Polyunsaturated Fatty Acid Balance and Fornix White Matter Integrity Marta K. Zamroziewicz 1, 2, 3 ;Erick J. Paul.
Predictors of Memory in Healthy Aging: Polyunsaturated Fatty Acid Balance and Fornix White Matter Integrity K. Zamroziewicz Marta 1, 2, 3 ;J. Paul Erick.
On Dataless Hierarchical Text Classification
Predictors of Memory in Healthy Aging: Polyunsaturated Fatty Acid Balance and Fornix White Matter Integrity Marta K. Zamroziewicz 1, 2, 3 ;Erick J. Paul.
Natural Language Processing (NLP)
Predictors of Memory in Healthy Aging: Polyunsaturated Fatty Acid Balance and Fornix White Matter Integrity Marta K. Zamroziewicz 1, 2, 3 ;Erick J. Paul.
SEMINAR 1. Title : Physics in contacts between probes and quantum world 2. Speaker : Myung-Ho Bae (Korea Research Institute of Standards and Science)
Improving a Pipeline Architecture for Shallow Discourse Parsing
“RDM Service Categories” by OCLC Research, from The Realities of Research Data Management. Part One: A Tour of the Research Data Management (RDM) Service.
Jiawei Han Department of Computer Science
Next-Generation Search Engines -Perspective and challenges
Principles of GIS Fundamental database concepts Shaowen Wang
Introduction to TIMAN: Text Information Managemetn & Analysis
Topics discussed in this section:
Jaya Krishna, M.Tech, Assistant Professor
Word Embeddings with Limited Memory
Real-world SIP Interoperability: Still an Elusive Quest
Subject: Language Processor
Topics discussed in this section:
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Knowledge Transfer via Multiple Model Local Structure Mapping
Parsing Bottom-Up.
Natural Language Processing (NLP)
Fall 2018, COMP 562 Poster Session
Modified at -
LANGUAGE EDUCATION.
Dan Roth Department of Computer Science
The Chinese University of Hong Kong
Visual Grounding.
Natural Language Processing (NLP)
Presentation transcript:

1 A Pipeline Model for Bottom-Up Dependency Parsing Ming-Wei Chang, Quang Do, Dan Roth Computer Science Department University of Illinois, Urbana-Champaign Tenth Conference on Natural Language Learning, Shared Task New York, USA, 2006

2 Dependency Parsing Approach: Summary Modified Shift-Reduced parser. Actions are selected via a classifier + Extended action set + Look ahead search Control Policy: Left to right, with step back Dependency Types: A separate multiclass classifier Multilingual: Convert non-projective languages to projective [Nirve and Nilsson, 2005]

3 Motivation Shift-Reduced Parsing as a pipeline model: A classifier is used to determine which action to take The decision at each stage depends on previous decisions. + Making a decision can rely on information acquired in previous stages - Making a decision can rely on incorrect information acquired in previous stages. Viewed this way, we want to: 1. Reduce the number of decisions 2. Make local decisions more robust

4 Parsing from left to right; considering pair of (currently) consecutive words (a,b) (with a<b) For the pair (a, b), to become the child of a, b must be a complete subtree. Standard action set: Left, Right, Shift [Yamada and Matsumoto, 03], Left: a is the parent of b Right: b is the parent of a Shift: the action is not Left or Right. + We split Shift into Shift, WaitLeft, WaitRight WaitLeft: a is the parent of b, but b is the parent of other nodes. Action is deferred. WaitRight (?): not needed ! Control policy: Step Back Provably, allows parsing in one pass over the sentence [ACL06] Reduces the number of decisions Parsing Algorithm ab c WL ab L R ab a b x S bacdab c d

5 A Pipeline Model with Look Ahead Search Pipilining decisions may result in error accumulation: The correct dependencies  If the algorithm decides w  x before x  y and x  z, we cannot recover the correct parent for y and z. Correct early decisions are crucial A look ahead search algorithm takes into account future predicted actions Local decisions are more robust wxyz wyz x

6 A Pipeline Model with Look Ahead – cont’ The search algorithm performs a search of length depth. Additive scoring is used to score the sequence The first action in this sequence is performed. depth=1 a0a0 a1a1 keep this action depth=2 a0a0 a1a1 a2a2 keep this action depth=3 a0a0 a1a1 a2a2 a3a3 keep this action

7 Experiments (for Swedish) * The effect of the new action. The effect of look ahead search. * For other languages, please refer to our paper.

8 Analysis WaitLeft and the Look Ahead Search improve the parsing results. Results can be improved by: Selecting features and parameters more carefully Currently we use exactly the same set of features and the same parameters for all languages. Using the FEAT column properly The result for languages with FEAT column is generally worse than the languages without FEAT column

9 Thank you !

10 It is not tractable to find the global optimal predicted sequence in the pipeline model with the large depth. In the pipeline framework, the feature vector of current decision depends on every previous predictions. The FEAT column Average difference between our system with the best systems With Feats 4.5%, Without Feats 3.4%

11 Labeling the Dependency Type A post-task after predicting the head for the tokens in the sentence. This is a multi-class classification. Consider every edge of the tree Classify the edge into several classes The parents of the tokens which were labeled in the first phase will be used as the features.

12 a b x ab abc ab L R WL

13

14

15

16 Experiments We show the effect of the new action (on Swedish).

17 Experiments The effect of look ahead search (on Swedish).