A Lightweight and High Performance Monolingual Word Aligner Xuchen Yao, Benjamin Van Durme, (Johns Hopkins) Chris Callison-Burch and Peter Clark (UPenn)

Slides:



Advertisements
Similar presentations
Joint Parsing and Alignment with Weakly Synchronized Grammars David Burkett, John Blitzer, & Dan Klein TexPoint fonts used in EMF. Read the TexPoint manual.
Advertisements

COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Large-Scale Entity-Based Online Social Network Profile Linkage.
1 Extended Gloss Overlaps as a Measure of Semantic Relatedness Satanjeev Banerjee Ted Pedersen Carnegie Mellon University University of Minnesota Duluth.
Baselines for Recognizing Textual Entailment Ling 541 Final Project Terrence Szymanski.
Deciding entailment and contradiction with stochastic and edit distance-based alignment Marie-Catherine de Marneffe, Sebastian Pado, Bill MacCartney, Anna.
Automatic Identification of Cognates, False Friends, and Partial Cognates University of Ottawa, Canada University of Ottawa, Canada.
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
Measures of Text Similarity
LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.
Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.
CS4705 Natural Language Processing.  Regular Expressions  Finite State Automata ◦ Determinism v. non-determinism ◦ (Weighted) Finite State Transducers.
Confidence Estimation for Machine Translation J. Blatz et.al, Coling 04 SSLI MTRG 11/17/2004 Takahiro Shinozaki.
1 Quasi-Synchronous Grammars  Based on key observations in MT: translated sentences often have some isomorphic syntactic structure, but not usually in.
Midterm Review CS4705 Natural Language Processing.
A Phrase-Based Model of Alignment for Natural Language Inference Bill MacCartney, Michel Galley, and Christopher D. Manning Stanford University 26 October.
PZ02A - Language translation
Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
Slide 1 EE3J2 Data Mining EE3J2 Data Mining - revision Martin Russell.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
NATURAL LANGUAGE TOOLKIT(NLTK) April Corbet. Overview 1. What is NLTK? 2. NLTK Basic Functionalities 3. Part of Speech Tagging 4. Chunking and Trees 5.
An Information Theoretic Approach to Bilingual Word Clustering Manaal Faruqui & Chris Dyer Language Technologies Institute SCS, CMU.
Named Entity Recognition and the Stanford NER Software Jenny Rose Finkel Stanford University March 9, 2007.
Large-Scale Cost-sensitive Online Social Network Profile Linkage.
STRUCTURED PERCEPTRON Alice Lai and Shi Zhi. Presentation Outline Introduction to Structured Perceptron ILP-CRF Model Averaged Perceptron Latent Variable.
Feature Selection for Automatic Taxonomy Induction The Features Input: Two terms Output: A numeric score, or. Lexical-Syntactic Patterns Co-occurrence.
Natural Language Processing Lab Northeastern University, China Feiliang Ren EBMT Based on Finite Automata State Transfer Generation Feiliang Ren.
Machine translation Context-based approach Lucia Otoyo.
ICS611 Introduction to Compilers Set 1. What is a Compiler? A compiler is software (a program) that translates a high-level programming language to machine.
1 The Hidden Vector State Language Model Vidura Senevitratne, Steve Young Cambridge University Engineering Department.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
Graphical models for part of speech tagging
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Deriving Paraphrases for Highly Inflected Languages from Comparable Documents Kfir Bar, Nachum Dershowitz Tel Aviv University, Israel.
2010 Failures in Czech-English Phrase-Based MT 2010 Failures in Czech-English Phrase-Based MT Full text, acknowledgement and the list of references in.
Paper Review by Utsav Sinha August, 2015 Part of assignment in CS 671: Natural Language Processing, IIT Kanpur.
1 Co-Training for Cross-Lingual Sentiment Classification Xiaojun Wan ( 萬小軍 ) Associate Professor, Peking University ACL 2009.
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
1 Intelligente Analyse- und Informationssysteme Frank Reichartz, Hannes Korte & Gerhard Paass Fraunhofer IAIS, Sankt Augustin, Germany Dependency Tree.
Prototype-Driven Learning for Sequence Models Aria Haghighi and Dan Klein University of California Berkeley Slides prepared by Andrew Carlson for the Semi-
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
Prior Knowledge Driven Domain Adaptation Gourab Kundu, Ming-wei Chang, and Dan Roth Hyphenated compounds are tagged as NN. Example: H-ras Digit letter.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Supertagging CMSC Natural Language Processing January 31, 2006.
Subjectivity Recognition on Word Senses via Semi-supervised Mincuts Fangzhong Su and Katja Markert School of Computing, University of Leeds Human Language.
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, et al. ACL 2009.
Hui Fang (ACL 2008) presentation 2009/02/04 Rick Liu.
Discriminative Word Alignment with Conditional Random Fields Phil Blunsom & Trevor Cohn [ACL2006] Eiji ARAMAKI.
Learning to Generate Complex Morphology for Machine Translation Einat Minkov †, Kristina Toutanova* and Hisami Suzuki* *Microsoft Research † Carnegie Mellon.
A German Corpus for Similarity Detection
Inferences on Two Samples Summary
Zhifei Li and Sanjeev Khudanpur Johns Hopkins University
CS4705 Natural Language Processing
WordNet WordNet, WSD.
Ariel Schwartz, Anna Divoli, and Marti Hearst
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
CS4705 Natural Language Processing
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
Presentation transcript:

A Lightweight and High Performance Monolingual Word Aligner Xuchen Yao, Benjamin Van Durme, (Johns Hopkins) Chris Callison-Burch and Peter Clark (UPenn) (Vulcan)

ACL 2013, Sofia2 monolingual word alignment Aligning one sentence pair from RTE2 Premise: Linda Johnson, who lives with her husband, Charles, and two cats in..., said Katrina has... Hypothesis: Linda Johnson is married to Charles alignment contributed by Brockett (2007)

ACL 2013, Sofia3 monolingual vs. bilingual aligment less training data (labeled or unlabeled), but more lexical resources semantic relatedness: cued by distributional word similaries the same grammar shared by source/target sentences

ACL 2013, Sofia4 monolingual vs. bilingual aligment less training data (labeled or unlabeled), but more lexical resources semantic relatedness: cued by distributional word similaries the same grammar shared by source/target sentences

ACL 2013, Sofia5 monolingual vs. bilingual aligment less training data (labeled or unlabeled), but more lexical resources semantic relatedness: cued by distributional word similaries the same grammar shared by source/target sentences

ACL 2013, Sofia6 a discriminative model first proposed by Blunsom and Cohn (2006): s, t: source (observation), target sentence a: target word indices (0 to target length), state 0 is NULL state for deletion. f(): feature functions

ACL 2013, Sofia7 a discriminative model first proposed by Blunsom and Cohn (2006): s, t: source (observation), target sentence a: target word indices (0 to target length), state 0 is NULL state for deletion. f(): feature functions

ACL 2013, Sofia8 a discriminative model first proposed by Blunsom and Cohn (2006): s, t: source (observation), target sentence a: target word indices (0 to target length), state 0 is NULL state for deletion. f(): feature functions

ACL 2013, Sofia9

ACL 2013, Sofia10 desired Viterbi decoding path

ACL 2013, Sofia11 a discriminative model first proposed by Blunsom and Cohn (2006): s, t: source (observation), target sentence a: target word indices (0 to target length), state 0 is NULL state for deletion. f(): feature functions

ACL 2013, Sofia12 features string similarity –Jaro Winkler, Dice Sorensen, Hamming, Jaccard, Levenshtein, NGram overlapping and common prefix matching POS tags matching WordNet –hypernym, hyponym, synonym, derived form, entailing, causing, members of, have member, substances of, have substances, parts of, have part

ACL 2013, Sofia13 features string similarity –Jaro Winkler, Dice Sorensen, Hamming, Jaccard, Levenshtein, NGram overlapping and common prefix matching POS tags matching WordNet –hypernym, hyponym, synonym, derived form, entailing, causing, members of, have member, substances of, have substances, parts of, have part

ACL 2013, Sofia14 features string similarity –Jaro Winkler, Dice Sorensen, Hamming, Jaccard, Levenshtein, NGram overlapping and common prefix matching POS tags matching WordNet –hypernym, hyponym, synonym, derived form, entailing, causing, members of, have member, substances of, have substances, parts of, have part

ACL 2013, Sofia15 features positional –offset difference between src/tgt word context –whether neighboring words are similar –helps to align functional words distortion (Markov feature) –how far apart are two aligned target words

ACL 2013, Sofia16 features positional –offset difference between src/tgt word context –whether neighboring words are similar –helps to align functional words distortion (Markov feature) –how far apart are two aligned target words

ACL 2013, Sofia17 features positional –offset difference between src/tgt word context –whether neighboring words are similar –helps to align functional words distortion (Markov feature) –how far apart are two aligned target words

ACL 2013, Sofia18 Implementation: jacana-align source code at lightweight: only used a POS tagger and WordNet written in Scala, optimize with LBFGS platform independent, compiles to a.jar file, fully interoperable with Java high performance? -> evaluation

ACL 2013, Sofia19 Baselines GIZA++ Tree Edit Distance (with stem/wordnet matching) MANLI –MacCartney, B.; Galley, M. & Manning, C. D., A Phrase-Based Alignment Model for Natural Language Inference, EMNLP 2008 MANLI-constraint (decoding with ILP) –Thadani, K. & McKeown, K. Optimal and syntactically-informed decoding for monolingual phrase-based alignment. ACL 2011

ACL 2013, Sofia20 Baselines GIZA++ Tree Edit Distance (with stem/wordnet matching) MANLI –MacCartney, B.; Galley, M. & Manning, C. D., A Phrase-Based Alignment Model for Natural Language Inference, EMNLP 2008 MANLI-constraint (decoding with ILP) –Thadani, K. & McKeown, K. Optimal and syntactically-informed decoding for monolingual phrase-based alignment. ACL 2011

ACL 2013, Sofia21 Baselines GIZA++ Tree Edit Distance (with stem/wordnet matching) MANLI –MacCartney, B.; Galley, M. & Manning, C. D., A Phrase-Based Alignment Model for Natural Language Inference, EMNLP 2008 MANLI-constraint (decoding with ILP) –Thadani, K. & McKeown, K. Optimal and syntactically-informed decoding for monolingual phrase-based alignment. ACL 2011

ACL 2013, Sofia22 Baselines GIZA++ Tree Edit Distance (with stem/wordnet matching) MANLI –MacCartney, B.; Galley, M. & Manning, C. D., A Phrase-Based Alignment Model for Natural Language Inference, EMNLP 2008 MANLI-constraint (decoding with ILP) –Thadani, K. & McKeown, K. Optimal and syntactically-informed decoding for monolingual phrase-based alignment. ACL 2011

ACL 2013, Sofia23 performance in F1 10.3%

ACL 2013, Sofia24 performance in F1 0.8% 3.3%

ACL 2013, Sofia25 performance in speed (seconds per sentecne) when sentences are more balanced, jacana- align is about 20x faster corpussentence pair length MANLI-approx.MANLI-exactjacana-align RTE229/111.67s0.08s0.025s FUSION27/ s2.45s0.096s 20x

ACL 2013, Sofia26 performance in speed (seconds per sentecne) the speed of jacana-align is not as sensitive to sentence length increase corpussentence pair length MANLI-approx.MANLI-exactjacana-align RTE229/111.67s0.08s0.025s FUSION27/ s2.45s0.096s 30x 4x

ACL 2013, Sofia27 Conclusion state-of-the-art monolingual word aligner –in accuracy –in speed open source, use it and hack it!

thank you with a demo