Named Entity Recognition based on Bilingual Co-training Li Yegang School of Computer, BIT.

Slides:



Advertisements
Similar presentations
Joint Parsing and Alignment with Weakly Synchronized Grammars David Burkett, John Blitzer, & Dan Klein TexPoint fonts used in EMF. Read the TexPoint manual.
Advertisements

Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Presenters: Arni, Sanjana.  Subtask of Information Extraction  Identify known entity names – person, places, organization etc  Identify the boundaries.
Automatic Text Processing: Cross-Lingual Text Categorization Automatic Text Processing: Cross-Lingual Text Categorization Dipartimento di Ingegneria dell’Informazione.
Automatic Identification of Cognates, False Friends, and Partial Cognates University of Ottawa, Canada University of Ottawa, Canada.
Cross-Domain Bootstrapping for Named Entity Recognition Ang Sun Ralph Grishman New York University July 28, 2011 Beijing, EOS, SIGIR 2011 NYU.
Chinese Word Segmentation Method for Domain-Special Machine Translation Su Chen; Zhang Yujie; Guo Zhen; Xu Jin’an Beijing Jiaotong University.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
Unsupervised Models for Named Entity Classification Michael Collins and Yoram Singer Yimeng Zhang March 1 st, 2007.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
“Applying Morphology Generation Models to Machine Translation” By Kristina Toutanova, Hisami Suzuki, Achim Ruopp (Microsoft Research). UW Machine Translation.
1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.
Introduction to Machine Learning Approach Lecture 5.
An Information Theoretic Approach to Bilingual Word Clustering Manaal Faruqui & Chris Dyer Language Technologies Institute SCS, CMU.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
A New Approach for Cross- Language Plagiarism Analysis Rafael Corezola Pereira, Viviane P. Moreira, and Renata Galante Universidade Federal do Rio Grande.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
BILINGUAL CO-TRAINING FOR MONOLINGUAL HYPONYMY-RELATION ACQUISITION Jong-Hoon Oh, Kiyotaka Uchimoto, Kentaro Torisawa ACL 2009.
Kuang Ru; Jinan Xu; Yujie Zhang; Peihao Wu Beijing Jiaotong University
Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Concept Unification of Terms in Different Languages for IR Qing Li, Sung-Hyon Myaeng (1), Yun Jin (2),Bo-yeong Kang (3) (1) Information & Communications.
1 Named Entity Recognition based on three different machine learning techniques Zornitsa Kozareva JRC Workshop September 27, 2005.
INSTITUTE OF COMPUTING TECHNOLOGY Bagging-based System Combination for Domain Adaptation Linfeng Song, Haitao Mi, Yajuan Lü and Qun Liu Institute of Computing.
Exploiting Domain Structure for Named Entity Recognition Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois at Urbana-Champaign.
1 Co-Training for Cross-Lingual Sentiment Classification Xiaojun Wan ( 萬小軍 ) Associate Professor, Peking University ACL 2009.
Multi-Layer Filtering algorithm Bilingual Chunk Alignment In Statistical Machine Translation An introduction of Multi-Layer Filtering (MLF) algorithm Dawei.
Unsupervised Constraint Driven Learning for Transliteration Discovery M. Chang, D. Goldwasser, D. Roth, and Y. Tu.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
A MIXED MODEL FOR CROSS LINGUAL OPINION ANALYSIS Lin Gui, Ruifeng Xu, Jun Xu, Li Yuan, Yuanlin Yao, Jiyun Zhou, Shuwei Wang, Qiaoyun Qiu, Ricky Chenug.
Semi-supervised Training of Statistical Parsers CMSC Natural Language Processing January 26, 2006.
CS 6998 NLP for the Web Columbia University 04/22/2010 Analyzing Wikipedia and Gold-Standard Corpora for NER Training William Y. Wang Computer Science.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Prototype-Driven Learning for Sequence Models Aria Haghighi and Dan Klein University of California Berkeley Slides prepared by Andrew Carlson for the Semi-
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Student : Sheng-Hsuan Wang Department.
CSKGOI'08 Commonsense Knowledge and Goal Oriented Interfaces.
Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.
Alignment of Bilingual Named Entities in Parallel Corpora Using Statistical Model Chun-Jen Lee Jason S. Chang Thomas C. Chuang AMTA 2004.
Presenter: Jinhua Du ( 杜金华 ) Xi’an University of Technology 西安理工大学 NLP&CC, Chongqing, Nov , 2013 Discriminative Latent Variable Based Classifier.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Mutual bilingual terminology extraction Le An Ha*, Gabriela Fernandez**, Ruslan Mitkov*, Gloria Corpas*** * University of Wolverhampton ** Universidad.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
MACHINE TRANSLATION PAPER 1 Daniel Montalvo, Chrysanthia Cheung-Lau, Jonny Wang CS159 Spring 2011.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
A New Approach for English- Chinese Named Entity Alignment Donghui Feng Yayuan Lv Ming Zhou USC MSR Asia EMNLP-04.
Discovering Relations among Named Entities from Large Corpora Takaaki Hasegawa *, Satoshi Sekine 1, Ralph Grishman 1 ACL 2004 * Cyberspace Laboratories.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.
Multi-Criteria-based Active Learning for Named Entity Recognition ACL 2004.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
September 2004CSAW Extraction of Bilingual Information from Parallel Texts Mike Rosner.
Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Chinese Named Entity Recognition using Lexicalized HMMs.
A CRF-BASED NAMED ENTITY RECOGNITION SYSTEM FOR TURKISH Information Extraction Project Reyyan Yeniterzi.
Language Identification and Part-of-Speech Tagging
Neural Machine Translation
Automatically Labeled Data Generation for Large Scale Event Extraction
A Brief Introduction to Distant Supervision
Restricted Boltzmann Machines for Classification
Statistical NLP: Lecture 13
Introduction Task: extracting relational facts from text
Leverage Consensus Partition for Domain-Specific Entity Coreference
Presentation transcript:

Named Entity Recognition based on Bilingual Co-training Li Yegang School of Computer, BIT

Outline Introduction and Related Work Bilingual Co-training Corrective NE Projection Annotation Experiment Result

1.Introduction Related work –Das et al.,2011 using parallel English-foreign language data, a high- quality NER tagger for English, and projected annotations for the foreign language. –Burkett et al., 2010 Parallel data has also been used to improve existing monolingual taggers or other analyzers in both languages.

Introduction Disadvantages – current NE alignment methods are not accurate enough, and many noises could be introduced during the word alignment stage. –manual annotation is usually obtained from a few limited domains, leading to a bad affect on statistical supervised learning methods.

Introduction Complementarity a) Results from Chinese name tagger : 金庸新 小说 (b) Results from English name tagger: the new novels of Jin Yong (c) Name tagging after using bilingual co-training: 金庸 新小说 “ 金庸新 ” and “ 金庸 ” in Chinese can be a PER name, while its English translation “Jin Yong” indicates that “ 金庸 ” is more likely to be a PER name than “ 金庸新 ”.

Introduction Complementarity (a) Results from English name tagger The captain of a ferry boat who works on 〈 PER 〉 Lake Constance 〈 /PER 〉... (b) Results from Chinese name tagger: 在〈 LOC 〉康斯坦茨湖〈 /LOC 〉工作的一艘渡船的船长... (c) Name tagging after using bilingual co-training: The captain of a ferry boat who works on 〈 LOC 〉 LakeConstance 〈 /LOC 〉 … “Lake” in English can be the suffix word of either a PER or LOC name, while its Chinese translation “ 康斯坦茨湖 ” indicates that “Lake Constance” is more likely to be a LOC name.

2.Bilingual Co-training Co-training –Starting with a set of labeled data, co-training algorithms attempt to increase the amount of annotated data using large amounts of unlabeled data. The process may continue for several iterations.

Bilingual Co-training –regarding the parallel Chinese-English sentences as weaker independent views for NE identity

Bilingual Co-training

Given: –A set Ls of source labeled examples –A set Lt of target labeled examples – A set Us of source unlabeled examples – A set Ut of target unlabeled examples Algorithm of Bilingual Co-training

1. Classifiers –Use Ls to train the classifiers Classifier(s) –Use Lt to train the classifiers Classifier(t) Algorithm of Bilingual Co-training

3.Corrective NE Projection Annotation Projection NE Candidate –For each word in the source language NE, we find all the possible projection word in target language through the word alignment. Next, we have all the projection words as the “seed” data. With an open-ended window for each seed, all the possible sequences located within the window are considered as possible candidates for NE projection. Their lengths range from 1 to the empirically determined length of the window. During the best candidate projection NE selection, the NE alignment model discussed as follows is applied to search the best projection NE.

3.Corrective NE Projection Annotation NE Alignment Model translation feature, the source NE and target NE’s co- occurrence feature, and length of NE pair feature

3.Corrective NE Projection Annotation Translation Feature (Brown et al.,1993)

3.Corrective NE Projection Annotation Co-occurrence Feature

3.Corrective NE Projection Annotation Length Feature (Church,1993)

Experimental Results NE TypeChinese(F)Enlish(F) PER LOC ORG ALL Baseline F-Measure (%) of NE Tagging

Experimental Results Co-training Model F-Measure (%) of NE Tagging NE TypeChinese(F)Enlish(F) PER LOC ORG ALL

Thank you!