Dialogue Act Tagging Using TBL Sachin Kamboj CISC 889: Statistical Approaches to NLP Spring 2003 September 14, 2015September 14, 2015September 14, 2015.

Slides:



Advertisements
Similar presentations
Three Basic Problems Compute the probability of a text: P m (W 1,N ) Compute maximum probability tag sequence: arg max T 1,N P m (T 1,N | W 1,N ) Compute.
Advertisements

Machine Learning Approaches to the Analysis of Large Corpora : A Survey Xunlei Rose Hu and Eric Atwell University of Leeds.
Automatic Timeline Generation from News Articles Josh Taylor and Jessica Jenkins.
University of Sheffield NLP Module 4: Machine Learning.
Three Basic Problems 1.Compute the probability of a text (observation) language modeling – evaluate alternative texts and models P m (W 1,N ) 2.Compute.
Linear Classifiers (perceptrons)
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Linear Model Incorporating Feature Ranking for Chinese Documents Readability Gang Sun, Zhiwei Jiang, Qing Gu and Daoxu Chen State Key Laboratory for Novel.
Confidence Measures for Speech Recognition Reza Sadraei.
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Feature Selection Presented by: Nafise Hatamikhah
Part II. Statistical NLP Advanced Artificial Intelligence Part of Speech Tagging Wolfram Burgard, Luc De Raedt, Bernhard Nebel, Lars Schmidt-Thieme Most.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Aki Hecht Seminar in Databases (236826) January 2009
CS Catching Up CS Porter Stemmer Porter Stemmer (1980) Used for tasks in which you only care about the stem –IR, modeling given/new distinction,
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Transformation-based error- driven learning (TBL) LING 572 Fei Xia 1/19/06.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Course Summary LING 572 Fei Xia 03/06/07. Outline Problem description General approach ML algorithms Important concepts Assignments What’s next?
Maximum Entropy Model LING 572 Fei Xia 02/07-02/09/06.
Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
A New Approach for Cross- Language Plagiarism Analysis Rafael Corezola Pereira, Viviane P. Moreira, and Renata Galante Universidade Federal do Rio Grande.
Module G Variables Sampling Accounting 4081Module G.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Part II. Statistical NLP Advanced Artificial Intelligence Applications of HMMs and PCFGs in NLP Wolfram Burgard, Luc De Raedt, Bernhard Nebel, Lars Schmidt-Thieme.
EMNLP’01 19/11/2001 ML: Classical methods from AI –Decision-Tree induction –Exemplar-based Learning –Rule Induction –T ransformation B ased E rror D riven.
Short Introduction to Machine Learning Instructor: Rada Mihalcea.
Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.
A Multi-Domain Evaluation of Scaling in Soar’s Episodic Memory Nate Derbinsky, Justin Li, John E. Laird University of Michigan.
Distributional Part-of-Speech Tagging Hinrich Schütze CSLI, Ventura Hall Stanford, CA , USA NLP Applications.
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
CSA2050: Introduction to Computational Linguistics Part of Speech (POS) Tagging II Transformation Based Tagging Brill (1995)
1 CS546: Machine Learning and Natural Language Discriminative vs Generative Classifiers This lecture is based on (Ng & Jordan, 02) paper and some slides.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
Benk Erika Kelemen Zsolt
Discriminative Models for Spoken Language Understanding Ye-Yi Wang, Alex Acero Microsoft Research, Redmond, Washington USA ICSLP 2006.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Transformation-Based Learning Advanced Statistical Methods in NLP Ling 572 March 1, 2012.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
PIER Research Methods Protocol Analysis Module Hua Ai Language Technologies Institute/ PSLC.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Using Text Mining and Natural Language Processing for.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-14: Probabilistic parsing; sequence labeling, PCFG.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Presented By- Shahina Ferdous, Student ID – , Spring 2010.
Discourse & Dialogue CS 359 November 13, 2001
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Automatic Grammar Induction and Parsing Free Text - Eric Brill Thur. POSTECH Dept. of Computer Science 심 준 혁.
Machine Learning Tutorial-2. Recall, Precision, F-measure, Accuracy Ch. 5.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Discriminative Frequent Pattern Analysis for Effective Classification By Hong Cheng, Xifeng Yan, Jiawei Han, Chih- Wei Hsu Presented by Mary Biddle.
DESIGNING AN ARTICLE Effective Writing 3. Objectives Raising awareness of the format, requirements and features of scientific articles Sharing information.
BOOTSTRAPPING INFORMATION EXTRACTION FROM SEMI-STRUCTURED WEB PAGES Andrew Carson and Charles Schafer.
An evolutionary approach for improving the quality of automatic summaries Constantin Orasan Research Group in Computational Linguistics School of Humanities,
Learning Analogies and Semantic Relations Nov William Cohen.
CSA2050: Introduction to Computational Linguistics Part of Speech (POS) Tagging II Transformation Based Tagging Brill (1995)
Conversational role assignment problem in multi-party dialogues Natasa Jovanovic Dennis Reidsma Rutger Rienks TKI group University of Twente.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
TEXT CLASSIFICATION AND CLASSIFIERS: A SURVEY & ROCCHIO CLASSIFICATION Kezban Demirtas
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Data Mining (and machine learning)
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Presentation transcript:

Dialogue Act Tagging Using TBL Sachin Kamboj CISC 889: Statistical Approaches to NLP Spring 2003 September 14, 2015September 14, 2015September 14, 2015

Outline Dialogue Acts Transformation Based Learning (TBL) Introduction Training Phase Example Motivation for use of TBL in Dialogue Limitations of TBL Monte Carlo Version of TBL Other Extensions References

Dialogue Acts For a computer system to understand (and participate in) a dialogue, the system should know the intentions of the speaker. Defined as: A concise abstraction of the intentional function of a speaker. #SpeakerUtteranceDialogue Act 1JohnHello.Greet 2JohnI’d like to meet you on Tuesday at 2:00Suggest 3MaryThat’s no good for me,Reject 4MaryBut I am free at 3:00Suggest 5JohnThat sounds fine to meAccept 6JohnI’ll see you then.Bye

Transformation-based Learning Uses machine learning to generate a sequence of rules to use in tagging data The first rules are general, making sweeping generalization and several errors Subsequent rules are more specific and usually correct specific errors

TBL: Training Phase START For each incorrect tag, generate ALL rules to fix this tag Compute a score for each rule Apply this rule to the corpus Find the highest Scoring rule Repeat? END No Yes Label each utterance with an initial tag

TBL: Example #Condition (s)New Dialogue Act 1NoneSUGGEST 2Includes “see” and “you”BYE 3Includes “sounds”ACCEPT 4Length < 4 words Previous tag is noneGREET 5Includes “no” Previous tag is SUGGESTREJECT Utterance Correct Tag Rule # Hello.GSSSGG I’d like to meet you on Tuesday at 2:00SSSSSS That’s no good for me,RSSSSR But I am free at 3:00SSSSSS That sounds fine to meASSAAA I’ll see you then.BSBBBB

TBL: Motivation for use in Dialogue Computing Dialogue Acts is similar to computing POS tags: POS tags depend on surrounding words Dialogue Acts depend on surrounding utterances. Advantages over other ML methods used for Dialogue Act Tagging: Generates intuitive rules: Allows researchers to gain insights into discourse theory Leverage learning: Can use tags that have already been computed. Does not overtrain

Limitations TBL does not assign a confidence measure to the tags TBL is highly dependent on the rule templates. Rule templates need to be manually selected in advance However, its difficult to find only and all the relevant templates. Omissions would result in handicapping the system Solution: Allow the system to learn which templates are useful Allow an overly general set of templates TBL is capable of discarding irrelevant rules…

Limitations (cont.) Problem with the solution: The system becomes intractable with more than 30 templates The system must generate and evaluate O( ip(v + 1) (2d + 1)f ) rules, where: i= no of instances p= no of passes f = no of features d = max. feature distance v = avg. no. of distinct values for each feature

A Monte Carlo Version of TBL Allows the system to consider a huge number of templates while still maintaining tractability System does not perform an exhaustive search through the space of possible rules. Instead only R randomly selected templates are instantiated at each pass Hence, system only considers O ( ipR ) rules. System still finds the best rules: Because the system has many opportunities to find the best rules

Other Extensions Committee-Based Sampling method to assign confidence measures Automatic generation of “Cue Phases”

Reference Sources Samuel, Ken and Carberry, Sandra and Vijay-Shanker, K Dialogue Act Tagging with Transformation-Based Learning. In Proceedings of the 17th International Conference on Computational Linguistics and the 36th Annual Meeting of the Association for Computational Linguistics. Montreal, Quebec, Canada Samuel, Ken and Carberry, Sandra and Vijay-Shanker, K An Investigation of Transformation-Based Learning in Discourse. In Machine Learning: Proceedings of the Fifteenth International Conference. Madison, Wisconsin Samuel, Ken and Carberry, Sandra and Vijay-Shanker, K Automatically Selecting Useful Phrases for Dialogue Act Tagging. In Proceedings of the Fourth Conference of the Pacific Association for Computational Linguistics. Waterloo, Ontario, Canada. Samuel, Ken and Carberry, Sandra and Vijay-Shanker, K Computing Dialogue Acts from Features with Transformation- Based Learning. In Applying Machine Learning to Discourse Processing: Papers from the 1998 AAAI Spring Symposium. Stanford, California

Questions?