Discovery of Manner Relations and their Applicability to Question Answering Roxana Girju 1,2, Manju Putcha 1, and Dan Moldovan 1 University of Texas at.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 2 (06/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Part of Speech (PoS)
Large-Scale Entity-Based Online Social Network Profile Linkage.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Word Sense Disambiguation for Machine Translation Han-Bin Chen
Data preprocessing before classification In Kennedy et al.: “Solving data mining problems”
A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts 04 10, 2014 Hyun Geun Soo Bo Pang and Lillian Lee (2004)
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
What is Statistical Modeling
Automatic Metaphor Interpretation as a Paraphrasing Task Ekaterina Shutova Computer Lab, University of Cambridge NAACL 2010.
Assuming normally distributed data! Naïve Bayes Classifier.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
1 SIMS 290-2: Applied Natural Language Processing Marti Hearst Sept 22, 2004.
Markov Model Based Classification of Semantic Roles A Final Project in Probabilistic Methods in AI Course Submitted By: Shlomit Tshuva, Libi Mann and Noam.
Learning Subjective Adjectives from Corpora Janyce M. Wiebe Presenter: Gabriel Nicolae.
CS 4705 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised –Dictionary-based.
Learning syntactic patterns for automatic hypernym discovery Rion Snow, Daniel Jurafsky and Andrew Y. Ng Prepared by Ang Sun
Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, Bing Qin
Measuring Language Development in Children: A Case Study of Grammar Checking in Child Language Transcripts Khairun-nisa Hassanali and Yang Liu {nisa,
Automatic Extraction of Opinion Propositions and their Holders Steven Bethard, Hong Yu, Ashley Thornton, Vasileios Hatzivassiloglou and Dan Jurafsky Department.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
A Feedback-Augmented Method for Detecting Errors in the Writing of Learners of English Ryo Nagata et al. Hyogo University of Teacher Education ACL 2006.
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Distributional Part-of-Speech Tagging Hinrich Schütze CSLI, Ventura Hall Stanford, CA , USA NLP Applications.
2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.
Text Classification, Active/Interactive learning.
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
Recognizing Names in Biomedical Texts: a Machine Learning Approach GuoDong Zhou 1,*, Jie Zhang 1,2, Jian Su 1, Dan Shen 1,2 and ChewLim Tan 2 1 Institute.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
1 Statistical NLP: Lecture 7 Collocations. 2 Introduction 4 Collocations are characterized by limited compositionality. 4 Large overlap between the concepts.
Copyright R. Weber Machine Learning, Data Mining INFO 629 Dr. R. Weber.
Prototype-Driven Learning for Sequence Models Aria Haghighi and Dan Klein University of California Berkeley Slides prepared by Andrew Carlson for the Semi-
Minimally Supervised Event Causality Identification Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign 1 EMNLP-2011.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Disambiguation Read J & M Chapter 17.1 – The Problem Washington Loses Appeal on Steel Duties Sue caught the bass with the new rod. Sue played the.
CSKGOI'08 Commonsense Knowledge and Goal Oriented Interfaces.
Artificial Intelligence 8. Supervised and unsupervised learning Japan Advanced Institute of Science and Technology (JAIST) Yoshimasa Tsuruoka.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Automatic recognition of discourse relations Lecture 3.
KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.
Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff School of Computing University of Utah Janyce Wiebe, Theresa Wilson Computing.
Automatic acquisition for low frequency lexical items Nuria Bel, Sergio Espeja, Montserrat Marimon.
Discovering Relations among Named Entities from Large Corpora Takaaki Hasegawa *, Satoshi Sekine 1, Ralph Grishman 1 ACL 2004 * Cyberspace Laboratories.
Labeling protein-protein interactions Barbara Rosario Marti Hearst Project overview The problem Identifying the interactions between proteins. Labeling.
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
Improving the Classification of Unknown Documents by Concept Graph Morteza Mohagheghi Reza Soltanpour
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
KNN & Naïve Bayes Hongning Wang
Text Classification and Naïve Bayes Formalizing the Naïve Bayes Classifier.
TRUE or FALSE? Syntax= the order of words in a sentence.
A Brief Introduction to Distant Supervision
Automatic Hedge Detection
Improving a Pipeline Architecture for Shallow Discourse Parsing
Statistical NLP: Lecture 9
A method for WSD on Unrestricted Text
Automatic Detection of Causal Relations for Question Answering
Logistic Regression [Many of the slides were originally created by Prof. Dan Jurafsky from Stanford.]
Extracting Why Text Segment from Web Based on Grammar-gram
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

Discovery of Manner Relations and their Applicability to Question Answering Roxana Girju 1,2, Manju Putcha 1, and Dan Moldovan 1 University of Texas at Dallas 1 Baylor University 2 ACL Workshop on Multilingual Summarization and Question Answering, 2003

Manner Relation An example: We want to work together to build our new economy, creating jobs by investing in technology so America can continue to lead the world in growth and opportunity. Q: How do Democrats want to build the economy? A: by creating jobs Q: How do Democrats want to create jobs? A: by investing in technology

Manner Relation (Cont.) In WordNet, the manner relation is defined as a way of acting or behaving. There are different ways of expressing manner. A possible way to check: answer to “ In what manner/how ? ” “ He runs quickly. ” vs. “ How hoes he run? ” but not to “ Where ? ”, or “ When ? ” “ He runs on the field. ”, “ He runs quite often. ”

Lexico-Syntactic Patterns Expressing Manner The most frequently occurring form of manner is as a semantic role (Quirk et al., 1985). Manner is encoded as a relationship between a verb and one of its arguments. One of the most frequently used patterns expressing manner is verb-adverb. Adverbs can be further classified as follows:

Manner Expressing (Cont.) Verb-Adverb Patterns Adverbs of manner that end in “ -ly ” Adverbs of manner that do not end in “ -ly ” Quality description adverbs: fast, good, well Adverbial expressions as much as 60% faster, louder than ever Compound adverbs of manner radio-style, tax-free Foreign adverbial expressions a la Gorbachev, en masse

Manner Expressing (Cont.) Other forms of manner relations (1) complex nominals (fast car), (2) verbs of implicit manner (for example whisper is a manner of speaking), (3) verb-PP (I took your coat by mistake) (4) verb-NP (He breathed a deep breath) (5) verb clauses (I cook vegetables as Chinese do)

Manner Expressing (Cont.) All these lexico-syntactic patterns are ambiguous. In this paper we focus only on the discovery of manner semantic roles expressed as verb-adverb pairs. The method, however, is extendable to many other manner forms and even to other semantic relations.

Approach Supervised learning Naive Bayes Classifier approach Adverb Dictionary (as reference) Adverbs in WordNet with a gloss like “ in a ~~ manner ” Adverbs that are annotated in TreeBank as MNR adverb-verb pairs

Features

Features (Cont.) (1) Specific adverb statistics Whether the adverb is in Adverb Dictionary or not. Statistics is the occurrences in training data. (2) Parent phrase type The syntactic tag of the parent of the adverb (3) Whether or not Adverb is present in the Dictionary The same as (1), but the statistics is the probability of being a manner adverb in the Adverb Dictionary.

Features (Cont.) (4) Distance between verb and adverb Counted as words between them (5) Component before the adverb The syntactic roles before the adverb. (6) Component after the adverb The syntactic roles after the adverb. (7) Adverb ends in “ ly ” True or false

Estimating Probabilities Class prior probabilities: Portion of positive (V j =1) or negative (V j =0) examples in the training data Class conditional probability (of each feature): pos_freq: occurrences of the feature in positive class VOCAB, TEXT: the distinct and total numbers of instances for the feature

Naive Bayes Classifier V nb is the output of the Naive Bayes Classifier, v j is the class in the target set V={+,-} f i are the individual features from the set F of the seven features. P(f i /+)=Prob(+) for f i P(f i /-)=Prob(-) for f i The classifier will generate a Look-Up table for classification use.

Training and Test Corpus Positive examples: ADVP-MNR annotations from Treebank2 (UPenn ’ s) Training:Testing=3:1

First Experiment Training = (1176 Positive Negative) = 3722 examples Testing = (507 Positive Negative) =1690 examples. Output of the program: Prior Positive Probability = Prior Negative Probability = Precision = 191/242 = 78.92% Recall = 191/507 = 37.62%

Second Experiment Some adverbs which always indicate in one class were removed. Negative: moreover, then, thus Positive: much, very, so

Application to Question Answering For questions tagged as MNR First locate the paragraph where the potential answer is in, Then identify the MNR tag in that paragraph. Q: How did Bob Marley die? A: Bob Marley died [of Melanoma][MNR]. Q: How was little Johnny dressed last night? A: Dressed [in a cowboy style][MNR], Johnny walked proudly on the street.

Conclusion The method presented in this paper for the detection and validation of manner relations is automatic and novel. Main flaw: Naive Bayes Classifier assumes feature independence But most features here are dependent on each other (except 1 and 4).