Detecting a Continuum of Compositionality in Phrasal Verbs Diana McCarthy & Bill Keller & John Carroll University of Sussex This research was supported.

Slides:

Advertisements

Similar presentations

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)

Advertisements

CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.

2013/12/10.  The Kendall’s tau correlation is another nonparametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.

LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

Automatic Metaphor Interpretation as a Paraphrasing Task Ekaterina Shutova Computer Lab, University of Cambridge NAACL 2010.

Extracting an Inventory of English Verb Constructions from Language Corpora Matthew Brook O’Donnell Nick C. Ellis Presentation.

January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.

Lecture 21: Spectral Clustering

Statistics MP Oakes (1998) Statistics for corpus linguistics. Edinburgh University Press.

Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.

Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University

Statistical Evaluation of Data

Statistical Treatment of Data Significant Figures : number of digits know with certainty + the first in doubt. Rounding off: use the same number of significant.

Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.

1 COMM 301: Empirical Research in Communication Kwan M Lee Lect3_1.

Nonparametric or Distribution-free Tests

Inferential Statistics

Feature Selection for Automatic Taxonomy Induction The Features Input: Two terms Output: A numeric score, or. Lexical-Syntactic Patterns Co-occurrence.

Latent Semantic Analysis Hongning Wang VS model in practice Document and query are represented by term vectors – Terms are not necessarily orthogonal.

Exploiting Wikipedia as External Knowledge for Document Clustering Sakyasingha Dasgupta, Pradeep Ghosh Data Mining and Exploration-Presentation School.

2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.

1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.

Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.

Types of Data in FCS Survey Nominal Scale – Labels and categories (branch, farming operation) Ordinal Scale – Order and rank (expectations, future plans,

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.

Association between 2 variables

Chapter 14 Nonparametric Tests Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.

Investigating the Relationship between Scores

Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.

Latent Semantic Analysis Hongning Wang Recap: vector space model Represent both doc and query by concept vectors – Each concept defines one dimension.

A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:

인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.

Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

1 Statistical NLP: Lecture 7 Collocations. 2 Introduction 4 Collocations are characterized by limited compositionality. 4 Large overlap between the concepts.

LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.

Summarization Focusing on Polarity or Opinion Fragments in Blogs Yohei Seki Toyohashi University of Technology Visiting Scholar at Columbia University.

MULTI-WORD ITEMS IN ENGLISH COLLOCATIONS RESTRICTED COLLOCATION WHERE CERTAIN WORDS OCCUR ALMOST ENTIRELY IN THE CO-TEXT OF ONE OR TWO OTHER WORDS IS A.

Authors: Marius Pasca and Benjamin Van Durme Presented by Bonan Min Weakly-Supervised Acquisition of Open- Domain Classes and Class Attributes from Web.

Instance-based mapping between thesauri and folksonomies Christian Wartena Rogier Brussee Telematica Instituut.

Evgeniy Gabrilovich and Shaul Markovitch

Collocations and Terminology Vasileios Hatzivassiloglou University of Texas at Dallas.

Evaluating Models of Computation and Storage in Human Sentence Processing Thang Luong CogACLL 2015 Tim J. O’Donnell & Noah D. Goodman.

Elang 273: Statistics. Review: Scientific Method 1. Observe something 2. Speculated why it is so and form hypothesis 3. Test hypothesis by getting data.

Supertagging CMSC Natural Language Processing January 31, 2006.

Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.

Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -

Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.

1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.

1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.

Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.

ENHANCING CLUSTER LABELING USING WIKIPEDIA David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab SIGIR’09.

Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.

Semantic Evaluation of Machine Translation Billy Wong, City University of Hong Kong 21 st May 2010.

Data Analysis: Statistics for Item Interactions. Purpose To provide a broad overview of statistical analyses appropriate for exploring interactions and.

Finding Predominant Word Senses in Untagged Text Diana McCarthy & Rob Koeling & Julie Weeds & Carroll Department of Indormatics, University of Sussex {dianam,

©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.

2016/9/301 Exploiting Wikipedia as External Knowledge for Document Clustering Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, and Xiaohua Zhou Proceeding.

Exploring Group Differences

Statistical NLP: Lecture 7

Web News Sentence Searching Using Linguistic Graph Similarity

Spearman’s Rank Correlation Test

Vector-Space (Distributional) Lexical Semantics

Statistical Inference for Managers

Semantics and discourse: Collocations, keywords and reliability of manual coding Brezina, V. (2018). Statistics in Corpus Linguistics: A Practical Guide.

Automatic Methods to Detect the Compositionality of Multiwords

Enriching Taxonomies With Functional Domain Knowledge

Descriptive Statistics

Presentation transcript:

Detecting a Continuum of Compositionality in Phrasal Verbs Diana McCarthy & Bill Keller & John Carroll University of Sussex This research was supported by: the RASP project (EPSRC) and the MEANING project (EU 5 th framework)

Overview Phrasal verbs Motivation for detecting compositionality Related research Using an automatically acquired thesaurus Evaluation Results Comparison –With some statistics used for multiword extraction –With entries in man-made resources Conclusions, problems and future directions

Phrasals: syntax and semantics Syntax, e.g. particle movement, adverbial placement Some productive combinations better handled in grammar (Villavicencio and Copestake, 2002) Want different treatment depending on compositionality e.g. fly up, eat up, step down, blow up, cock up use neighbours from thesaurus to indicate degree of compositionality Compare to compositionality judgements – on an ordinal scale Cut-off points to be determined by application

Motivation Selectional Preference Acquisition (eat & eat up vs blow & blow up) Word Sense Disambiguation –importance of identification depends on degree of compositionality, and granularity of sense distinctions Multiword Acquisition – relate phrasal sense to senses of simplex verb, how related are they?

RASP Parser Output Phrasal Verb e.g point out the hotel (|ncmod| _ |point:16_VV0| |out:17_RP|) (|dobj| _ |point:16_VV0| |hotel:19_NN1|) vs Prepositional verb e.g. refer to the map (|iobj| |to:12_II| |refer:11_VV0||map:14_NN1|)

Parser Evaluation For verb and particle constructions identified as such in the WSJ Use phrasal lists (such as in ANLT ) to improve parser performance Precision Recall RASP RASP with ANLT list MINIPAR

Related Research Extraction –Blaheta and Johnson (2001) Phrasality and `good collocation’ correlated with opaqueness –Baldwin and Villavicencio (2002) Compositionality –Lin (1999) thesaurus filtered with Log-likelihood ratio, used to obtain substitutes, test significance of difference in mutual information of substitute MW to original. –Schone and Jurafsky (2001) LSA for multiword induction –Bannard et al. thesaurus and LSA, evaluation for verb and particle contribution –Baldwin et al. LSA compared with WordNet based scores

Acquiring the Thesaurus Thesaurus acquired from RASP parses of the written portion of the BNC data Phrasal verbs (blow up) and their simplex counterpart (blow) listed with all subjects and direct objects Thesaurus obtained following Lin (1998) Output: top 500 nearest neighbours listed (with similarity score)

Using the Thesaurus: –climb+down: clamber+up.248 slither+down.206 creep+down.183 … –climb: walk jump.148 go+up.147… Position and similarity score of simplex verb within phrasal neighbours Overlap of neighbours of simplex with neighbours of phrasal How often the same particle occurs in neighbours Evaluation – no cut off, see correlation between measures and ranks from human judges

Evaluation 100 phrasal verbs selected randomly from 3 partitions of the frequency spectrum, + 16 verbs selected manually 3 judges : native English speakers List of 116 verbs, score between 0 and 10 (fully compositional) Removed any verbs with don’t know category (5 such verbs) Scores treated as ranks, look at correlation of ranks Average ranks used as a gold-standard

Inter-Rater Agreement Kendall Coefficient of Concordance (Siegel and Castellan, 1988) useful for 3 or more judges giving ordinal judgements linear relationship to the average Spearman Rank- order Correlation Coefficient taken over all possible pairs of rankings highly significant W = 0.594, χ 2 = probability of this value by chance <=

Measures simplexasneighbour X = 500 rankofsimplex X=500 scoreofsimplex The similarity score of the simplex in top X = 500 neighbours overlap of first X neighbours, where X= 30, 50, 100, and 500 overlapS of first X neighbours, where X= 30, 50, 100, and 500, with simplex form of neighbours in phrasal neighbours sameparticle number of neighbours with same particle as phrasal X=500 sameparticle-simplex as above minus the number of simplex neighbours with the same particle X = 500

Overlap

OverlapS

For Comparison Statistics –Log-likelihood ratio test (Dunning, 1993) –Mutual Information (point-wise) Church and Hanks (1990) – χ2 (chi-squared) Man-Made resources –WordNet –ANLT lists (phrasal and prepositional verbs)

Results Overlap r s Z scorep under H 0 X = X = X = X = OverlapS X = < X = < X = X =

Results continued… X=500 statisticZ scorep under H 0 sameparticler s= < sameparticle- simplex r s= < simplexasneighbour MW simplexrankr s= simplexscorer s=

Correlations of GS with man-made resources and statistics statistic Z score P under H 0 LLRr s = χ2χ2 r s = MIr s = Phrasal freqr s = Simplex freqr s = WordNet MW ANLT phrasals MW ANLT prep n sMW

Correlation of measures with man-made resources In WordNetIn ANLT phrasals MI sameparticle- simplex

Conclusions, Problems and Future Directions Thesaurus measures worked better than statistics, especially looking for neighbours having the same particle Straight overlap of neighbours – not as good as hoped, Overlap taking particles into account helps. May help to use similarity scores or ranks of neighbours. Polysemy is a problem for both methods and evaluation. Continuum of compositionality useful for exploring relationship – still need cut-offs for application