Measures of Coincidence Vasileios Hatzivassiloglou University of Texas at Dallas.

Slides:



Advertisements
Similar presentations
Hypothesis testing Another judgment method of sampling data.
Advertisements

Chapter 6 Sampling and Sampling Distributions
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Hypothesis Testing Steps in Hypothesis Testing:
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Visual Recognition Tutorial
Chapter 8 Estimation: Additional Topics
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chapter 7 Sampling and Sampling Distributions
Chapter 6 The Normal Distribution
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
CHI-SQUARE GOODNESS OF FIT TEST u A nonparametric statistic u Nonparametric: u does not test a hypothesis about a population value (parameter) u requires.
DEPENDENT SAMPLES t-TEST What is the Purpose?What Are the Assumptions?How Does it Work?
Class 3: Estimating Scoring Rules for Sequence Alignment.
CONFIDENCE INTERVALS What is the Purpose of a Confidence Interval?
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter 5 Continuous Random Variables and Probability Distributions
Chapter 7 Estimation: Single Population
7-2 Estimating a Population Proportion
Experimental Evaluation
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
An Introduction to Logistic Regression
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Today Concepts underlying inferential statistics
PSY 307 – Statistics for the Behavioral Sciences
5-3 Inference on the Means of Two Populations, Variances Unknown
Statistics for Managers Using Microsoft® Excel 7th Edition
PSY 307 – Statistics for the Behavioral Sciences
Choosing Statistical Procedures
AS 737 Categorical Data Analysis For Multivariate
AM Recitation 2/10/11.
Chapter 13: Inference in Regression
Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Education 793 Class Notes T-tests 29 October 2003.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Chapter 8: Confidence Intervals
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Classifier Evaluation Vasileios Hatzivassiloglou University of Texas at Dallas.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Multinomial Distribution
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Comp. Genomics Recitation 3 The statistics of database searching.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Statistical Estimation Vasileios Hatzivassiloglou University of Texas at Dallas.
1 6. Mean, Variance, Moments and Characteristic Functions For a r.v X, its p.d.f represents complete information about it, and for any Borel set B on the.
Multiple Sequence Alignment Vasileios Hatzivassiloglou University of Texas at Dallas.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chapter 6 Sampling and Sampling Distributions
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
I. ANOVA revisited & reviewed
Data Analysis for Two-Way Tables
Chapter 9 Hypothesis Testing.
Continuous Random Variable Normal Distribution
Chapter 9 Estimation: Additional Topics
Presentation transcript:

Measures of Coincidence Vasileios Hatzivassiloglou University of Texas at Dallas

A study of different measures Smadja, McKeown, and Hatzivassiloglou (1996): Translating Collocations for Bilingual Lexicons: A Statistical Approach Use aligned parallel corpora (Hansards) Task: Find translation for a word group across languages

Sketch of algorithm Start with set of collocations in French Find candidate single word translations according to association between original collocation and translation Measure association between source collocation and pairs of candidate words Expand iteratively to triplets, etc. by recalculating association

Dice vs. SI Dice depends on conditional probabilities only SI depends on the marginals: logP(X|Y)-logP(X) SI depends on how rare X is Limit behavior

Asymmetry Many kinds of asymmetry –Between X and Y –Between X=1 and X=0 –1-1 matches versus 0-0 matches Adding 0-0 matches does not change Dice Adding 0-0 matches always increases SI

Effect of asymmetry Hypothetical scenario on 100 sentences A,B appear together twice, by themselves three times each Dice: 2×2 / (5+5) = 0.4 SI: log (0.02 / (0.05×0.05)) = 3 bits MI: bits

Reversing one and zeroes Now replace every 1 with 0 and vice versa New variables A′, B′ occur together 92 times, each occurs by itself three times Dice: 2×92 / ( ) = MI: Unchanged ( bits) SI: log(0.92 / (0.95×0.95)) = bits

Explaining the behavior Limit effect as P(X) decreases with P(X|Y) constant P(X) eventually dominates SI Makes SI (and MI) more sensitive to estimation errors

Bounds and testing purpose No upper bound for SI and MI Dice is always between 0 and 1 Easy to test SI/MI for independence Easy to test Dice for correlation

Empirical comparison How to compare without redoing the entire experiment? Solution: Use competing measure in the last round Test cases where the correct solution is available Provide lower bound on competitor error

Empirical results 45 French collocations 2 did not produce any candidate translation Dice resulted in 36 correct, 7 incorrect translations SI resulted in 26 correct, 17 incorrect translations

Re-examining contingency tables Ted Dunning, “Accurate Methods for the Statistics of Surprise and Coincidence”, Computational Linguistics, Problem: Asymptotic normality assumptions How much data is enough? Are researchers aware of the need for statistical validity analysis?

Rarity of words Empirical counts on words show that 20–30% of words appear less than 1 in 50,000 words Estimating binomial as normal: Good as long as np(1-p) > 5 Significance overestimated by 20% for np=1, 40 for np=0.1, for np=0.01

Likelihood in parameter spaces Parametric model (known except for parameter values) Likelihood function H(ω;k) Hypothesis represented by a point ω 0

Likelihood ratio Test statistic: -2logλ Rapidly approaches χ 2 distribution for binomial H

Comparing to chi-square Leads to same formula as Pearson’s chi- square statistic when approximating with normal distribution Diverges significantly from chi-square for low np Closely follows chi-square distribution

Experimental results 32,000 words of financial text from Switzerland Find highly correlated word pairs Observe top-ranked entries for log- likelihood and chi-square Chi-square leads to huge scores for rare pairs 2,682 of 2,693 bigrams violate assumptions