1 Lecture 8 Measures of association: chi square test, mutual information, binomial distribution and log likelihood ratio.

Slides:



Advertisements
Similar presentations
Statistical Methods Lecture 26
Advertisements

Friday: Lab 3 & A3 due Mon Oct 1: Exam I  this room, 12 pm Please, no computers or smartphones Mon Oct 1: No grad seminar Next grad seminar: Wednesday,
Chapter 18: The Chi-Square Statistic
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences CHAPTER.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
1 Text categorization Feature selection: chi square test.
Chapter 13: The Chi-Square Test
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
1 SIMS 290-2: Applied Natural Language Processing Marti Hearst Oct 23, 2006 (Slides developed by Preslav Nakov)
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
1 SIMS 290-2: Applied Natural Language Processing Preslav Nakov Sept 29, 2004.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Inferences About Process Quality
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 15 The.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Albert Gatt Corpora and Statistical Methods – Part 2.
Imagine a a bag that contained 90 white marbles and 10 black marbles. If you drew 10 marbles, how many would you expect to come up white, and how many.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Nonparametric Statistics
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
CHI SQUARE TESTS.
Copyright © 2010 Pearson Education, Inc. Slide
The two way frequency table The  2 statistic Techniques for examining dependence amongst two categorical variables.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
© Copyright McGraw-Hill 2004
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Chapter 13 Understanding research results: statistical inference.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chi-Square Chapter 14. Chi Square Introduction A population can be divided according to gender, age group, type of personality, marital status, religion,
BINARY LOGISTIC REGRESSION
Chapter 11 Chi-Square Tests.
Chapter 4. Inference about Process Quality
Active Learning Lecture Slides
Chapter 25 Comparing Counts.
Qualitative data – tests of association
Chapter 10 Analyzing the Association Between Categorical Variables
Overview and Chi-Square
15.1 Goodness-of-Fit Tests
Chapter 11 Chi-Square Tests.
Lesson 11 - R Chapter 11 Review:
Analyzing the Association Between Categorical Variables
Chapter 26 Comparing Counts.
Chi-Squared AP Biology.
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Chapter 18: The Chi-Square Statistic
Chapter 26 Comparing Counts.
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
Presentation transcript:

1 Lecture 8 Measures of association: chi square test, mutual information, binomial distribution and log likelihood ratio

2 Experiments in Multidocument summarization (SNM’02) Summarization system based on a range of features Raises issues we have not discussed upto now Non-extractive techniques Ordering of information

3 Lead values feature Lead sentences of news articles can often make excellent brief summaries But for multi-document summaries there are several first sentences, so difficult to choose! They are information dense Can we find very informative words based on this observation Used Binomial test to decide if

4 Sample lead words

5 Verb specificity Compare “arrest” with “do” or “be” Often given subjects are very strongly associated with a verb Actors  appear in movies Singers  release an album Compute associations between subject nouns and verbs Use mutual association measure

6 Concept sets Frequency of words are not that reliable, even when stemming is used Synonyms, hypernyms and hyponyms from wordnet

7 Other features Location A negative value that penalizes sentencesthat appear late in the document. Publication Date Additional value to the most recent documents, on the assumption that users will want the most up-to-date information. Target Indicates the presence of the central personage in the document cluster, if one exists. Length A penalty for sentences that are below a minimum (15 words) and above a maximum (30 words). Short sentences are often require some introduction or reference resolution, or else are a kind of interjection. Long sentences can cover multiple thoughts that are often found elsewhere in the document cluster in single sentences. Others Indicates the presence of any named entity, weighted to the frequency of that entity across all documents. Pronoun A negative value on sentences that have pronouns in the beginning of the sentence.

8 Other issues Sentence ordering How to present the selected information? Even good choices might be hard to understand if they are presented in the wrong order Imagine a newspaper articles with all sentences randomly permuted Noun phrases Depend on the context

9 Extractive summary

10 Partly modified summary

11 Measures of associations For supervised learning, they can help us detrmine which features are predictive of the distinctions we want to make Chi square test from last lecture Words that are likely to appear in the first sentence rather than anywhere else Verbs that are strongly associated with a given subjects  A variety of measures are defined in the Chapter 5 reading

12  2 statistic (pronounced “kai square”) A commonly used method of comparing proportions. Measures the lack of independence between a term and a category  2 statistic (CHI)

13 Is “jaguar” a good predictor for the “auto” class? We want to compare: the observed distribution above; and null hypothesis: that jaguar and auto are independent  2 statistic (CHI) Term = jaguar Term  jaguar Class = auto2500 Class  auto 39500

14 Under the null hypothesis: (jaguar and auto independent): How many co-occurrences of jaguar and auto do we expect? If independent: P r (j,a) = P r (j)  P r (a) So, there would be N  P r (j,a), i.e. N  P r (j)  P r (a) occurances of “jaguar” P r (j) = (2+3)/N; P r (a) = (2+500)/N; N= N(5/N)(502/N)=2510/N=2510/10005  0.25  2 statistic (CHI) Term = jaguar Term  jaguar Class = auto2500 Class  auto 39500

15 Under the null hypothesis: (jaguar and auto independent): How many co-occurrences of jaguar and auto do we expect?  2 statistic (CHI) Term = jaguar Term  jaguar Class = auto2(0.25)500 Class  auto expected: f e observed: f o

16 Under the null hypothesis: (jaguar and auto – independent): How many co-occurrences of jaguar and auto do we expect?  2 statistic (CHI) Term = jaguar Term  jaguar Class = auto2(0.25)500(502) Class  auto 3(4.75)9500(9498) expected: f e observed: f o

17  2 is interested in (f o – f e ) 2 /f e summed over all table entries: The null hypothesis is rejected with confidence.999, since 12.9 > (the value for.999 confidence).  2 statistic (CHI) Term = jaguar Term  jaguar Class = auto2(0.25)500(502) Class  auto 3(4.75)9500(9498) expected: f e observed: f o

18 There is a simpler formula for  2 :  2 statistic (CHI) N = A + B + C + D A = #(t,c)C = #(¬t,c) B = #(t,¬c)D = #(¬t, ¬c)

19 Finding translation equivalents

20 Binomial distribution k—number of “successes” n—number of trails x—probability of success

21 Log likelihood ratio test

22 Log likelihood ratio test