Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Tests of Hypotheses Based on a Single Sample
Tests of Static Asset Pricing Models
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
Today Today: Chapter 9 Assignment: 9.2, 9.4, 9.42 (Geo(p)=“geometric distribution”), 9-R9(a,b) Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Inference about a Mean Part II
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Inferences About Process Quality
Chapter 7 Inferences Regarding Population Variances.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Random Sampling, Point Estimation and Maximum Likelihood.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Confidence intervals and hypothesis testing Petter Mostad
Tests of Hypotheses Involving Two Populations Tests for the Differences of Means Comparison of two means: and The method of comparison depends on.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Confidence Interval & Unbiased Estimator Review and Foreword.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
Virtual University of Pakistan
Independent-Samples t-test
Independent-Samples t-test
Chapter 4. Inference about Process Quality
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Basic t-test 10/6.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Inferences Regarding Population Variances
Chapter 9 Hypothesis Testing.
CONCEPTS OF ESTIMATION
Chapter 9 Hypothesis Testing.
Problems: Q&A chapter 6, problems Chapter 6:
Discrete Event Simulation - 4
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 10 Analyzing the Association Between Categorical Variables
Review of Statistical Inference
Statistical Process Control
Contingency tables and goodness of fit
Inferences Regarding Population Variances
Hypothesis Tests for a Standard Deviation
Hypothesis Testing: The Difference Between Two Population Means
Last Update 12th May 2011 SESSION 41 & 42 Hypothesis Testing.
Chapter 9 Hypothesis Testing: Single Population
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Presentation transcript:

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests Soccer Goals in European Premier Leagues - 2004

Statistical Testing Principles Goal: Test a Hypothesis concerning parameter value(s) in a larger population (or nature), based on observed sample data Data – Identified with respect to a (possibly hypothesized) probability distribution that is indexed by one or more unknown parameters Notation:

Example – English League – Total Goals/Match Suppose we wish to test whether the mean number of goals (in a hypothetically infinite population) of games is equal to 3. Note: all games of equal length (no overtime in regular season games) Data: Y=Total # of goals in a randomly selected game Distribution: Assume Poisson with parameter q Null Hypothesis: H0: q = 3 Alternative Hypothesis: HA: q ≠ 3 Joint Probability Density Function:

Likelihood Function Another term for joint probability density/mass function. Common Notation: L(q) or L(q,y) or L(q|y) Considered as a function of both the (observed) data and the (unknown) parameter values Used in estimation and testing parameter value(s) Goal is to choose parameter value(s) that maximize likelihood function given the observed data. Typically work with the log of the likelihood, as it is often easier to differentiate to solve for maximum likelihood (ML) estimators for many families of probability distributions

ML Estimation of Poisson Mean

Total Goals Data

Likelihood Ratio Test Identify the parameter space: W = {q:q>0} Identify the parameter space under H0: W0 = {q:q=q0} Evaluate the maximum log-Likelihood Evaluate the log-Likelihood under H0 Any terms not involving parameter can be ignored Take -2 times difference (H0 – maximum) Under null hypothesis (and large samples), statistic is approximately chi-square with 1 degree of freedom (number of constraints under H0)

Soccer Goals Example We have strong evidence to conclude the “true” mean total number of goals is below 3.

Wald Test - I By Central Limit Theorem arguments, many estimators have sampling distributions that are approximately normal in large samples Then, if we have an estimate of the variance of the estimator, we can obtain a chi-square statistic by taking the square of the distance between the ML estimate and the value under H0 divided by the estimated variance The estimated variance can be obtained from the second derivative of the log-Likelihood

Wald Test - II

Lagrange Multiplier (Score) Test Obtain the first derivative of the log-Likelihood evaluated at the parameter under H0 (This is the slope of the log-Likelihood, evaluated at q0 and is called the score) Multiply the square of the score by the variance of the ML estimate, evaluated at q0 . This is the inverse of the variance of the score. Then chi-square test statistic is computed as follows:

Soccer Goals Example

Generalization to Tests of Multiple Parameters

Soccer Goals Example Premier League Games in 2004 for k=5 European Countries: England n1 = 380, Y1• = 975 France n2 = 380, Y2• = 826 Germany n3 = 306, Y3• = 890 Italy n4 = 380, Y4• = 960 Spain n5 = 380, Y5• = 980

Testing Equality of Mean Goals Among Countries - I

Testing Equality of Mean Goals Among Countries - II

Likelihood Ratio Test Evidence that the true population means differ (in particular: France lower, Germany higher than the others)

Wald Test

Lagrange Multiplier (Score) Test

Testing Goodness of Fit to Poisson Distribution All estimation and testing has assumed that number of goals follow Poisson distributions To test whether that assumption is reasonable, we compare the observed distributions of goals with what we would expect under the Poisson model We can check whether the observed mean and variance are similar (under Poisson model they are equal) We can also obtain a chi-square statistic by summing over range of goals: (observed#-expected#)2/expected# which under hypothesis of model fits is approximately chi-square with (# in range)-1 degrees of freedom

Distributions of Goals All leagues, except France, appear to be well described by the Poisson distribution. Especially England, Germany, and Spain