Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Tests of Hypotheses Based on a Single Sample
Tests of Static Asset Pricing Models
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Statistical Estimation and Sampling Distributions
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
1 Hypothesis Testing. 2  Greene: App. C:  Statistical Test: Divide parameter space (Ω) into two disjoint sets: Ω 0, Ω 1  Ω 0 ∩ Ω 1 =  and Ω.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Basics of Statistical Estimation. Learning Probabilities: Classical Approach Simplest case: Flipping a thumbtack tails heads True probability  is unknown.
Today Today: Chapter 9 Assignment: 9.2, 9.4, 9.42 (Geo(p)=“geometric distribution”), 9-R9(a,b) Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Useful Statistical Distributions for Econometrics Econometrics is usually concerned with the estimation of equations of the form: The normal distribution.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Inference about a Mean Part II
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Chapter 7 Inferences Regarding Population Variances.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Random Sampling, Point Estimation and Maximum Likelihood.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Confidence intervals and hypothesis testing Petter Mostad
Tests of Hypotheses Involving Two Populations Tests for the Differences of Means Comparison of two means: and The method of comparison depends on.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Confidence Interval & Unbiased Estimator Review and Foreword.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Brief Review Probability and Statistics. Probability distributions Continuous distributions.
© Copyright McGraw-Hill 2004
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Chapter 4. Inference about Process Quality
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Ch3: Model Building through Regression
Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests
Chapter 9 Hypothesis Testing.
Discrete Event Simulation - 4
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 10 Analyzing the Association Between Categorical Variables
Review of Statistical Inference
Hypothesis Testing: The Difference Between Two Population Means
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Presentation transcript:

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests Soccer Goals in European Premier Leagues - 2004

Statistical Testing Principles Goal: Test a Hypothesis concerning parameter value(s) in a larger population (or nature), based on observed sample data Data – Identified with respect to a (possibly hypothesized) probability distribution that is indexed by one or more unknown parameters Notation:

Example – English League – Total Goals/Match Suppose we wish to test whether the mean number of goals (in a hypothetically infinite population) of games is equal to 3. Note: all games of equal length (no overtime in regular season games) Data: Y=Total # of goals in a randomly selected game Distribution: Assume Poisson with parameter q Null Hypothesis: H0: q = 3 Alternative Hypothesis: HA: q ≠ 3 Joint Probability Density Function:

Likelihood Function Another term for joint probability density/mass function. Common Notation: L(q) or L(q,y) or L(q|y) Considered as a function of both the (observed) data and the (unknown) parameter values Used in estimation and testing parameter value(s) Goal is to choose parameter value(s) that maximize likelihood function given the observed data. Typically work with the log of the likelihood, as it is often easier to differentiate to solve for maximum likelihood (ML) estimators for many families of probability distributions

ML Estimation of Poisson Mean

Total Goals Data

Likelihood Ratio Test Identify the parameter space: W = {q:q>0} Identify the parameter space under H0: W0 = {q:q=q0} Evaluate the maximum log-Likelihood Evaluate the log-Likelihood under H0 Any terms not involving parameter can be ignored Take -2 times difference (H0 – maximum) Under null hypothesis (and large samples), statistic is approximately chi-square with 1 degree of freedom (number of constraints under H0)

Soccer Goals Example We have strong evidence to conclude the “true” mean total number of goals is below 3.

Wald Test - I By Central Limit Theorem arguments, many estimators have sampling distributions that are approximately normal in large samples Then, if we have an estimate of the variance of the estimator, we can obtain a chi-square statistic by taking the square of the distance between the ML estimate and the value under H0 divided by the estimated variance The estimated variance can be obtained from the second derivative of the log-Likelihood

Wald Test - II

Lagrange Multiplier (Score) Test Obtain the first derivative of the log-Likelihood evaluated at the parameter under H0 (This is the slope of the log-Likelihood, evaluated at q0 and is called the score) Multiply the square of the score by the variance of the ML estimate, evaluated at q0 . This is the inverse of the variance of the score. Then chi-square test statistic is computed as follows:

Soccer Goals Example

Generalization to Tests of Multiple Parameters

Soccer Goals Example Premier League Games in 2004 for k=5 European Countries: England n1 = 380, Y1• = 975 France n2 = 380, Y2• = 826 Germany n3 = 306, Y3• = 890 Italy n4 = 380, Y4• = 960 Spain n5 = 380, Y5• = 980

Testing Equality of Mean Goals Among Countries - I

Testing Equality of Mean Goals Among Countries - II

Likelihood Ratio Test Evidence that the true population means differ (in particular: France lower, Germany higher than the others)

Wald Test

Lagrange Multiplier (Score) Test

Testing Goodness of Fit to Poisson Distribution All estimation and testing has assumed that number of goals follow Poisson distributions To test whether that assumption is reasonable, we compare the observed distributions of goals with what we would expect under the Poisson model We can check whether the observed mean and variance are similar (under Poisson model they are equal) We can also obtain a chi-square statistic by summing over range of goals: (observed#-expected#)2/expected# which under hypothesis of model fits is approximately chi-square with (# in range)-1 degrees of freedom

Distributions of Goals All leagues, except France, appear to be well described by the Poisson distribution. Especially England, Germany, and Spain