Tests Jean-Yves Le Boudec. Contents 1.The Neyman Pearson framework 2.Likelihood Ratio Tests 3.ANOVA 4.Asymptotic Results 5.Other Tests 1.

Slides:



Advertisements
Similar presentations
1 Chapter 20: Statistical Tests for Ordinal Data.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Happiness comes not from material wealth but less desire. 1.
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Economics 105: Statistics Go over GH 11 & 12 GH 13 & 14 due Thursday.
Chi Square Tests Chapter 17.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
M ATH S TAT T RIVIAL P URSUIT (S ORT OF )F OR R EVIEW ( MATH 30)
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Model Fitting Jean-Yves Le Boudec 0. Contents 1 Virus Infection Data We would like to capture the growth of infected hosts (explanatory model) An exponential.
Chapter Seventeen HYPOTHESIS TESTING
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
Chapter 7 Analysis of ariance Variation Inherent or Natural Variation Due to the cumulative effect of many small unavoidable causes. Also referred to.
Chapter 2 Simple Comparative Experiments
Tests Jean-Yves Le Boudec. Contents 1.The Neyman Pearson framework 2.Likelihood Ratio Tests 3.ANOVA 4.Asymptotic Results 5.Other Tests 1.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Chapter 11: Inference for Distributions
Inferences About Process Quality
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
An Introduction to Statistical Inference Mike Wasikowski June 12, 2008.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Non-parametric statistics
Nonparametrics and goodness of fit Petter Mostad
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Review I volunteer in my son’s 2nd grade class on library day. Each kid gets to check out one book. Here are the types of books they picked this week:
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
CHAPTER 14: Nonparametric Methods
Chapter 13 – Difference Between Two Parameters Math 22 Introductory Statistics.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics seventh edition, by Ronald M. Weiers Presentation by Priscilla Chaffe-Stengel.
Nonparametric Tests IPS Chapter 15 © 2009 W.H. Freeman and Company.
May 2004 Prof. Himayatullah 1 Basic Econometrics Chapter 5: TWO-VARIABLE REGRESSION: Interval Estimation and Hypothesis Testing.
Nonparametric Statistics
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Chapter 15 – Analysis of Variance Math 22 Introductory Statistics.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
Academic Research Academic Research Dr Kishor Bhanushali M
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 7.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 3.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 6.
ENGR 610 Applied Statistics Fall Week 7 Marshall University CITE Jack Smith.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Example The strength of concrete depends, to some extent on the method used for drying it. Two different drying methods were tested independently on specimens.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Chapter 11 Created by Bethany Stubbe and Stephan Kogitz.
Part Four ANALYSIS AND PRESENTATION OF DATA
Research Methodology Lecture No :25 (Hypothesis Testing – Difference in Groups)
Non-Parametric Tests 12/1.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/6.
Non-Parametric Tests.
Data Analysis and Interpretation
SA3202 Statistical Methods for Social Sciences
Simple Linear Regression
Chapter 24 Comparing Two Means.
Introductory Statistics
Presentation transcript:

Tests Jean-Yves Le Boudec

Contents 1.The Neyman Pearson framework 2.Likelihood Ratio Tests 3.ANOVA 4.Asymptotic Results 5.Other Tests 1

Tests Tests are used to give a binary answer to hypotheses of a statistical nature Ex: is A better than B? Ex: does this data come from a normal distribution ? Ex: does factor n influence the result ? 2

Example: Non Paired Data Is red better than blue ? For data set (a) answer is clear (by inspection of confidence interval) no test required 3

Is this data normal ? 4

5.1 The Neyman-Pearson Framework 5

Example: Non Paired Data; Is Red better than Blue ? 6

Example: Non Paired Data; Is Red better than Blue ? ANOVA Model 7

Critical Region, Size and Power 8

Example : Paired Data Is A better than B ? 9

10

Power 11

12 Grey Zone power

p-value of a test 13

p-value of a test 14

15

Tests are just tests 16

17 Grey Zone power

18

2. Likelihood Ratio Test A special case of Neyman-Pearson A Systematic Method to define tests, of general applicability 19

20

Example : Paired Data Is A better than B ? 21

22

A Classical Test: Student Test The model : The hypotheses : 23

24

Example : Paired Data Is A better than B ? 25

Here it is the same as a Conf. Interval 26

Test versus Confidence Intervals If you can have a confidence interval, use it instead of a test 27

The “Simple Goodness of Fit” Test Model Hypotheses 28

1. compute likelihood ratio statistic 29

2. compute p-value 30

Mendel’s Peas P= 0.92 ± 0.05 => Accept H 0 31

3 ANOVA Often used as “Magic Tool” Important to understand the underlying assumptions Model Data comes from iid normal sample with unknown means and same variance Hypotheses 32

33

34

The ANOVA Theorem We build a likelihood ratio statistic test The assumption that data is normal and variance is the same allows an explicit computation it becomes a least square problem = a geometrical problem we need to compute orthogonal projections on M and M 0 35

The ANOVA Theorem 36

Geometrical Interpretation Accept H 0 if SS2 is small The theorem tells us what “small” means in a statistical sense 37

38

ANOVA Output: Network Monitoring 39

The Fisher-F distribution 40

41

Compare Test to Confidence Intervals For non paired data, we cannot simply compute the difference However CI is sufficient for parameter set 1 Tests disambiguate parameter sets 2 and 3 42

Test the assumptions of the test… Need to test the assumptions Normal In each group: qqplot… Same variance 43

44

4 Asymptotic Results 45 2 x Likelihood ratio statistic

46

The chi-square distribution 47

Asymptotic Result Applicable when central limit theorem holds If applicable, radically simple Compute likelihood ratio statistic Inspect and find the order p (nb of dimensions that H1 adds to H0) This is equivalent to 2 optimization subproblems lrs = = max likelihood under H1 - max likelihood under H0 The p-value is 48

Composite Goodness of Fit Test We want to test the hypothesis that an iid sample has a distribution that comes from a given parametric family 49

Apply the Generic Method Compute likelihood ratio statistic Compute p-value Either use MC or the large n asymptotic 50

51

Is it normal ? 52

53

54

Mendel’s Peas P= 0.92 ± 0.05 => Accept H 0 55

Test of Independence Model Hypotheses 56

Apply the generic method 57

58

5 Other Tests Simple Goodness of Fit Model: iid data Hypotheses: H 0 common distrib has cdf F() H 1 common distrib is anything Kolmogorov-Smirnov: under H 0, the distribution of is independent of F() 59

60

Anderson-Darling An alternative to K-S, less sensitive to “outliers” 61

62

63

Jarque Bera test of normality (Chapter 4) Based on Kurtosis and Skewness Should be 0 for normal distribution 64

65

Robust Tests Median Test Model : iid sample Hypotheses 66

Median Test 67

Wilcoxon Signed Rank Test 68

Wilcoxon Rank Sum Test Model: X i and Y j independent samples, each is iid Hypotheses: H 0 both have same distribution H 1 the distributions differ by a location shift 69

Wilcoxon Rank Sum Test 70

Turning Point 71 the indices of X (1), X (2), X (3) form a permutation, uniform among 6 values