8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Frequent problem: Decision making based on statistical information.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Introduction to Hypothesis Testing
Goodness of fit, confidence intervals and limits Jorge Andre Swieca School Campos do Jordão, January,2003 fourth lecture.
There are two statistical tests for mean: 1) z test – Used for large samples (n ≥ 30) 1) t test – Used for small samples (n < 30)
Computing and Statistical Data Analysis / Stat 4
Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
Probability and Statistics Basic concepts II (from a physicist point of view) Benoit CLEMENT – Université J. Fourier / LPSC
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
14 Sept 2004 D.Dedovich Tau041 Measurement of Tau hadronic branching ratios in DELPHI experiment at LEP Dima Dedovich (Dubna) DELPHI Collaboration E.Phys.J.
Likelihood ratio tests
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
8. Hypotheses 8.4 Two more things K. Desch – Statistical methods of data analysis SS10 Inclusion of systematic errors LHR methods needs a prediction (from.
Presenting: Lihu Berman
BCOR 1020 Business Statistics
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
Inference about a Mean Part II
Chapter 2 Simple Comparative Experiments
Inferences About Process Quality
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Hypothesis Testing.
Jump to first page HYPOTHESIS TESTING The use of sample data to make a decision either to accept or to reject a statement about a parameter value or about.
Chapter 8 Introduction to Hypothesis Testing
Claims about a Population Mean when σ is Known Objective: test a claim.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Statistical Decision Theory
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
STA Statistical Inference
Chapter 7 Hypothesis testing. §7.1 The basic concepts of hypothesis testing  1 An example Example 7.1 We selected 20 newborns randomly from a region.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
IE241: Introduction to Hypothesis Testing. We said before that estimation of parameters was one of the two major areas of statistics. Now let’s turn to.
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
G. Cowan CLASHEP 2011 / Topics in Statistical Data Analysis / Lecture 21 Topics in Statistical Data Analysis for HEP Lecture 2: Statistical Tests CERN.
A statistical test for point source searches - Aart Heijboer - AWG - Cern june 2002 A statistical test for point source searches Aart Heijboer contents:
G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1 Lecture 2 1 Probability Definition, Bayes’ theorem, probability densities and their properties,
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
G. Cowan S0S 2010 / Statistical Tests and Limits 1 Statistical Tests and Limits Lecture 1: general formalism IN2P3 School of Statistics Autrans, France.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
G. Cowan TAE 2010 / Statistics for HEP / Lecture 11 Statistical Methods for HEP Lecture 1: Introduction Taller de Altas Energías 2010 Universitat de Barcelona.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
2005 Unbinned Point Source Analysis Update Jim Braun IceCube Fall 2006 Collaboration Meeting.
1 Introduction to Statistics − Day 2 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
Calice Meeting Argonne Muon identification with the hadron calorimeter Nicola D’Ascenzo.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
Example The strength of concrete depends, to some extent on the method used for drying it. Two different drying methods were tested independently on specimens.
G. Cowan Aachen 2014 / Statistics for Particle Physics, Lecture 21 Statistical Methods for Particle Physics Lecture 2: statistical tests, multivariate.
G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 1 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 1: Introduction,
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 21 Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN,
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
Lecture Nine - Twelve Tests of Significance.
Chapter 2 Simple Comparative Experiments
Discrete Event Simulation - 4
Presentation transcript:

8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Frequent problem: Decision making based on statistical information Examples: a) Distinguishing different particle species in detector, e. g. e – π separation Different signals in detector material : specific energy loss in tracking detector E ECAL / E HCAL ratio shower shapes in calorimeter … Observables P.d.f.s of observables are often different for different hypotheses but they overlap. In this case, the aim is: large efficiency, small background H1H1 H0H0 Obs 1 Obs 2

b) Search for new particles statistical separation of similar events: background and new particles (signal) Concepts Problem: How well do observed data agree with predicted probabilities (hypotheses)? The hypothesis under consideration is traditionally called null hypothesis H 0. Often comparison with alternative hypotheses H 1, H 2,… n measured values Each hypothesis characterised through p.d.f. could be : 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 - n measurements of the same random variable (n ”events”) - n different observables of an event (E, p, …) - Combination of both - Same measurement from n different experiments

To test agreement between data and given hypothesis, one constructs a function of (x 1,..., x n ) called test statistic (usually is scalar function) For each of the hypotheses, there is a p.d.f. for the statistic t: g(t|H 0 ), g(t|H 1 ), etc 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Decision to accept or reject hypothesis H 0 by defining: critical region (reject H 0 ) acceptance region (accept H 0 ) e. g. by defining a cut value t cut : t < t cut “accepted” t > t cut “rejected”

8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Selecting t cut (decision boundary), one defines significance level of the test: meaning that there is the probability α to reject the hypothesis H 0, if H 0 is true – called an “error of the first kind” Probability to accept H 0 if an alternative hypothesis H 1 is true is given by : = “error of the second kind” 1 – β is called the power of the test to discriminate against the alternative hypothesis H 1

8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Example 1: Particle identification test statistic = measured specific energy loss (for fixed momentum) e. g. H 0 = e; H 1 = π Assumption: Only electrons and pions are in sample Task: Select a sample of electrons (“signal”) by requiring t < t cut Selected pions are background. Probabilities to select electrons and pions (“efficiencies”) are : How to choose t cut ? large t cut : large signal efficiency, much background small t cut : small signal efficiency, little background (= large purity)

8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Number of accepted particles: (can only be used if ) Probability that a particle with observed value of test statistic t is an electron: a e, a π = 1 – a e are prior probabilities for electrons and pions, respectively → must be known (for example from MC simulation) Purity :

Example 2: Counting experiment An experiment counts events of a certain type. Events comprise (on average) v B background and v S signal events The observed number of events is n Test statistic: Poisson probability null hypothesis : Example: v B = 5.6, n = Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Question: Probability to observe 18 or more events if one expects on average 5.6 and H 0 is true

p-value is often expressed in terms of equivalent (Gaussian) std. deviations Convention: if p-value (CL B ) < “5σ” (5.7∙10 -7 ), then the background hypothesis is rejected (“discovery”) Note: - That does not mean that one detects a signal with 1-p=0.99… (no statement is made about H 1 ) - Requires exact understanding of expected background 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10

Choice of the critical region How does one choose a critical region of a statistical test in an optimal way? If one defines a significance level α for a null hypothesis, one wants the largest possible power 1 – β. In other words: For a fixed efficiency ε = 1 – α, we want the largest possible purity For a one dimensional test statistic: 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 trivial, if is monotonic function In that case the value of t cut fixes 1 – β

More difficult: a) more complicated p.d.f. of test statistic a simple cut might not be optimal b) multidimensional test statistic 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 WcWc t1t1 t2t2

→ 1 – β will be maximal, if one selects the region Ω c such that the ratio is maximised. Equivalently: Select acceptance region 1 – Ω c such that is minimal Neyman-Pearson lemma: The acceptance region giving the highest power (and signal purity) for a given significance level α (or selection efficiency ε = 1 – α) is the region in t-space such that: where t c is determined by the desired efficiency The quantity is called likelihood ratio 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10

→ reduction of n-dimensional test statistic to one-dim. statistic r Important property of LR: n independent measurements with test statistics t 1,…,t n : and → Example: Counting experiment 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10

Combination of many counting experiments with different purities: or = weighted sum of events 8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10