Presentation is loading. Please wait.

Presentation is loading. Please wait.

880.P20 Winter 2006 Richard Kass 1 Hypothesis Testing The goal of hypothesis testing is to set up a procedure(s) to allow us to decide if a model is acceptable.

Similar presentations


Presentation on theme: "880.P20 Winter 2006 Richard Kass 1 Hypothesis Testing The goal of hypothesis testing is to set up a procedure(s) to allow us to decide if a model is acceptable."— Presentation transcript:

1 880.P20 Winter 2006 Richard Kass 1 Hypothesis Testing The goal of hypothesis testing is to set up a procedure(s) to allow us to decide if a model is acceptable in light of our experimental observations. Example: A theory predicts that BR(Higgs     )= 2x10 -  and you measure (4  2) x10 - . The hypothesis we want to test is “are experiment and theory consistent?” Hypothesis testing does not have to compare theory and experiment. Example: CLEO measures the  c lifetime to be (180  7)fs while SELEX measures (198  7)fs. The hypothesis we want to test is “are the lifetime results from CLEO and SELEX consistent?” There are two types of hypotheses tests: parametric and non-parametric Parametric: compares the values of parameters (e.g. does the mass of proton = mass of electron ?) Non-parametric: deals with the shape of a distribution (e.g. is angular distribution consistent with being flat?) Consider the case of neutron decay. Suppose we have two theories that both predict the energy spectrum of the electron emitted in the decay of the neutron. Here a parametric test might not be able to distinguish between the two theories since both theories might predict the same average energy of the emitted electron. However a non-parametric test would be able to distinguish between the two theories as the shape of the energy spectrum differs for each theory.

2 880.P20 Winter 2006 Richard Kass 2 Hypothesis testing A procedure for using hypothesis testing: a)Measure (or calculate) something b)Find something that you wish to compare with your measurement (theory, experiment) c)Form a hypothesis (e.g. my measurement, x, is consistent with the PDG value) H 0 : x=x PDG H 0 is called the “null hypothesis” d)Calculate the confidence level that the hypothesis is true e)Accept or reject the hypothesis depending on some minimum acceptable confidence level Problems with the above procedure a)What is a confidence level ? b)How do you calculate a confidence level? c)What is an acceptable confidence level ? How would we test the hypothesis “the space shuttle is safe?” Is 1 explosion per 10 launches safe? Or 1 explosion per 1000 launches? A working definition of the confidence level: The probability of the event happening by chance. Example: Suppose we measure some quantity (X) and we know that it is described by a gaussian pdf with  =0 and  =1. What is the confidence level for measuring X  2 (i.e.  2  from the mean)? Thus we would say that the confidence level for measuring X  2 is 0.025 or 2.5% and we would expect to get a value of X  2 one out of 40 tries if the underlying pdf is gaussian.

3 880.P20 Winter 2006 Richard Kass 3 Hypothesis testing Cautions A few cautions about using confidence limits a) You must know the underlying pdf to calculate the limits. Example: suppose we have a scale of known accuracy (  = 10 gm ) and we weigh something to be 20 gm. Assuming a gaussian pdf we could calculate a 2.5% chance that our object weighs ≤ 0 gm?? We must make sure that the probability distribution is defined in the region where we are trying to extract information. b) What does a confidence level really mean? Classical vs Baysian viewpoints Two types of errors associated with hypothesis testing: Type I: reject H 0 when it is true Type II: accept H 0 when it is false f(x|  0 ) x xcxc area=prob. of a Type I error f(x|  1 ) x xcxc area=prob. of a Type II error For the case where H 0 :  0 and the alternative H 1 :  1 we can calculate the probability of a Type I or Type II error. Assume we reject H 0 if x>x c.

4 880.P20 Winter 2006 Richard Kass 4 Hypothesis Testing-Gaussian Variables We wish to test if a quantity we have measured (  =average of n measurements ) is consistent with a known mean (  0 ). H 0 : 0.9  0.2=0.33 If acceptable CL=5%, then we would reject H 0

5 880.P20 Winter 2006 Richard Kass 5 Hypothesis Testing-Gaussian Variables Do charge 2/3 quarks exist? Another variation of the quark problem H 0 : 0.9  0.2=0.67 If acceptable CL=5%, then we would accept H 0 If acceptable CL=5%, then we would reject H 0 H 0 : our charge measurements are consistent with q=1/3 quarks

6 880.P20 Winter 2006 Richard Kass 6 Hypothesis Testing-Gaussian Variables Tests when both means are unknown but come from a gaussian pdf: Example: Do two experiments agree with each other? CLEO measures the  c lifetime to be (180  7)fs while SELEX measures (198  7)fs. Thus 7% of the time we should expect the experiments to disagree at this level. n and m are the number of measurements for each mean H 0 : 180  7=198  7 If acceptable CL=5%, then we would accept H 0

7 880.P20 Winter 2006 Richard Kass 7 Hypothesis Testing-non-Gaussian Variables Assume we have a bunch of measurements (x i ’s) that are NOT from a gaussian pdf. (e.g. they could be from a poisson distribution) We can calculate a quantity that looks like a  2 which compares our data (d i ) with corresponding predictions (PR(x i )) from a pdf: K. Pearson showed that for a wide variety of pdfs (e.g. Poisson) the above test statistic becomes distributed according to a  2 pdf with n-1 dof. For example, the data are from a poisson distribution with mean= . If we have a total of N events in our sample then we predict: NP(m,  ) events for a given value of m (= 0, 1, 2, 3, 4….) At the same time, we have observed N(m) events. If NP(m,  ) >5 for all m then the following is approximately a  2 with n-1 dof.

8 880.P20 Winter 2006 Richard Kass 8 Hypothesis Testing-Pearson’s  2 Test The following is the numbers of neutrino events detected in 10 second intervals by the IMB experiment on 23 February 1987 around which time the supernova S1987a was first seen by experimenters: #events0123456789 #intervals1024860307581530001 Assuming the data is described by a Poisson distribution. Calculate the average & compute the average number events expected in an interval. = 0.777 if we include interval with 9 events We can calculate a  2 assuming the data are described by a Poisson distribution: The predicted number of intervals is given by: #events0123456789 #intervals predicted1064823318821620.30.030.0030.0003 Note: we use  2 =prediction for a Poisson There are 7 (= 9-2) DOF’s here and the probability of  2 /D.O.F. = 3.6/7 is high (≈80%), indicating a good fit to a Poisson Reject H 0 : “interval with 9 events is from Poisson with =0.77”


Download ppt "880.P20 Winter 2006 Richard Kass 1 Hypothesis Testing The goal of hypothesis testing is to set up a procedure(s) to allow us to decide if a model is acceptable."

Similar presentations


Ads by Google