School on Data Science in (Astro)particle Physics

Slides:



Advertisements
Similar presentations
G. Cowan TAE 2013 / Statistics Problems1 TAE Statistics Problems Glen Cowan Physics Department Royal Holloway, University of London
Advertisements

Factor Graphs, Variable Elimination, MLEs Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A AA A A.
Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:
G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.
8. Statistical tests 8.1 Hypotheses K. Desch – Statistical methods of data analysis SS10 Frequent problem: Decision making based on statistical information.
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan RHUL Physics Higgs combination note status page 1 Status of Higgs Combination Note ATLAS Statistics/Higgs Meeting Phone, 7 April, 2008 Glen Cowan.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits Lecture 2: Tests based on likelihood.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
Graduierten-Kolleg RWTH Aachen February 2014 Glen Cowan
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #23.
G. Cowan CLASHEP 2011 / Topics in Statistical Data Analysis / Lecture 21 Topics in Statistical Data Analysis for HEP Lecture 2: Statistical Tests CERN.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
ROOT and statistics tutorial Exercise: Discover the Higgs, part 2 Attilio Andreazza Università di Milano and INFN Caterina Doglioni Université de Genève.
G. Cowan Weizmann Statistics Workshop, 2015 / GDC Lecture 31 Statistical Methods for Particle Physics Lecture 3: asymptotics I; Asimov data set Statistical.
G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
G. Cowan S0S 2010 / Statistical Tests and Limits 1 Statistical Tests and Limits Lecture 1: general formalism IN2P3 School of Statistics Autrans, France.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
G. Cowan TAE 2010 / Statistics for HEP / Lecture 11 Statistical Methods for HEP Lecture 1: Introduction Taller de Altas Energías 2010 Universitat de Barcelona.
G. Cowan NExT Workshop, 2015 / GDC Lecture 11 Statistical Methods for Particle Physics Lecture 1: introduction & statistical tests Fifth NExT PhD Workshop:
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan IDPASC School of Flavour Physics, Valencia, 2-7 May 2013 / Statistical Analysis Tools 1 Statistical Analysis Tools for Particle Physics IDPASC.
G. Cowan Terascale Statistics School 2015 / Statistics for Run II1 Statistics for LHC Run II Terascale Statistics School DESY, Hamburg March 23-27, 2015.
G. Cowan RHUL Physics Status of Higgs combination page 1 Status of Higgs Combination ATLAS Higgs Meeting CERN/phone, 7 November, 2008 Glen Cowan, RHUL.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 2: Discovery.
Statistics 300: Elementary Statistics Section 11-3.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 21 Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN,
G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.
G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits Academic Training Lectures CERN,
Discussion on significance
Statistics for HEP Lecture 1: Introduction and basic formalism
Computing and Statistical Data Analysis / Stat 11
iSTEP 2016 Tsinghua University, Beijing July 10-20, 2016
Tutorial on Statistics TRISEP School 27, 28 June 2016 Glen Cowan
Statistics for the LHC Lecture 3: Setting limits
Some Statistical Tools for Particle Physics
LAL Orsay, 2016 / Lectures on Statistics
Comment on Event Quality Variables for Multivariate Analyses
Lecture 4 1 Probability (90 min.)
Tutorial on Multivariate Methods (TMVA)
ISTEP 2016 Final Project— Project on
TexPoint fonts used in EMF.
TAE 2017 Centro de ciencias Pedro Pascual Benasque, Spain
Graduierten-Kolleg RWTH Aachen February 2014 Glen Cowan
TAE 2017 / Statistics Lecture 1
TAE 2017 / Statistics Lecture 3
HCPSS 2016 / Statistics Lecture 1
TAE 2018 / Statistics Lecture 3
THE CLs METHOD Statistica per l'analisi dei dati – Dottorato in Fisica – XXIX Ciclo Alessandro Pistone.
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
TAE 2018 Benasque, Spain 3-15 Sept 2018 Glen Cowan Physics Department
Some Prospects for Statistical Data Analysis in High Energy Physics
Computing and Statistical Data Analysis / Stat 7
TRISEP 2016 / Statistics Lecture 1
TAE 2017 / Statistics Lecture 2
TRISEP 2016 / Statistics Lecture 2
Statistical Methods for HEP Lecture 3: Discovery and Limits
iSTEP 2016 Tsinghua University, Beijing July 10-20, 2016
TAE 2018 Centro de ciencias Pedro Pascual Benasque, Spain
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Presentation transcript:

Statistical Methods for Particle Physics Problem 2: statistical test for discovery www.lip.pt/data-science-2018/ www.pp.rhul.ac.uk/~cowan/stat/lip18/prob_sheet_2.pdf School on Data Science in (Astro)particle Physics LIP Lisboa, 12-14 March, 2018 Glen Cowan, RHUL Physics G. Cowan LIP 2018 / Problem 2 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAA

Problem 2 – discovering a small signal Materials at www.pp.rhul.ac.uk/~cowan/stat/invisibles/ Problem concerns searching for a signal such as Dark Matter by counting events. Suppose signal/background events are characterized by a variable x (0 ≤ x ≤ 1): As a first step, test the background hypothesis for each event: if x < xcut, reject background hypothesis. G. Cowan LIP 2018 / Problem 2

Testing the outcome of the full experiment In the full experiment we will find n events in the signal region (x < xcut), and we can model this with a Poisson distribution: Suppose total expected events in 0 ≤ x ≤ 1 are btot = 100, stot = 10; expected in x < xcut are s, b. Suppose for a given xcut, b = 0.5 and we observe nobs = 3 events. Find the p-value of the hypothesis that s = 0: and the corresponding significance: G. Cowan LIP 2018 / Problem 2

Experimental sensitivity To characterize the experimental sensitivity we can give the median, assuming s and b both present, of the significance of a test of s = 0. For s « b this can be approximated by A better approximation is: Try this for xcut = 0.1 and if you have time, write a small program to maximize the median Z with respect to xcut. We will discuss these formulae in later lectures, including methods for treating uncertainty in b. G. Cowan LIP 2018 / Problem 2

Using the x values Instead of just counting events with x < xcut , we can define a statistic that takes into account all the values of x. I.e. the data are: n, x1, .., xn. Later we will discuss ways of doing this with the likelihood ratio Ls+b/Lb, which leads to the statistic Using www.pp.rhul.ac.uk/~cowan/stat/invisibles/mc/invisibleMC.cc find the distribution of this statistic under the “b” and “s+b” hypotheses. From these find the median, assuming the s+b hypothesis, of the significance of the b (i.e., s = 0) hypothesis. Compare with result from the experiment based only on counting n events. G. Cowan LIP 2018 / Problem 2