G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

AP STUDY SESSION 2.
1
05/11/2006Prof. dr hab. Elżbieta Richter-Wąs Physics Program of the experiments at L arge H adron C ollider Lecture 4.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
STATISTICS Joint and Conditional Distributions
STATISTICS HYPOTHESES TEST (I)
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
UNITED NATIONS Shipment Details Report – January 2006.
David Burdett May 11, 2004 Package Binding for WS CDL.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Experimental Particle Physics PHYS6011 Joel Goldstein, RAL 1.Introduction & Accelerators 2.Particle Interactions and Detectors (2) 3.Collider Experiments.
Chapter 7 Sampling and Sampling Distributions
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Statistical Issues at LHCb Yuehong Xie University of Edinburgh (on behalf of the LHCb Collaboration) PHYSTAT-LHC Workshop CERN, Geneva June 27-29, 2007.
Solve Multi-step Equations
Break Time Remaining 10:00.
Turing Machines.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
Detection Chia-Hsin Cheng. Wireless Access Tech. Lab. CCU Wireless Access Tech. Lab. 2 Outlines Detection Theory Simple Binary Hypothesis Tests Bayes.
PP Test Review Sections 6-1 to 6-6
Bright Futures Guidelines Priorities and Screening Tables
Bellwork Do the following problem on a ½ sheet of paper and turn in.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
Hypothesis Tests: Two Independent Samples
Top Quark Mass Measurements at Hadron Colliders G. WATTS (UW/SEATTLE, CPPM) For the DZERO, CDF, CMS, and ATLAS collaborations July 15, 2014.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Adding Up In Chunks.
Computing and Statistical Data Analysis / Stat 4
Topics in statistical data analysis for particle physics
Artificial Intelligence
1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Analyzing Genes and Genomes
Introduction to the LHCb masterclass exercise
1 Titre de la diapositive SDMO Industries – Training Département MICS KERYS 09- MICS KERYS – WEBSITE.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Clock will move after 1 minute
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Simple Linear Regression Analysis
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan SUSSP65, St Andrews, August 2009 / Statistical Methods 3 page 1 Statistical Methods in Particle Physics Lecture 3: Multivariate Methods.
G. Cowan Lectures on Statistical Data Analysis Lecture 7 page 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 3: Multivariate Methods (II) 清华大学高能物理研究中心 2010 年 4 月 12—16.
G. Cowan CLASHEP 2011 / Topics in Statistical Data Analysis / Lecture 21 Topics in Statistical Data Analysis for HEP Lecture 2: Statistical Tests CERN.
1 Glen Cowan Multivariate Statistical Methods in Particle Physics Machine Learning and Multivariate Statistical Methods in Particle Physics Glen Cowan.
G. Cowan IDPASC School of Flavour Physics, Valencia, 2-7 May 2013 / Statistical Analysis Tools 1 Statistical Analysis Tools for Particle Physics IDPASC.
1 Introduction to Statistics − Day 2 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
Kinematics of Top Decays in the Dilepton and the Lepton + Jets channels: Probing the Top Mass University of Athens - Physics Department Section of Nuclear.
Recent progress in multivariate methods for particle physics
Computing and Statistical Data Analysis Stat 5: Multivariate Methods
SUSSP65, St Andrews, August 2009 / Statistical Methods 3
Machine Learning and Multivariate Statistical Methods in Particle Physics Glen Cowan RHUL Physics RHUL Computer Science Seminar.
Presentation transcript:

G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16 日 Glen Cowan Physics Department Royal Holloway, University of London

G. Cowan Statistical Methods in Particle Physics2 Outline of lectures Day #1: Introduction Review of probability and Monte Carlo Review of statistics: parameter estimation Day #2: Multivariate methods (I) Event selection as a statistical test Cut-based, linear discriminant, neural networks Day #3: Multivariate methods (II) More multivariate classifiers: BDT, SVM,... Day #4: Significance tests for discovery and limits Including systematics using profile likelihood Day #5: Bayesian methods Bayesian parameter estimation and model selection

G. Cowan Statistical Methods in Particle Physics3 Day #2: outline Multivariate methods for HEP Event selection as a statistical test Neyman-Pearson lemma and likelihood ratio test Some multivariate classifiers Cut-based event selection Linear classifiers Neural networks Probability density estimation methods

G. Cowan Statistical Methods in Particle Physicspage 4

G. Cowan Statistical Methods in Particle Physicspage 5

G. Cowan Statistical Methods in Particle Physicspage 6 The Large Hadron Collider Counter-rotating proton beams in 27 km circumference ring pp centre-of-mass energy 14 TeV Detectors at 4 pp collision points: ATLAS CMS LHCb (b physics) ALICE (heavy ion physics) general purpose

G. Cowan Statistical Methods in Particle Physicspage 7 The ATLAS detector 2100 physicists 37 countries 167 universities/labs 25 m diameter 46 m length 7000 tonnes ~10 8 electronic channels

G. Cowan Statistical Methods in Particle Physicspage 8 LHC event production rates most events (boring) interesting very interesting (~1 out of every ) mildly interesting

G. Cowan Statistical Methods in Particle Physicspage 9 LHC data At LHC, ~10 9 pp collision events per second, mostly uninteresting do quick sifting, record ~200 events/sec single event ~ 1 Mbyte 1 “year”  10 7 s, pp collisions / year 2  10 9 events recorded / year (~2 Pbyte / year) For new/rare processes, rates at LHC can be vanishingly small e.g. Higgs bosons detectable per year could be ~10 3 → 'needle in a haystack' For Standard Model and (many) non-SM processes we can generate simulated data with Monte Carlo programs (including simulation of the detector).

G. Cowan Statistical Methods in Particle Physicspage 10 A simulated SUSY event in ATLAS high p T muons high p T jets of hadrons missing transverse energy p p

G. Cowan Statistical Methods in Particle Physicspage 11 Background events This event from Standard Model ttbar production also has high p T jets and muons, and some missing transverse energy. → can easily mimic a SUSY event.

G. Cowan Statistical Methods in Particle Physicspage 12 A simulated event PYTHIA Monte Carlo pp → gluino-gluino......

G. Cowan Statistical Methods in Particle Physicspage 13 Event selection as a statistical test For each event we measure a set of numbers: x 1 = jet p T x 2 = missing energy x 3 = particle i.d. measure,... follows some n-dimensional joint probability density, which depends on the type of event produced, i.e., was it E.g. hypotheses H 0, H 1,... Often simply “signal”, “background”

G. Cowan Statistical Methods in Particle Physicspage 14 Finding an optimal decision boundary In particle physics usually start by making simple “cuts”: x i < c i x j < c j Maybe later try some other type of decision boundary: H0H0 H0H0 H0H0 H1H1 H1H1 H1H1

G. Cowan Statistical Methods in Particle Physicspage 15

G. Cowan Statistical Methods in Particle Physicspage 16

G. Cowan Statistical Methods in Particle Physicspage 17

G. Cowan Statistical Methods in Particle Physicspage 18

G. Cowan Statistical Methods in Particle Physicspage 19

G. Cowan Statistical Methods in Particle Physicspage 20 Two distinct event selection problems In some cases, the event types in question are both known to exist. Example: separation of different particle types (electron vs muon) Use the selected sample for further study. In other cases, the null hypothesis H 0 means "Standard Model" events, and the alternative H 1 means "events of a type whose existence is not yet established" (to do so is the goal of the analysis). Many subtle issues here, mainly related to the heavy burden of proof required to establish presence of a new phenomenon. Typically require p-value of background-only hypothesis below ~ 10  (a 5 sigma effect) to claim discovery of "New Physics".

G. Cowan Statistical Methods in Particle Physicspage 21 Using classifier output for discovery y f(y)f(y) y N(y)N(y) Normalized to unity Normalized to expected number of events excess? signal background search region Discovery = number of events found in search region incompatible with background-only hypothesis. p-value of background-only hypothesis can depend crucially distribution f(y|b) in the "search region". y cut

G. Cowan Statistical Methods in Particle Physicspage 22 Example of a "cut-based" study In the 1990s, the CDF experiment at Fermilab (Chicago) measured the number of hadron jets produced in proton-antiproton collisions as a function of their momentum perpendicular to the beam direction: Prediction low relative to data for very high transverse momentum. "jet" of particles

G. Cowan Statistical Methods in Particle Physicspage 23 High p T jets = quark substructure? Although the data agree remarkably well with the Standard Model (QCD) prediction overall, the excess at high p T appears significant: The fact that the variable is "understandable" leads directly to a plausible explanation for the discrepancy, namely, that quarks could possess an internal substructure. Would not have been the case if the variable plotted was a complicated combination of many inputs.

G. Cowan Statistical Methods in Particle Physicspage 24 High p T jets from parton model uncertainty Furthermore the physical understanding of the variable led one to a more plausible explanation, namely, an uncertain modeling of the quark (and gluon) momentum distributions inside the proton. When model adjusted, discrepancy largely disappears: Can be regarded as a "success" of the cut-based approach. Physical understanding of output variable led to solution of apparent discrepancy.

G. Cowan Statistical Methods in Particle Physicspage 25

G. Cowan Statistical Methods in Particle Physicspage 26

G. Cowan Statistical Methods in Particle Physicspage 27

G. Cowan Statistical Methods in Particle Physicspage 28

G. Cowan Statistical Methods in Particle Physicspage 29

G. Cowan Statistical Methods in Particle Physicspage 30

G. Cowan Statistical Methods in Particle Physicspage 31

G. Cowan Statistical Methods in Particle Physicspage 32

G. Cowan Statistical Methods in Particle Physicspage 33

G. Cowan Statistical Methods in Particle Physicspage 34

G. Cowan Statistical Methods in Particle Physicspage 35

G. Cowan Statistical Methods in Particle Physicspage 36

G. Cowan Statistical Methods in Particle Physicspage 37 Neural network example from LEP II Signal: e  e  → W  W  (often 4 well separated hadron jets) Background: e  e  → qqgg (4 less well separated hadron jets) ← input variables based on jet structure, event shape,... none by itself gives much separation. Neural network output: (Garrido, Juste and Martinez, ALEPH )

G. Cowan Statistical Methods in Particle Physicspage 38 Some issues with neural networks In the example with WW events, goal was to select these events so as to study properties of the W boson. Needed to avoid using input variables correlated to the properties we eventually wanted to study (not trivial). In principle a single hidden layer with an sufficiently large number of nodes can approximate arbitrarily well the optimal test variable (likelihood ratio). Usually start with relatively small number of nodes and increase until misclassification rate on validation data sample ceases to decrease. Often MC training data is cheap -- problems with getting stuck in local minima, overtraining, etc., less important than concerns of systematic differences between the training data and Nature, and concerns about the ease of interpretation of the output.

G. Cowan Statistical Methods in Particle Physicspage 39 Overtraining training sample independent test sample If decision boundary is too flexible it will conform too closely to the training points → overtraining. Monitor by applying classifier to independent test sample.

G. Cowan Statistical Methods in Particle Physicspage 40 validation sample training sample Monitoring overtraining We can monitor the misclassification rate (or value of the error function) as a function of some parameter related to the level of flexibility of the decision boundary, such as the number of nodes in the hidden layer. For the data sample used to train the network, the error rate continues to decrease, but for an independent validation sample, it will level off and even increase. error rate number of nodes

G. Cowan Statistical Methods in Particle Physicspage 41

G. Cowan Statistical Methods in Particle Physicspage 42

G. Cowan Statistical Methods in Particle Physics43

G. Cowan Statistical Methods in Particle Physics44

G. Cowan Statistical Methods in Particle Physics45

G. Cowan Statistical Methods in Particle Physics46

G. Cowan Statistical Methods in Particle Physics47

G. Cowan Statistical Methods in Particle Physics48

G. Cowan Statistical Methods in Particle Physics49

G. Cowan Statistical Methods in Particle Physics50

G. Cowan Statistical Methods in Particle Physics51

G. Cowan Statistical Methods in Particle Physics52

G. Cowan Statistical Methods in Particle Physics53

G. Cowan Statistical Methods in Particle Physics54

G. Cowan Statistical Methods in Particle Physicspage 55 Summary Information from many variables can be used to distinguish between event types. Try to exploit as much information as possible. Try to keep method as simple as possible. Often start with: cuts, linear classifiers And then try less simple methods: neural networks Tomorrow we will see some more multivariate classifiers: Probability density estimation methods Boosted Decision Trees Support Vector Machines